logo


smoking


<p>It is easier that you might think to fool yourself with data. It is quantified so there is less bias right? This series of videos shows you an analysis using <a href="https://pandas.pydata.org/pandas-docs/stable/">pandas</a> that demonstrates why this might not be true.</p>


1 - The Dataset
2 - Cleaning
3 - Smoking is Good
4 - What about Age
5 - Smoking is Bad
6 - Quantifying the Effect
7 - Wrapping Up

This dataset is listed in the datasets portion of our website. You can also download it directly here.

We assume that it is saved in a downloads folder on a mac. Double check that you change the code if you save it somewhere else.

import pandas as pd
import matplotlib.pylab as plt

df = pd.read_csv("~/Downloads/smoking.csv")