Calmcode - smoking: the dataset

The Dataset

1 2 3 4 5 6 7

It is easier that you might think to fool yourself with data. It is quantified so there is less bias right? This series of videos shows you an analysis using pandas that demonstrates why this might not be true.

This dataset is listed in the calmcode datasets section portion of our website. You can also download it directly.

We assume that it is saved in a downloads folder on a mac. Double check that you change the code if you save it somewhere else.

import pandas as pd
import matplotlib.pylab as plt

df = pd.read_csv("~/Downloads/smoking.csv")