A university in Italy was doing research on the effect of sleep on programming performance. The question is, where do we draw the line? When is the difference in performance big enough that you can't say that it is due to chance? Also, can't we explain the effect with gpa?
You can download this dataset by download the file here. You can also fetch it from the command line by running;
With that out of the way. Let's write our first summary for this analysis.
import numpy as np import pandas as pd import matplotlib.pylab as plt df = pd.read_csv("/<path>/sleep_deprived_coding.csv") (df .groupby('sleep') .agg(n=('id', 'count'), mean_unit_tests=('passed_unit_tests', np.mean), mean_asserts=('passed_asserts', np.mean), mean_user_stories=('tackled_user_stories', np.mean),))
Feedback? See an issue? Feel free to mention it here.
If you want to be kept up to date, consider getting the newsletter.