logo

... sleep.


A university in Italy was doing research on the effect of sleep on programming performance. The question is, where do we draw the line? When is the difference in performance big enough that you can't say that it is due to chance? Also, can't we explain the effect with gpa?


Episode Notes

You can download this dataset by download the file here. You can also fetch it from the command line by running;

wget https://calmcode.io/datasets/sleep_deprived_coding.csv

The original research paper of the dataset can be found here. Feel free to check the datasets portion of the website to see some details about the dataset.

With that out of the way. Let's write our first summary for this analysis.

import numpy as np
import pandas as pd
import matplotlib.pylab as plt
df = pd.read_csv("/<path>/sleep_deprived_coding.csv")

(df
  .groupby('sleep')
  .agg(n=('id', 'count'),
      mean_unit_tests=('passed_unit_tests', np.mean),
      mean_asserts=('passed_asserts', np.mean),
      mean_user_stories=('tackled_user_stories', np.mean),))

Feedback? See an issue? Feel free to mention it here.

If you want to be kept up to date, consider getting the newsletter.