import pandas as pd
Lab 2 grades out. Good overall! Some general comments:
df.resample()
Week 3 survey - themes:
df = pd.DataFrame(index=range(1,6))
df["Surprised?"] = [2, 15, 10, 1, 0]
df["Surprised?"].plot.bar()
<AxesSubplot:>
df["Creeped out?"] = [5, 6, 11, 3, 1]
df["Creeped out?"].plot.bar()
<AxesSubplot:>
df["Accurate?"] = [0, 1, 4, 23, 2]
df["Accurate?"].plot.bar()
<AxesSubplot:>
Discuss any particularly notable findings that pertain to the questions you were asked to write about:
- How did your data compare to what you expected? Was there anything surprising, or creepy, or just plain strange? Describe the types of data that you see.
- How comprehensive was your download? Are you able to determine whether the company gave you everything they had, or were they more selective?
- What kinds of data science questions could someone answer about you based solely on this data? What kinds of data science questions could someone with access to millions of records like yours answer?
- Are you comfortable with the extent and/or accuracy of data collected? Does the company have controls for opting out of collection of the sorts of data you’d rather they not have? If not - or if the company suddenly decided tomorrow to remove those controls - what should our society do about this?
Were your reactions similar or different? Was this due to your attitudes, or due to differences in the data?
Decide on one most interesting thing (from any of the above discussion) to share with the class. The person who woke up latest this morning will be your group's spokesperson.