Lecture 14 / Lab 4 - Z Scores; Space-Time Cube Slicing

Announcements:

Goals

Z-Scores and Normalization

(a brief topic I ran out of time for on Tuesday)

In the NHANES dataset, heights and other length measurements are given in centimeters. I don't have intuition for what's a normal height in centimeters - if you're 160cm tall, are you short? tall? average? One thing I could do is convert to feet and inches which I do know. But sometimes you don't have any units that are intuitive.

To compute a $z$-score:

  1. Subtract the mean
  2. Divide by the standard deviation.

Now instead of the raw data value, you have an interpretable measure of how close each point is to the mean. If you have an approximately-Gaussian distribution, you also have a good idea of how unusual that point is!