DATA 311 - Lecture 20: Baselines Worksheet

Names:

Baselines

For each of the following prediction scenarios, come up with the strongest baseline you can think of that does not require any machine learning.

  1. Predict whether an email message is spam. The training data contains equal numbers of spam (positive) and non-spam (negative) examples.

  2. Your task is to predict whether an MRI scan shows a tumor or not. The training data contains 90% non-tumor images (negative examples) and 10% tumor images (positive examples).

  3. Given all weather measurements from today and prior, predict whether it will rain tomorrow.

  4. For the NHANES body measurement dataset, predict a person’s leg length given their height.