Introduction
In this lab you will explore the normal distribution.
Normal distribution
The normal distribution is used to describe continuous data. It is a two parameter distribution defined by the mean (\(\mu\)) and standard deviation (\(\sigma\)). Every aspect of the curve is described using the following notation (\(N(\mu,~\sigma)\)) where the mean (\(\mu\)) can be any real number and the standard deviation (\(\sigma\)) any value greater then 0. In the exercise you will get to see the impact the mean and standard deviation have on the shape of the distribution curve. Also unlike the last lab the normal distribution is visualized with a line instead of vertical lines. This is because unlike the discrete distributions values between the integers are possible.
Exercise 1: Visualizing the normal distribution
Instructions The plot below shows three normal curves. Use the sliders to change the means and standard deviations to see the impact it has on the shape and position of the curve. Once you get a feel for whats happening, use the interactive plot to answer the quiz question. (You will have to adjust the X axis range if the mean is set far away from 0)
Parameters for The Green Distribution
Parameters for The Blue Distribution
Parameters for The Red Distribution
Quiz: Questions 1-4
Normal probability density function
There are an endless number of possible curves, but the one we will discuss the most in class is the “standard normal”. The standard normal distribution has a mean of 0 and a standard deviation of 1 (\(N(\mu=0,~\sigma=1)\)). The equation for the general pdf for a normal distribution is a little intimidating, but since the standard normal distribution is the simplest case the pdf simplifies to \(P(x) = \frac{1}{{\sqrt {2\pi } }}e^{ - \frac{{1}}{2}x^2}\). With modern computers and calculators the need to use the simplest case is not a major concern. In the exercise below you will use an interactive graph to find probabilities. The graph shades the area under the curve associated with the probability in red. The probability is shown on the plot in blue text.
Exercise 2: Calculating Probabilities
Instructions: The graph doesn’t seem to change but pay attention to the x axis as you change the mean or standard deviation. To answer the quiz questions select “Calculate Probability” from the drop down menu.
How to calculate the different probabilities:
- To calculate \(P(X<x)\) set the upper bound to \(x\).
- For example with \(N(\mu=150,\sigma=15)\) the \(P(X<120)=0.0228\) set the upper bound is set to \(120\) (The compliment probability is also shown).
- For example with \(N(\mu=150,\sigma=15)\) the \(P(X<120)=0.0228\) set the upper bound is set to \(120\) (The compliment probability is also shown).
- To calculate \(P(X>x)\) set the lower bound to \(x\).
- For example with \(N(\mu=150,\sigma=15)\) the \(P(X>120)=0.0228\) set the lower bound is set to \(120\) (The compliment probability is also shown).
- For example with \(N(\mu=150,\sigma=15)\) the \(P(X>120)=0.0228\) set the lower bound is set to \(120\) (The compliment probability is also shown).
- To calculate \(P(x1 < X < x2)\) set the lower bound to \(x1\) and the upper bound to \(x2\).
- For example with \(N(\mu=150,\sigma=15)\) the \(P(120 < X < 140)=0.2297\) set the lower bound to \(120\) and the upper bound to \(140\).
Set the Distribution Parameters
Set the lower and upper bound for the probability you want to calculate. You can check if you are doing it right using the example solutions in the instructions.
This returns the value for the specified percentile.
Quiz: Questions 5-8
Scenario 1: A ski gondola in Vail, Colorado carries skiers to the top of a mountain. It bears a plaque stating that the maximum capacity is 12 people or 2004 pounds. That capacity will be exceeded if 12 people have weights with a mean greater than \(\frac{2004}{12}=167lbs\). Because men tend to weigh more than women, a “worst case” scenario involved 12 passengers who are all men. Men have weights that are normally distributed with a mean of 172 lbs and a standard deviation of 29 lbs.
Scenario 2: Weights of newborn babies in the U.S. are normally distributed with a mean of 3420g and a standard deviation of 495g. A newborn weighing less than 2200g is considered to be at risk.
Scenario 3: It was found that the mean length of 100 parts produced by a lathe was 20.05 mm with a standard deviation of 0.2mm.
Inverse normal probability function
Finding the probability of randomly selecting a value from a distribution is pretty fun, but sometimes we want to know the value associated with a certain probability. For example among the how tall does a person have to be to be taller than 90% of other people. To find that value we do the inverse of what we did to calculate a probability. Instead of specifying a lower or upper bound we specify a probability (percentile) and find the value that corresponds.
Exercise 3: Using the inverse probability function
Instructions To answer the quiz questions select “Calculate Percentiles” from the drop down menu. (Hint: if you are having a hard time setting specific values with the slider. Click the slider and then use the arrow keys to increase or decrease the value)
Finding the \(x\) values that correspond to percentiles:
- To to find \(x\) such that \(P(X<x)=0.25\) set the percentile to \(0.25\).
- For example with \(N(\mu=0,~\sigma=1)\) the \(x\) such that \(P(X<x)=0.25\) is \(-0.6745\).
Set the Distribution Parameters
Set the lower and upper bound for the probability you want to calculate. You can check if you are doing it right using the example solutions in the instructions.
This returns the value for the specified percentile.
Quiz: Questions 9-11
Scenario 1: A ski gondola in Vail, Colorado carries skiers to the top of a mountain. It bears a plaque stating that the maximum capacity is 12 people or 2004 pounds. That capacity will be exceeded if 12 people have weights with a mean greater than \(\frac{2004}{12}=167lbs\). Because men tend to weigh more than women, a “worst case” scenario involved 12 passengers who are all men. Men have weights that are normally distributed with a mean of 172 lbs and a standard deviation of 29 lbs.
Scenario 2: Weights of newborn babies in the U.S. are normally distributed with a mean of 3420g and a standard deviation of 495g. A newborn weighing less than 2200g is considered to be at risk.
Scenario 3: It was found that the mean length of 100 parts produced by a lathe was 20.05 mm with a standard deviation of 0.2mm.
Summary
In this lab, you completed 3 exercises and answered 11 quiz questions.
The lab covered 3 topics:
- The normal distribution
- Normal probability function
- Inverse normal probability function
You are done with lab! Don’t worry if you are sad it is over you can always review one of the previous labs! Don’t forget to record your answers and take the eLC quiz to get credit