Lecture Notes 6

Econ 29000, Principles of Statistics

Kevin R Foster, CCNY

Fall 2011

Wrap-up and Review: Calculating Area under Normal Distribution

It might be useful, when thinking about Normal distributions, to consider much of the work as just a change of measurement units along the x-axis. A Standard Normal (with zero mean and standard deviation of one) can be thought of as just a change of units away from a Normal (with some non-zero mean and standard deviation other than one).

For example, the question, " For a Normal Distribution with mean 4 and standard deviation 6, what is area to the left of 1?" We can graph this as:

or it can be equivalently graphed with a change of units, by subtracting the mean (4) and dividing by the standard deviation (6):

It's the same area under either distribution – it doesn't matter whether we use the red units or the blue units; we get the same answer. The blue units are "standardized" while the red are not; for the blue units we use a standard normal distribution while for the red units we use a normal distribution with a different mean and standard deviation.

Examples of Standard Normal Calculations

Just Pictures

Find:

Given:

=0.79

Or, given

=0.29

Or, given

=0.21

Find

Given

=0.36

Or, given

=0.86

Or, given

=0.14

1. A random variable is distributed as a standard normal. Sketch the PDF in each case. Find the Z-statistics.

a. What is the probability that we could observe a value as far or farther than 1.3?

b. What is the probability that we could observe a value nearer than 1.8?

c. What value would leave 10% of the probability in the right-hand tail?

d. What value would leave 25% in both the tails (together)?

A random variable is distributed with mean of 7 and standard deviation of 9.

e. What is the probability that we could observe a value lower than 6?

f. What is the probability that we could observe a value higher than 12?

g. What is the probability that we'd observe a value between 6.5 and 7.5?

h. What is the probability that the standardized value lies between 0.5 and -0.5?

(these questions were on previous year's exams.)

a. What is the probability that we could observe a value as far or farther than 1.3? Sketch this picture:

emetrics_m_fig4

Using Excel, we can find the area to the left of -1.3, this is NORMSDIST(-1.3):

emetrics_m_fig4

Which is 0.0968. To find the area of both tails we just multiply by two since they're symmetric so the total area is 0.1936.

Could instead solve this by finding NORMSDIST(1.3) which is the area to the left of 1.3, so:

emetrics_m_fig4

Which is 0.9032. So subtract this from 1 to find the remaining area in the right-hand tail, which is 0.0968.

emetrics_m_fig4

Then multiply by two to find the area of both tails – getting the exact same answer as before.

A standard normal table might give the area of the right-hand tail.

b. What is the probability that we could observe a value nearer than 1.8?

Sketch as:

emetrics_m_fig4

Again using Excel gives NORMSDIST(-1.8) = 0.03593, which is this area:

emetrics_m_fig4

So multiply by 2 to find the area of both tails,

emetrics_m_fig4

Then subtract this combined area from 1 to find the remaining middle area: 1 – 0.03593 – 0.03593 = 0.9281.

c. What value would leave 10% of the probability in the right-hand tail?

Want to find some Z such that P{X>Z} = 0.10. Use the inverse function on Excel, NORMSINV(0.10) to find that this is -1.28.

Z

emetrics_m_fig4

But why a negative number? Again, because Excel is giving the area to the left; but the standard normal is symmetric so Z=1.28 gives the answer.

d. What value would leave 25% in both the tails (together)?

Now we want to find Z that leaves 25% in each tail; i.e. 12.5% or 0.125. Again use NORMSINV(0.125) = -1.15, so the area less than -1.15 and greater than +1.15 is 0.25.

-Z

+Z

emetrics_m_fig4

e. What is the probability that we could observe a value lower than 6?

This is for a Normal distribution with mean 7 and standard deviation of 9, not a standard normal (mean zero and stdev 1). We need to work back-and-forth between the two distributions, finding a way to "translate" between horizontal axes.

emetrics_m_fig4

First note how we could re-label the Standard Normal axis is the picture above, if we think of the numerals as representing standard deviations from the mean. So the mean is seven; translate zero as seven:

How do we fill in the rest? Each standard deviation has size 9, so 1 standard deviation above the mean is 7 + 9 = 16.

Then one standard deviation less than the mean is 7 – 9 = -2.

Then fill in above and below to get:

So to find the area below 6 is:

emetrics_m_fig4

The formula for this back-and-forth was given in previous lectures. If X has a Normal Distribution with mean µ and standard deviation σ, then Z has a Standard Normal distribution with mean of 0 and standard deviation of 1, where (re-write as , which is the formula used to "translate" the horizontal axis above).

So the area above, to the left of 6 (measured in X-units) is equivalently the area to the left of . Look this up with Excel: NORMSDIST(-1/9) = 0.4558. This seems about right since we know that the area to the left of zero is 0.5; this is just a bit to the left so it's just a bit less than 50%.

f. What is the probability that we could observe a value higher than 12?

-29 -20 -11 -2 7 16 25 34 43

emetrics_m_fig4

So translate 12 as equivalent to ; lookup in Excel as NORMSDIST(5/9) = 0.7107.

g. What is the probability that we'd observe a value between 6.5 and 7.5?

This is the little slice:

-29 -20 -11 -2 7 16 25 34 43

emetrics_m_fig4

Between 6.5, which translates as and 7.5 which is . We can calculate NORMSDIST(-1/18) = 0.4778; this means that the area between -1/18 and zero must be 0.5 – 0.4778 = 0.0222, so the symmetric area from zero to +1/18 is the same size so the total area must be 0.0443.

h. What is the probability that the standardized value lies between 0.5 and -0.5?

This doesn't require translation since it's already translated to standardized values; 0.5 standard deviations from 7 is 7 ± 4.5 so 2.5 and 11.5. The probability to the left of- 0.5 is NORMSDIST(-0.5) = 0.3085 so the area from -0.5 to zero must be 0.5 – 0.3085 = 0.1915. Twice this area is 0.3829.