A continuous random variable XX has the normal distribution with mean μμ and variance σ2σ2 if the pdf of XX is
f(x)=1√2πσ2e−(x−μ)22σ2f(x)=1√2πσ2e−(x−μ)22σ2
We say XX has the Normal(μ,σ2)(μ,σ2) distribution.
There is no closed from cdf for a normal distribution.
Claim: The function f(x)=1√2πσ2e−(x−μ)22σ2f(x)=1√2πσ2e−(x−μ)22σ2 is a pdf.
▼ Proof:
Since f(x)>0f(x)>0 for all x,x, the only thing that needs to be shown is that ∫∞−∞f(x)dx=1.∫∞−∞f(x)dx=1. First, substitute x−μx−μ for xx to eliminate the μμ in the exponent.
∫∞−∞1√2πσ2e−(x−μ)22σ2dx=∫∞−∞1√2σ2e−x22σ2dx∫∞−∞1√2πσ2e−(x−μ)22σ2dx=∫∞−∞1√2σ2e−x22σ2dx
It is easier to compute the square of the integral rather than the integral itself.
(∫∞−∞1√2πσ2e−x22σ2dx)2=(∫∞−∞1√2πσ2e−x22σ2dx)(∫∞−∞1√2πσ2e−x22σ2dx)=(∫∞−∞1√2πσ2e−x22σ2dx)(∫∞−∞1√2πσ2e−y22σ2dy)=12πσ2∫∞−∞∫∞−∞e−x22σ2e−y22σ2dxdy=12πσ2∫∞−∞∫∞−∞e−x2+y22σ2dxdy(∫∞−∞1√2πσ2e−x22σ2dx)2=(∫∞−∞1√2πσ2e−x22σ2dx)(∫∞−∞1√2πσ2e−x22σ2dx)=(∫∞−∞1√2πσ2e−x22σ2dx)(∫∞−∞1√2πσ2e−y22σ2dy)=12πσ2∫∞−∞∫∞−∞e−x22σ2e−y22σ2dxdy=12πσ2∫∞−∞∫∞−∞e−x2+y22σ2dxdy
Now use polar coordinates. Substitute r2r2 for x2+y2x2+y2 and rdrdθrdrdθ for dxdy.dxdy.12πσ2∫∞−∞∫∞−∞e−x2+y22σ2dxdy=12πσ2∫2π0∫∞0e−r22σ2rdrdθ=12πσ2∫2π0∫∞0re−r22σ2drdθ12πσ2∫∞−∞∫∞−∞e−x2+y22σ2dxdy=12πσ2∫2π0∫∞0e−r22σ2rdrdθ=12πσ2∫2π0∫∞0re−r22σ2drdθ
Substitute u=r2, so du=2rdr.12πσ2∫2π0∫∞0re−r22σ2drdθ=12πσ2∫2π0∫∞012e−u2σ2dudθ=12πσ2∫2π0−2σ22e−u2σ2|∞u=0dθ=12πσ2∫2π0σ2dθ=12πσ22πσ2=1
Since (∫∞−∞f(x)dx)2=1 and f(x)>0,∫∞−∞f(x)=1.
Claim: The expected value of X is E[X]=μ.
▼ Proof:
The result can be found by direct computation.
E[X]=∫∞−∞x√2πσ2e−(x−μ)22σ2dx=∫∞−∞y+μ√2πσ2e−y22σ2dy
where the last line follows by substituting y+μ for x. Splitting the integral over the y+μ we get
∫∞−∞y+μ√2πσ2e−y22σ2dy=∫∞−∞y√2πσ2e−y22σ2dy+∫∞−∞μ√2πσ2e−y22σ2dy=∫∞−∞y√2πσ2e−y22σ2dy+μ∫∞−∞1√2πσ2e−y22σ2dy
Now we compute each integral in turn. The left integral is
∫∞−∞y√2πσ2e−y22σ2dy=∫0−∞y√2πσ2e−y22σ2dy+∫∞0y√2πσ2e−y22σ2dy=−∫∞0y√2πσ2e−y22σ2dy+∫∞0y√2πσ2e−y22σ2dy=0
where the second line follows from a substitution of −y for y.
The second integral is an integral of a pdf of a normal random variable, so it is equal to 1.μ∫∞−∞1√2πσ2e−y22σ2dy=μ⋅1=μ
Claim: The variance of X is Var(X)=σ2.
The proof of this fact is reserved for later.
Standard Normal
A standard normal random variable has mean 0 and variance 1. The pdf for a standard normal random variable is
f(x)=1√2πe−x22
Claim: A normal random variable X with mean μ and standard deviation σ can be transformed to a standard normal random variable by subtracting the mean and dividing the standard deviation:
X−μσ
▼ Proof:
Let Y=X−μσ, and let FX(t) and FY(t) be the cdf's of X and Y, respectively. By computation, we can write FY(t) in terms of FX(t).FY(t)=P(X−μσ≤t)=P(X≤σt+μ)=FX(σt+μ)
Take the derivative of both sides to convert the cdf's to pdf's.
ddtFY(t)=ddtFX(σt+μ)⇒fY(t)=σfX(σt+μ)
So, the pdf of Y is
fY(t)=σ⋅1√2πσ2e−((σt+μ)−μ)22σ2=1√2πe−t22
which shows Y, or X−μσ, has the standard normal distribution.
Bell Curve
Normal random variables take the shape of a bell curve. The highest point on the curve is the mean. The curve is always symmetric about the mean. The curve will be narrow or wide depending on the standard deviation.
This graph shows the pdf of a standard normal random variable.
This graph shows the pdf of a Normal(0,2) random variable. The standard normal is in gray for reference. The higher variance makes the graph wider and flatter.
This graph shows the pdf of a Normal(1,1) random variable. The standard normal is in gray for reference. Raising the mean shifts the graph to the right.
Standard Normal Table
You can refer to the standard normal table when finding probabilities of normally distributed random variables.
▼ Standard Normal Table:
Example: Let X be normally distributed with mean 3 and standard deviation 2. Find the following:
P(X<4)
P(X>2)?
P(X<0)
Solution: For each case, transform X into a standard normal random variable by subtracting the mean and dividing by the standard deviation. Let Z be a standard normal random variable.
First, standardize the random variable.
P(X<4)=P(X−32<0.5)=P(Z<0.5)
In the table, look for the 0.5 and the .00 column, since added together this is 0.50. The number found there is 0.69146. Therefore,
P(X<4)≈0.69146
Standardize the normal variable.
P(X>2)=P(X−32>−0.5)=P(Z>−0.5)
By symmetry of the distribution, P(Z>−0.5)=P(Z≤0.5). Since Z is continuous, this is the same as P(Z<0.5). In part 1, we found P(Z<0.5)≈0.69146. So, P(X>2)≈0.69146.
Standardize the normal variable.
P(X<0)=P(X−32<−1.5)=P(Z<−1.5)
The standard normal table only shows the probability that Z is less than a positive value, so we need to rearrange the inequality.
By symmetry, P(Z<−1.5)=P(Z>1.5). Using the fact that P(A)=1−P(AC),P(Z>1.5)=1−P(Z≤1.5)
Since Z is continuous, P(Z≤1.5)=P(Z<1.5). This probability can be found in row 1.5 and column .00, since 1.5+.00=1.50. The number there is 0.93319. Therefore,
P(X<0)≈1−0.93319=0.06681
1. Let Z be a standard normal random variable. Find P(Z>0).
Unanswered
Solution: By symmetry of the standard normal, P(Z>0)=1−P(Z<0)=0.5.
2. Let X have the Normal(−2,4) distribution. Find P(X>0).
Unanswered
Solution: First standardize X, then use the table for the standard normal distribution.
P(X>0)=P(X+2>2)=P(X+22>1)=P(Z>1)
where Z is a standard normal random variable. To use the table, taking the complement of {Z>1}.P(Z>1)=1−P(Z≤1)=1−0.84134≈0.16
3. Let X and Y be normally distributed with mean 0. If Var(X)>Var(Y), which is true?
Unanswered
Solution: Let X have variance σ2 and Y have variance τ2 where σ2<τ2. The pdf of X is fX(x)=1√2πσ2e−x22σ2 and the pdf of Y is fY(y)=1√2πτ2e−y2τ2. Since X and Y are independent, their joint pdf is f(x,y)=12πστe−(x−μ)22σ2−(y−μ)22τ2. Using the joint pdf, we have
P(X<Y)=∫∞−∞∫∞y12πστe−x22σ2−y22τ2dxdy
Substituting −x for x and −y for y we get
∫∞−∞∫∞y12πστe−x22σ2−y22τ2dxdy=∫∞−∞∫y−∞12πστe−x22σ2−y22τ2dxdy=P(Y<X)
To see why the bounds on the integral are what they are, notice that the condition for the region we are integrating over changes from x<y to −x<−y, or x>y.
So, P(X<Y)=P(Y<X). Since X and Y are continuous, P(X=Y)=0, so P(X>Y)=0.5.
4. Let X be normally distributed with mean 1 and variance 2. What is P(0<X<2)?
Unanswered
Solution: First, normalize X.P(0<X<2)=P(−1√2<X−1√2<1√2)=P(−1√2<Z<1√2)
where Z is a standard normal distribution.
P(−1√2<Z<1√2)=P(Z<1√2)−P(Z≤−1√2)=P(Z<1√2)−(1−P(Z≤1√2))=2P(Z<1√2)−1≈2P(Z<0.71)−1
Using the standard normal distribution table, we get
2P(Z<0.71)−1≈2⋅0.76115−1≈0.52