Whenever a random experiment is replicated, the Random Variable that equals the average (or total) result over the replicates tends to have a normal distribution as the number of replicates becomes large.
It is one of the cornerstones of probability theory and statistics, because of the role it plays in the Central Limit Theorem, and because many real-world phenomena involve random quantities that are approximately normal (e.g., errors in scientific measurement).
It is also known by other names such as: Gaussian Distribution, Bell shaped Distribution.
It can be observed from the above graph that the distribution is symmetric about its center, which is also the mean (0 in this case). This makes the probability of events at equal deviations from the mean, equally probable. The density is highly centered around the mean, which translates to lower probabilities for values away from the mean.
The probability density function of the general normal distribution is given as:
In the above formula, all the symbols have their usual meanings, σ is the Standard Deviation and μ is the Mean.
It is easy to get overwhelmed by the above formula while trying to understand everything in one glance, but we can try to break it down into smaller pieces so as to get an intuition as to what is going on.
The z-score is a measure of how many standard deviations away a data point is from the mean. Mathematically,
The exponent of e in the above formula is the square of the z-score times -1 / 2. This is actually in accordance to the observations that we made above. Values away from the mean have a lower probability compared to the values near the mean. Values away from the mean will have a higher z-score and consequently a lower probability since the exponent is negative. The opposite is true for values closer to the mean.
This gives way for the 68-95-99.7 rule, which states that the percentage of values that lie within a band around the mean in a normal distribution with a width of two, four and six standard deviations, comprise 68%, 95% and 99.7% of all the values. The figure given below shows this rule-
The effects of μ and δ on the distribution are shown below. Here μ is used to reposition the center of the distribution and consequently move the graph left or right, and σ is used to flatten or inflate the curve-
Expectation
Expected value E[x] can be found by simply multiply the probability distribution function with x and integrate over all possible values
Let ‘X’ be a normal distributed random variable with parameters μ and σ2.
we know that area or the region inside normal distribution curve is 1 (because probability is 1)
therefore
writing x as (x - μ) + μ yields
letting y = x-μ
first one is symmetric about y-axis, hence value of that integral is 0.
E[x] = 0 + μ * 1
therefore ,
expectation E[x] = μ
variance = δ2
standard deviation =
In the General Normal Distribution, if the Mean is set to 0 and the Standard Deviation is set to 1, then the corresponding distribution obtained is called the Standard Normal Distribution.
The Probability Density function now becomes:
The cumulative density function of normal distribution does not give a closed formula. Hence precomputed values formulated in tables are used where-ever required. But these tables only contain data for the standard distribution. In order to find the cumulative probability for a general normal distribution, it is first standardized and then computed using the value tables.
This is beneficial in two ways:
If X is a normal random variable with E(X) = μ and V(X) = σ2,
the random variable is a normal random variable with E(Z) = 0 and V(Z) = 1.
That is, Z is a standard normal random variable.
Example: Suppose that the current measurements in a strip of wire are assumed to follow a normal distribution with a mean of 10 milliamperes and a variance of four (milliamperes)2. What is the probability that a measurement exceeds 13 milliamperes?
Solution: Let X denote the current in milliamperes. The requested probability can be represented as P (X > 13).
Let Z = (X ? 10) 2. With the Normal Distribution now standardized, the probability P(X > 13) = P(Z > 1.5) can now be easily computed.
Looking at the above table, first we find 1.5 in the X column, and then since there are no more digits of significance we look for 0.00 in the Y column. The corresponding cell gives us the value of P(Z ≤ 1.5) = 0.93319
So,
P(Z ≥ 1.5) = 1 - P(Z ≤ 1.5) = 1 - 0.93319 = 0.06681
Expected value, variance, standard deviation
The expected value of a standard normal random variable X is
expected value E[x] = 0
variance V[x] = 1
standard deviation = 1
65 videos|120 docs|94 tests
|
1. What is a normal distribution? |
2. How is the normal distribution characterized? |
3. What is the empirical rule for the normal distribution? |
4. How can the normal distribution be used in statistics? |
5. Can real-world data always be modeled by a normal distribution? |
|
Explore Courses for Civil Engineering (CE) exam
|