Module 3. Probability distributions

Lesson 12

NORMAL DISTRIBUTION

12.1 Introduction

In preceding two lessons discrete probability distributions viz., Binomial and Poisson distribution were discussed. We shall now take up another distribution, which is an important continuous probability distribution known as the normal distribution. Normal distribution is probably the most important and widely used theoretical distribution. Normal distribution unlike the Binomial and Poisson is a continuous probability distribution. It has been observed that a vast number of variables arising in studies of agricultural and dairying, social, psychological and economic phenomena tend to follow normal distribution. The normal distribution was first discovered by French Mathematician Abraham De-Moivre in 1733, who obtained this continuous distribution as a limiting case of the Binomial distribution. But it was later rediscovered and applied by Laplace and Karl Gauss. It is also known as Gaussian distribution after the name of Karl Friedrich Gauss.

12.2 Definition of Normal Distribution

A continuous random variable X is said to have a normal distribution with parameters µ (mean) and σ (standard deviation), if its density function is given by the probability law

where π and e are given by π = 22/7 and e=2.7183 (base of natural logarithms).

Remarks

1) A random variable X with mean µ and variance σ ² following the normal law given above is represented as X~N(µ, σ ²).

2) If X~N(µ, σ ²) , then , is defined as a standard normal variate with E(Z)=0 and Var (Z)=1and we write Z~N(0, 1)

3) The p.d.f. of a standard normal variate Z is given by

4) Normal distribution is a limiting form of the binomial distribution when

a) n, the number of trials is indefinite large, i.e. n→∞ and

b) neither p nor q is very small.

5) Normal distribution is a limiting form of Poisson distribution when its mean m is large and n is also large.

12.3 Chief Characteristics (Properties) of Normal Distribution

It has the following properties:

1. The graph of f(x) is bell shaped unimodal and symmetric curve as shown in the Fig. 12.1. The top of the bell is directly above the mean (µ).

Fig. 12.1 Normal probability curve

2. The curve is symmetrical about the line X = μ, (Z = 0) i.e., it has the same shape on either side of the line X = μ, (or Z = 0). This is because the equation of the curve ∅(z) remains unchanged if we change z to -z.

3. Since the distribution is symmetrical, mean, median and mode coincide. Thus, Mean = Median = Mode = μ

4. Since Mean = Median = Mode = μ, the ordinate at X = μ, (Z = 0) divides the whole area into two equal parts. Further, since total area under normal probability curve is 1, the area to the right of the ordinate as well as to the left of the ordinate at X = μ (or Z = 0) is 0.5

5. Also, by virtue of symmetry the quartiles are equidistant from median (µ), i.e,

6. Since the distribution is symmetrical, all moments of odd order about the mean are zero. Thus μ_2n+1 = 0; (n = 0,1,2,…) i.e. μ₁ = μ₃ = μ₅ = --- = 0.

7. The moments (about mean) of even order are given by

Putting n=1 and 2 we get

μ₂ = σ² and μ₄ = 3σ⁴

and

8. Since the distribution is symmetrical, the moment coefficient of skewness based on moments is given by

9. The coefficient of kurtosis is given by

10. No portion of the curve lies below the x-axis, since f(x) being the probability can never be negative.

11. Theoretically, the range of the distribution is from -∞ < to < ∞. But practically, range = 6σ

12. As x increases numerically [i.e. on either side of X = μ], the value of f(x) decreases rapidly, the maximum probability occurring at X = μ and is given by

Thus maximum value of f(x) is inversely proportional to the standard deviation. For large values of σ, f(x) increases, i.e., the curve has a normal peak.

13. Distribution is unimodal with the only mode occurring at X = μ.

14. X-axis is an asymptote to the curve i.e., for numerically large value of X (on either side of the line (X = μ)), the curve becomes parallel to the X-axis and is supposed to meet it at infinity.

15. A linear combination of independent normal variates is also a normal variate. If X_1,X₂, … ,X_n are independent normal variates with mean μ₁,μ₂, … ,μ_n and standard deviations σ₁,σ₂, … , σ_n respectively then their linear combination

Where a₁, a₂, … , a_n are constants, is also a normal variate with Mean = a₁μ₁+ a₂μ₂+ … + a_n μ_n and Variance = a₁² σ₁² + a₂² σ₂² + … + a_n² σ_n². In particular, if we take a₁ = a₂ = … = a_n =1 then we get “X₁+X₂ + … +X_n is a normal variate with mean μ₁+μ₂+ … +μ_n and variance σ₁² + σ₂² + … + σ_n² “.Thus, the sum of independent normal variates is also a normal variate with mean equal to sum of their means and standard deviation equal to square root of sum of the squares of their standard deviations. This is known as the ‘Re-productive or Additive Property’ of the Normal distribution.

16. Mean Deviation (M.D.) about mean or median or mode is given by

17. Quartiles are given (in terms of µ and σ) by

18. Quartile deviation (Q.D.) is given by

Also

19. We have (approximately):

From property 18 we also have

20. Points of inflexion of the normal curve are at X = μ ± σ i.e. they are equidistant from mean at a distance of σ and are given by :

21. Area property: One of the most fundamental property of the normal probability curve is the area property. If X ∽ N (µ, σ²), then the probability that random value of X will lie between X= μ and X= x₁ is given

∴ P (µ < x < x₁) = P (0 < z < z₁)

Where is the probability function of standard normal variate. The definite integral is known as Normal Probability integral and gives the area under standard normal curve between the ordinate z=0 and z = z₁. These areas have been provided in the form of table for different values of z₁ at the intervals of 0.01 which are available in any standard text books of statistics.

Particular Cases:

1. In particular, the probability that a random variable X lies in the interval (μ-σ, μ+σ) is given by

The area under the normal probability curve between the ordinates at X= μ-σ and X= μ+σ is 0.6826.In other words, the range X = μ±σ covers 68.26% of the observations (as shown in Fig.12.2). This is known as 1σ limit of normal distribution

Fig. 12.2 1σ, 2σ and 3σ under Normal Probability Curve

2. The probability that random variable X lies in the interval (μ-2σ, μ+2σ) is given by

The area under the normal probability curve between the ordinates at X= μ-2σ and X= μ+2σ is 0.95445. In other words, the range X = μ±2σ covers 95.445% of the observations (as shown in Fig. 12.2). This is known as 2σ limits of normal distribution and is considered as warning limit in case of statistical quality control which implies that it is a warning to the manufacturer that the manufacturing process is going out of control.

3. The probability that random variable X lies in the interval (μ−3σ, μ+3σ) is given by

The area under the normal probability curve between the ordinates at X=μ−3σ and X= μ+3σ is 0.9973.In other words, the range X = μ±2σ covers 99.73% of the observations (as shown in Fig. 12.2). This is known as 3σ limits of normal distribution and it implies the manufacturing process is out of control in case of statistical quality control.

Thus, the probability that a normal variate X lies outside the range µ ± 3σ is given as

Thus, in all probability, we should expect a normal variate to lie within the range µ ± 3σ though theoretically may range from −∞ to ∞.

12.4 Examples of Normal Distribution

(i) The age at first calving of cows belonging to the same breed and living under similar environmental conditions tend to normal frequency distribution.

(ii) The milk yield of cows in a large herd tends to follow a normal frequency distribution.

(iii) The chemical constituents of milk like fat, SNF, protein etc. for large samples follow normal distribution.

12.5 Computation of Area Under Normal Probability Curve

Probability that a continuous random variable X in any value between a and b is

which is the area bounded by the curve p(x), X-axis and the ordinates at X=a and X=b and is shown in Fig. 12.3.

Fig. 12.3

Similarly the area to the right of the ordinates at X=x₁ and x₁is less than mean (as shown in Fig. 12.4 )

Fig. 12.4

At X=X₁ Z=-Z₁ so P(X>X₁)=0.5+P(-Z₁<Z<0)=0.5+ P(0<Z<Z₁) , where P(0<Z<Z₁)can be read from the normal tables. The application of this distribution in solving problems is illustrated through following examples.

Example 1. Average lactation yield for 1000 cows maintained at a farm is 1700 kg and their standard deviation is 85 kg. A cow is considered as high yielder if it has a lactation yield greater than 1900 kg and poor yielder if it has lactation yield less than 1600 kg. Find the number of high yielding and poor yielding cows.

Solution:

Here, µ=1700 kg and σ = 85 kg and let X denote the lactation milk yield

Fig. 12.5

a) To find number of high yielder cows we first find the probability of cows yielding more than 1900 kg. i.e. P(X > 1900 kg) (Fig. 12.5). So, we first compute the value standard normal variate i.e. Z₁and then find area under shaded region using normaltables

At X₁ = 1900 kg.

P(X₁ > 1900 kg) = P (Z₁ >2.353) =0.5 – P (0≤ Z₁ ≤ 2.353) = 0.5–0.49069=0.00931

Number of high yielder cows = N xP(z₁ > 2.353) = 0.00931 ×1000 = 9.31 = 9 cows

b) To find number of low yielder cows, we first find the probability of cows yielding less than 1600 kg i.e. P(X₂ < 1600 kg). So, we first compute the value standard normal variate i.e. Z₂ and then find area under shaded region using normaltables

P(X₂ < 1600) = P (Z₂ < -1.18)=0.5 – P(0≤ Z₂ ≤ 1.18) = 0.5–0.38109=0.119

Number of low yielder cows = N xP(X₂ < 1600) = 0.119 × 1000 = 119 cows

Conclusion: Total number of high yielding & low yielding cows are 9 and 119 respectively.

Example 2. An Intelligence test was administrated to 1000 students. The average score of students was 42 with standard deviation of 24. Find

(a) Number of students exceeding a score of 50

(b) Number of students scoring between 30 & 58

Solution: In this problem µ=42 and σ = 24 and let X denote the score obtained

(a) Number of students exceeding score 50

Fig. 12.6

As shown in figure 12.6 we want to find P(X>50) i.e. probability of shaded portion

At X=50,

P(X>50) =P(Z > 0.334) = 0.5 – P(0≤ Z ≤ 0.334)= 0.5 – 0.1308== 0.3692

No of students = 1000 * 0.3692= 369.2 ~ 369 students

(b) Number of students scoring between 30 and 58

As shown in figure 12.7 we want to find P(30<X<58) i.e. probability of shaded portion

Fig. 12.7

P( Z₁> -0.5) = P(0≤Z₁ ≤ 0.5)= 0.1915

P(Z₂<0.6667)= P(0 ≤ Z₂≤ 0.6667)=0.2476

P(30<X<58)=P(-0.5≤ Z≤0.6667) =0.1915+0.2476=0.4391

No of students = 1000 * .4391 = 439.1 ~ 439 students

(c) Value of score exceeded by top 100 students. Let x₁ be the value of score exceeded by top 100 students, the probability of top 100 students = 100/N = 100/1000 = 0.1 such that P(X>x₁) = 0.1

At X= x₁, = Z₁. From Fig. 12.8 the P(X>x₁) shown as shaded region

P(X>x₁)=P(Z>Z₁)=0.1 ⇒P(0 ≤ Z ≤ Z₁)=0.4 = 1.286

x₁= 72.86 ~73

Fig. 12.8

Conclusion

(a) 369 students scored more than 50.

(b) 439 students scored between 30 & 58.

Example 3. Tins are filled by an automatic filling machine with ghee. Average quantity filled in tin is 1000g. It is found that 5% of tins had ghee less than 980 grams. Find the standard deviation.

Solution : Here µ=1000g and let X be quantity of ghee filled in a tin

Fig. 12.9

From fig. 12.9 P(X <980)=0.05 ⇒ P(Z < -Z₁) = -1.645

⇒ (980-1000/σ) -1.645 ⇒ -20 = -1.645 σ , Hence σ =12.1840~12.18 gm

Conclusion :

Standard derivation of the ghee filled in tin is 12.18 gm.

Example 4. In a normal distribution, 31% of items are under 45 and 8% are over 64. Find the mean & standard deviation.

Fig. 12.10

Solution:

Let X denotes the variable under consideration. We are given that P (X₁< 45) = 0.31 and P (X₂ > 64) = 0.08. If X has normal distribution with mean µ and standard deviation σ. Then standard normal variates corresponding to X₁= 45 and X₂ = 64 (from Fig 12.10) are

When X₁ = 45,

When X₂ = 64,

From the fig. 12.10, P (0 <Z < Z₂) = 0.42 ⇒ Z₂= 1.405 (from normal tables)

.........(Eq. 1)

P (-Z₁ <Z < 0) = 0.19 ⇒ P (0 <Z < Z₁) = -0.496(from normal tables)

.........(Eq. 2)

Solving equations (1) & (2) we get

μ= 49.95 ~ 50 and σ = 9.99 ~ 10

Conclusion

Mean & standard deviation of given distribution are 50 & 10 respectively.

Example 5. Average net weight of coffee complete powder is 250 g with a standard deviation of 3g. The powder is packed in polypack with an average weight of 5g with a standard deviation of 0.2g. Average weight of tin in which polypack is packed is 100g with a standard deviation of 1.5g. Individual weights of all items follow normal distribution. If 5% tins are classified as underweight tins then what would be the weight of filled in tin. Filled in tins are classified as overweight if their weight exceeds a weight of 360g. What proportion of tins are overweight tin?

Solution:

Let X₁ be the normal variate for weight of coffee with mean(μ₁)= 250g and s.d.(σ₁)=3g

X₂ be the normal variate for poly pack with mean ( μ₂)= 5g and s.d.(σ₂)=0.2 g

X₃ be the normal variate for tin with mean (μ₃)= 100g and s.d.(σ₃)=1.5 g

By using the reproductive property of normal distribution, the composite weight of tin comprising of coffee complete, polypack and tin will follow a normal distribution with mean (μ=μ₁+μ₂+μ₃)=250+5+100=355g and standard deviation () = =

=3.36 g i.e. (X=X₁₊ X₂₊ X₃)~N(355g,3.36 g) If 5% tins are classified as underweight, Let x₁ be the weight of tin considered as underweight, then we have : P(X<x₁)=0.05. Then the standard normal variate corresponding to x₁is

From fig.12.11 P(−Z₁< Z < 0)=0.45⇒P(0<Z< Z₁) = 0.45 ⇒ (x₁-355)/3.36 = -1.6415 ⇒ x₁ = 349.67g

Hence weight of underweight tins is 349.47 g.

Fig. 12.11

Filled in tins exceeding a weight of 360g are classified as overweight. To find the proportion of tins which are overweight we proceed as follows:

Fig. 12.12

As shown in figure 12.12, we want to find P(X>360) i.e. probability of shaded portion

At X=360,

P (X > 360) = P (Z > 1.488) = 0.5 – P (0≤ Z ≤ 1.488) = 0.5 – 0.4316== 0.0684

Hence, 6.84 % tins are overweight

Conclusion:

Weight of underweight tins is 349.47g & 6.84% tins are overweight

12.6 Importance of Normal Distribution

Normal distribution plays a very important role in statistics because

(i) Most of the discrete probability distributions occurring in practice e.g., Binomial and Poisson can be approximated to normal distribution as n number of trials tends to increase.

(ii) Even if a variable is not normally distributed, it can be sometimes be brought to normal by a simple mathematical transformation, if the distribution of X is skewed, the distribution of or log x might come out to be normal.

(iii) If X~N(μ,σ²) then P [μ−3σ < x < μ+3σ] =0.9973 ⇒ [|Z|˃3] = 1− 0.9973=0.0027. Thus the probability of standard normal variate going outside the limits ±3 is practically zero. This property of normal distribution forms the basis of entire large sample theory.

(iv) Many of the sampling distribution e.g., student’s t, Snedecor’s F, Chi square distributions etc tend to normality for large samples. Further, the proof of all the tests of significance in the sample is based upon the fundamental assumptions that the populations from which the samples have been drawn are normal.

(v) The whole theory of exact sample (small sample) tests viz. t , χ², F etc, is based on the fundamental assumption that the parent population from which the samples have been drawn follows normal distribution.

(vi) Normal distribution finds large applications in statistical quality control in industry for setting up of control limits.

(vii) Theory of normal curves can be applied to the graduation of the curve which is not normal.