Module 1. Descriptive statistics

Lesson 5

MEASURES OF SKEWNESS AND KURTOSIS

5.1 Introduction

In the preceding lessons, we have discussed the measures of central tendency and dispersion in case of frequency distribution. The measures of central tendency tell us about the concentration of the observations about the middle of the distribution and the measure of dispersion gives us an idea about the spread or scatter of the observations about some measure of central tendency. These measures, however, don’t adequately describe a frequency distribution in the sense that there could be two or more distributions with the same mean and standard deviation but still different from each other with regard to shape or pattern of distribution. Thus these two measures of central tendency and dispersion are inadequate to characterize a distribution completely and must be supported and supplemented by two more measures viz. skewness and kurtosis which we shall discuss in this lesson.

5.2 Skewness

Literal meaning of skewness is “lack of symmetry”. It measures the degree of departure of a distribution from symmetry and reveals the direction of scatterdness of the items.

A frequency distribution is said to be symmetrical when values of the variables equidistant from their mean have equal frequencies. If a frequency distribution is not symmetrical, it is said to be asymmetrical or skewed. Any deviation from symmetry is called skewness.

According to Morris Humberg “Skewness refers to the asymmetry or lack of symmetry in the shape of a frequency distribution”.

According to Croxton & Cowden “When a series is not symmetrical it is said to be asymmetrical or skewed”.

According to Simpson & Kafka “Measures of skewness tell us the direction and the extent of skewness. In a symmetrical distribution the mean, median and mode are identical. The more we move away from the mode, the larger the asymmetry or skewness”.

In the words of Riggleman and Frisbee “Skewness is the lack of symmetry when a frequency distribution is plotted on a chart, skewness present in the items tends to be dispersed more on one side of the mean than on the other”.

Thus the above definitions make it clear that the word skewness refers to the lack of symmetry. If a distribution is normal there would be no skewness in it and the curve drawn from the distribution would be symmetrical. In case of skewed distributions the curve drawn would be elongated either to the left or to the right. The concept of skewness gains importance from the fact that statistical theory is often based upon the assumption of the normal distribution. A measure of skewness is, therefore necessary in order to guard against the consequences of the assumption. The following three figures would give an idea about the shape of symmetrical and asymmetrical curves.

5.2.1 Symmetrical curve

The figure 5.1, given below, presents the shape of a symmetrical curve which is bell shaped having no skewness. The value of mean (M), median (Md) and mode (Mo) for such a curve would be identical.

Fig. 5.1 Symmetrical distribution

In a symmetrical distribution the values of mean, median and mode coincide. The spread of the frequencies is the same on both sides of the centre point of the curve. For a symmetrical distribution Mean = Median = Mode.

5.2.2 Positively skewed curve

A positively skewed curve has a longer tail towards the higher values of X i.e. the frequency curve gradually slopes down towards the higher values of X. In a positively skewed distribution the mean is greater than the median and then mode and the median lies in between mean and mode. The frequencies are spread over a greater range of values on the high value end of the curve (the right hand side) as is clear from the Fig 5.2.

Fig. 5.2 Positively skewed distribution

For a positively skewed distribution Mean > Median > Mode.

5.2.3 Negatively skewed curve

A negatively skewed curve has a longer tail towards the lower values of X i.e. the frequency curve gradually slopes down towards  the lower values  of X as shown in  Fig. 5.3.

Fig. 5.3 Negatively skewed distribution

In the negatively skewed distribution the mode is the maximum and mean is the least. The median lies in between mean and mode. The elongated tail in negatively skewed distribution is on the left hand side as would be clear from Fig 5.3. For a negatively skewed distribution, Mean < Median < Mode.

5.3 Measures of Skewness

Measures of skewness are meant to give an idea about the extent of asymmetry in a series. A distribution is said to be skewed if

·     The frequency curve of the distribution is not a symmetric bell shaped curve but stretched more to one side than to the other.

·      The values of mean (M), median (Md) and mode (Mo) fall at different points i.e they don’t coincide.

·      Quartiles Q1 and Q3 are not equidistant from the median.

·      The corresponding pairs of deciles and percentiles are not equidistant from the median.

If a particular distribution is found to be skewed, the next problem that arises is to measure the extent of skewness. To find out the direction and extent of asymmetry in a series statistical measures of skewness are employed.

 These measures can be absolute or relative. The absolute measures of skewness tell us the extent of asymmetry and whether it is positive or negative. The absolute skewness is based on the difference between mean and mode. Symbolically,

            Absolute skewness = Mean – Mode

Skewness will be positive, if the value of mean is greater than the mode and skewness will be negative, if the value of mean is less than the mode. The difference between the mean and the mode, whether positive or negative, indicates the distribution is positively skewed or negatively skewed. However, such an absolute measure of skewness is not adequate because it cannot be used for comparison of skewness in two distributions, if they are in different units, since difference between the mean and mode will be in terms of the units of distribution. Thus for comparison purposes we use the relative measures of skewness known as co-efficient of skewness.

5.3.1 Karl pearson’s coefficient of skewness

The first coefficient of skewness as defined by Karl Pearson is

           

This measure is based on the fact that the mean and the mode are drawn widely apart. Skewness will be positive if mean > mode and negative if mean < mode. There is no limit to this measure in theory and this is a slight drawback. But in practice the value given by this formula is rarely very high and its value usually lies between -1 and +1.

It may also be written as  as Mode = 3 Median – 2 Mean

This coefficient is a pure number without units since both numerator and denominator have the same dimensions. The value of this coefficient lies between -3 and +3.

5.3.2 Bowley’s Coefficient of Skewness

Prof. A.L. Bowley’s Coefficient of Skewness is based on quartiles and is given by:

           

This is also known as Coefficient of Skewness based on quartiles and is especially useful in situations where quartiles and median are used viz.

·      When the mode is ill-defined and extreme observations are present in the data.

·      When the distribution has open end classes or unequal class intervals.

This coefficient is a pure number without units since both numerator and denominator have the same dimensions. The value of this coefficient lies between -1 and +1.

5.3.3  Kelly’s Coefficient of Skewness

The drawback of Bowley’s Coefficient of Skewness is that it ignores the 50% of the data which can be partially removed by taking two deciles or percentiles equidistant from the median value. The refinement was suggested by Kelly.

           

5.3.4  Coefficient of Skewness based on moments

5.3.4.1  Moments

Moments are the general statistical measure used to describe and analyse the characteristics of a frequency distribution viz. central tendency, dispersion, skewness and kurtosis. Let us consider the variable X having a frequency distribution as given below:

X

X1

X2

X3

----

---

Xn

f

f1

f2

f3

----

---

fn

 Then,  is the arithmetic mean.

5.3.4.2  Moments about Mean

The rth moment about Mean  is denoted by µr and also known as central moment and is defined as

                                                                                                                                                                                          ………(Eq. 5.1)

Putting r=0 in equation (5.1) we get

                                                                                                                                                                                                                       ………( Eq. 5.2)     

Putting r=1 in equation (5.1) we get                                                                                                                                ………(Eq. 5.3)

because the algebraic sum of deviations of a given set of observations from their mean is zero. Thus the first moment about mean is always zero.

 Again taking r=2 we get                                                                                                                                            ………(Eq. 5.4)

 Hence second moment about mean gives the variance of the distribution.

                                                                                                                                                                                                                               ………(Eq. 5.5)

                                                                                                                                                                                             ………(Eq. 5.6) 

5.3.4.3  Moments about an arbitrary point A

The rth moment about any point A denoted by µr are also known as raw moment and is defined as

                                                                                                                                                                                ………(Eq. 5.7)

Putting r=0 and r=1 in equation (5.7) we get respectively

                                                                                                                                                                                                                         ………(Eq. 5.8)

                                                                                                     ………(Eq. 5.9)  

                  where  is the first moment about the arbitrary point A                                                                                             ………(Eq. 5.10)

 Taking r=2, 3, 4 in (5.7) we get respectively

 Second moment about the point A                                                                                                                                            ………(Eq. 5.11)

 Third moment about the point A                                                                                                                                             ………(Eq. 5.12)

 Fourth moment about the point A                                                                                                                                               ………(Eq. 5.13)

 5.3.4.4  Relation between moments about mean and moments about an arbitrary point A

In this section we shall try to obtain the expression for , the rth moment about mean as defined in (5.1) in terms of  , the rth moment about any arbitrary point A as defined in (5.7).

                                                                                                                                                                                    ………(Eq. 5.14)

From (5.9) we get  , substituting in (5.14) we get

             

Now by binomial theorem

               

               

           

 Putting r = 2, 3, 4 respectively, we get

               

                                       

                  

               

               

           

     =    

5.3.4.5  Pearson’s β and γ coefficients

Karl Pearson gave the following four coefficients calculated from the moments about mean and defined them as follows:

           

 These coefficients are pure numbers independent of units of measurement and as such can be conveniently used for comparative studies.

In terms of these coefficients, the coefficient of skewness is given exactly by        

               

 The importance of these coefficients lies in the fact that they give some idea about the shape of the curve obtained from the frequency distribution. For symmetrical distribution, all the moments of odd order about the mean vanish or (are zero) and therefore µ3 = 0 and hence β1 = 0. Thus, β1 gives a measure of departure from symmetry i.e. of skewness. Thus, the sign of skewness depends upon µ3. If µ3 is positive we get positive skewness and if µ3 is negative, we get negative skewness.

5.4  Kurtosis

The expression Kurtosis is used to describe the peakedness of a curve. Kurtosis is a Greek word means ‘bulginess’. In statistics kurtosis refers to the degree of flatness or peakedness in the region about the mode of a frequency curve. The degree of Kurtosis of a distribution is measured relative to the peakedness of normal curve. If we know the measures of central tendency, dispersion and skewness, we cannot still form a complete idea about the distribution as is clear from the figure 5.4.

Fig. 5.4  Type of kurtosis

All the three curves are symmetrical about mean and have same variation (range). In order to identify a distribution completely we need one more measure which Prof. Karl Pearson called ‘convexity of the curve’ or ‘kurtosis’. It is measured by β2 and γ2 given as under

             

Curve of type B which is neither flat nor peaked is known as normal curve and shape of its hump is accepted as a standard one. Curves with humps of the form of normal curve are said to have normal kurtosis and are termed as  Mesokurtic .The curve of type A , which are more peaked than the normal curve are known as Leptokurtic  and are said to lack kurtosis or to have negative kurtosis. Curves of type C , which are flatter than the normal curve are called  Platykurtic  and they are said to posses kurtosis in excess or have positive kurtosis.

 Example 1: Find different measures of skewness and kurtosis taking data given in example 1 of Lesson 3, using different methods.

Solution: Prepare the following table to calculate different measures of skewness and kurtosis using the values of Mean (M) = 1910, Median (Md) = 1890.8696, Mode (Mo) = 1866.3636, Variance  σ2 = 29500, Q1 = 1772.1053 and Q3 = 2030 as calculated earlier.

Calculation of moments about an arbitrary constant 2080

 Class  Interval

Mid-value  (Xi)

frequency

(fi)

1630-1730

1680

17

-400

-6800

2720000

-1088000000

435200000000

1730-1830

1780

19

-300

-5700

1710000

-513000000

153900000000

1830-1930

1880

23

-200

-4600

920000

-184000000

36800000000

1930-2030

1980

16

-100

-1600

160000

-16000000

1600000000

2030-2130

2080

14

0

0

0

0

0

2130-2230

2180

7

100

700

70000

7000000

700000000

2230-2330

2280

2

200

400

80000

16000000

3200000000

2330-2430

2380

2

300

600

180000

54000000

16200000000

Total

 

100

-400

-17000

5840000

-1724000000

647600000000

 

First moment about the point A=2080

               

 Second moment about the point A

                                                            

Third moment about the point A

             

Fourth moment about the point A

             

Compute the central moments using raw moments as follows:

           

             

                = (-17240000) – 3(58400)(-170) + 2(-170)3

                =2718000

               

=6476000000 – 4(-17240000)(-170) + 6(58400)(-170)2 – 3(-170)4

= 2373730000

Calculation of moments about mean

 

Class  Interval

Mid-value  (Xi)

frequency

(fi)

1630-1730

1680

17

-230

-3910

899300

-206839000

47572970000

1730-1830

1780

19

-130

-2470

321100

-41743000

5426590000

1830-1930

1880

23

-30

-690

20700

-621000

18630000

1930-2030

1980

16

70

1120

78400

5488000

384160000

2030-2130

2080

14

170

2380

404600

68782000

11692940000

2130-2230

2180

7

270

1890

510300

137781000

37200870000

2230-2330

2280

2

370

740

273800

101306000

37483220000

2330-2430

2380

2

470

940

441800

207646000

97593620000

Total

 

100

 

0

2950000

271800000

237373000000

 


First moment about mean :

 

                   

                     

Second moment about mean:

            .                                            

which is equal to variance as calculated in example 2.

Third moment about mean:

             

Fourth moment about mean:

             

Karl Pearson’s Coefficient of skewness (SK):

           

Bowley’s Coefficient of skewness (SK):

           

From the above calculation the coefficient of skewness and kurtosis can be calculated as under:

           

  
  

Hence this frequency distribution is positively skewed and platykurtic in nature.