Module 5. Test of significance

Lesson 20

F- TEST AND ITS APPLICATIONS

20.1 Introduction

A large number of research experiments are conducted to examine the effect of various factors on the production and quality attributes of milk and milk products. F-test is used either for testing the hypothesis about the equality of two population variances or the equality of two or more population means. The equality of two population means was dealt with t-test. Besides a t-test, we can also apply F-test for testing equality of two population means. Sir Ronald A. Fisher defined a statistic Z which is based upon ratio of two sample variances. In this lesson we will consider the distribution of ratio of two sample variances which was worked out by G.W. Snedecor.

20.2  F-Statistic

Let X1i (i=1,2,…,n1)  be a random sample of size n1from the first population with variance σ12 and X2j (j=1,2,…,n2) be another independent random sample of size n2 from the second normal  population with variance σ22. The F- statistic is defined as the ratio of estimates of two variances as given below:

               

where, S12 > S22 and are unbiased estimates of population variances which are given by:

               

               

It follows Snedecor’s F- distribution with (n1-1, n2-1) d.f. i.e., F~F (n1 – 1, n2 – 1). Further, if X is a χ2-variate with n1 d.f. and Y is another independent χ2-variate with n2 d.f., then F-statistic is defined as: i.e. F-statistic is the ratio of two independent Chi-square variates divided by their respective degrees of freedom. This statistic follows G.W. Snedecor F distribution with (n1,n2) d.f. The sampling distribution of F-statistic does not involve any population parameter and depends only on the degrees of freedom n1and n2.

20.3  Application of F- Distribution

F-distribution has a number of applications in statistics, some of which are given below

a)      F-test for equality of population variances

b)      F test for testing equality of several population  means

20.3.1  F-test for equality of population variances

Suppose we are interested to find if two normal populations have same variance. Let X1i (i=1,2,…,n1)  be a random sample of size n1from the first population with variance σ12 and X2j (j=1,2,…,n2) be another independent random sample of size n2 from the second normal  population with variance σ22. The two samples are independent of each other.

Under the null hypothesis H0: σ12 = σ22 = σ2 i.e., population variances are equal. In other words the two independent estimates of the common population variance do not differ. The test statistic  F = (S12/ S22) ~ F(n1 – 1, n2 – 1) (S12 > S22)

Where S12 and S22 are unbiased estimates of the common population variance σ2 and are given by

               

 

               

It follows Snedecor’s F- distribution with (n1-1, n2-1) d.f. i.e., F~F (n1 – 1, n2 – 1). Since F-test is based on the ratio of two variances it is also known as Variance Ratio Test. If calculated value of F is greater than tabulated value at α% level of significance H0 is rejected at α percent level of significance which implies that the two independent estimates of population variances are heterogeneous. On the other hand if calculated value of F is less than tabulated F then H0 is not rejected at α percent level of significance suggesting that the estimates of population variances are homogenous.

Remarks:

·       Since the available tables of the significant values of F are for the right –tail test, i.e., against an alternative σ12 > σ22, in numerical problems we will take greater of the variance S12 or S22 in the numerator and adjust for the degrees of freedom i.e., degree of freedom of the large variance must be taken in the numerator while computing F.

·       In numerical problems, usually sample variance s2 is given from which S2 can be obtained on using the relation n s2 = (n-1) S2.

The procedure is illustrated by following examples

Example 1: In a sample of 8 observations, the sum of squared deviations of items from the mean was 94.5. In another sample of 10 observations, the value was found to be 101.7. Test whether the difference is significant at 5% level of significance.

Solution:  Let us take the hypothesis that there is no difference in the variances of two samples i.e., H0: σ12 = σ22, with an alternative H0: σ12 ≠ σ22. we are given

              

              
             

           

Applying F test

           

Tabulated value of F (7, 9) d.f. at 5% level of significance is 3.29.

Calculated value of F is less than the tabulated value, it is not significant. Hence H0 may be accepted and conclude that the difference in the variances of two samples is not significant at 5% level of significance which implies that the two population variances are homogenous or the sample have been drawn from the population having same variances.

Example 2: Two random samples drawn from normal population are :

Sample I

20

16

26

27

23

22

18

24

25

19

 

 

Sample II

27

33

42

35

32

34

38

28

41

43

30

37

Obtain estimates of the variances of two populations and test whether two populations have same variances.

Solution: Let us take the hypothesis that the two populations have the same variance i.e.,

            H0: σ12 = σ22 = σ02. With an alternative H0: σ12 ≠ σ22.

Table 20.1

Sample I

X1

22

Sample II

X2

20

-2

4

27

-8.00

64

16

-6

36

33

-2.00

4

26

4

16

42

7.00

49

27

5

25

35

0.00

0

23

1

1

32

-3.00

9

22

0

0

34

-1.00

1

18

-4

16

38

3.00

9

24

2

4

28

-7.00

49

25

3

9

41

6.00

36

19

-3

9

43

8.00

64

 

 

 

30

-5.00

25

 

 

 

37

2.00

4

220

 

120

420

 

314

 

           

           

           

           

Applying F test

           

Tabulated value of F(11,9) d.f. at 5% level of significance is 3.10.

Calculated value of F is less than the tabulated value, it is not significant. Hence H0 may be accepted and conclude that the difference in the variances of two samples is not significant at 5% level of significance which implies that the two populations have the same variance.

20.3.2  F test for equality of several population means

This test is widely used in the technique of analysis of variance which plays a very important and fundamental role in Design of experiments which is discussed in details in next module.