LAB 6: Non-parametric Tests

In this lab, we will examine:

Two probability distributions used in non-parametric tests: and D
How and when to use the following tests:
How to calculate expected frequencies
Important cautions for the Chi-square tests

Recommended web links:
Hyperstat.com: A diagram of the Chi-square distribution
VassarStats: An applet showing how the Chi-square distribution changes with different degrees of freedom. Enter degrees of freedom in the dialogue box that pops up (between 1 and 20)
Examples and Applications: An overview and examples using the Chi-square distribution

Applied hypothesis testing...

There are two groups of inferential statistical tests: parametric and non-parametric tests. A parametric test requires that the underlying population for each sample is normally distributed, and that the data are either interval or ratio.

Non - parametric tests have been developed that use only frequency counts in the calculation of the test statistics. No assumptions are made about the shape of the underlying population distribution.

Introduction to Goodness of Fit Tests

'Goodness of Fit' tests compare the frequencies observed in a sample to a distribution of expected frequencies in order to infer similarity.

When we discuss the mechanics of the non-parametric Goodness of Fit tests, you will notice the word distribution used in two different contexts.

Frequency distributions refer to the distribution of data values in a sample or population (refresh my memory). The tests use frequencies in different ways:
- Chi-square() tests use simple frequency counts.
- Kolmogorov-Smirnov tests use cumulative relative frequencies.
Probability distributions ( and D) model the distribution of test statistics (refresh my memory). Two important probability distributions are introduced below.

The Chi-square () Distribution

Pronounced kai not chee This distribution begins at 0 and is positively skewed The distribution has no negative values.	Shape of the distribution
There are many distribution shapes, each associated with a different number of degrees of freedom (similar to the student’s t distribution). As the number of degrees of freedom increases, the distribution becomes less skewed This distribution is used for the one and multi-sample Chi-square tests.

The ‘D’ Distribution

The D distribution has a similar shape to the distribution.
This distribution also starts at 0 and has no negative values.
The D distribution is used specifically for the Kolmogorov-Smirnov Goodness of Fit test

Applications and Calculations

These tests use the hypothesis testing procedures detailed in Lab 5. For each test, we will outline:

the general uses of the test
how the test works
the test assumptions
the appropriate hypotheses
the appropriate probability distribution
how to calculate the test statistic
what is the decision rule
how to apply the test (using an example)

Chi-square tests

The chi-square tests are the most basic tests because they use nominal categories and simple frequency counts. They are flexible tests because they can be applied to most data. However, these tests treat all categories as nominal so information associated with ordinal, interval or ratio levels of measurement will be lost.

In this discussion, we are going to start with one and two sample chi-square tests and then examine the one and two sample Kolmogorov-Smirnov tests.

One Sample Chi-square () Test

Use

This test compares the frequencies observed in a sample to expected frequencies provided by the analyst. This test uses the sample frequency distribution to make inferences about the population frequency distribution.

How does it work?

This test focuses on how well the observed and expected frequency counts match for each nominal or ordinal category. The expected frequencies may be uniformly, randomly, or proportionally distributed (more details later in the lab). The analyst can choose an expected frequency distribution based on his/her needs.

This test uses the following statistical hypotheses:

:	there is no difference between the observed and expected frequencies (i.e. the sample is drawn from a population that follows the expected distribution).
:	there is a significant difference between the observed and expected frequencies (i.e. the sample is drawn from a population that does not follow the expected distribution).

The test statistic provides a measure of the amount of difference between the two frequency distributions. If the difference between the observed and expected distributions is small, will be small. If the difference is large, will be large.

Assumptions

Data are organized into nominal or ordinal categories. You can ‘downgrade’ interval or ratio data to the ordinal level by collapsing the data into categories.
Categories are mutually exclusive
Frequencies are absolute counts (i.e. not relative frequencies or percentages).
If only two categories, the expected frequency in each must be 5 or more.
If more than 2 categories, no category should have an expected frequency less than 1, and not more than one in 5 expected frequency counts should be less than 5.

Test Statistic

Formula:

where:
O_i = observed frequency in each category
E_i = expected frequency in each category

Critical Value

Critical values (

) are based on the significance level (

) and the number of degrees of freedom (k-1, where k=number of categories).

Decision rule

Reject

if the calculated value (

*) is greater than critical value (

).

If the

* is greater than

, it will fall in the red 'rejection region' on the chi-square probability distribution.

Example

Hydrogeologists are studying the occurrence of natural springs by rock type in Spring Valley. They count the number of springs by rock type within a similar-sized sample areas. The frequencies are summarized below.

RESEARCH QUESTION: Is the occurrence of natural springs influenced by rock type?

1. Select the appropriate test:
To investigate this research question, we need to compare our observed frequencies to a uniform frequency distribution. Why? If rock type is not important in determining the location of springs, we will expect the number of springs to be evenly distributed over rock type. If our observed frequencies are different from the expected uniform frequencies, we can infer that rock type affects the occurrence of natural springs. We will use the one sample chi-square test to answer our research question.

2. Check assumptions:

rock types are nominal categories
categories are mutually exclusive (each rock belongs to one category only)
frequencies are absolute counts (dataset does not contain relative frequencies)
expected frequencies:
To calculate the expected frequencies for a uniform distribution, take the total observed count and divide by the number of categories. In this example, we have observed 24 natural springs in three rock type categories. Therefore, we would expect 24/3 = 8 springs in each category if rock type did not influence the location of springs.

No category has an expected frequency of less than 5
You can apply this test to your data.

3. State your hypotheses:

4. Select significance level:
We will use the standard = 0.05 (95% confidence level)

5. Select probability distribution of test statistic:
This test uses the distribution for the test statistic

6. Establish the critical values:
At = 0.05, and degrees of freedom (k-1) = 2, the critical value () = 5.99.
(Recall that k is the number of categories; we have 3 rock types so k = 3)

7. Calculate test statistic:

Rock type	Observed frequency	Expected frequency	(Obs-Exp)²/Exp
Limestone	7	8	0.13
Calcareous Marl	3	8	3.13
Sandstone	14	8	4.50
Total	24		7.75

Therefore, * = 7.75

8. Make inference using the decision rule:

Rule: reject

* >

From above,

* (7.75) is greater than

(5.99), so reject

9. State conclusion:
We conclude with 95% confidence that the observed frequencies are significantly different from the expected (uniform) frequency distribution (i.e the springs are not uniformly distributed between rock types). We infer that rock type influences the occurrence of natural springs in Spring Valley.

Your turn!

The research presented above was repeated in an adjacent valley. The observed frequencies are listed below. RESEARCH QUESTION: Is there a significant difference in the occurrence of springs due to rock type in this valley? (check your answer).

Rock type	Number of Springs
Limestone	11
Calcareous Marl	5
Sandstone	14
Volcanics	6
Total	36

Two or more sample Chi-square () test

Uses

This test compares the observed frequency distributions of two or more samples to see if the samples are drawn from the same underlying population (i.e. do the samples have the same parent population or not?).