Statistics and Research Methodology in Clinical Psychology: Unit 4: Hypothesis testing

Unit - IV: Hypothesis testing: Formulation and types; null hypothesis, alternate hypothesis, type I

and type II errors, level of significance, power of the test, p-value. Concept of standard error and

confidence interval.

Hypothesis is a conjectural statement of the relation between two or more variables. For example, a study designed to look at the relationship between anxiety and test performance might have a hypothesis that states, "This study is designed to assess the hypothesis that anxious people will perform worse on a test than individuals who are not anxious."

Hypothesis are always in declarative sentence form, and they relate, either generally or specifically, variables to variables. There are two criteria for "good" hypothesis and hypothesis statements. One, hypothesis are statements about the relations between variables. For example, over-learning leads to performance decrement. Second criterion is that hypothesis carry clear implications for testing the stated relations. For example, groups A and B will differ on some characteristics. So hypothesis can be tested and shown to be probably true or probably false.

Hypothesis has some virtue. It directs investigation. There are important differences between problem and hypothesis. The problem is a question and is not directly testable. But hypothesis is testable.

Sources of hypothesis
Hypothesis can be deduced from theory and from other hypotheses.

Hpothesis testing -

Step1: Make a hypothesis and select a criteria for the decsion The standard logic that underlies hypothesis testing is that there are always (at least) two hypotheses: the null hypothesis and the alternative hypothesis.
The null hypothesis (H₀) predicts that the independent variable (treatment) has no effect on the dependent variable for the population.
The alternative hypothesis (H₁) predicts that the independent variable will have an effect on the dependent variable for the population - we'll talk more about how specific this hypothesis may be
The logic of hypothesis testing assumes that we are trying to reject the null hypothesis, not that we are trying to prove the alternative hypothesis.
Why? Generally, It is easier to show that something isn't true, than to prove that it is. This is especially true when we are dealing with samples. Remember that we aren't testing every individual in the population, only a sub set.
Example :
Hypothesis: All dogs have 4 legs.
To reject: need to have a sample which includes 1 or more dogs with more or fewer than 4 legs.
To accept: need to examine every dog in the population and count their legs. So part of the first step is to set up your null hypothesis and your alternative hypothesis. The other part of this step is to decide what criteria that you are going to use to either reject or fail to reject (not accept) the null hypothesis. So consider the problem that we have. We have a sample and its descriptive statistics are different from the population's parameters (which may be based on the control group sample statistics). How do we decide whether the difference that we see is due to a "real" difference (which reflects a difference between two populations) or is due to sampling error? To deal with this problem the researcher must set a criteria in advance. For example, think of the kinds of questions we were asking in the previous chapter. Given a population X with a m = 65 and a s = 10, what is the probability that our sample (of size n) will have a mean of 80? We're going to be asking the same questions here, but taking it a step further and say things like, "Gee, the probability that my sample has a mean of 80 is 0.0002. That's pretty small. I'll bet that my sample isn't really from this population, but is instead from another population." setting a criteria in advance is concerned with this part about saying "that's pretty small". When we set the criteria in advance, we are essentially saying, how small a chance is small enough to reject the null hypothesis. Or in other words, how big a difference do I need to have to reject the null hypothesis. That's the big picture of setting the criteria, now let's look at the details:
what are the possible real world situations?
- H₀ is correct
- H₀ is wrong
what are the possible conclusions?
- H₀ is correct
- H₀ is wrong
So this sets up four possibilities (2 * 2):
- 2 ways of making mistakes
- 2 chances to be correct

Actual situation

Experimenter's Conclusions

H₀ is correct

H₀ is wrong

Reject H₀

Fail to reject H₀

oops! Type I error	Yay! correct
Yay! correct	oops! Type II error

the two kinds of error each have their own name, because they really are reflecting different things

-type I error (a, alpha) - the H₀ is actually correct, but the experimenter rejected it

- e.g., there really is only one population, even though the probability of getting a sample was really small, you just got one of those rare samples

-type II error

₀

- e.g., your sample really does come from another population, but your sample mean is too close to the original population mean that you aren't can't rule out the possibility that there is only one population

The courtroom/jury analogy

Actual situation

Jury's Verdict

X is innocent

X is guilty

Guilty

Not Guilty

oops! Type I error	Yay! correct
Yay! correct	oops! Type II error

alpha level

level of significance

alpha level

level of significance

₀

Consider the following sample mean distributions

	a = prob of making a type I error
	general alternative hypothesis H₀: no difference H₁: there is a difference Two-tailed test a = 0.05 so this is 0.025 in each tail 0.025 + 0.025 = 0.05
	specific alternative hypothesis H₀: no difference H₁: there is a difference & the new group should have a higher meanOne-tailed test a = 0.05 so this is 0.05 in the tail

₀

critical regions

critical region

example

Population distribution	So the population m = 65 and s = 10. Suppose that you take a sample of n = 25, give them the treatment and get a = 69.Did the treatment work? Does it affect the population of individuals? Which distribution should you look at? population? sample means?
distribution of sample means	Look at distribution of sample means.Find your sample mean in the distribution. Look up the probability of getting that mean or higher for the sample (see last chapter). Let's assume an a = 0.05 Let's also assume that our alternative hypothesis is that the treatment should improve performance (make the mean higher) now we need to find our standard error. = = 10/5 = 2
	what is our critical region? Well, this is a one tailed test. so, look at the unit normal table, and find the area that corresponds to a = 0.05 z = 1.65 (conservative, really 1.645) so, translate this into a sample mean = Z + m = (1.65)(2)+65 = 68.3 so, if = 69, then we reject the H₀

_critical

₀

Population distribution	So the population m = 65 and s = 10. Suppose that you take a sample of n = 25, give them the treatment and get a = 69. Did the treatment work? Does it affect the population of individuals?Which distribution should you look at? population? sample means?
distribution of sample means	Look at distribution of sample means.Find your sample mean in the distribution. Look up the probability of getting that mean or higher for the sample (see last chapter). Let's assume an a = 0.05 Let's also assume that our alternative hypothesis is that the treatment should change performance, so we have a two-tailed test.
	now we need to find our standard error. = = 10/(sqroot 25) = 2what is our critical region? Well, this is a two tailed test. so, look at the unit normal table, and find the area that corresponds to a = 0.05 z = 1.96 so, translate this into a sample mean = Z + m = (1.96)(2)+65 = 68.9 so, if = 69, then we reject the H₀

Assumptions of hypotheses testing

Random sample

Independent observations

s is known and is constant

the sampling distribution is relatively normal

Violations

Almost done

other kind of error

Actual situation

Experimenter's Conclusions

H₀ is correct

H₀ is wrong

Reject H₀

Fail to reject H₀

oops! Type I error	Yay! correct
Yay! correct	oops! Type II error

₀

power

₀

Power

	a big difference between the two populationsnotice that the shaded region is large the chance to correctly reject the null hypothesis is good
	a smaller difference between the two populationsnotice that the shaded region is smaller the chance to correctly reject the null hypothesis is not nearly as good

Factors that affect power

	One-tailed test a = 0.05all of the critical region (a) is on one side of the distribution
	Two-tailed test a = 0.05 because a specific direction is not predicted, the critical region (a) is spread out equally on both sides of the distributionas a result the power is smaller

	Small n a = 0.05relatively large standard error
	Larger n a = 0.05Smaller standard error as a result the power is greater

Statistics and Research Methodology in Clinical Psychology

Friday, January 9, 2015

Unit 4: Hypothesis testing

No comments:

Post a Comment