Psychology 121, Lecture 5

by Hal S. Kopeikin, Ph.D. © 1998


Announcements

  1. the midterm is almost upon us. Check the syllabus for details, and review our discussion of it in your notes from our first class meeting.
  2. the test will be "hard" but not "tricky." Don't worry about computing formulas we have not discussed in class. For the slope of a line, correlation coefficients, KR20, etc. you should understand the formulas (what does the number mean? When should you use it?) rather than memorizing it. DO memorize formulas I present in class (e.g., mean, z scores).  And, learn the proportions in for a normal distribution in figure 2-7 (rounded to percents is fine).

Validity

Types of Validity

Face Validity

1. Content Validity

How well does the test represent a defined universe of information? E.g., test of arithmetic with high content validity will have questions on all of the central aspects of arithmetic and sampling will be proportional (i.e., will not over-represent on area vs. another).

Content validity is most relevant when test is a sample representing a well-defined domain. The quality of representation is the key. Content validity is generally established by logical means, statistical evidence is secondary. It is established by comparing items and patterns of answers to a test plan and definition of the underlying domain.
 
 

2. Criterion-Related Validity

Concern is how well the test estimates or predicts something else. The something else is a criterion, operationalized as a criterion measure. It is important to be able to distinguish a predictor (e.g., like the SAT) from a criterion, that is, what you are trying to predict (e.g., college achievement) from a criterion measure, the standard by which you measure the criterion (e.g., college GPA). Predictive validity involves estimating a future criterion. Concurrent validity relates to current status on a criterion.

Key Issues in Criterion-Related Validity

3. Construct Validity

What is the nature of the theoretical abstraction, how well is it represented?

This is most relevant for tests measuring theoretical abstractions. The test and the definition of what is measured are intimately linked. Note: if the theory is no good, then the test can't be very good either. The level of construct validity is based on how well the theory works and how well the test measures the abstraction in question.

Establishing construct validity is an ongoing process which continues as part of theory development.

Construct validity is established by a pattern of findings, showing the relationship of what the test measures to other measurements and analyzing those for fit with the theory.

Convergent and divergent evidence is utilized. Convergent evidence is demonstration that the test measures what it is supposed to measure. Divergent evidence shows that a test is unrelated to things that the theory says it is supposed to be unrelated to. A test should have both convergent and divergent evidence. For instance, we can expect that IQ scores will converge with verbal ability, grades in college, etc. And if IQ scores should not converge with hair color or ring size.

Some contend construct validity is the general case, relevant to all psychological tests. They maintain that content- and criterion-related validity are best considered as parts of the pattern necessary to establish construct validity.
 

4. How valid is valid enough?