Information about Psychometric



Psychometrics is the field of study concerned with the theory and technique of educational and psychological measurement, which includes the measurement of knowledge, abilities, attitudes, and personality traits. The field is primarily concerned with the study of differences between individuals and between groups of individuals. It involves two major research tasks, namely: (i) the construction of instruments and procedures for measurement; and (ii) the development and refinement of theoretical approaches to measurement.

Origins and background

Much of the early theoretical and applied work in psychometrics was undertaken in an attempt to measure intelligence. Francis Galton is often referred to as the father of psychometrics, having devised and used mental tests. However, the origin of psychometrics also has connections to the related field of psychophysics. Charles Spearman, a pioneer in psychometrics who developed approaches to the measurement of intelligence, studied under Wilhelm Wundt and was trained in psychophysics. The psychometrician L. L. Thurstone later developed and applied a theoretical approach to the measurement referred to as the law of comparative judgment, an approach which has close connections to the psychophysical theory developed by Ernst Heinrich Weber and Gustav Fechner. In addition, Spearman and Thurstone both made important contributions to the theory and application of factor analysis, a statistical method that has been developed and used extensively in psychometrics.

More recently, psychometric theory has been applied in the measurement of personality, attitudes and beliefs, academic achievement, and in health-related fields. Measurement of these unobservable phenomena is difficult, and much of the research and accumulated art in this discipline has been developed in an attempt to properly define and quantify such phenomena. Critics, including practitioners in the physical sciences and social activists, have argued that such definition and quantification is impossibly difficult, and that such measurements are often misused. Proponents of psychometric techniques can reply, though, that their critics often misuse data by not applying psychometric criteria, and also that various quantitative phenomena in the physical sciences, such as heat and forces, cannot be observed directly but must be inferred from their manifestations.

Figures who made significant contributions to psychometrics include Karl Pearson, L. L. Thurstone, Georg Rasch, Johnson O'Connor, Frederick M. Lord and Arthur Jensen.

Definition of measurement in the social sciences

The definition of measurement in the social sciences has a long history. A currently widespread definition, proposed by Stanley Smith Stevens (1946), is that measurement is "the assignment of numerals to objects or events according to some rule". This definition was introduced in the paper in which Stevens proposed four levels of measurement. Although widely adopted, this definition differs in important respects from the more classical definition of measurement adopted throughout the physical sciences, which is that measurement is the numerical estimation and expression of the magnitude of one quantity relative to another (Michell, 1997). Indeed, Stevens' definition of measurement was put forward in response to the British Ferguson Committee, whose chair, A. Ferguson, was a physicist. The committee was appointed in 1932 by the British Association for the Advancement of Science to investigate the possibility of quantitatively estimating sensory events. Although its chair and other members were physicists, the committee also comprised several psychologists. The committee's report highlighted the importance of the definition of measurement. While Stevens' response was to propose a new definition, which has had considerable influence in the field, this was by no means the only response to the report. Another, notably different, response was to accept the classical definition, as reflected in the following statement:

"Measurement in psychology and physics are in no sense different. Physicists can measure when they can find the operations by which they may meet the necessary criteria; psychologists have but to do the same. They need not worry about the mysterious differences between the meaning of measurement in the two sciences." (Reese, 1943, p. 49)


These divergent responses are reflected to a large extent within alternative approaches to measurement. For example, methods based on covariance matrices are typically employed on the premise that numbers, such as raw scores derived from assessments, are measurements. Such approaches implicitly entail Stevens' definition of measurement, which requires only that numbers are assigned according to some rule. The main research task, then, is generally considered to be the discovery of associations between scores, and of factors posited to underlie such associations. On the other hand, when measurement models such as the Rasch model are employed, numbers are not assigned based on a rule. Instead, in keeping with Reese's statement above, specific criteria for measurement are stated, and the objective is to construct procedures or operations that provide data which meet the relevant criteria. Measurements are estimated based on the models, and tests are conducted to ascertain whether it has been possible to meet the relevant criteria.

Instruments and procedures

The first psychometric instruments were designed to measure the concept of intelligence. The best known historical approach involves the Stanford-Binet IQ test, developed originally by the French Psychologist Alfred Binet. Contrary to a fairly widespread misconception, there is no compelling evidence that it is possible to measure innate intelligence through such instruments, in the sense of an innate learning capacity unaffected by experience, nor was this the original intention when they were developed. Nevertheless, IQ tests are useful tools for various purposes. An alternative conception of intelligence is that cognitive capacities within individuals are a manifestation of a general component, or general intelligence factor, as well as cognitive capacity specific to a given domain.

Psychometrics is applied widely in educational assessment to measure abilities in domains such as reading, writing, and mathematics. The main approaches in applying tests in these domains have been Classical Test Theory and the more modern Item Response Theory and Rasch measurement models. These modern approaches permit joint scaling of persons and assessment items, which provides a basis for mapping of developmental continua by allowing descriptions of the skills displayed at various points along a continuum. Such approaches provide powerful information regarding the nature of developmental growth within various domains.

Another major focus in psychometrics have been on personality testing. There have been a range of theoretical approaches to conceptualising and measuring personality. Some of the better known instruments include the Minnesota Multiphasic Personality Inventory, the Five-factor Model (or "Big 5") and the Myers-Briggs Type Indicator. Attitudes have also been studied extensively in psychometrics. A common approach to the measurement of attitudes is the use of the Likert scale. An alternative approach involves the application of unfolding measurement models, the most general being the Hyperbolic Cosine Model (Andrich & Luo, 1993).

Theoretical approaches

Psychometric theory involves several distinct areas of study. First, psychometricians have developed a large body of theory used in the development of mental tests and analysis of data collected from these tests. This work can be roughly divided into classical test theory (CTT) and the more recent item response theory (IRT: Embretson & Reise, 2000; Hambleton & Swaminathan, 1985). An approach which is similar to IRT but also quite distinctive, in terms of its origins and features, is represented by the Rasch model for measurement. The development of the Rasch model, and the broader class of models to which it belongs, was explicitly founded on requirements of measurement in the physical sciences (Rasch, 1960).

Second, psychometricians have developed methods for working with large matrices of correlations and covariances. Techniques in this general tradition include factor analysis (finding important underlying dimensions in the data), multidimensional scaling (finding a simple representation for high-dimensional data) and data clustering (finding objects which are like each other). In these multivariate descriptive methods, users try to simplify large amounts of data. More recently, structural equation modeling and path analysis represent more sophisticated approaches to solving this problem of large covariance matrices. These methods allow statistically sophisticated models to be fitted to data and tested to determine if they are adequate fits.

One of the main deficiencies in various factor analysis is a lack of cutting points. A usual procedure is to stop factoring when eigenvalues drop below one because the original sphere shrinks. The lack of the cutting points concerns other multivariate methods, too. At the bottom, psychometric spaces are Hilbertian but they are dealt with as if Cartesian. Therefore, the problem is more of interpretations than utilizing a method.

Key concepts

The key traditional concepts in classical test theory are reliability and validity. A reliable measure is measuring something consistently, while a valid measure is measuring what it is supposed to measure. A reliable measure may be consistent without necessarily being valid, e.g., a measurement instrument like a broken ruler may always under-measure a quantity by the same amount each time (consistently), but the resulting quantity is still wrong, that is, invalid. For another analogy, a reliable rifle will have a tight cluster of bullets in the target, while a valid one will center its cluster around the center of the target, whether or not the cluster is a tight one.

Both reliability and validity may be assessed mathematically. Internal consistency may be assessed by correlating performance on two halves of a test (split-half reliability); the value of the Pearson product-moment correlation coefficient is adjusted with the Spearman-Brown prediction formula to correspond to the correlation between two full-length tests. Other approaches include the intra-class correlation (the ratio of variance of measurements of a given target to the variance of all targets). A commonly used measure is Cronbach's α, which is equivalent to the mean of all possible split-half coefficients. Stability over repeated measures is assessed with the Pearson coefficient, as is the equivalence of different versions of the same measure (different forms of an intelligence test, for example). Other measures are also used.

Validity may be assessed by correlating measures with a criterion measure known to be valid. When the criterion measure is collected at the same time as the measure being validated the goal is to establish concurrent validity; when the criterion is collected later the goal is to establish predictive validity. A measure has construct validity if it is related to other variables as required by theory. Content validity is simply a demonstration that the items of a test are drawn from the domain being measured. In a personnel selection example, test content is based on a defined statement or set of statements of knowledge, skill, ability, or other characteristics obtained from a job analysis.

Predictive or concurrent validity cannot exceed the square of the correlation between two versions of the same measure.

Item response theory models the relationship between latent traits and responses to test items. Among other advantages, IRT provides a basis for obtaining an estimate of the location of a test-taker on a given latent trait as well as the standard error of measurement of that location. For example, a university student's knowledge of history can be deduced from his or her score on a university test and then be compared reliably with a high school student's knowledge deduced from a less difficult test. Scores derived by classical test theory do not have this characteristic, and assessment of actual ability (rather than ability relative to other test-takers) must be assessed by comparing scores to those of a norm group randomly selected from the population. In fact, all measures derived from classical test theory are dependent on the sample tested, while, in principle, those derived from item response theory are not.

Standards of quality

The considerations of validity and reliability typically are viewed as essential elements for determining the quality of any test. However, professional and practitioner associations frequently have placed these concerns within broader contexts when developing standards and making overall judgments about the quality of any test as a whole within a given context. A consideration of concern in many applied research settings is whether or not the metric of a given psychological inventory is meaningful or arbitrary.[1]

Testing standards

In this field, the Standards for Educational and Psychological Testing [2] place standards about validity and reliability, along with errors of measurement and related considerations under the general topic of test construction, evaluation and documentation. The second major topic covers standards related to fairness in testing, including fairness in testing and test use, the rights and responsibilities of test takers, testing individuals of diverse linguistic backgrounds, and testing individuals with disabilities. The third and final major topic covers standards related to testing applications, including the responsibilities of test users, psychological testing and assessment, educational testing and assessment, testing in employment and credentialing, plus testing in program evaluation and public policy.

Evaluation standards

In the field of evaluation, and in particular educational evaluation, the Joint Committee on Standards for Educational Evaluation [3] has published three sets of standards for evaluations. The Personnel Evaluation Standards [4] was published in 1988, The Program Evaluation Standards (2nd edition) [5] was published in 1994, and The Student Evaluation Standards [6] was published in 2003.

Each publication presents and elaborates a set of standards for use in a variety of educational settings. The standards provide guidelines for designing, implementing, assessing and improving the identified form of evaluation. Each of the standards has been placed in one of four fundamental categories to promote educational evaluations that are proper, useful, feasible, and accurate. In these sets of standards, validity and reliability considerations are covered under the accuracy topic. For example, the student accuracy standards help ensure that student evaluations will provide sound, accurate, and credible information about student learning and performance.

See also

Notes

1. ^ Blanton, H., & Jaccard, J. (2006). Arbitrary metrics in psychology. American Psychologist, 61(1), 27-41.
2. ^ The Standards for Educational and Psychological Testing
3. ^ Joint Committee on Standards for Educational Evaluation
4. ^ Joint Committee on Standards for Educational Evaluation. (1988). The Personnel Evaluation Standards: How to Assess Systems for Evaluating Educators. Newbury Park, CA: Sage Publications.
5. ^ Joint Committee on Standards for Educational Evaluation. (1994). The Program Evaluation Standards, 2nd Edition. Newbury Park, CA: Sage Publications.
6. ^ Committee on Standards for Educational Evaluation. (2003). The Student Evaluation Standards: How to Improve Evaluations of Students. Newbury Park, CA: Corwin Press.

References

  • Andrich, D. & Luo, G. (1993) A hyperbolic cosine model for unfolding dichotomous single-stimulus responses. Applied Psychological Measurement, 17, 253-276.
  • Michell, J. (1997). Quantitative science and the definition of measurement in psychology. British Journal of Psychology, 88, 355-383.
  • Michell, J. (1999). Measurement in Psychology. Cambridge: Cambridge University Press.
  • Rasch, G. (1960/1980). Probabilistic models for some intelligence and attainment tests. Copenhagen, Danish Institute for Educational Research), expanded edition (1980) with foreword and afterword by B.D. Wright. Chicago: The University of Chicago Press.
  • Reese, T.W. (1943). The application of the theory of physical measurement to the measurement of psychological magnitudes, with three experimental examples. Psychological Monographs, 55, 1-89.
  • Stevens, S. S. (1946). On the theory of scales of measurement. Science, 103, 677-80.
  • Thurstone, L.L. (1927). A law of comparative judgement. Psychological Review, 34, 278-286.
  • Thurstone, L.L. (1929). The Measurement of Psychological Value. In T.V. Smith and W.K. Wright (Eds.), Essays in Philosophy by Seventeen Doctors of Philosophy of the University of Chicago. Chicago: Open Court.
  • Thurstone, L.L. (1959). The Measurement of Values. Chicago: The University of Chicago Press.

External links

Psychology (from Greek: Literally "talk about the soul" (from logos)) is both an academic and applied discipline involving the scientific study of mental processes and behavior.
..... Click the link for more information.
Psychology
· History
· Wikiproject

RESEARCH Ψ
Abnormal Biological Cognitive Developmental Emotion Experimental
Evolutionary Legal
Mathematical
Neuropsychology
Personality
..... Click the link for more information.
Experimental psychology approaches psychology as one of the natural sciences, and therefore assumes that it is susceptible to the experimental method. Many experimental psychologists have gone further, and have assumed that all methods of investigation other than
..... Click the link for more information.


Abnormal psychology is the scientific study of abnormal behavior in order to describe, predict, explain, and change abnormal patterns of functioning. Abnormal psychology in clinical psychology studies the nature of psychopathology, its causes, and its treatments.
..... Click the link for more information.
biological psychology or psychobiology[1] is the application of the principles of biology to the study of mental processes and behavior. A psychobiologist, for instance, may compare the imprinting behavior in goslings to the early attachment behavior in human
..... Click the link for more information.
Cognitive psychology is the school of psychology that examines internal mental processes such as problem solving, memory, and language. It had its foundations in the Gestalt psychology of Max Wertheimer, Wolfgang Köhler, and Kurt Koffka, and in the work of Jean Piaget, who studied
..... Click the link for more information.
Developmental psychology, also known as Human Development, is the scientific study of progressive psychological changes that occur in human beings as they age. Originally concerned with infants and children, and later other periods of great change such as adolescence and
..... Click the link for more information.
emotion is a "complex reaction pattern, involving experiential, behavioral, and physiological elements, by which the individual attempts to deal with a personally significant matter of event.
..... Click the link for more information.
Experimental psychology approaches psychology as one of the natural sciences, and therefore assumes that it is susceptible to the experimental method. Many experimental psychologists have gone further, and have assumed that all methods of investigation other than
..... Click the link for more information.


Evolutionary psychology (abbreviated EP) is a theoretical approach to psychology that attempts to explain mental and psychological traits—such as memory, perception, or language—as adaptations, i.e., as the functional products of natural selection.
..... Click the link for more information.
Legal psychology involves the application of empirical psychological research to legal institutions and people who come into contact with the law. Legal psychology is a field that takes basic social and cognitive theories and principles and applies them to issues in the
..... Click the link for more information.
Mathematical Psychology is an approach to psychological research that is based on mathematical modeling of perceptual, cognitive and motor processes, and on the establishment of law-like rules that relate quantifiable stimulus characteristics with quantifiable behavior.
..... Click the link for more information.
Neuropsychology is an interdisciplinary branch of psychology and neuroscience that aims to understand how the structure and function of the brain relate to specific psychological processes and overt behaviors.
..... Click the link for more information.


Personality psychology is a branch of psychology which studies personality and individual differences. One emphasis in this area is to construct a coherent picture of a person and his or her major psychological processes.
..... Click the link for more information.
Positive psychology is a relatively young branch of psychology that "studies the strengths and virtues that enable individuals and communities to thrive."[1] People have been discussing the question of human happiness since at least Ancient Greece.
..... Click the link for more information.
Psychonomics describes an approach to psychology that aims at discovering the laws (Greek: 'nomos') that govern the workings of the mind (Greek: 'psyche'). The field is directly related to experimental psychology.
..... Click the link for more information.


Psychophysics is a subdiscipline of psychology dealing with the relationship between physical stimuli and their subjective correlates, or percepts.

History


..... Click the link for more information.
Social psychology is the scientific study of how people's thoughts, feelings, and behaviors are influenced by the actual, imagined, or implied presence of others (Allport, 1985). By this definition, scientific refers to the empirical method of investigation.
..... Click the link for more information.
Transpersonal psychology is a school of psychology that studies the transpersonal, the transcendent or spiritual aspects of the human mind. The Journal of Transpersonal Psychology
..... Click the link for more information.


The basic premise of applied psychology is the use of psychological principles and theories to overcome problems in other areas, such as mental health, business management, education, health, product design, ergonomics, and law.
..... Click the link for more information.


Clinical psychology includes the scientific study and application of psychology for the purpose of understanding, preventing, and relieving psychologically-based distress or dysfunction and to promote subjective well-being and personal development.
..... Click the link for more information.
Educational psychology is the study of how humans learn in educational settings, the effectiveness of educational interventions, the psychology of teaching, and the social psychology of schools as organizations.
..... Click the link for more information.


Forensic psychology is the intersection between Psychology and the Criminal justice system. It is a division of applied psychology concerned with the collection, examination and presentation of psychological evidence for judicial purposes.
..... Click the link for more information.
Health psychology concerns itself with understanding how biology, behavior, and social context influence health and illness.[1] Health psychologists generally work alongside other medical professionals in clinical settings, although many also teach and conduct
..... Click the link for more information.
Industrial and organizational psychology (also known as I/O psychology, work psychology, work and organizational psychology, W-O psychology, occupational psychology, personnel psychology or talent assessment
..... Click the link for more information.
Sport psychology is a specialization within psychology that seeks to understand psychological/mental factors that affect performance in sports, physical activity and exercise and apply these to enhance individual and team performance.
..... Click the link for more information.
This is a list of important publications in psychology, organized by field. Some reasons why a particular publication might be regarded as important:
  • Topic creator – A publication that created a new topic
  • Breakthrough

..... Click the link for more information.
: Top - 0–9 A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

#

16 Personality Factors - 5-Hydroxytryptophan (5-HTP) -

A

A-not-B error  - A. H. Almaas - Aaron Rosanoff - Aaron T.
..... Click the link for more information.
This is an alphabetical List of Psychotherapies. It is an incomplete list and new or minor approaches are still being added.

See the main article Psychotherapy for a description of what psychotherapy is and how it developed.
..... Click the link for more information.
Education encompasses teaching and learning specific skills, and also something less tangible but more profound: the imparting of knowledge, positive judgment and well-developed wisdom.
..... Click the link for more information.


This article is copied from an article on Wikipedia.org - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.
Herod_Archelaus


page counter