Statistical and Educational Services
Inferential Statistics Glossary (A-Z)
Inferential Statistics Glossary (A-Z)
The hypothesis that is accepted if the null hypothesis is rejected, usually represented by the symbol H1. Also known as the experimental or research hypothesis. The alternative hypothesis usually states that the independent variable has had an effect on the dependent variable that cannot be explained by chance alone. (N.B. You never prove that the alternative hypothesis is correct, even if the null hypothesis is rejected, there is always a chance that you have wrongly rejected it.)
Availability (convenience) sampling
A sample based on subjects who are available and willing to participate in the study. Many experiments in psychology use undergraduates as subjects as they are cheap and easy to find.
Between subjects design
An experimental design in which each is randomly assigned to only one of the treatment conditions.
A research method involving the detailed study of a single individual, used mainly in clinical psychology and neuropsychology.
The appropriate test to investigate independent frequency counts taken from a sample.
An independent variable created by the selection of subjects on the basis of a factor such as age or gender. It is a characteristic which is observed and classified rather than imposed by the experimenter.
One or more variables not under the control of the experimenter that vary systematically with the independent variable, decreasing the experimenter's ability to isolate cause and effect.
A group of subjects which does not receive the experimental treatment but in all other respects is treated in the same way as the experimental group, (so as to tease out the effects of the treatment itself). In medical studies involving the administration of the control group is known as the placebo group. A neutral substance ( placebo ) is administered to this group without the subjects knowing if it is an active drug or not.
A statistical relationship between two variables such that high scores on one factor tend to go with high scores on the other factor (positive correlation) or that high scores on one factor go with low scores on the other factor (negative correlation).
A type of research design in which patterns of corre
An arrangement of treatment conditions to neutralize practice effects the effects of fatigue, e.g. where group A completes Task 1 followed by Task and group B completes Task 2 followed by Task 1.
A technique used for studying developmental factors e.g. by studying a number of groups of children of differing ages (and thus at different stages of development) at the same point in time.
Features of the experimental situation which can affect the subjects' behavior, in particular when the subject has expectations about what he/she is required to or has worked out what the experimenter "wants" to happen.
The selected behavior which is measured to try to gauge the effect of the independent variable in an experimental design.
Data summarized in such as mean, median, mode. This forms the first stage of data analysis. Means, standard deviations standard errors are presented in the form of a table.
In order to control for experimenter effects and demand characteristics, neither the subjects nor the experimenter knows which group of subjects has received which experimental treatment until after the data has been collected. This type of design is of particular use in pharmacology research when new drugs are being tested.
From the Greek en, (in), (trial), meaning derived from careful observations or experiments rather than from speculation or theory.
An independent variable in which some aspect of the subject's physical surroundings manipulated.
A method of research which permits the inference of cause and effect. At least two groups of subjects are treated exactly alike in all ways except one, the independent variable. Differences in the behavior of the Experimental and Control group which cannot be accounted for by experimental error are then attributed to the effect of the experimental treatment.
The plan of the experiment which specifies the treatment conditions (independent variables ), what is to be measured (dependent variables ) and methods of assigning subjects to groups.
Uncontrolled sources of variability in the results which occur randomly during the experiment. Much of this error is due to individual differences among subjects.
See Alternative hypothesis.
Where the experimenter attempts to identify cause and effect.
Research designed to investigate an area on which little information exists. This includes the use of pilot studies, which are trial runs of an experiment. The aim is to gain more information before doing more thorough research.
The degree to which results of a study with a sample of subjects can be generalized to make statements about a much larger population of subjects.
A statistical index relating systematic variation in the data (caused by treatment effects plus random error) to unsystematic variability in the data (caused by random error alone). The effects of treatments plus error the numerator and the effect of error(chance) is the denominator of the F-ratio.
A term which usually refers to the independent variable. If two different independent variables are used they are referred to as Factor A and Factor B. Each factor may have different levels, e.g. if Factor A is a drug treatment then doses of differing amounts are the dose levels.
Experimental designs in which two or more independent variables are used. This permits the analysis of interactions between variables.
An inference made from a sample a population. The researcher attempts to extend the results of his/her study to a much larger group of people.
See Null hypothesis.
See Alternative Hypothesis.
The formal process by which a decision is made concerning the rejection or acceptance of the null hypothesis.
The variable manipulated by the experimenter. It is a feature of a task given to or a manipulation of the external or internal environment. Internal environment refers to attitudes, beliefs etc.
Procedures and measures used to make inferences about population characteristics from samples drawn from that population. The process of hypothesis testing is part of inferential statistics.
Measurements on a continuous numerical scale with equal intervals between points but where the zero point is arbitrary, as in, for example, measures of temperature in Celsius or Fahrenheit.
Jonckheere trend test
A statistical test which investigates whether a trend exists across a number of ordered between-subjects conditions.
Linear regression line
A "line of best fit" depicting the linear relationship between two variables. The line is characterized by two features, slope intercept. The formula for the best-fit straight line can be used to predict one variable from another.
A linear relationship between the independent and the dependent variable, a relationship which can be represented by a straight line.
A form of research often used to study, e.g., developmental issues where the group of subjects is studied over an extended period of time. Measurements are taken several times at regular intervals to look at the effect of time on the dependent variable.
A class of between-subjects design in which the subjects are matched on one or more relevant characteristics. This design is used to reduce between groups variability.
A measure of central tendency, giving an average of a set of scores (i.e. the sum of all the scores divided by the number of scores in the set).
of central tendency, giving the value of the middlemost score (above or below which half of all the scores lie). If there are an even number of scores the median is the average of the two middle scores.
A form of observational research in which the observer records information about naturally occurring behavior while attempting not to intervene or affect the behavior in any way. This research is also described as unobtrusive.
Data is allocated into different (often named or numbered) categories. For example, the allocation of books in a library catalog to different topics. Data on this scale cannot be meaningfully added and subtracted.
Research which lacks variable which is manipulated by the experimenter. Useful in situations where it is not possible or not ethical to manipulate the variable of interest.
Used when data is ordinal or nominal in so that operations like addition and subtraction cannot be meaningfully applied. These tests are less sensitive than parametric tests to trends in the data (e.g. differences between conditions) but can be used with a wider range of measures.
A theoretical data distribution which appears is symmetrical about the mean and has the most probable scores concentrated around the mean. Progressively less likely scores occur further away from the mean. 68.26% of the scores lie within one standard deviation either side of the mean, 95.44% of the scores lie within 2 standard deviations either side of the mean and 99.75% fall within three standard deviations.
This is usually a statement of "no effect", that is to that the independent variable will not have any effect on the dependent variable and that any differences between the experimental and control groups are attributable to chance. The null hypothesis is usually represented by the symbol and is stated in that it can be rejected as an explanation for the results of the experiment.
The systematic study of behavior as it occurs in the natural environment.
A scale of measurement where data are put in order, but where there is no fixed amount of difference between the points on the scale. For example, the rank order of premier league football teams, or World ranking of tennis players.
Logical systems made up of theories and research techniques which reflect a predominant way of thinking about a particular topic.
Can be carried out on data which is interval or ratio scale, and thus is suitable for arithmetic operations such as addition and subtraction. This enables parameters such as mean and standard deviation to be defined.
A form of observational research in which the observer's presence is known to the subject.
An inactive substance or dummy treatment administered to a control group to compare its' effects with a real substance, drug or treatment.
A positive or therapeutic benefit resulting from the administration of a placebo to someone who believes the treatment is real.
The total number of all possible subjects or elements which could be included in a study. If the data are valid, the results of research on a sample of subjects drawn from a much larger population can then be generalized to the population.
The probability of correctly rejecting the null hypothesis, i.e. rejecting the null hypothesis when it is false; defined as 1 minus the probability of a type II error (See type I and type II errors).
The systematic change (increase or decrease) in the subjects' performance over a series of treatment conditions in a repeated measures (within-subjects) design. A potential source of error usually neutralized by using a counterbalancing design.
Research designed to find out if the score on a particular measure or a test result corresponds with some other behavior of interest. e.g. do the results of an IQ test predict how well a person will perform in final exams?
From the Greek, psyche (mind) logos (study), the study of the nature and functions of the mind and of human behavior.
From the Greek, psyche (mind) metron (measure), testing of mental ability such as IQ, also includes the use of tests to measure interests, attitudes personality.
Random assignment of subjects
Procedure by which each subject has an equal probability of being assigned to each different treatment condition in an experiment.
Introduced by R. A. Fisher in 1926 so that inferential statistics could be carried out to analyze differences between groups of subjects.
A group of subjects randomly chosen from a defined population.
A procedure in which each member of the population has an equal chance of being included in the sample.
The consistency with which a measuring instrument (such as a psychometric test) performs its' function, gauged, for example, by comparing test scores from the same subjects at different times.
A subgroup selected from a larger group of potential subjects (population).
The number of subjects assigned to a treatment condition in an experiment or study.
The process of selecting subjects for research. See random sampling, availability sampling.
The frequency distribution of a statistic obtained from an extremely large number of random samples drawn from a specified population.
Is achieved when there is a low probability that the results of an experiment occurred by chance alone. In it is conventional that results are said to be significant if the probability of their occurrence by chance is equal to or less than 5 or 0.05.
The probability with which the experimenter is willing to reject the null hypothesis (in favor of the alternative hypothesis) when the null hypothesis is in fact correct. Also known as the probability of a type I error.
A measure of dispersion within a set of data, calculated from the square root of the variance, to give a value in the same range as raw scores. The standard deviation is the spread of scores around the mean of the sample.
The standard deviation of the sampling distribution of the mean. A statistical estimate of the population standard deviation based on the mean and standard deviation of one sample. Calculated by dividing the standard deviation of the sample by the square root of the number of subjects in the sample.
The loss of subjects during a research study (hopefully not due to death). Subjects may drop out of a study for a variety of reasons. It becomes a problem when is not random and occurs unequally across groups.
Research using questionnaires or interviews to poll or obtain information.
A parametric statistical test of the difference between the means of two samples.
An independent variable in which some aspect of a task is manipulated by the experimenter.
A set of propositions which summaries, organize, and explain a variety of known facts, e.g. Darwin's theory of evolution. Theories are intended to logically summaries information and to give a framework for the generation of new tests and ideas on the topic.
Type I error
An error of statistical inference when the null hypothesis is rejected when it is true. This is an error of "seeing too much in the data."
Type II error
An error of statistical inference when the null hypothesis is retained when it is false. This is an error of "not seeing enough in the data."
From the Latin (strong), the degree to which a measuring instrument measures what it is supposed to measure.
The degree to which differences exist among a set of scores. The standard deviation is usually used to describe the variability of scores in a sample.
A property that can take different values. In variables are classed as independent and dependent.
A measure of variability based on the variation of subjects treated alike in an experiment (i.e. the subjects are in the same group). The amount of within-groups variability gives a measure of experimental error.
An experimental design where all subjects receive all treatment conditions. Also called a repeated measures design.
The variable which is plotted on the X-axis of a graph (i.e. the horizontal axis). In an experiment the variable plotted on the X-axis is the independent variable.
The variable plotted on the Y-axis of a graph (vertical axis). In an experiment, the Y-variable is the dependent variable.
A score expressed in units of standard deviations from the mean. Also known as a standard score.