To conduct a correlational study, researchers must first define and operationalize two co-variables. These variables are often obtained from pre-existing data or through self-report measures like questionnaires.
The data is then plotted on a scattergraph to determine the direction of the relationship. If the points form a line from bottom-left to top-right, it is positive; top-left to bottom-right indicates a negative relationship.
Statistical tests are applied to calculate the coefficient (). A value close to or suggests a strong relationship, while a value closer to suggests a weak or non-existent relationship.
The most critical distinction in research is between correlation and causation. While an experiment can prove that one variable causes a change in another, a correlation only shows that they change together.
| Feature | Correlation | Experiment |
|---|---|---|
| Variables | Two co-variables | IV and DV |
| Manipulation | No manipulation | IV is manipulated |
| Goal | Identify relationships | Establish cause-effect |
| Control | Low control over extraneous variables | High control in lab settings |
Correlations are subject to the third variable problem. This occurs when an unmeasured 'intervening' variable is actually responsible for the relationship seen between the two co-variables.
When describing a correlation in an exam, never use causal language. Avoid words like 'causes', 'leads to', or 'affects'; instead, use phrases like 'is associated with' or 'there is a relationship between'.
Always check the sign and the value of the coefficient. A coefficient of is actually 'stronger' than a coefficient of , because the absolute value is higher, even though the direction is negative.
If asked to interpret a scattergraph, look for outliers. A single data point that sits far away from the general trend can significantly skew the correlation coefficient and lead to misleading conclusions.
A common mistake is assuming that a strong correlation implies a direct link. In reality, the relationship could be bi-directional, where both variables influence each other simultaneously.
Correlations are only effective for identifying linear relationships. If the relationship is curvilinear (e.g., anxiety improving performance up to a point, then decreasing it), a standard correlation coefficient may incorrectly suggest a zero correlation.