| Feature | Primary Data | Secondary Data |
|---|---|---|
| Source | Original collection by researcher | Pre-existing published/recorded data |
| Purpose | Specific to the current research | Collected for a different original purpose |
| Cost | High (requires staff, tools, time) | Low (often free or subscription-based) |
| Time | Long duration to collect and process | Rapidly available |
| Accuracy | High control over quality/bias | Quality depends on the original source |
Identify the 'Who' and 'Why': When presented with a scenario, always ask: 'Did the person currently using the data also collect it?' If yes, it is primary; if no, it is secondary.
Contextual Classification: Remember that the same dataset can be primary for one person and secondary for another. For example, a hospital's patient records are primary data for the hospital's administration but secondary data for a university researcher studying disease trends.
Evaluate Reliability: In exam questions regarding secondary data, always look for mentions of 'source credibility' or 'date of publication' as these are the primary criteria for verifying its usefulness.
Check for Bias: Primary data allows the researcher to minimize bias through design, while secondary data requires the researcher to identify and account for the biases of the original collector.
The 'Better' Fallacy: A common mistake is assuming primary data is always 'better' than secondary data. In reality, secondary data from a massive government census is often more accurate and representative than a small-scale primary survey conducted by an individual.
Ignoring Metadata: Students often forget that secondary data requires an understanding of how it was originally collected (metadata). Without knowing the original definitions and constraints, the data can be easily misinterpreted.
Confusing Method with Type: Just because data is in a digital database doesn't make it secondary. If you built the database through your own field research, it is still primary data.