| Feature | Primary Data | Secondary Data |
|---|---|---|
| Source | Direct collection by researcher | Existing records/publications |
| Cost | High (labor, materials, time) | Low (often free or subscription-based) |
| Time | Long duration to collect | Rapidly accessible |
| Accuracy | High (controlled by researcher) | Variable (depends on original source) |
| Relevance | High (tailored to specific goal) | Moderate (may not fit perfectly) |
Identify the 'Who' and 'When': In exam scenarios, always ask: 'Did the person solving the problem collect this data themselves?' If yes, it is primary. If they found it in a book or database, it is secondary.
Reliability Checks: When discussing secondary data, always mention the need to verify the reputation of the source and the date of publication to ensure the data is not obsolete.
Contextual Justification: If asked which to use, justify based on constraints. Use primary for niche, new, or highly specific problems; use secondary for broad trends, historical comparisons, or when budgets are limited.
The 'Accuracy' Trap: Students often assume primary data is always 'better.' However, primary data can be highly biased if the survey design is poor, while secondary data from a government census is often more accurate due to massive sample sizes.
Ignoring Original Purpose: A common mistake is using secondary data without considering why it was originally collected. If the original purpose was marketing, the data might be skewed compared to data collected for scientific research.
Confusing the Medium with the Source: Just because you find data on the internet does not automatically make it secondary if you are the one who programmed the web-scraper to collect original user interactions.