Core Idea
- Digital behavior (searches, clicks, online traces) reveals truth people hide in surveys — use this "digital truth serum" to understand real human behavior, not self-reported versions
- Right data beats big data — a small dataset of honest signals outperforms massive amounts of misleading information
Four Powers of Big Data
- Access new data types previously unavailable (search behavior, online traces)
- Get honest answers people won't share in surveys or face-to-face
- Zoom in on small subsets (geographic regions, age cohorts) with statistical clarity
- Run rapid randomized experiments (A/B testing) to prove causation, not just correlation
Testing & Experimentation
- A/B test headlines, messaging, ads before full rollout — Boston Globe increased clicks 38% through headline testing
- Test political/persuasion messages — different framings move different voter groups; validate messaging before launch
- Beware addiction loops — apps optimize engagement through A/B testing; recognize manipulative design patterns
Proving Causality
- Use natural experiments when controlled trials aren't possible — look for "cutoff" data in your field (e.g., school enrollment cutoffs)
- Cross-validate findings across multiple datasets — don't rely on one study
- Test for reverse causality — determine if X causes Y or Y causes X (e.g., does college make you rich, or do rich people attend college?)
Business Applications
- Personalize pricing using customer doppelgangers — match customers to similar profiles for dynamic pricing
- Analyze unfiltered customer reviews — real patterns in feedback beat surveys
- Test monetary incentives on your specific population — financial windfalls don't always improve outcomes
Research & Interviewing
- Ask open-ended questions — word choice reveals hidden attitudes better than structured questions
- Listen more than you speak — active listening predicts success in sales, negotiation, and relationships
Critical Pitfalls to Avoid
- Don't measure only what's easy — focus on what actually matters, not just quantifiable metrics
- Beware the "curse of dimensionality" — more variables = easier to find false patterns; always validate predictions on new data
- Avoid individual-level predictions — low accuracy, high ethical risk; use for groups instead
- Watch for corporate/government overreach — predictive data enables discrimination and manipulation
Action Plan
- Audit your current data sources — identify where you're relying on surveys/intuition instead of digital behavior signals
- Design one A/B test this month — on messaging, pricing, or product feature; validate assumptions with real data, not guesswork
- Cross-validate your biggest business assumption — test for reverse causality and find alternative datasets that support or contradict it
- Combine multiple imperfect data sources — don't rely on single studies; triangulate truth from surveys + behavior + feedback
- Validate findings on new data — always test hypotheses on out-of-sample data to avoid false patterns