Year 3

Data crunching is the practice of collecting, processing, and analyzing data to extract insights that drive decisions. In the context of entrepreneurship, it means making decisions based on evidence rather than gut feeling.

The process:

  1. Collect — gather data from relevant sources. User behavior, sales figures, surveys, market data, web analytics. The challenge is usually not having too little data but too much.
  2. Clean — raw data is messy. Missing values, duplicates, inconsistent formats. Cleaning data is boring but essential — garbage in, garbage out.
  3. Analyze — look for patterns, correlations, trends, and anomalies. This ranges from simple spreadsheet analysis to advanced statistical modeling.
  4. Visualize — turn numbers into pictures. A good chart communicates more than a table of numbers ever could. Charts are for understanding; tables are for lookup.
  5. Act — the whole point. Data without action is trivia. What decision does this analysis inform?

Key concepts:

  • Correlation ≠ causation — ice cream sales and drowning deaths both increase in summer. That doesn’t mean ice cream causes drowning. Be careful about causal claims.
  • Sample size matters — 5 users saying they like your product doesn’t mean the market does. Statistical significance is a thing.
  • Vanity metrics vs. actionable metrics — page views feel good but don’t tell you much. Conversion rate, retention, revenue per user — those drive decisions.
  • A/B testing — the gold standard for digital experimentation. Show group A one version, group B another, measure which performs better. Let the data decide.

Tools range from simple (Excel, Google Sheets) to advanced (Python with pandas, SQL databases, Tableau, R). Start simple. Upgrade tools when your questions outgrow them.

Related: Machine Learning, Data Agencies