The 4 Branches of Data Science

All data science projects fall within one or more of the following four areas:

Statistical Inference
Causal Inference
Machine Learning
Descriptive Statistics

Below are short descriptions of each, followed by examples.

Statistical Inference

Goal: measurement, typically optimizing for an unbiased and low variance estimator.

Examples:

Metrics based on samples, e.g. conducting human reviews of samples of content to measure the rate of violative content being seen by users on the platform
Hypothesis testing, e.g. testing whether user retention differs across demographics

Causal Inference

Goal: inferring the causal connection between two events

Examples:

A/B tests, e.g. the new UX design outperform the old one on conversion rate?
Observational studies, i.e. absent a randomized experiment, can we tell if one event caused another?

Machine Learning

Goal: learning prediction functions using data and algorithms

Examples:

Classifying violative content, e.g. videos, images, comments
Predicting the product that a customer will purchase next

Descriptive Statistics

Goal: summarize the data, primarily to tell the story of what is happening, or to help generate hypotheses. This is what is commonly meant by the term “analytics”.

Examples:

Creating a North Star metric for teams to optimize towards
Calculating the growth rates in different customer segments

The 4 Branches of Data Science

Statistical Inference

Causal Inference

Machine Learning

Descriptive Statistics

Published by Justin

Leave a Comment Cancel reply

Statistical Inference

Causal Inference

Machine Learning

Descriptive Statistics

Share this:

Published by Justin

Leave a Comment Cancel reply