10 questions · STAR-scored

Data Scientist Interview Questions

The questions data scientists actually get asked — with STAR-structured sample answers you can rewrite in your voice. Practice the rooms before you're in them.

The questions

1
Behavioral
Walk me through a model you put in production.
Show sample answer

S: Churn prediction for retention. T: Predict next-30-day churn well enough that retention team prioritises outreach. A: XGBoost on 90-day behavioural features, AUC 0.87, productionised with FastAPI + Docker. R: 3.2× lift on the targeted segment, $1.4M retained ARR.

2
Behavioral
Explain p-value to a stakeholder.
Show sample answer

P-value is the probability of seeing a result this extreme if the change had no real effect. So a p of 0.03 means: if our change did nothing, there's a 3% chance we'd see this much improvement by random variation. Not 'probability we're right' — that's a Bayesian framing.

3
Behavioral
How do you decide what to A/B test?
Show sample answer

Three filters: (1) is the metric we care about measurable in <4 weeks? (2) does the variant create enough delta that we can detect it with our traffic? (3) is the alternative (just shipping it) materially cheap if we're wrong?

4
Case
How would you detect bot traffic?
Show sample answer

Multi-layer: behavioural (event timing distribution, session depth), browser fingerprint (canvas, audio), IP reputation, post-hoc anomaly detection on conversion patterns. Start with behavioural — cheapest, catches most. Layer fingerprinting for the sophisticated cases.

5
Technical
When would you NOT use ML?
Show sample answer

When a heuristic captures 90%+ of the value at 10% of the operational cost. When the data is non-stationary in ways your model can't adapt to. When the explainability requirement is high enough that a simple rule beats a black-box accuracy gain.

6
Technical
How do you handle imbalanced classes?
Show sample answer

First — verify the imbalance is real (sometimes it's a labelling artifact). Then: class weights, SMOTE-like oversampling, undersampling the majority, or threshold tuning post-prediction. Often class weights + threshold tuning is the cheapest path to a good operating point.

7
Behavioral
Describe a stakeholder you struggled to convince.
Show sample answer

S: Exec didn't trust experiment results because variant looked worse on a *vanity* metric. T: Convince. A: Walked through the guardrail metrics, the segment analysis, the causal-inference robustness check. Brought the engineering lead to corroborate the deploy was clean. R: Shipped, +14% on the actual north star.

8
Behavioral
How do you stay current on the field?
Show sample answer

Quarterly: read 2-3 papers from NeurIPS / ICML in my area. Weekly: substack/podcast for 30 minutes. Daily: I subscribe to specific researchers on Twitter/X. Once a year I attend one conference in person.

9
Behavioral
What's a metric you've moved that you're most proud of?
Show sample answer

Took activation rate from 41% to 63% in 4 months. Owned the experiment design, ran 8 variants, the winning one was the boring-est option (defaults that pre-filled from signup) — humbling.

10
Technical
How do you measure model decay in production?
Show sample answer

Daily: tracking metric (e.g. AUC on the prior week's labelled data, with delay). Weekly: drift in input features (PSI). Monthly: a held-out canary of new data scored against the production model. Alert thresholds on any of the three.

How to prepare — the STAR rubric

Every strong behavioral answer follows the same four-part structure: Situation(the context — 2 sentences), Task (what success looked like — 1 sentence),Action (what you actually did, 3-5 specific steps), and Result(the measurable outcome). Most candidates over-invest in Situation and under-invest in Result. The Result is where the interviewer scores you.

Watch-outs specific to data scientist interviews

Run a data scientist mock interview — free.

Voice or text. Per-answer STAR scoring. Saved across devices.

Start free
Continue your Data Scientist prep
About this guide
The ApplyVita Career Team

The ApplyVita Career Team builds the resume-scoring and job-matching tools at the core of ApplyVita. Our guidance is grounded in the same four-component ATS rubric our product scores resumes on — content and impact, keyword match, formatting, and skills — and in current recruiter and hiring-manager practice. Every guide is checked against that rubric before it is published, and updated as hiring norms change.

Salary figures are estimates informed by publicly reported data from Glassdoor, Levels.fyi, AmbitionBox, LinkedIn Salary and others — negotiation anchors, not guarantees.Read our editorial standards, sourcing & corrections policy →