10 questions · STAR-scored

Data Scientist Interview Questions

The questions data scientists actually get asked — with STAR-structured sample answers you can rewrite in your voice. Practice the rooms before you're in them.

By The ApplyVita Career TeamUpdated June 2, 2026How we know this

The questions

Behavioral

Walk me through a model you put in production.

Show sample answer ▾

S: Churn prediction for retention. T: Predict next-30-day churn well enough that retention team prioritises outreach. A: XGBoost on 90-day behavioural features, AUC 0.87, productionised with FastAPI + Docker. R: 3.2× lift on the targeted segment, $1.4M retained ARR.

Behavioral

Explain p-value to a stakeholder.

Show sample answer ▾

P-value is the probability of seeing a result this extreme if the change had no real effect. So a p of 0.03 means: if our change did nothing, there's a 3% chance we'd see this much improvement by random variation. Not 'probability we're right' — that's a Bayesian framing.

Behavioral

How do you decide what to A/B test?

Show sample answer ▾

Three filters: (1) is the metric we care about measurable in <4 weeks? (2) does the variant create enough delta that we can detect it with our traffic? (3) is the alternative (just shipping it) materially cheap if we're wrong?

Case

How would you detect bot traffic?

Show sample answer ▾

Multi-layer: behavioural (event timing distribution, session depth), browser fingerprint (canvas, audio), IP reputation, post-hoc anomaly detection on conversion patterns. Start with behavioural — cheapest, catches most. Layer fingerprinting for the sophisticated cases.

Technical

When would you NOT use ML?

Show sample answer ▾

When a heuristic captures 90%+ of the value at 10% of the operational cost. When the data is non-stationary in ways your model can't adapt to. When the explainability requirement is high enough that a simple rule beats a black-box accuracy gain.

Technical

How do you handle imbalanced classes?

Show sample answer ▾

First — verify the imbalance is real (sometimes it's a labelling artifact). Then: class weights, SMOTE-like oversampling, undersampling the majority, or threshold tuning post-prediction. Often class weights + threshold tuning is the cheapest path to a good operating point.

Behavioral

Describe a stakeholder you struggled to convince.

Show sample answer ▾

S: Exec didn't trust experiment results because variant looked worse on a *vanity* metric. T: Convince. A: Walked through the guardrail metrics, the segment analysis, the causal-inference robustness check. Brought the engineering lead to corroborate the deploy was clean. R: Shipped, +14% on the actual north star.

Behavioral

How do you stay current on the field?

Show sample answer ▾

Quarterly: read 2-3 papers from NeurIPS / ICML in my area. Weekly: substack/podcast for 30 minutes. Daily: I subscribe to specific researchers on Twitter/X. Once a year I attend one conference in person.

Behavioral

What's a metric you've moved that you're most proud of?

Show sample answer ▾

Took activation rate from 41% to 63% in 4 months. Owned the experiment design, ran 8 variants, the winning one was the boring-est option (defaults that pre-filled from signup) — humbling.

Technical

How do you measure model decay in production?

Show sample answer ▾

Daily: tracking metric (e.g. AUC on the prior week's labelled data, with delay). Weekly: drift in input features (PSI). Monthly: a held-out canary of new data scored against the production model. Alert thresholds on any of the three.

How to prepare — the STAR rubric

Every strong behavioral answer follows the same four-part structure: Situation(the context — 2 sentences), Task (what success looked like — 1 sentence),Action (what you actually did, 3-5 specific steps), and Result(the measurable outcome). Most candidates over-invest in Situation and under-invest in Result. The Result is where the interviewer scores you.

Watch-outs specific to data scientist interviews

Use 'Data Scientist' as a literal phrase in your summary — many ATSes weight exact-title matches.
Include 'machine learning' and 'experimentation' both — different recruiters search differently.
List Python, SQL, R as separate skills — many ATSes filter on language presence.

Run a data scientist mock interview — free.

Voice or text. Per-answer STAR scoring. Saved across devices.

Start free

Continue your Data Scientist prep

Data Scientist Resume Example

Open

Data Scientist Cover Letter

Open

Data Scientist Salary Guide

Open

Put this into action — free, no signup

Score this résumé — free

Free ATS checker

Build a Data Scientist résumé

Free resume builder

Tailor it to a job

Match any JD

About this guide

The ApplyVita Career Team

The ApplyVita Career Team builds the resume-scoring and job-matching tools at the core of ApplyVita. Our guidance is grounded in the same four-component ATS rubric our product scores resumes on — content and impact, keyword match, formatting, and skills — and in current recruiter and hiring-manager practice. Every guide is checked against that rubric before it is published, and updated as hiring norms change.

Salary figures are estimates informed by publicly reported data from Glassdoor, Levels.fyi, AmbitionBox, LinkedIn Salary and others — negotiation anchors, not guarantees.Read our editorial standards, sourcing & corrections policy →