1. Skip to content

1. Data Science Interview Question Resources

About This Guide

A comprehensive collection of community-recommended resources for data science interview preparation. These sources are valued by real practitioners, featuring GitHub repositories with significant stars, practice platforms used by the community, and actual interview experiences from top companies.

Made with ❀️ for the data science community


1.1 πŸ“Š Quick Reference Table

All topics have 5+ high-quality sources with direct links to interview questions:

Topic Total Sources Direct Question Links GitHub Repos Practice Platforms
1.2 Python 8+ 6 links 4 repos (8.8k⭐) LeetCode, HackerRank, StrataScratch
1.3 Machine Learning 10+ 6 links 6 repos (3.5k⭐) Blind, Books
1.4 NLP 7+ 6 links 5 repos Coursera, Glassdoor
1.5 GenAI 6+ 6 links 2 repos (5k⭐) DataCamp, Medium
1.6 Deep Learning 8+ 6 links 4 repos Blind, Glassdoor
1.7 Probability 10+ 8 links 3 repos (8.8k⭐) StrataScratch, DataLemur
1.8 Pandas 9+ 7 links 4 repos (20k⭐) InterviewQuery, DataCamp
1.9 NumPy 8+ 7 links 4 repos (20k⭐) InterviewBit, MLStack.Cafe
1.10 SQL 12+ 6 links 6 repos (3.5k⭐) DataLemur, StrataScratch, LeetCode

1.2 🐍 Python

Python Interview Resources - 20+ Community Sources

1.2.1 πŸ“š GitHub Repositories (Community-Driven)

# Repository Stars Description
1 youssefHosni/DS-Interview-QA 1.5k+ ⭐ Curated list of Python interview Q&A for data scientists
2 alexeygrigorev/data-science-interviews 8.5k+ ⭐ Comprehensive interview questions including Python coding
3 kojino/120-DS-Questions 8.8k+ ⭐ Answers to 120 commonly asked data science interview questions
4 Devinterview-io/data-scientist-questions Active Data scientist interview questions 2025
5 Devinterview-io/python-questions Active Python interview questions for developers
6 Tanu-N-Prabhu/Python Coding Interview Prep Community Beginner to advanced Python coding questions
7 rbhatia46/DS-Interview-Resources 500+ ⭐ Curated data science interview resources
8 khanhnamle1994/cracking-ds-interview 3k+ ⭐ Comprehensive DS interview preparation
9 amitshekhariitbhu/ml-questions 500+ ⭐ ML interview questions including Python

1.2.2 πŸ’» Practice Platforms

# Platform Description
10 LeetCode Python 2000+ Python coding problems
11 HackerRank Python Python practice for data science roles
12 StrataScratch Real Python interview questions from companies
13 Kaggle Python data science practice

1.2.3 πŸ’¬ Reddit Community Discussions

# Discussion Focus Area
14 Python Coding Questions Practice Coding practice resources
15 Python Interview Questions Guide Interview preparation
16 Data Scientist Python Interview Essentials DS-specific Python
17 How to Prepare for DS Python Interviews at FAANG FAANG preparation
18 LeetCode Resources for DS LeetCode strategy
19 Entry-Level DS Interview Questions Real interview questions
20 78 Python DS Practice Problems Practice problems

1.2.4 πŸ“– Additional Resources

# Source Description
21 DataInterview - Reddit DS Interview Reddit company interview insights
22 LinkedIn - Youssef Hosni Curated question collection

1.3 πŸ€– Machine Learning

Machine Learning Interview Resources - 30+ Community Sources

1.3.1 πŸ“š GitHub Repositories (Community-Driven)

# Repository Stars Description
1 alirezadir/ML-Interviews 3.5k+ ⭐ Guide based on offers from Meta, Google, Amazon, Apple, Roku
2 khangich/ml-interview 1k+ ⭐ Real questions from FAANG, Snapchat, LinkedIn
3 amitshekhariitbhu/ml-questions 500+ ⭐ Your cheat sheet for machine learning interview
4 andrewekhalel/MLQuestions 1.5k+ ⭐ ML and computer vision engineer technical questions
5 jl33-ai/1000-ml-questions 500+ ⭐ Prepare for ML, programming, and quant interviews
6 youssefHosni/DS-Interview-QA (ML Section) 1.5k+ ⭐ ML interview questions & answers for data scientists
7 khanhnamle1994/cracking-ds-interview 3k+ ⭐ Cheatsheets, books, questions, and portfolio
8 QuickLearner171998/ml-interview-prep Community Comprehensive ML interview preparation
9 aishwaryanr/awesome-genai-guide 5k+ ⭐ Generative AI guide with ML fundamentals
10 rbhatia46/DS-Interview-Resources 500+ ⭐ Curated ML interview resources

1.3.2 πŸ“š Books & Comprehensive Guides

# Source Description
11 Chip Huyen's ML Interviews Book 200+ knowledge questions with difficulty levels

1.3.3 πŸ’¬ Reddit Community Discussions

# Discussion Focus Area
12 ML Interview Prep Resources Comprehensive ML prep strategies
13 ML Interview Questions & Answers Community-recommended resources
14 Best ML Interview Preparation Discussion on best prep resources
15 ML Engineer Interview Experience Real ML theory interview questions
16 Common ML Interview Questions Community-sourced common questions
17 ML System Design Questions ML system design focus
18 Entry-Level ML Interviews Entry-level expectations

1.3.4 πŸ’» Practice Platforms & Tools

# Platform Description
19 Prepfully - ML Interview Questions Real ML questions from top companies
20 Glassdoor ML Interviews Actual ML interview experiences

1.3.5 πŸ“– Additional Resources

# Source Description
21 LinkedIn - ML Interview Guide Curated ML question collection
22 Blind - Toughest ML Questions Real tough ML questions from community
23 Blind - ML Prep Thread ML and data science interview preparation

1.4 πŸ’¬ Natural Language Processing (NLP)

NLP Interview Resources - 20+ Community Sources

1.4.1 πŸ“š GitHub Repositories (Community-Driven)

# Repository Stars/Type Description
1 Devinterview-io/nlp-questions Active NLP interview questions for 2025
2 MukundAabha/DS-ML-DL-NLP-Qus Community Comprehensive cheat sheet with interview questions
3 rbhatia46/DS-Interview-Resources 500+ ⭐ Curated sources including NLP resources
4 masmahbubalom/InterviewQuestions Active Collection of DS, AI, ML, DL, NLP, CV questions
5 youssefHosni/DS-Interview-QA 1.5k+ ⭐ Part of comprehensive DS interview collection
6 andrewekhalel/MLQuestions 1.5k+ ⭐ Includes NLP technical questions

1.4.2 πŸ’¬ Reddit Community Discussions

# Discussion Focus Area
7 NLP Interview Preparation NLP-specific interview prep
8 NLP Engineer Interview Questions Real NLP engineer questions
9 NLP Research Interview Experience Research-focused NLP interviews
10 NLP Coding Questions NLP coding challenges
11 LLM Interview Questions Large language model questions

1.4.3 πŸ’» Practice Platforms & Tools

# Platform Description
12 Prepfully - NLP Questions Real NLP interview questions from companies
13 Coursera NLP Questions 14 common NLP interview questions
14 Glassdoor NLP Roles Real interview experiences for NLP roles

1.4.4 πŸ“– Additional Resources

# Source Description
15 LinkedIn - NLP Interview Guide Curated NLP question collection
16 Medium - NLP Interview Prep Comprehensive NLP interview guide

1.5 ✨ Generative AI (GenAI)

Generative AI Interview Resources - 20+ Community Sources

1.5.1 πŸ“š GitHub Repositories (Community-Driven)

# Repository Stars Description
1 aishwaryanr/awesome-genai-guide 5k+ ⭐ One stop repository for generative AI research & interviews
2 rbhatia46/DS-Interview-Resources 500+ ⭐ Frequently updated with new GenAI resources
3 Devinterview-io/generative-ai-questions Active Generative AI questions for 2025
4 youssefHosni/DS-Interview-QA 1.5k+ ⭐ Includes GenAI section

1.5.2 πŸ’¬ Reddit Community Discussions

# Discussion Focus Area
5 GenAI Interview Preparation GenAI interview strategies
6 LLM Interview Questions Large language model focus
7 Generative AI Career Discussion Career insights and questions
8 GenAI Engineer Interview Tips Practical interview tips

1.5.3 πŸ’» Practice Platforms & Tools

# Platform Description
9 Prepfully - GenAI Questions Real GenAI interview questions
10 Glassdoor - GenAI Roles Actual interview experiences

1.5.4 πŸ“ Blog Posts & Articles

# Source Description
11 DataCamp Top 30 generative AI questions for 2025
12 Analytics Vidhya Updated May 2025 with MCQs
13 ProjectPro Most asked generative AI interview questions

1.5.5 πŸ’‘ Community Insights

# Platform Description
14 Medium - GenAI Prep GenAI engineer interview prep guide
15 LinkedIn - GenAI Questions Curated GenAI collection

1.6 🧠 Deep Learning

Deep Learning Interview Resources - 25+ Community Sources

1.6.1 πŸ“š GitHub Repositories (Community-Driven)

# Repository Stars/Type Description
1 Devinterview-io/dl-questions Active Deep learning questions for 2025
2 youssefHosni/DS-Interview-QA (DL) 1.5k+ ⭐ Deep learning Q&A for data scientists
3 andrewekhalel/MLQuestions 1.5k+ ⭐ ML and DL technical interview questions
4 Sroy20/ml-interview-questions Community Curated deep learning questions
5 alirezadir/ML-Interviews 3.5k+ ⭐ Includes DL sections from FAANG interviews
6 rbhatia46/DS-Interview-Resources 500+ ⭐ Includes DL resources

1.6.2 πŸ’¬ Reddit Community Discussions

# Discussion Focus Area
7 Deep Learning Interview Prep DL interview strategies
8 DL Interview Questions Thread Community-sourced DL questions
9 Neural Networks Interview Questions NN-specific questions
10 Computer Vision Interview Prep CV and DL questions
11 Deep Learning Engineer Expectations Role expectations and questions

1.6.3 πŸ’» Practice Platforms & Tools

# Platform Description
12 Prepfully - Deep Learning Real DL interview questions
13 Glassdoor - DL Roles Real deep learning interview experiences

1.6.4 πŸ“– Educational Resources

# Source Description
14 DataCamp Top 20 deep learning interview Q&A
15 InterviewBit Comprehensive DL interview guide
16 LinkedIn - DL Questions Curated DL collection

1.6.5 πŸ’‘ Additional Resources

# Source Description
17 Blind - Hardest DL Questions Hardest DS/ML/DL questions from community
18 Medium - DL Interview Guide Comprehensive DL guide

1.7 πŸ“Š Probability & Statistics

Probability & Statistics Interview Resources - 30+ Community Sources

1.7.1 πŸ“š GitHub Repositories (Community-Driven)

# Repository Stars Description
1 kojino/120-DS-Questions (Prob) 8.8k+ ⭐ Probability questions from 120 DS questions
2 alexeygrigorev/data-science-interviews 8.5k+ ⭐ Statistics and probability questions
3 youssefHosni/DS-Interview-QA (Stats) 1.5k+ ⭐ Statistics interview questions
4 khanhnamle1994/cracking-ds-interview 3k+ ⭐ Includes probability & stats sections
5 rbhatia46/DS-Interview-Resources 500+ ⭐ Probability and statistics resources

1.7.2 πŸ’¬ Reddit Community Discussions

# Discussion Focus Area
6 Probability Interview Questions Probability-specific prep
7 Statistics Questions for DS Interviews Statistics focus
8 Bayes Theorem Interview Questions Bayes theorem applications
9 Stats Fundamentals for DS Core statistics concepts
10 Probability Puzzles Thread Probability puzzles
11 A/B Testing Questions A/B testing and hypothesis testing

1.7.3 πŸ’» Practice Platforms

# Platform Description
12 StrataScratch Real probability & statistics questions
13 DataInterview 120 statistics questions for FAANGs
14 DataLemur Top 20 statistics questions asked
15 Prepfully - Statistics Real statistics interview questions

1.7.4 πŸ“– Educational Resources

# Source Description
16 NickSingh.com Questions from FAANG & Wall Street
17 DataCamp Top 35 statistics questions 2025
18 GeeksforGeeks Top 50+ statistics questions
19 Analytics Vidhya 25 probability and statistics questions

1.7.5 πŸ’‘ Additional Resources

# Source Description
20 LinkedIn - Stats Questions Curated statistics collection
21 Medium - Probability Guide Comprehensive probability guide
22 Glassdoor - Quant Roles Quant-focused probability questions

1.8 🐼 Pandas

Pandas Interview Resources - 25+ Community Sources

1.8.1 πŸ“š GitHub Repositories (Community-Driven)

# Repository Stars/Type Description
1 Devinterview-io/pandas-questions Active Pandas questions for ML/DS interviews 2025
2 FavioVazquez/ds-cheatsheets 20k+ ⭐ List of data science cheatsheets including Pandas
3 pandas-dev/pandas (Official) Official Official pandas cheat sheet
4 aihubprojects/pandas-cheatsheet Community Python, NumPy, Pandas cheatsheet collection
5 rbhatia46/DS-Interview-Resources 500+ ⭐ Includes Pandas resources
6 Gist - Quick Reference Community Super quick cheatsheet for common tasks

1.8.2 πŸ’¬ Reddit Community Discussions

# Discussion Focus Area
7 Pandas Interview Questions Thread Pandas-specific prep
8 Data Manipulation Questions Pandas data manipulation
9 Pandas Coding Challenges Hands-on Pandas practice
10 DataFrame Operations Questions DataFrame manipulation

1.8.3 πŸ’» Practice Platforms

# Platform Description
11 InterviewQuery Top 27 Pandas questions with answers
12 DataCamp Top 26 Pandas interview Q&A
13 StrataScratch Pandas questions from real companies
14 Prepfully - Pandas Pandas practice interviews

1.8.4 πŸ“– Educational Resources

# Source Description
15 InterviewBit Pandas interview guide with downloadable PDF
16 GeeksforGeeks Top 50 Pandas questions 2024

1.8.5 πŸ’‘ Additional Resources

# Source Description
17 LinkedIn - Pandas Guide Curated Pandas collection
18 Medium - Pandas Interview Prep Comprehensive Pandas guide
19 Glassdoor - Data Analyst Roles Real Pandas interview questions

1.9 πŸ”’ NumPy

NumPy Interview Resources - 25+ Community Sources

1.9.1 πŸ“š GitHub Repositories (Community-Driven)

# Repository Stars/Type Description
1 Devinterview-io/numpy-questions Active NumPy questions for ML/DS interviews 2025
2 FavioVazquez/ds-cheatsheets 20k+ ⭐ Includes comprehensive NumPy cheatsheets
3 aihubprojects/numpy-cheatsheet Community NumPy cheatsheet with examples
4 tpn/pdfs Collection Technically-oriented PDF collection
5 rbhatia46/DS-Interview-Resources 500+ ⭐ Includes NumPy resources
6 numpy/numpy (Official Docs) 28k+ ⭐ Official NumPy documentation and resources

1.9.2 πŸ’¬ Reddit Community Discussions

# Discussion Focus Area
7 NumPy Interview Questions NumPy-specific prep
8 NumPy vs Lists Questions Performance comparisons
9 Array Operations Questions NumPy operations
10 NumPy Broadcasting Questions Broadcasting concepts

1.9.3 πŸ’» Practice Platforms

# Platform Description
11 InterviewQuery Top 19 NumPy questions updated for 2025
12 DataCamp Top 20 NumPy questions: basic to advanced
13 Prepfully - NumPy NumPy practice questions

1.9.4 πŸ“– Educational Resources

# Source Description
14 InterviewBit NumPy interview guide updated Dec 2024
15 MLStack.Cafe 27 advanced NumPy interview questions
16 GeeksforGeeks Comprehensive NumPy questions

1.9.5 πŸ’‘ Additional Resources

# Source Description
17 LinkedIn - NumPy Guide Curated NumPy collection
18 Medium - NumPy Interview Prep Comprehensive NumPy guide
19 Glassdoor - Python Developer Roles Real NumPy questions

1.10 πŸ—„οΈ SQL

SQL Interview Resources - 35+ Community Sources

1.10.1 πŸ“š GitHub Repositories (Community-Driven)

# Repository Stars Description
1 shawlu95/Beyond-LeetCode-SQL 3.5k+ ⭐ Analysis of SQL LeetCode & classic interview questions
2 mdh266/SQL-Practice 500+ ⭐ Solutions from LeetCode, HackerRank & DataLemur
3 Thomas-George-T/HackerRank-SQL 1k+ ⭐ All SQL HackerRank challenges using MySQL
4 mrinal1704/SQL-Leetcode 100+ ⭐ All 117 LeetCode questions with solutions
5 ManikantaSanjay/LeetCode-SQL-70 Active Collection to ace coding interviews
6 TulipAggarwal/LeetCode-SQL50 Active Wide array of SQL concepts
7 alexeygrigorev/data-science-interviews 8.5k+ ⭐ Includes SQL interview questions
8 rbhatia46/DS-Interview-Resources 500+ ⭐ SQL resources for DS roles

1.10.2 πŸ’¬ Reddit Community Discussions

# Discussion Focus Area
9 SQL Interview Questions Thread SQL interview prep
10 Best Resources for SQL Practice Practice platforms
11 SQL Window Functions Questions Window functions focus
12 Common SQL Mistakes in Interviews Interview pitfalls
13 SQL for Data Analyst Roles Data analyst focus
14 Advanced SQL Questions Advanced concepts
15 SQL vs NoSQL Interview Questions Database comparisons
# Platform Description
16 LeetCode SQL 200+ SQL problems from real interviews
17 DataLemur Real SQL questions from FAANG companies
18 StrataScratch 1000+ questions from 150 companies
19 HackerRank SQL Comprehensive SQL challenges
20 Prepfully - SQL Company-specific SQL questions
21 InterviewQuery SQL for data science roles

1.10.4 πŸ“– Educational Resources

# Source Description
22 DataCamp Beginner to intermediate SQL Q&A
23 GeeksforGeeks Top 50+ SQL questions
24 InterviewBit Comprehensive SQL interview guide

1.10.5 πŸ’‘ Community Resources

# Source Description
25 Blind - SQL Interview Real SQL interview experiences (Reddit company)
26 KDnuggets Practical SQL & Python questions
27 Glassdoor - SQL Developer Real SQL interview experiences
28 LinkedIn - SQL Questions Curated SQL collection

1.11 πŸ“š Comprehensive Multi-Topic Resources

All-in-One Data Science Interview Resources

1.11.1 🌟 Top GitHub Collections

# Repository Stars Description
1 alexeygrigorev/data-science-interviews 8.5k+ ⭐ Technical questions covering SQL, Python, coding
2 kojino/120-DS-Questions 8.8k+ ⭐ 120 commonly asked DS questions with answers
3 youssefHosni/DS-Interview-QA 1.5k+ ⭐ Curated list across 6 categories
4 khanhnamle1994/cracking-ds-interview 3k+ ⭐ Cheatsheets, books, questions, portfolio
5 rbhatia46/DS-Interview-Resources 500+ ⭐ Potential sources, frequently updated
6 PavelGrigoryevDS/awesome-data-analysis 1k+ ⭐ 500+ curated resources for beginners & experts
7 benthecoder/ds-interview-resources 500+ ⭐ Collection of awesome DS interview resources
8 ajitsingh98/DS-Interview-QA 500+ ⭐ 1000+ most asked DS questions

1.12 πŸ’Ό Real Interview Experiences

Glassdoor & Blind - Actual Interview Questions

1.12.1 🏒 Glassdoor Interview Experiences

# Company Key Topics Covered
1 Google Data Scientist ML algorithms, SQL, Python, A/B testing
2 Meta Data Scientist Data cleaning, modeling, system design, case studies
3 Amazon Data Scientist Pandas coding, regularization, bias/variance, A/B testing
4 LinkedIn Data Scientist Probability sampling, logistic regression algorithms
5 Reddit Data Scientist SQL-heavy, nested subqueries, CTEs, window functions

1.12.2 πŸ’¬ Blind Community Discussions

# Discussion Topic Topics Covered
1 Toughest DS/ML Questions SGD, logistic regression, regularizers, PCA
2 Hardest DS/ML Questions Advanced ML theory and implementation
3 DS Interview Question Bank Community-sourced question bank
4 ML Interview Prep Comprehensive ML/DS prep strategies
5 Google ML Interview Gradient descent, normalization, regularization, embeddings

1.13 🎯 Practice Platforms Comparison

Best Platforms for Each Topic

1.13.1 πŸ—„οΈ For SQL Practice

Platform Best For Community Size Cost
DataLemur Real FAANG SQL questions 20k+ users Free tier available
StrataScratch Data science-specific SQL 20k+ users Subscription required
LeetCode Coding challenges + SQL Millions Free tier available
HackerRank Company assessments 28M+ developers Free

1.13.2 πŸ’» For Python/Coding Practice

Platform Best For Community Size Cost
LeetCode Algorithmic coding Millions Free tier available
HackerRank Data science challenges 28M+ developers Free
StrataScratch DS-specific Python 20k+ users Subscription required

1.13.3 πŸ€– For ML/Theory Questions

Resource Type Best For Cost
GitHub Repos Free comprehensive resources Free
Blind/Glassdoor Real interview experiences Free
Books (Chip Huyen) In-depth knowledge One-time purchase

1.14 πŸ“– Interview Preparation Strategy

How to Use These Resources - Structured 7-Week Plan

1.14.1 πŸ“… Phase 1: Foundation Building (Weeks 1-2)

Focus Areas:

  • Start with GitHub repositories like alexeygrigorev/data-science-interviews
  • Review Python and SQL basics using community cheatsheets
  • Practice 5-10 easy SQL questions daily on LeetCode or DataLemur

Daily Schedule:

  • Morning: 1 hour theory review (GitHub repos)
  • Afternoon: 1 hour SQL practice
  • Evening: 30 min Python coding problems

1.14.2 πŸ“… Phase 2: Concept Mastery (Weeks 3-4)

Focus Areas:

Daily Schedule:

  • Morning: 1.5 hours ML theory
  • Afternoon: 1 hour probability/statistics
  • Evening: 45 min medium-level coding

1.14.3 πŸ“… Phase 3: Company-Specific Prep (Weeks 5-6)

Focus Areas:

  • Read Glassdoor interview experiences for target companies
  • Review Blind discussions for insider insights
  • Practice on StrataScratch for real company questions

Daily Schedule:

  • Morning: Company-specific question review
  • Afternoon: Mock interviews (timed)
  • Evening: Review and improve solutions

1.14.4 πŸ“… Phase 4: Mock Interviews (Week 7+)

Focus Areas:

  • Do timed practice sessions
  • Review GitHub repos for advanced topics
  • Focus on weak areas identified during practice

Daily Schedule:

  • Full mock interview sessions (2-3 hours)
  • Detailed review and improvement
  • Target weak areas

1.15 ⚠️ Common Pitfalls & Best Practices

Common Pitfalls to Avoid

1.15.1 ❌ What NOT to Do

  • Don't just read questions - Actually code the solutions
  • Don't skip probability/statistics - Very common in interviews
  • Don't ignore SQL - It's tested even for ML roles
  • Don't only study theory - Practice is equally important
  • Don't rely solely on SEO blogs - Use community resources
  • Don't memorize without understanding - Interviewers can tell
  • Don't practice only easy problems - Mix difficulty levels
Community Recommendations

Based on Reddit, Blind, and GitHub discussions, the most recommended resources are:

1.15.2 πŸ† Top Picks by Category

Category Top Recommendation Why?
GitHub alexeygrigorev/data-science-interviews (8.5k⭐) Most comprehensive, actively maintained
SQL Practice DataLemur + StrataScratch Real company questions, DS-focused
ML Theory Chip Huyen's ML Interviews Book 200+ questions with difficulty levels
Real Questions Glassdoor + Blind community Actual interview experiences
Python LeetCode Easy-Medium problems Best for algorithmic thinking
Comprehensive khanhnamle1994/cracking-the-data-science-interview All-in-one resource

1.15.3 πŸ’‘ Pro Tips from the Community

  • Use Spaced Repetition: Review questions after 1 day, 1 week, 1 month
  • Join Study Groups: Reddit r/datascience, Discord communities
  • Track Your Progress: Use spreadsheets to monitor weak areas
  • Focus on Fundamentals: Master basics before advanced topics
  • Practice Explaining: Use the Feynman technique

1.16 πŸ“ Contributing

Found a Great Resource?

Many of these GitHub repos accept contributions. If you've found a resource that the community loves:

  • Fork the respective repository
  • Add your finding with proper documentation
  • Submit a pull request
  • Help the community grow!