Skip to content

Data Science

Trending data science repos — analysis, visualization, and notebooks.

← All topics
1

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

84.9k

5y 1mo old

2
apache/supersetTypeScript

Apache Superset is a Data Visualization and Data Exploration Platform

72.2k

10y 10mo old

3

scikit-learn: machine learning in Python

65.6k

15y 10mo old

4

Deep Learning for humans

63.9k

11y 2mo old

5

The 30 Days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than 100 days. Follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw

60.4k

6y 5mo old

6

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

48.3k

15y 10mo old

7

Learn how to develop, deploy and iterate on production-grade ML applications.

47.1k

7y 6mo old

8

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

44.9k

11y 1mo old

9

Streamlit — A faster way to build and share data apps.

44.1k

6y 8mo old

10

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

42.2k

7y 4mo old

11

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

41.9k

9y 6mo old

12

10 Weeks, 20 Lessons, Data Science for All!

34.6k

5y 1mo old

13

💫 Industrial-strength Natural Language Processing (NLP) in Python

33.4k

11y 11mo old

14

500 AI Machine learning Deep learning Computer vision NLP Projects with code

32.6k

5y 3mo old

15

Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep learning.

31.2k

9y 3mo old

16

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

31.0k

7y 1mo old

17

Roadmap to becoming an Artificial Intelligence Expert in 2022

30.9k

5y 6mo old

18

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

29.0k

11y 4mo old

19

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

28.8k

5y 9mo old

20

:memo: An awesome Data Science repository to learn and apply for real world problems.

28.7k

11y 10mo old

21

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

28.5k

7y 7mo old

22

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

28.4k

13y 4mo old

23
fastai/fastbookJupyter Notebook

The fastai book, published as Jupyter Notebooks

24.8k

6y 2mo old

24

Data Apps & Dashboards for Python. No JavaScript Required.

24.4k

11y 1mo old

25

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

23.4k

2y 11mo old

26

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

23.4k

5y 4mo old

27

matplotlib: plotting with Python

22.7k

15y 4mo old

28

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

22.0k

7y 10mo old

29

Best Practices on Recommendation Systems

21.6k

7y 7mo old

30

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

20.1k

2y 8mo old

Click any repo to view its star history on StarTrail