/Awesome-Data-Science-Resources

This repo contains resources on data scince, machine learning, deep learning, data engineering, and SQL.

Primary LanguageJupyter Notebook

🔶 AWESOME DATA SCIENCE RESOURCES 🔶

This repo contains resources on data scince, AI, machine learning, deep learning, data engineering and SQL.

✨ Data Science

📕 💯 Free Books for Data Science

📙 Articles Data Scientists Should Read

  • Transformers enable the processing of sequences in a parallel method.

  • BERT is an NLP model based on transformers.

  • StyleGAN is a generative adversarial network (GAN) introduced by Nvidia researchers in December 2018, and made source available in February 2019.

  • CLIP is a neural network trained on a variety of (image, text) pairs.

  • The game of Go with deep neural networks paper revealed AlphaGo which defeated the European Go champion by 5 games to 0.

  • DNN for YouTube Recommendations paper mentions the architecture of Deep Learning models used for recommendations on YouTube.

💪 Data Science Projects

🚀 Data Libraries Tutorial

🤖 Data Science Tools

  • lazypredict helps build a lot of basic models without much code and helps understand which models works better without any parameter tuning.

  • PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows.

  • FeatureSelectionGA searches for one of the best feature set from other features in order to attain a high accuracy.

🔥 Github Repos for Data Science

✨ Machine Learning

🔥 Github Repos for ML

📕 💯 Free Books for ML

🚩 Machine Learning Courses

💪 Machine Learning Projects

🙌 Machine Learning Tools

MLOps Tools

✨ Machine Learning Engineering

🔥 ML Engineering GitHub Repos

✨ Deep Learning

📕 10 Best Deep Learning Books

🚩 💯 Free Deep Learning Courses

🙌 Deep Learning Frameworks

🔥 GitHub Repos for Deep Learning

😍 Stable Diffusion Examples

✨ Natural Language Processing

🔥 GitHub Repos for Learning NLP

  • Natural Language Processing Tutorial is a tutorial for those who are studying NLP using Pytorch.

  • NLP Recipes repo contains examples and best practices for building NLP systems, provided as Jupyter notebooks and utility functions.

  • NLP Course course includes lecture and seminar materials about NLP for each week.

  • NLP in Python Tutorial covers NLP step-by-step with several Jupyter Notebooks during the tutorial and uses a number of data science libraries along the way.

  • Awesome NLP repo contains a curated list of resources dedicated to Natural Language Processing.

  • Deep Learning Drizzle is an organized website where you can find all the free courses from Top Universities with their links.

🚩 Courses for NLP

✨ Data Engineering

🔥 GitHub Repos for Learning Data Engineering

💪 Data Engineering Projects

  • HashtagCashtag shows how to build a big data pipeline for user sentiment analysis on the US stock market.

  • Building a Data Engineering Project in 20 Minutes learns web scraping with real-estates, uploading them to S3, Spark, and Delta Lake, and adding Data Science with Jupyter.

  • Analyzing GitHub Repos

  • The goal of Web Crawler For Online Inflation project is to calculate inflation rates from first principles.

  • This repo contains projects done which applies principles in data engineering.

  • Data Engineering Project is an implementation of the data pipeline which consumes the latest news from RSS Feeds and makes them available for users via handy API. The pipeline infrastructure is built using popular, open-source projects.

  • This aim of this repository is to help you develop and learn those skills. You can find the high level topics such python data processing, SQL database table design, PySpark, data cleaning.

✨ AI

🤖 💯 Free AI Tools

  • An AI Research Assistant: Elicit

✨ SQL

🚩 💯 Best Free Resources for Learning SQL

  • SQLZoo is an interactive, Wiki-based tutorial that provides lessons and projects for beginners in SQL.

  • SQLBolt offers easy-to-follow instructions, a simple interface, and interactive exercises to teach basic proficiency in SQL.

  • Kaggle provides tutorials to learn SQL for working with databases, using Google BigQuery from introduction to advanced level.

  • CodeAcademy teaches you how to communicate with relational databases through SQL.

  • Pop SQL allows you to share queries, store commonly used queries in a searchable library, and provides a visual interface for analysis.

  • Learning SQL book is for you if you like to learn with a book. This book is available to read for free online via this PDF.

  • Khan Academy includes video-based content with detailed explanations

💪 SQL Projects

💹 DataSets


😍 I'll update this repo once I find new sources so don't forget to watch this repo. If you enjoy this repo, give me star and share.

👍 Let's connect! YouTube | Medium | Twitter | Instagram | Tiktok | Reddit