JPL-DSCS's Stars
ynqa/jnv
Interactive JSON filter using jq
pyenv/pyenv
Simple Python version management
ankurchavda/streamify
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!
DataTalksClub/machine-learning-zoomcamp
Learn ML engineering for free in 4 months!
mwouts/jupytext
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
zinggAI/zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.
modin-project/modin
Modin: Scale your Pandas workflows by changing a single line of code
apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
google/python-fire
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
public-apis/public-apis
A collective list of free APIs
aFelipeSP/pdfme
Make PDFs easily
MTrajK/coding-problems
Solutions for various coding/algorithmic problems and many useful resources for learning algorithms and data structures
fbdesignpro/sweetviz
Visualize and compare datasets, target values and associations, with one line of code.
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!
DeepLcom/deepl-python
Official Python library for the DeepL language translation API.
minitorch/minitorch
The full minitorch student suite.
sodadata/soda-core
:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
SeldonIO/alibi-detect
Algorithms for outlier, adversarial and drift detection
eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
zhuowei/nft_ptr
C++ `std::unique_ptr` that represents each object as an NFT on the Ethereum blockchain
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
labmlai/labml
🔎 Monitor deep learning model training and hardware usage from your mobile phone 📱
peter-evans/autopep8
A GitHub action for autopep8, a tool that automatically formats Python code to conform to the PEP 8 style guide.
igorbarinov/awesome-data-engineering
A curated list of data engineering tools for software developers
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
apache/datafusion
Apache DataFusion SQL Query Engine
roapi/roapi
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
yasoob/intermediatePython
angular/angular.js
AngularJS - HTML enhanced for web apps!