IndWalker's Stars
faif/python-patterns
A collection of design patterns/idioms in Python
satwikkansal/wtfpython
What the f*ck Python? 😱
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
pola-rs/polars
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
apache/kafka
Mirror of Apache Kafka
freqtrade/freqtrade
Free, open source crypto trading bot
celery/celery
Distributed Task Queue (development branch)
rakyll/hey
HTTP load generator, ApacheBench (ab) replacement
joke2k/faker
Faker is a Python package that generates fake data for you.
scylladb/scylladb
NoSQL data store using the seastar framework, compatible with Apache Cassandra
modin-project/modin
Modin: Scale your Pandas workflows by changing a single line of code
wader/fq
jq for binary formats - tool, language and decoders for working with binary and text formats
sqlfluff/sqlfluff
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
pytoolz/toolz
A functional standard library for Python.
treeverse/lakeFS
lakeFS - Data version control for your data lake | Git for data
dry-python/returns
Make your functions return something meaningful, typed, and safe!
pyeve/cerberus
Lightweight, extensible data validation library for Python
blockchain-etl/ethereum-etl
Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ
emirozer/fake2db
create custom test databases that are populated with fake data
elementary-data/elementary
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
re-data/re-data
re_data - fix data issues before your users & CEO would discover them 😊
abhishek-ch/around-dataengineering
A Data Engineering & Machine Learning Knowledge Hub
dbt-checkpoint/dbt-checkpoint
:fishing_pole_and_fish: List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
pydata/parallel-tutorial
Parallel computing in Python tutorial materials
mozilla/bigquery-etl
Bigquery ETL
apssouza22/big-data-pipeline-lambda-arch
A full big data pipeline (Lambda Architecture) with Spark, Kafka, HDFS and Cassandra.
mjirv/dbt-datamocktool
A dbt package for unit testing your SQL analytics models
GoogleCloudPlatform/dataflow-cookbook
infinitelambda/dq-tools
Make simple storing test results and visualisation of these in a BI dashboard
Casa-dos-Ventos/project_data_governance_casa_dos_ventos_google