monocongo's Stars
DataTalksClub/data-engineering-zoomcamp
Free Data Engineering course!
nltk/nltk
NLTK Source
DataTalksClub/machine-learning-zoomcamp
Learn ML engineering for free in 4 months!
joshpxyne/gpt-migrate
Easily migrate your codebase from one framework or language to another.
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
whylabs/whylogs
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
UniversalDataTool/universal-data-tool
Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
SharpAI/DeepCamera
Open-Source AI Camera. Empower any camera/CCTV with state-of-the-art AI, including facial recognition, person recognition(RE-ID) car detection, fall detection and more
joeyajames/Python
Python code for YouTube videos.
pyjanitor-devs/pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
google/earthengine-community
Tutorials and content created by Earth Engine users, for Earth Engine users
noahgift/Python-MLOps-Cookbook
This is an example of a Containerized Flask Application that can deploy to many target environments including: AWS, GCP and Azure.
monocongo/climate_indices
Climate indices for drought monitoring
Ouranosinc/xclim
Library of derived climate variables, ie climate indicators, based on xarray.
mebauer/data-analysis-using-python
Data Analysis Using Python: A Beginner’s Guide Featuring NYC Open Data.
MichaelisTrofficus/gpt4docstrings
Generating Python docstrings with OpenAI ChatGPT!!
pyOpenSci/software-submission
Submit your package for review by pyOpenSci here! If you have questions please post them here: https://pyopensci.discourse.group/
johnny-chivers/pyspark-glue-tutorial
oorb/oorb
An open-source orbit-computation package for Solar System objects.
godatadriven-dockerhub/pyspark
EverythingMe/pyretrace
A python reimplementation on Proguard's Retrace
Anant/example-airflow-and-spark
sschatts/conference_talks
Aaron-K-T-Berry/airflow-docker-boilerplate
mogalmahesh/binary-guy-aws
piyush-singhal/airflow-docker
Install Airflow using docker
Corey4005/get-usdm-shapefiles
This is a tool to help users download large quantities of US drought monitor shapefiles from the GIS database. Contains an example to get point descriptions from 18 sites for 20 years.
Anirudhann/PoC_data_validation_libs_python
Simple PoC on Python Data Validation Libraries and its performane
clstoulouse/parquetCubeIngestion
This repository is providing the source code and documentation about the Parquet Cube Ingestion described in the GMD publication "A Parquet Cube alternative to store gridded data for data analytics and modeling".
Global-Water-Security-Center/data-exploration
Repository to collect Python Scripts