martinsotir's Stars
patrick-kidger/torchtyping
Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.
ddelange/mapply
Sensible multi-core apply function for Pandas
jmcarpenter2/swifter
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
dask/dask
Parallel computing with task scheduling
xhochy/fletcher
Pandas ExtensionDType/Array backed by Apache Arrow
duckdb/duckdb
DuckDB is an analytical in-process SQL database management system
feast-dev/feast
The Open Source Feature Store for Machine Learning
Epistimio/orion
Asynchronous Distributed Hyperparameter Optimization.
scikit-learn-contrib/sklearn-pandas
Pandas integration with sklearn
Lightning-Universe/lightning-flash
Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains
mamba-org/mamba
The Fast Cross-Platform Package Manager
zenml-io/zenml
ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io.
h2oai/wave
Realtime Web Apps and Dashboards for Python and R
JasonKessler/scattertext
Beautiful visualizations of how language differs among document types.
lux-org/lux
Automatically visualize your pandas dataframe via a single print! 📊 💡
joke2k/faker
Faker is a Python package that generates fake data for you.
PAIR-code/lit
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
h2oai/datatable
A Python package for manipulating 2-dimensional tabular data structures
Lightning-AI/pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
SirRob1997/Crowded-Valley---Results
This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Galileo-Galilei/kedro-mlflow
A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)
uwdata/arquero
Query processing and transformation of array-backed data tables.
dabl/dabl
Data Analysis Baseline Library
capeprivacy/cape-dataframes
Privacy transformations on Spark and Pandas dataframes backed by a simple policy language.
deepset-ai/haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
great-expectations/great_expectations
Always know what to expect from your data.
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
numpy/numpy
The fundamental package for scientific computing with Python.