dmpetrov
Creator of DVC - Data Version Control. Ex-Data Scientist at Microsoft. PhD in CS.
datachain.aiSan Francisco Bay Area, CA
dmpetrov's Stars
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
python-poetry/poetry
Python packaging and dependency management made easy
HarisIqbal88/PlotNeuralNet
Latex code for making neural networks diagrams
deepset-ai/haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
neondatabase/neon
Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.
iterative/dvc
🦉 Data Versioning and ML Experiments
cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
DataTalksClub/mlops-zoomcamp
Free MLOps course from DataTalks.Club
activeloopai/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
replicate/cog
Containers for machine learning
ydb-platform/ydb
YDB is an open source Distributed SQL Database that combines high availability and scalability with strong consistency and ACID transactions
jmespath/jmespath.py
JMESPath is a query language for JSON.
SnellerInc/sneller
World's fastest log analysis: λ + SQL + JSON + S3
sberbank-ai-lab/LightAutoML
LAMA - automatic model creation framework
iterative/mlem
🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞
grok-ai/nn-template
Generic template to bootstrap your PyTorch project.
dosyago/sirdb
:man: a simple, git diffable JSON database on yer filesystem. By the power of NodeJS
RandomFractals/vscode-data-preview
Data Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
isidentical/refactor
AST-based fragmental source code refactoring toolkit for Python
astashov/tixi
Ascii charts editor
sscardapane/reprodl2021
Host repository for the "Reproducible Deep Learning" PhD course
iterative/terraform-provider-iterative
☁️ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetes
kevin-hanselman/dud
A lightweight CLI tool for versioning data alongside source code and building data pipelines.
iterative/gto
🏷️ Git Tag Ops. Turn your Git repository into Artifact Registry or Model Registry.
baruchel/vim-notebook
A plugin for the Vim editor for handling any interpreter in a Notebook style
gennaro-tedesco/nvim-dvc
The long awaited dvc plugin for neovim
jcpsantiago/dvthis
R utilities for DVC pipelines.
lRomul/gramtion
Twitter bot for generating photo descriptions (alt text)
avatar-cli/avatar-cli
Magic wrapper to run containerized CLI tools (Mirror repository)
anonymous-conference2021/Co-evolution-of-ML-Pipelines