dmpetrov
Creator of DVC - Data Version Control. Ex-Data Scientist at Microsoft. PhD in CS.
datachain.aiSan Francisco Bay Area, CA
dmpetrov's Stars
Oulu-IMEDS/pytorch_bn_fusion
Batch normalization fusion for PyTorch
albumentations-team/albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Bloomberg-Beta/Manual
You were probably looking for our website... this is it. We moved our website here, so you can see the insides of how we work.
tdda/tdda
Test-Driven Data Analysis Functions
ternaus/TernausNet
UNet model with VGG11 encoder pre-trained on Kaggle Carvana dataset
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
ryfeus/lambda-packs
Precompiled packages for AWS Lambda
creafz/pytorch-cnn-finetune
Fine-tune pretrained Convolutional Neural Networks with PyTorch
sourcerer-io/sourcerer-app
🦄 Sourcerer app makes a visual profile from your GitHub and git repositories.
ternaus/kaggle_dstl_submission
Code for a winning model (3 out of 419) in a Dstl Satellite Imagery Feature Detection challenge
pditommaso/awesome-pipeline
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
jupyter/nbdime
Tools for diffing and merging of Jupyter notebooks.
elfi-dev/elfi
ELFI - Engine for Likelihood-Free Inference
metaphacts/ontodia
Ontodia data diagraming library
analysiscenter/batchflow
BatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.
Yorko/mlcourse.ai
Open Machine Learning Course
fchollet/deep-learning-with-python-notebooks
Jupyter notebooks for the code samples of the book "Deep Learning with Python"
c-smile/awesome-programming
Revelation from experienced kamikazes of programming
keras-team/keras
Deep Learning for humans
deeppavlov/ner
Named Entity Recognition
JohannesBuchner/imagehash
A Python Perceptual Image Hashing Module
torch/torch7
http://torch.ch
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
nanxstats/liftr
🐳 Containerize R Markdown documents for continuous reproducibility
derek73/python-nameparser
A simple Python module for parsing human names into their individual components
studioml/studio
Studio: Simplify and expedite model building process
d3/d3
Bring data to life with SVG, Canvas and HTML. :bar_chart::chart_with_upwards_trend::tada:
networkx/networkx
Network Analysis in Python
gvyshnya/DVC_R_Ensemble
Materials of a case study to build a DVC-based ML pipeline for an R project with ensemble prediction
dmlc/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow