jyuu's Stars
scikit-learn/scikit-learn
scikit-learn: machine learning in Python
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
visenger/awesome-mlops
A curated list of references for MLOps
dotnet/machinelearning
ML.NET is an open source and cross-platform machine learning framework for .NET.
erincatto/box2d
Box2D is a 2D physics engine for games
iterative/cml
♾️ CML - Continuous Machine Learning | CI/CD for ML
microsoft/hummingbird
Hummingbird compiles trained ML models into tensor computation for faster inference.
weld-project/weld
High-performance runtime for data analytics applications
aappleby/smhasher
Automatically exported from code.google.com/p/smhasher
allisonhorst/stats-illustrations
R & stats illustrations by @allison_horst
microsoft/CDM
The Common Data Model (CDM) is a standard and extensible collection of schemas (entities, attributes, relationships) that represents business concepts and activities with well-defined semantics, to facilitate data interoperability. Examples of entities include: Account, Contact, Lead, Opportunity, Product, etc.
linkedin/dr-elephant
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
SheffieldML/GPyOpt
Gaussian Process Optimization using GPy
Azure/AzurePublicDataset
Microsoft Azure Traces
dabl/dabl
Data Analysis Baseline Library
echen102/COVID-19-TweetIDs
The repository contains an ongoing collection of tweets IDs associated with the novel coronavirus COVID-19 (SARS-CoV-2), which commenced on January 28, 2020.
zerostaticthemes/hugo-whisper-theme
Whisper is a minimal documentation theme for Hugo.
yaringal/DropoutUncertaintyCaffeModels
Dropout As A Bayesian Approximation: Code
EmilHvitfeldt/RStudioConf2020Slides
Links to slides for rstudio::conf 2020
rstudio-conf-2020/applied-ml
Code and Resources for "Applied Machine Learning"
mine-cetinkaya-rundel/covid19-r
Collection of analyses, packages, visualisations of COVID-19 data in R
carpentries/glosario
A multilingual glossary for computing and data science terms.
wch/chatstream
Example Shiny for Python app which talks to the OpenAI API
orcasound/orcadata
Development of bioacoustic tools for analyzing Orcasound data -- either post-processing of archived raw FLAC files or real-time analysis of the lossy stream and/or FLAC files.
revodavid/mlops-r
Resources for Machine Learning Operations with R
microsoft/Peregrine
Peregrine is a workload optimization platform for cloud query engines. The goal of Peregrine is three-fold: 1. make it easier to ingest and analyze query workload telemetry into a common engine-agnostic representation, 2. help developers to quickly build workload optimization applications to reduce overall costs and improve operational efficiency, and 3. providing better experience to the customers in the form of workload insights, actionable recommendations, and self-tuning capabilities.
shkumar64/AzureMachineLearningWorkshop
fvarga01/Azure-Talking-Points
Annotated Microsoft Azure documentation links used throughout day to day technical conversations.
paleolimbot/minimal-thesis-bookdown
ayat-khairy/tuneful-data