anuprulez
Machine learning researcher/Data scientist, University of Freiburg, Germany. I love building scalable, reproducible machine learning toolkits.
@galaxyproject @BackofenLab @conda-forgeFreiburg, Germany
anuprulez's Stars
mermaid-js/mermaid
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
meta-llama/llama
Inference code for Llama models
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
psf/black
The uncompromising Python code formatter
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
fastai/fastbook
The fastai book, published as Jupyter Notebooks
mlflow/mlflow
Open source platform for the machine learning lifecycle
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
CopilotKit/CopilotKit
React UI + elegant infrastructure for AI Copilots, in-app AI agents, AI chatbots, and AI-powered Textareas 🪁
kedro-org/kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
jupyter/docker-stacks
Ready-to-run Docker images containing Jupyter applications
reactive-python/reactpy
It's React, but in Python
graviraja/MLOps-Basics
jupyterlab/jupyter-ai
A generative AI extension for JupyterLab
Beckschen/TransUNet
This repository includes the official project of TransUNet, presented in our paper: TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.
stanford-crfm/BioMedLM
modularml/max
A collection of sample programs, notebooks, and tools which highlight the power of the MAX Platform
ML-Bioinfo-CEITEC/genomic_benchmarks
Benchmarks for classification of genomic sequences
gher-uliege/DIVAnd-jupyterhub
jupyterhub docker image with DIVAnd pre-installed
saiprasadbarke/covid19-nucleotide-sequence-mutation-prediction
Predicting interclade mutations in the SARS-CoV-2 genome using sequence to sequence transformers
uwwint/discourse-scraper
anuprulez/tool-resource-prediction
Tool Resource Prediction for Genomic Datasets
tuncK/nanosampler