denisshepelin

CPH BioScience 2021 Alumni, interested in synthetic biology, metabolic engineering and probabilistic programming. Currently DS/ML @ LabTwin

@labtwin-gmbh Berlin, Germany

denisshepelin's Stars

milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
Language:Go29.7k 276 11.8k2.8k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python27.6k 225 4.6k4.1k
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
26.6k 286 412.2k
deepset-ai/haystack
:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Language:Python16.9k 134 3.5k1.9k
neuml/txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Language:Python8.8k 86 753580
stanfordnlp/stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
Language:Python7.2k 140 894887
aimhubio/aim
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
Language:Python5.2k 44 1k316
mosaicml/composer
Supercharge Your Model Training
Language:Python5.1k 49 543415
NVIDIA/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
Language:Python4k 33 353367
cirruslabs/tart
macOS and Linux VMs on Apple Silicon to use in CI and other automations
Language:Swift3.8k 39 339110
quarto-dev/quarto-cli
Open-source scientific and technical publishing system built on Pandoc.
Language:JavaScript3.8k 30 4.9k310
TDAmeritrade/stumpy
STUMPY is a powerful and scalable Python library for modern time series analysis
Language:Python3.6k 59 503317
salesforce/Merlion
Merlion: A Machine Learning Framework for Time Series Intelligence
Language:Python3.4k 55 86295
OpenNMT/CTranslate2
Fast inference engine for Transformer models
Language:C++3.3k 57 690287
jcrist/msgspec
A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
Language:Python2.3k 19 36367
jalammar/ecco
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
Language:Jupyter Notebook2k 24 64168
ELS-RD/kernl
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
Language:Jupyter Notebook1.5k 29 17493
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Language:Python1.2k 21 87137
unum-cloud/ucall
Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uring ☎️
Language:C1.1k 18 2940
evo-design/evo
Biological foundation modeling from molecular to genome scale
Language:Jupyter Notebook931 18 50112
pm4py/pm4py-core
Public repository for the PM4Py (Process Mining for Python) project.
Language:Python707 38 371277
koaning/embetter
just a bunch of useful embeddings
Language:Python458 8 5315
bigscience-workshop/biomedical
Tools for curating biomedical training data for large-scale language modeling
Language:Python455 30 388116
matrix-profile-foundation/matrixprofile
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Language:Python362 18 6462
microsoft/UDOP
237 24 56
lmcinnes/glasbey
Algorithmically create or extend categorical colour palettes
Language:Python194 3 57
IBM/fastfit
FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes
Language:Python182 5 1513
AIRI-Institute/GENA_LM
GENA-LM is a transformer masked language model trained on human DNA sequence.
Language:Jupyter Notebook165 8 1916
rodrigo-arenas/pyworkforce
Standard tools for workforce management, queuing, scheduling, rostering and optimization problems.
Language:Python74 4 919
averkij/multipunct
Train punctuation and capitalization models for different languages
Language:Jupyter Notebook24 3 31

denisshepelin

denisshepelin's Stars

milvus-io/milvus

vllm-project/vllm

google-research/tuning_playbook

deepset-ai/haystack

neuml/txtai

stanfordnlp/stanza

aimhubio/aim

mosaicml/composer

NVIDIA/NeMo-Guardrails

cirruslabs/tart

quarto-dev/quarto-cli

TDAmeritrade/stumpy

salesforce/Merlion

OpenNMT/CTranslate2

jcrist/msgspec

jalammar/ecco

ELS-RD/kernl

mit-han-lab/smoothquant

unum-cloud/ucall

evo-design/evo

pm4py/pm4py-core

koaning/embetter

bigscience-workshop/biomedical

matrix-profile-foundation/matrixprofile

microsoft/UDOP

lmcinnes/glasbey

IBM/fastfit

AIRI-Institute/GENA_LM

rodrigo-arenas/pyworkforce

averkij/multipunct