denisshepelin
CPH BioScience 2021 Alumni, interested in synthetic biology, metabolic engineering and probabilistic programming. Currently DS/ML @ LabTwin
@labtwin-gmbh Berlin, Germany
denisshepelin's Stars
milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
deepset-ai/haystack
:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
neuml/txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
stanfordnlp/stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
aimhubio/aim
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
mosaicml/composer
Supercharge Your Model Training
NVIDIA/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
cirruslabs/tart
macOS and Linux VMs on Apple Silicon to use in CI and other automations
quarto-dev/quarto-cli
Open-source scientific and technical publishing system built on Pandoc.
TDAmeritrade/stumpy
STUMPY is a powerful and scalable Python library for modern time series analysis
salesforce/Merlion
Merlion: A Machine Learning Framework for Time Series Intelligence
OpenNMT/CTranslate2
Fast inference engine for Transformer models
jcrist/msgspec
A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
jalammar/ecco
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
ELS-RD/kernl
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
unum-cloud/ucall
Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uring ☎️
evo-design/evo
Biological foundation modeling from molecular to genome scale
pm4py/pm4py-core
Public repository for the PM4Py (Process Mining for Python) project.
koaning/embetter
just a bunch of useful embeddings
bigscience-workshop/biomedical
Tools for curating biomedical training data for large-scale language modeling
matrix-profile-foundation/matrixprofile
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
microsoft/UDOP
lmcinnes/glasbey
Algorithmically create or extend categorical colour palettes
IBM/fastfit
FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes
AIRI-Institute/GENA_LM
GENA-LM is a transformer masked language model trained on human DNA sequence.
rodrigo-arenas/pyworkforce
Standard tools for workforce management, queuing, scheduling, rostering and optimization problems.
averkij/multipunct
Train punctuation and capitalization models for different languages