mlecauchois

mlecauchois's Stars

huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.6k1.2k
cambridgeltl/sapbert
[NAACL'21 & ACL'21] SapBERT: Self-alignment pretraining for BERT & XL-BEL: Cross-Lingual Biomedical Entity Linking.
Language:Python16935
aphp/edspdf
EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.
Language:Python386
cisnlp/simalign
Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)
Language:Python34747
facebookresearch/MUSE
A library for Multilingual Unsupervised or Supervised word Embeddings
Language:Python3.2k551
maidis/awesome-machine-translation
A list of awesome Machine Translation frameworks, libraries, software and papers
16923
whaleloops/KEPT
auto icd coding with prompt
Language:Jupyter Notebook4617
meta-llama/llama
Inference code for Llama models
Language:Python55.8k9.5k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.6k4.5k
derailed/k9s
🐶 Kubernetes CLI To Manage Your Clusters In Style!
Language:Go26.7k1.7k
google-research/frame-interpolation
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
Language:Python2.8k281
GanjinZero/ICD-MSMN
Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding [ACL 2022]
Language:Python478
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Language:JavaScript18.4k2.3k
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
Language:Python26.4k2.9k
mckaywrigley/chatbot-ui
Come join the best place on the internet to learn AI skills. Use code "chatbotui" for an extra 20% off.
Language:TypeScript28.4k7.9k
Crataco/ai-guide
A guide for FOSS text generation frontends, models, and jargon.
1787
chaoyi-wu/PMC-LLaMA
The official codes for "PMC-LLaMA: Towards Building Open-source Language Models for Medicine"
Language:Python58452
calum-bird/howmanyparams.com
Compute-optimal LLMs
Language:TypeScript111
cado-security/masked-ai
Masked Python SDK wrapper for OpenAI API. Use public LLM APIs securely.
Language:Python11113
AndreasMadsen/python-textualheatmap
Create interactive textual heat maps for Jupiter notebooks
Language:Jupyter Notebook19614
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Language:Python6.8k772
AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
Language:Python1.6k101
jalammar/ecco
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
Language:Jupyter Notebook2k168
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python141k26.6k
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Language:Python4.5k346
NVIDIA/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
Language:Python4k367
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Language:Jupyter Notebook4.4k369
Stability-AI/StableLM
StableLM: Stability AI Language Models
Language:Jupyter Notebook15.8k1k
mosaicml/composer
Supercharge Your Model Training
Language:Python5.1k415
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python34.9k4.1k