mlecauchois's Stars
huggingface/trl
Train transformer language models with reinforcement learning.
cambridgeltl/sapbert
[NAACL'21 & ACL'21] SapBERT: Self-alignment pretraining for BERT & XL-BEL: Cross-Lingual Biomedical Entity Linking.
aphp/edspdf
EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.
cisnlp/simalign
Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)
facebookresearch/MUSE
A library for Multilingual Unsupervised or Supervised word Embeddings
maidis/awesome-machine-translation
A list of awesome Machine Translation frameworks, libraries, software and papers
whaleloops/KEPT
auto icd coding with prompt
meta-llama/llama
Inference code for Llama models
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
derailed/k9s
🐶 Kubernetes CLI To Manage Your Clusters In Style!
google-research/frame-interpolation
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
GanjinZero/ICD-MSMN
Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding [ACL 2022]
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
mckaywrigley/chatbot-ui
Come join the best place on the internet to learn AI skills. Use code "chatbotui" for an extra 20% off.
Crataco/ai-guide
A guide for FOSS text generation frontends, models, and jargon.
chaoyi-wu/PMC-LLaMA
The official codes for "PMC-LLaMA: Towards Building Open-source Language Models for Medicine"
calum-bird/howmanyparams.com
Compute-optimal LLMs
cado-security/masked-ai
Masked Python SDK wrapper for OpenAI API. Use public LLM APIs securely.
AndreasMadsen/python-textualheatmap
Create interactive textual heat maps for Jupiter notebooks
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
jalammar/ecco
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
NVIDIA/NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
sanchit-gandhi/whisper-jax
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Stability-AI/StableLM
StableLM: Stability AI Language Models
mosaicml/composer
Supercharge Your Model Training
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.