cmcmaster1's Stars
MDK8888/GPTFast
Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.
leptonai/leptonai
A Pythonic framework to simplify AI service building
excalidraw/excalidraw
Virtual whiteboard for sketching hand-drawn like diagrams
crewAIInc/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
taylorai/mlx_embedding_models
run embeddings in MLX
argmaxinc/WhisperKit
On-device Speech Recognition for Apple Silicon
eth-sri/lmql
A language for constraint-guided and efficient LLM programming.
lucidrains/self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
dwzhu-pku/PoSE
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
da-z/mlx-ui
A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.
Codium-ai/AlphaCodium
Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""
microsoft/LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
TheMind-AI/fluid-db
Fluid Database
datamllab/LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
theroyallab/tabbyAPI
An OAI compatible exllamav2 API that's both lightweight and fast
catid/oaillama3
Simple setup to self-host LLaMA3-70B model with an OpenAI API
databricks/lilac
Curate better data for LLMs
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
arcee-ai/mergekit
Tools for merging pretrained large language models.
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
allenai/papermage
library supporting NLP and CV research on scientific papers
databricks/megablocks
huggingface/transformers.js
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
yuchenlin/LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.
epfLLM/Megatron-LLM
distributed trainer for LLMs
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy