Lyngsoe's Stars
duriantaco/pykomodo
A Python-based parallel file chunking system designed for processing large codebases into LLM-friendly chunks.
noamgat/lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
aphrodite-engine/aphrodite-engine
Large-scale LLM inference engine
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
guidance-ai/guidance
A guidance language for controlling large language models.
JoakimEdin/medical-coding-reproducibility
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
tiantiantu/KSI
This repository contains codes for Knowledge Source Intergration (KSI) framework
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
OpenNMT/CTranslate2
Fast inference engine for Transformer models
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
VodLM/vod
End-to-end training of Retrieval-Augmented LMs (REALM, RAG)
josStorer/RWKV-Runner
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use.
huggingface/text-generation-inference
Large Language Model Text Generation Inference
python-poetry/install.python-poetry.org
The official Poetry installation script
neuralmagic/deepsparse
Sparsity-aware deep learning inference runtime for CPUs
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—language models
ml6team/fondant
Production-ready data processing made easy and shareable
BlinkDL/SmallInitEmb
LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence
deepspeedai/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
BlinkDL/RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
python-poetry/poetry
Python packaging and dependency management made easy
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
qdrant/qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
eugeneyan/open-llms
đź“‹ A list of open LLMs available for commercial use.