singhranjodh's Stars
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
google/magika
Detect file content types with deep learning
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
OpenAccess-AI-Collective/axolotl
Go ahead and axolotl questions
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
pytorch/torchtune
A Native-PyTorch Library for LLM Fine-tuning
neuralmagic/deepsparse
Sparsity-aware deep learning inference runtime for CPUs
facebookresearch/jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
neuralmagic/sparseml
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
GoogleCloudPlatform/localllm
microsoft/SoM
Set-of-Mark Prompting for LMMs
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
google-deepmind/long-form-factuality
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".
google-research/adapter-bert
zju3dv/EfficientLoFTR
pytorch/ao
Native PyTorch library for quantization and sparsity
pacman100/LLM-Workshop
LLM Workshop by Sourab Mangrulkar
chiehwangs/gaussian-head
Official repository for 'GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation'
AI4Bharat/indicnlp_corpus
Description Describes the IndicNLP corpus and associated datasets
Vaibhavs10/fast-llm.rs
ParticleMedia/RAGTruth
Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"
elastic/connectors
Source code for all Elastic connectors, developed by the Search team at Elastic, and home of our Python connector development framework
speechnovateur/languagecodec_tmp
Temporary anonymous version
hyparam/hyllama
llama.cpp gguf file parser for javascript
aredden/torch-bnb-fp4
Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops