strngelet's Stars
tech-srl/RASP
An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"
RulinShao/LightSeq
Official repository for LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers
admineral/GPT4-Vision-React-Starter
Early Alpha Release: Chat with Your Image - Leveraging GPT-4 Vision and Function Calls for AI-Powered Image Analysis and Description
THUDM/LongAlign
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
alibaba/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
DaoD/INTERS
This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"
swj0419/detect-pretrain-code-contamination
supabase-community/vercel-ai-chatbot
A full-featured, Supabaseified Next.js AI chatbot built by Vercel Labs & Supabase
mistralai/mistral-inference
Official inference library for Mistral models
karpathy/llama2.c
Inference Llama 2 in one file of pure C
tairov/llama2.mojo
Inference Llama 2 in one file of pure 🔥
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
Ki6an/fastT5
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
meta-llama/codellama
Inference code for CodeLlama models
microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration