sandkoan's Stars
karpathy/llm.c
LLM training in simple, raw C/CUDA
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
marimo-team/marimo
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
mistralai/mistral-finetune
nus-apr/auto-code-rover
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.
mlabonne/llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
RManLuo/Awesome-LLM-KG
Awesome papers about unifying LLMs and KGs
EleutherAI/math-lm
SafeAILab/EAGLE
Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)
open-compass/MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
mirage-project/mirage
Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA
facebookresearch/searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
jondurbin/bagel
A bagel, with everything.
virattt/financial-datasets
Financial datasets for LLMs 🧪
isi-nlp/Zoph_RNN
C++/CUDA toolkit for training sequence and sequence-to-sequence models across multiple GPUs
HITsz-TMG/awesome-llm-attributions
A Survey of Attributions for Large Language Models
Vance0124/Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
logix-project/logix
AI Logging for Interpretability and Explainability🔬
walkerdb/supreme_court_transcripts
Max-Fu/tvl
nateraw/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
google-deepmind/pix2act
foobarbaz-inc/conversation-memory-streamlit
Demo of ConversationEntityMemory in Streamlit.
vroomai/live
🎧 Chat with Ableton Live in your browser.
cognitivecomputations/agenticworker
kozistr/triton-grpc-proxy-rs
Proxy server for triton gRPC server that inferences embedding model in Rust
symbolica-ai/gap-sys
Rust bindings to GAP (Groups, Algorithms, Programming)
talaviram/OpenSpoken
Real-Time Transcription for Apple Devices using Apple’s Speech Recognition
neefrehman/millzbot
A GPT-2 bot trained on my bosses tweets, and a guide to making your own