Pinned Repositories
1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
Advanced-Python
Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
carbs
Cost aware hyperparameter tuning algorithm
chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
cluster-health
cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
crawl4ai
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
data_management_LLM
Collection of training data management explorations for large language models
musram's Repositories
musram/1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
musram/Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
musram/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
musram/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
musram/crawl4ai
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
musram/data_management_LLM
Collection of training data management explorations for large language models
musram/distilabel
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
musram/DistillKit
An Open Source Toolkit For LLM Distillation
musram/dolma
Data and tools for generating and inspecting OLMo pre-training data.
musram/evaluate-llms
Includes examples on how to evaluate LLMs
musram/exo
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
musram/flash-attention
Fast and memory-efficient exact attention
musram/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
musram/GPU-Puzzles
Solve puzzles. Learn CUDA.
musram/landwind
Responsive and clean landing page built with Tailwind CSS and Flowbite
musram/llama-agentic-system
Agentic components of the Llama Stack APIs
musram/llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
musram/LLM-Pretraining-Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
musram/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
musram/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
musram/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
musram/ml-engineering
Machine Learning Engineering Open Book
musram/optillm
Optimizing inference proxy for LLMs
musram/regmix
🧬 RegMix: Data Mixture as Regression for Language Model Pre-training
musram/ring-attention-pytorch
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
musram/rotary-embedding-torch
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
musram/sailcraft
Data Toolkit for Sailor Language Models
musram/Tau
Tau LLM made with Unity 6 ML Agent
musram/tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
musram/unsloth
Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory