koalazf99's Stars
karpathy/LLM101n
LLM101n: Let's build a Storyteller
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
CosmosShadow/gptpdf
Using GPT to parse PDF
mistralai/mistral-finetune
microsoft/Phi-3CookBook
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
deepseek-ai/DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
jackmpcollins/magentic
Seamlessly integrate LLMs as Python functions
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
OpenLLMAI/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
mlfoundations/dclm
DataComp for Language Models
GAIR-NLP/anole
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
magpie-align/magpie
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
google/aqt
bigcode-project/bigcodebench
BigCodeBench: Benchmarking Code Generation Towards AGI
leanprover/vscode-lean4
Visual Studio Code extension for the Lean 4 proof assistant
zhaoyu-li/DL4TP
[COLM 2024] A Survey on Deep Learning for Theorem Proving
keirp/OpenWebMath
xlang-ai/Spider2-V
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
sail-sg/regmix
š§¬ RegMix: Data Mixture as Regression for Language Model Pre-training
GAIR-NLP/OlympicArena
This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"
epfml/schedules-and-scaling
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
ChenWu98/agent-attack
[Arxiv 2024] Adversarial attacks on multimodal agents
LLM360/k2-train
GAIR-NLP/MoPS
[ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"
koalazf99/Awesome-DataCentric-LLM
Trending projects & awesome papers about data-centric llm studies.
crux-eval/eval-arena
zhxieml/remiss-jailbreak
young-geng/tpu_pod_commander
TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.