koalazf99's Stars
ServiceNow/Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
mangiucugna/json_repair
A python module to repair invalid JSON, commonly used to parse the output of LLMs
likaixin2000/MMCode
[EMNLP 2024] Multi-modal reasoning problems via code generation.
e2b-dev/E2B
Secure open source cloud runtime for AI apps & AI agents
richards199999/Thinking-Claude
Let your Claude able to think
GAIR-NLP/Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
ekinakyurek/marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
xu3kev/BARC
Bootstrapping ARC
anishathalye/auriga
Auriga is a minimalist LaTeX beamer presentation theme 📽
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
OpenCoder-llm/OpenCoder-llm
The Open Cookbook for Top-Tier Code Large Language Model
bklieger-groq/g1
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
sail-sg/oat
🌾 OAT: Online AlignmenT for LLMs
NVIDIA/Cosmos-Tokenizer
A suite of image and video neural tokenizers
huggingface/llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
THUDM/Android-Lab
openai/Video-Pre-Training
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
etched-ai/open-oasis
Inference script for Oasis 500M
ayaka14732/tpu-starter
Everything you want to know about Google Cloud TPU
yixiaoer/tpux
A set of Python scripts that makes your experience on TPU better
sail-sg/zero-bubble-pipeline-parallelism
Zero Bubble Pipeline Parallelism
deanmalmgren/textract
extract text from any document. no muss. no fuss.
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
zorazrw/agent-workflow-memory
AWM: Agent Workflow Memory
Xiao9905/AutoGLM
cxcscmu/Montessori-Instruct
Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning
mit-han-lab/vila-u
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
HKUNLP/STRING
Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
ranpox/awesome-computer-use
This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.