koalazf99

AI Research @GAIR-NLP | Ex @microsoft, @xlang-ai

Shanghai Jiao Tong UniversityShanghai

koalazf99's Stars

ServiceNow/Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
Language:Python7312
mangiucugna/json_repair
A python module to repair invalid JSON, commonly used to parse the output of LLMs
Language:Python1.3k69
likaixin2000/MMCode
[EMNLP 2024] Multi-modal reasoning problems via code generation.
Language:Python17
e2b-dev/E2B
Secure open source cloud runtime for AI apps & AI agents
Language:HTML7.2k476
richards199999/Thinking-Claude
Let your Claude able to think
Language:TypeScript10.5k1.2k
GAIR-NLP/Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
Language:Python763
ekinakyurek/marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
Language:Python26424
xu3kev/BARC
Bootstrapping ARC
Language:Python849
anishathalye/auriga
Auriga is a minimalist LaTeX beamer presentation theme 📽
Language:TeX34532
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
Language:Python4.1k388
OpenCoder-llm/OpenCoder-llm
The Open Cookbook for Top-Tier Code Large Language Model
Language:Python1.5k91
bklieger-groq/g1
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
Language:Python4.1k371
sail-sg/oat
🌾 OAT: Online AlignmenT for LLMs
Language:Python746
NVIDIA/Cosmos-Tokenizer
A suite of image and video neural tokenizers
Language:Python1k26
huggingface/llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
Language:Python24624
THUDM/Android-Lab
Language:Python1768
openai/Video-Pre-Training
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Language:Python1.4k146
etched-ai/open-oasis
Inference script for Oasis 500M
Language:Python1.7k144
ayaka14732/tpu-starter
Everything you want to know about Google Cloud TPU
Language:Python50430
yixiaoer/tpux
A set of Python scripts that makes your experience on TPU better
Language:Python421
sail-sg/zero-bubble-pipeline-parallelism
Zero Bubble Pipeline Parallelism
Language:Python29514
deanmalmgren/textract
extract text from any document. no muss. no fuss.
Language:HTML3.9k611
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Language:Python6.9k532
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
Language:Python2.5k200
zorazrw/agent-workflow-memory
AWM: Agent Workflow Memory
Language:Python22419
Xiao9905/AutoGLM
Language:JavaScript394
cxcscmu/Montessori-Instruct
Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning
Language:Python363
mit-han-lab/vila-u
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Language:Python1893
HKUNLP/STRING
Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
Language:Python673
ranpox/awesome-computer-use
This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.
1461

koalazf99

koalazf99's Stars

ServiceNow/Fast-LLM

mangiucugna/json_repair

likaixin2000/MMCode

e2b-dev/E2B

richards199999/Thinking-Claude

GAIR-NLP/Entropy-ABF

ekinakyurek/marc

xu3kev/BARC

anishathalye/auriga

argilla-io/argilla

OpenCoder-llm/OpenCoder-llm

bklieger-groq/g1

sail-sg/oat

NVIDIA/Cosmos-Tokenizer

huggingface/llm-swarm

THUDM/Android-Lab

openai/Video-Pre-Training

etched-ai/open-oasis

ayaka14732/tpu-starter

yixiaoer/tpux

sail-sg/zero-bubble-pipeline-parallelism

deanmalmgren/textract

skypilot-org/skypilot

THUDM/GLM-4-Voice

zorazrw/agent-workflow-memory

Xiao9905/AutoGLM

cxcscmu/Montessori-Instruct

mit-han-lab/vila-u

HKUNLP/STRING

ranpox/awesome-computer-use