THUKEG

ChatGLM, GLM-4, CogVLM, CodeGeeX, CogView, ImageReward, CogVideoX | CogDL, GraphMAE, AMiner | Zhipu.ai (Z.ai) & Knowledge Engineering Group (KEG)

FIT Building, Tsinghua University

Pinned Repositories

AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Language:Python2.8k 25 152195
AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Language:Python1.5k 16 53106
CogDL
CogDL: A Comprehensive Library for Graph Deep Learning (WWW 2023)
Language:Python1.8k 41 118308
GLM
GLM (General Language Model)
Language:Python3.3k 45 193329
LongBench
LongBench v2 and LongBench (ACL 25'&24')
Language:Python968 8 107103
LongWriter
[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Language:Python1.7k 19 37168
P-tuning-v2
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
Language:Python2.1k 29 78204
slime
slime is a LLM post-training framework for RL Scaling.
Language:Python1.8k160
T1
RL Scaling and Test-Time Scaling (ICML'25)
113 13 01
WebGLM
WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
Language:Python1.6k 24 70138

THUKEG's Repositories

THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Language:Python2.8k 25 152195
THUDM/slime
slime is a LLM post-training framework for RL Scaling.
Language:Python1.8k160
THUDM/LongWriter
[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Language:Python1.7k 19 37168
THUDM/WebGLM
WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
Language:Python1.6k 24 70138
THUDM/SwissArmyTransformer
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
Language:Python1.1k 31 8298
THUDM/LongBench
LongBench v2 and LongBench (ACL 25'&24')
Language:Python968 8 107103
THUDM/ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
Language:Python664 3 2551
THUDM/LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Language:Python503 12 1632
THUDM/WebRL
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
Language:Python455 15 5031
THUDM/LongAlign
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
Language:Python256 7 1521
THUDM/Android-Lab
Language:Python236 12 1615
THUDM/VisualAgentBench
Towards Large Multimodal Models as Visual Foundation Agents
Language:Python236 12 158
THUDM/T1
RL Scaling and Test-Time Scaling (ICML'25)
113 13 01
THUDM/Awesome-Parameter-Efficient-Fine-Tuning-for-Foundation-Models
Parameter-Efficient Fine-Tuning for Foundation Models
93 9 03
THUDM/CogKit
Finetuning and inference tools for the CogView4 and CogVideoX model series.
Language:Python878
THUDM/INFTY
INFTY Engine: An Optimization Toolkit to Support Continual AI
Language:Python66
THUDM/TreeRL
TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25
Language:Python603
THUDM/SWE-Dev
[ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.
Language:Python55
THUDM/WhoIsWho
KDD'23 Web-Scale Academic Name Disambiguation: the WhoIsWho Benchmark, Leaderboard, and Toolkit
Language:Python42 8 516
THUDM/DataSciBench
DataSciBench: An LLM Agent Benchmark for Data Science
Language:Python32 8 23
THUDM/MoELoRA_Riemannian
Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)
Language:Python32 1 00
THUDM/DeepDive
DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL
21
THUDM/scholar-profiling
Language:Jupyter Notebook17 9 41
THUDM/AndroidGen
Language:Python91
THUDM/ReST-RL
Efficient Two-Stage Reinforcement Learning for LLMs
Language:Python8
THUDM/BiPro
code and data for Paper: BIPro: Zero-shot Chinese Poem Generation via Block Inverse Prompting Constrained Generation Framework(ACL 2025 main)
Language:Python7 8 0
THUDM/AlignMMBench
code, data and model for Paper: AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models (ACL'25 main)
Language:Python51
THUDM/z-ai-sdk-typescript
Typescript SDK for Z.ai - Not yet released.
Language:TypeScript5
THUDM/BattleAgentBench
Language:Python4 8 0
THUDM/MobileRL
4