Xiao9905's Stars
guidance-ai/guidance
A guidance language for controlling large language models.
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
academicpages/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
THUDM/CodeGeeX2
CodeGeeX2: A More Powerful Multilingual Code Generation Model
abhishekkrthakur/approachingalmost
Approaching (Almost) Any Machine Learning Problem
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
princeton-nlp/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
google-deepmind/alphageometry
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
openai/simple-evals
neulab/prompt2model
prompt2model - Generate Deployable Models from Natural Language Instructions
THUDM/AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
facebookresearch/MetaCLIP
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
THUDM/AutoWebGLM
An LLM-based Web Navigating Agent (KDD'24)
THUDM/LongBench
[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
salesforce/DialogStudio
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI
THUDM/AlignBench
大模型多维度中文对齐评测基准 (ACL 2024)
thu-coai/BPO
thu-coai/SafetyBench
Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]
ServiceNow/WorkArena
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
thu-coai/CritiqueLLM
THUDM/ChatGLM-Math
Eikor/InstructPLM
The first large protein language model trained follows structure instructions.
THUDM/SciGLM
SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)
THUDM/NaturalCodeBench
NaturalCodeBench (Findings of ACL 2024)
THUDM/Self-Contrast
Extensive Self-Contrast Enables Feedback-Free Language Model Alignment
benjamintenny/mongo-python-driver-proxy
PyMongo with proxy support