kevindragon221's Stars
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
wgwang/awesome-LLMs-In-China
**大模型
google/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Neph0s/awesome-llm-role-playing-with-persona
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
Zoeyyao27/CoT-Igniting-Agent
This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
melaniewalsh/Intro-Cultural-Analytics
Introduction to Cultural Analytics & Python, course website and online textbook powered by Jupyter Book
thu-coai/COLDataset
The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection
thu-coai/SafetyBench
Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]
xuyuzhuang11/OneBit
The homepage of OneBit model quantization framework.
THUNLP-MT/StableToolBench
A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
Edward-Sun/RECITE
Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI
MiaoXiong2320/llm-uncertainty
code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"
SEACrowd/seacrowd-datahub
A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
HannahKirk/prism-alignment
The Prism Alignment Project
THUNLP-MT/SKR
Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)
xuyuzhuang11/Werewolf
CLARIN-PL/personalized-nlp
UKPLab/maps
Multicultural Proverbs and Sayings
asaakyan/SocNormNLI
THUNLP-MT/CODIS
Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".
astrodrew/CDEval
THUNLP-MT/FIIG
Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions (EMNLP 2023 Findings)
THUNLP-MT/Brote
THUNLP-MT/symbol2language
Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models
zhilizju/Culture-mixup
JonathanQZheng/Stanceosaurus
THUNLP-MT/DEEM
THUNLP-MT/RiC