kevindragon221

Tsinghua University

kevindragon221's Stars

OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Language:Shell25.5k 310 2623.2k
OpenBMB/MiniCPM
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Language:Jupyter Notebook7.1k 75 207450
wgwang/awesome-LLMs-In-China
**大模型
5.4k 107 27449
google/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Language:Python2.9k 51 151590
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python2.4k 176 83783
Neph0s/awesome-llm-role-playing-with-persona
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
544 15 027
Zoeyyao27/CoT-Igniting-Agent
This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
338 5 027
melaniewalsh/Intro-Cultural-Analytics
Introduction to Cultural Analytics & Python, course website and online textbook powered by Jupyter Book
Language:Jupyter Notebook256 10 3786
thu-coai/COLDataset
The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection
210 2 518
thu-coai/SafetyBench
Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]
Language:Python152 2 86
xuyuzhuang11/OneBit
The homepage of OneBit model quantization framework.
Language:Python152 4 93
THUNLP-MT/StableToolBench
A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
Language:Python109 3 2013
Edward-Sun/RECITE
Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI
Language:Python91 3 310
MiaoXiong2320/llm-uncertainty
code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"
Language:Python68 3 24
SEACrowd/seacrowd-datahub
A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.
Language:Python66 5 37358
HannahKirk/prism-alignment
The Prism Alignment Project
Language:Jupyter Notebook35 2 11
THUNLP-MT/SKR
Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)
Language:Python22 5 10
xuyuzhuang11/Werewolf
Language:Python21 1 20
CLARIN-PL/personalized-nlp
Language:Jupyter Notebook9 6 22
UKPLab/maps
Multicultural Proverbs and Sayings
Language:Python9 6 00
asaakyan/SocNormNLI
Language:Jupyter Notebook8 1 02
THUNLP-MT/CODIS
Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".
Language:JavaScript8 4 00
astrodrew/CDEval
7 1 00
THUNLP-MT/FIIG
Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions (EMNLP 2023 Findings)
7 5 2
THUNLP-MT/Brote
Language:Python6 4 01
THUNLP-MT/symbol2language
Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models
6 4 01
zhilizju/Culture-mixup
Language:Python4 1 00
JonathanQZheng/Stanceosaurus
Language:Python3 1 02
THUNLP-MT/DEEM
2 5 0
THUNLP-MT/RiC
1 4 00

kevindragon221

kevindragon221's Stars

OpenBMB/ChatDev

OpenBMB/MiniCPM

wgwang/awesome-LLMs-In-China

google/BIG-bench

openai/multiagent-particle-envs

Neph0s/awesome-llm-role-playing-with-persona

Zoeyyao27/CoT-Igniting-Agent

melaniewalsh/Intro-Cultural-Analytics

thu-coai/COLDataset

thu-coai/SafetyBench

xuyuzhuang11/OneBit

THUNLP-MT/StableToolBench

Edward-Sun/RECITE

MiaoXiong2320/llm-uncertainty

SEACrowd/seacrowd-datahub

HannahKirk/prism-alignment

THUNLP-MT/SKR

xuyuzhuang11/Werewolf

CLARIN-PL/personalized-nlp

UKPLab/maps

asaakyan/SocNormNLI

THUNLP-MT/CODIS

astrodrew/CDEval

THUNLP-MT/FIIG

THUNLP-MT/Brote

THUNLP-MT/symbol2language

zhilizju/Culture-mixup

JonathanQZheng/Stanceosaurus

THUNLP-MT/DEEM

THUNLP-MT/RiC