tongyx361

Senior undergraduate @ DCST, Tsinghua University. Research intern @hkust-nlp (previously: @THUDM). Interested in LLM & AI for Education/Research/Software Eng.

Tsinghua UniversityBeijing, China

tongyx361's Stars

twitter/the-algorithm
Source code for Twitter's Recommendation Algorithm
Language:Scala62.8k 341 97912.2k
xai-org/grok-1
Grok open release
Language:Python49.8k 597 2148.3k
dair-ai/ml-visuals
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
14.1k 116 501.4k
mozillazg/python-pinyin
汉字转拼音(pypinyin)
Language:Python5k 99 270617
xlang-ai/OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Language:Python4.1k 47 100458
openai/transformer-debugger
Language:Python4.1k 26 14241
pyutils/line_profiler
Line-by-line profiling for Python
Language:Python2.8k 16 104121
openai/simple-evals
Language:Python2.2k 28 15186
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook1.6k 8 156248
SakanaAI/evolutionary-model-merge
Official repository of Evolutionary Optimization of Model Merging Recipes
Language:Python1.3k 41 1195
openai/following-instructions-human-feedback
1.2k 135 7142
lxneng/xpinyin
Translate Chinese hanzi to pinyin (拼音) by Python, 汉字转拼音
Language:Python827 45 39177
ruixiangcui/AGIEval
Language:Python721 9 2748
sylinrl/TruthfulQA
TruthfulQA: Measuring How Models Imitate Human Falsehoods
Language:Jupyter Notebook644 8 1677
noxdafox/pebble
Multi threading and processing eye-candy.
Language:Python575 10 11654
FloridSleeves/LLMDebugger
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step
Language:Python472 6 1547
meta-math/MetaMath
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Language:Python404 7 2838
huggingface/datablations
Scaling Data-Constrained Language Models
Language:Jupyter Notebook329 33 719
OpenBMB/Eurus
Language:Python297 11 1114
web-arena-x/visualwebarena
VisualWebArena is a benchmark for multimodal agents.
Language:Python272 5 5152
lm-sys/arena-hard
Arena-Hard benchmark
Language:Jupyter Notebook200 7 1411
princeton-nlp/LLMBar
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
Language:Python120 7 48
OpenBMB/OlympiadBench
[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.
Language:Python113 5 108
tongyx361/Awesome-LLM4Math
Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers get the gist as quickly as possible.
101 2 02
qtli/GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
Language:Python54 1 35
tongyx361/Awesome-LLM-Research
Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers get the gist as quickly as possible.
45 1 10
liyucheng09/LatestEval
Latest Evaluation Toolkit (LatestEval). Assessing the language models with latest, uncontaminated materials.
Language:Python22 1 00
midas-research/mathify
An extensive mathematics dataset called MathQuest sourced from the 11th and 12th standard Mathematics NCERT textbooks.
Language:Jupyter Notebook6 3 0
nii-yamagishilab/mla
A Multi-Level Attention Model for Evidence-Based Fact Checking
Language:Python6 4 03
zhaochenyang20/data_mining
数据挖掘
Language:Python6 2 0

tongyx361

tongyx361's Stars

twitter/the-algorithm

xai-org/grok-1

dair-ai/ml-visuals

mozillazg/python-pinyin

xlang-ai/OpenAgents

openai/transformer-debugger

pyutils/line_profiler

openai/simple-evals

tatsu-lab/alpaca_eval

SakanaAI/evolutionary-model-merge

openai/following-instructions-human-feedback

lxneng/xpinyin

ruixiangcui/AGIEval

sylinrl/TruthfulQA

noxdafox/pebble

FloridSleeves/LLMDebugger

meta-math/MetaMath

huggingface/datablations

OpenBMB/Eurus

web-arena-x/visualwebarena

lm-sys/arena-hard

princeton-nlp/LLMBar

OpenBMB/OlympiadBench

tongyx361/Awesome-LLM4Math

qtli/GSM-Plus

tongyx361/Awesome-LLM-Research

liyucheng09/LatestEval

midas-research/mathify

nii-yamagishilab/mla

zhaochenyang20/data_mining