tongyx361

Senior undergraduate @ DCST, Tsinghua University. Research intern @hkust-nlp (previously: @THUDM). Interested in LLM & AI for Education/Research/Software Eng.

Tsinghua UniversityBeijing, China

Pinned Repositories

dart-math
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
Language:Jupyter Notebook86 1 63
ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Language:Python1.2k 15 9465
Awesome-LLM-Research
Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers get the gist as quickly as possible.
38 1 10
Awesome-LLM4Math
Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers get the gist as quickly as possible.
97 2 02
nbdev-template-tongyx361
nbdev template customed by Yuxuan Tong
Language:Python10
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python20
reward-by-prm800k
Language:Jupyter Notebook3 1 00
sample-difficulty-adaptive-tuning
Code from project *Sample-Difficulty-Adaptive Supervised Fine-Tuning for Mathematical Reasoning* at course ANN-23F@THU-CST
Language:Jupyter Notebook2 1 00
symeval
Evaluation utilities based on SymPy.
Language:Python9 1 00

tongyx361's Repositories

tongyx361/Awesome-LLM4Math
Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers get the gist as quickly as possible.
97 2 02
tongyx361/Awesome-LLM-Research
Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise descriptions to help readers get the gist as quickly as possible.
38 1 10
tongyx361/symeval
Evaluation utilities based on SymPy.
Language:Python9 1 00
tongyx361/reward-by-prm800k
Language:Jupyter Notebook3 1 00
tongyx361/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python20
tongyx361/sample-difficulty-adaptive-tuning
Code from project *Sample-Difficulty-Adaptive Supervised Fine-Tuning for Mathematical Reasoning* at course ANN-23F@THU-CST
Language:Jupyter Notebook2 1 00
tongyx361/dotfiles
Shawn Yuxuan Tong's dotfiles
Language:Shell1 1 00
tongyx361/execode
Execute code in text efficiently and safely.
Language:Jupyter Notebook10
tongyx361/legal-search
Code for lab of **legal search engine** in *Introduction to Search Engine* (24S) course by Prof. Qingyao Ai @ THU-CST
Language:Jupyter Notebook10
tongyx361/nbdev-template-tongyx361
nbdev template customed by Yuxuan Tong
Language:Python10
tongyx361/OS-2023A-Ex7-1-eBPF
Answer Repo to Ex7.1 @ THU-CST OS-2023A (by Prof. Xiang Yong)
Language:C1 1 00
tongyx361/agent-search
Flexible and efficient LLM agent searching algorithm implementation
Language:Jupyter Notebook
tongyx361/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook
tongyx361/cod-lab2
Language:Scala
tongyx361/MixEval
The official evaluation suite and dynamic data release for MixEval.
tongyx361/nbdev-hello-world
nbdev hello world
Language:Jupyter Notebook
tongyx361/nbdev-test
Test repo for nbdev
Language:Python1 0
tongyx361/nextjs-dashboard
Language:TypeScript
tongyx361/OlympiadBench
An Olympiad-level bilingual multimodal scientific benchmark, featuring 8,952 questions from Olympiad-level mathematics and physics competitions, including the Chinese college entrance exam.
tongyx361/OS-2023A-Ex4-1-uCore-batch-OS-procedure-analysis
Answer to THU-CS OS-2023A-Ex4-1 uCore batch OS procedure analysis
1 0
tongyx361/OS-2023A-Ex6-1-Secondary-Page-Table
THU-CST OS-2023A（向勇老师开设）的课后练习 Ex6.1 “二级页表”的解答仓库
Language:Jupyter Notebook1 0
tongyx361/OS-Ex8-1-Python-Simulated-File-System
Answer Repo to Ex8.1 Python-Simulated File System @ THU-CST OS-2023A (by Prof. Yong Xiang)
Language:Python1 0
tongyx361/psp-lab1-viz-fourier-series
Lab 1 *Visualization of Fourier Series* in course *Principles of Signal Processing* by Prof. Jia Jia at DSCT, THU
Language:Jupyter Notebook
tongyx361/Qwen2.5-Math
A series of math-specific large language models of our Qwen2 series.
tongyx361/SpinalTemplateMill
A simple SpinalHDL demo project based on Mill
Language:Scala
tongyx361/tongyx361.github.io
(Shawn) Yuxuan Tong's Homepage
Language:Python1 0
tongyx361/VinePPO
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
tongyx361/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python
tongyx361/ZeroEval
A simple unified framework for evaluating LLMs
Language:Python
tongyx361/zhihu-md-pub
Markdown (and related files) to publish on Zhihu.