Pinned Repositories
astraios
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
bigcode-dataset
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
bigcodebench
[ICLR'25] BigCodeBench: Benchmarking Code Generation Towards AGI
SWE-Arena
SWE Arena
DataAug4Code
Source Code Data Augmentation for Deep Learning: A Survey.
ice-score
[EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code
llm-benchmark
A list of LLM benchmark frameworks.
PyArmadillo
PyArmadillo: an alternative approach to linear algebra in Python
X-Repo2Run
X-Repo2Run: Configuraing Multilingual Docker Environment via Code Agent
terryyz's Repositories
terryyz/ice-score
[EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code
terryyz/DataAug4Code
Source Code Data Augmentation for Deep Learning: A Survey.
terryyz/llm-benchmark
A list of LLM benchmark frameworks.
terryyz/PyArmadillo
PyArmadillo: an alternative approach to linear algebra in Python
terryyz/_peft
terryyz/arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
terryyz/asleep
terryyz/Awesome-LLMs-Evaluation-Papers
The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.
terryyz/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
terryyz/DECAF
DECAF: Deep Extreme Classification with Label Features
terryyz/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
terryyz/otalign
Gromov-Wasserstein Alignment of Embeddings
terryyz/picard
PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models
terryyz/rtt-rethinking
Rethinking Round-trip Translation for Machine Translation Evaluation
terryyz/SimCSE
EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings
terryyz/terryyz
terryyz/AsleepKeyboardDataset
terryyz/big-fc
terryyz/blog
Public repo for HF blog posts
terryyz/evalplus
EvalPlus for rigourous evaluation of LLM-synthesized code
terryyz/I-S00N
terryyz/LLM4SE
Large Language Models for Software Engineering
terryyz/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
terryyz/PyCodeGPT
A pre-trained GPT model for Python code completion and generation
terryyz/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
terryyz/ViLPAct
terryyz/Virtual-FPGA-Board