rickyang1114
I'm a Ph.D. student at State Key Lab of CAD&CG, Zhejiang University. I'm currently insterested in Trustworthy AI and LLMs.
Zhejiang UniversityHangzhou
rickyang1114's Stars
locuslab/tofu
Landing Page for TOFU
GAIR-NLP/ProX
Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
akoksal/LongForm
Reverse Instructions to generate instruction tuning data with corpus examples
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
hiyouga/LLaMA-Factory-Doc
LLaMA Factory Document
THUDM/LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
da03/Internalize_CoT_Step_by_Step
facebookresearch/rlfh-gen-div
This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity
allenai/OLMoE
OLMoE: Open Mixture-of-Experts Language Models
MadryLab/context-cite
Attribute (or cite) statements generated by LLMs back to in-context information.
RUC-GSAI/YuLan-Chat
YuLan: An Open-Source Large Language Model
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
xlang-ai/Spider2
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
RUC-GSAI/Llama-3-SynE
Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 Llama-3 的科学推理和中文能力
GraySwanAI/nanoGCG
A fast + lightweight implementation of the GCG algorithm in PyTorch
xu1998hz/llm_self_bias
This is the project to quantify the issues with LLM's self evaluation
aqweteddy/ChatVector
Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages.
tianyi-lab/Superfiltering
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
tianyi-lab/Cherry_LLM
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models
magpie-align/magpie
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
fe1ixxu/CPO_SIMPO
This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.
princeton-nlp/SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
mtdvio/every-programmer-should-know
A collection of (mostly) technical things every software developer should know about
LALBJ/PAI
[ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs
NATSpeech/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
hongtaoh/cv_emulate
Academic CVs that you can emulate
QwenLM/online_merging_optimizers
Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment
iamgroot42/mimir
Python package for measuring memorization in LLMs.