rickyang1114

I'm a Ph.D. student at State Key Lab of CAD&CG, Zhejiang University. I'm currently insterested in Trustworthy AI and LLMs.

Zhejiang UniversityHangzhou

rickyang1114's Stars

locuslab/tofu
Landing Page for TOFU
Language:Python8318
GAIR-NLP/ProX
Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"
Language:Python1307
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.5k393
akoksal/LongForm
Reverse Instructions to generate instruction tuning data with corpus examples
20310
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python72059
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python3.8k406
hiyouga/LLaMA-Factory-Doc
LLaMA Factory Document
657
THUDM/LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Language:Python27718
da03/Internalize_CoT_Step_by_Step
Language:Python947
facebookresearch/rlfh-gen-div
This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity
Language:Python365
allenai/OLMoE
OLMoE: Open Mixture-of-Experts Language Models
Language:Jupyter Notebook39830
MadryLab/context-cite
Attribute (or cite) statements generated by LLMs back to in-context information.
Language:Jupyter Notebook12011
RUC-GSAI/YuLan-Chat
YuLan: An Open-Source Large Language Model
Language:Python54649
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.2k2.3k
xlang-ai/Spider2
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Language:HTML833
RUC-GSAI/Llama-3-SynE
Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 Llama-3 的科学推理和中文能力
213
GraySwanAI/nanoGCG
A fast + lightweight implementation of the GCG algorithm in PyTorch
Language:Python8723
xu1998hz/llm_self_bias
This is the project to quantify the issues with LLM's self evaluation
Language:Python5
aqweteddy/ChatVector
Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages.
Language:Python26
tianyi-lab/Superfiltering
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
Language:Python1058
tianyi-lab/Cherry_LLM
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models
Language:Python28720
magpie-align/magpie
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
Language:Python42344
fe1ixxu/CPO_SIMPO
This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.
Language:Python303
princeton-nlp/SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
Language:Python66341
mtdvio/every-programmer-should-know
A collection of (mostly) technical things every software developer should know about
82.9k7.7k
LALBJ/PAI
[ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs
Language:Python511
NATSpeech/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Language:Python96699
hongtaoh/cv_emulate
Academic CVs that you can emulate
Language:TeX33241
QwenLM/online_merging_optimizers
Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment
Language:Python626
iamgroot42/mimir
Python package for measuring memorization in LLMs.
Language:Jupyter Notebook11017

rickyang1114

rickyang1114's Stars

locuslab/tofu

GAIR-NLP/ProX

huggingface/alignment-handbook

akoksal/LongForm

RLHFlow/RLHF-Reward-Modeling

open-compass/opencompass

hiyouga/LLaMA-Factory-Doc

THUDM/LongCite

da03/Internalize_CoT_Step_by_Step

facebookresearch/rlfh-gen-div

allenai/OLMoE

MadryLab/context-cite

RUC-GSAI/YuLan-Chat

NVIDIA/Megatron-LM

xlang-ai/Spider2

RUC-GSAI/Llama-3-SynE

GraySwanAI/nanoGCG

xu1998hz/llm_self_bias

aqweteddy/ChatVector

tianyi-lab/Superfiltering

tianyi-lab/Cherry_LLM

magpie-align/magpie

fe1ixxu/CPO_SIMPO

princeton-nlp/SimPO

mtdvio/every-programmer-should-know

LALBJ/PAI

NATSpeech/NATSpeech

hongtaoh/cv_emulate

QwenLM/online_merging_optimizers

iamgroot42/mimir