ZexiLee
2020-2025: PhD Student in Artificial Intelligence@Zhejiang University 2016 - 2020: Bachelor in Engineering@Zhejiang University
@ZhejiangUniversityHangzhou, Zhejiang, China
ZexiLee's Stars
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
afatcoder/LeetcodeTop
汇总各大互联网公司容易考察的高频leetcode题🔥
KindXiaoming/pykan
Kolmogorov Arnold Networks
state-spaces/mamba
Mamba SSM architecture
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
arcee-ai/mergekit
Tools for merging pretrained large language models.
TheNetAdmin/zjuthesis
Zhejiang University Graduation Thesis LaTeX Template
zjunlp/EasyEdit
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
THUDM/WebGLM
WebGLM: An Efficient Web-enhanced Question Answering System (KDD 2023)
SakanaAI/evolutionary-model-merge
Official repository of Evolutionary Optimization of Model Merging Recipes
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
princeton-nlp/ALCE
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
RAIVNLab/MRL
Code repository for the paper - "Matryoshka Representation Learning"
cccntu/minLoRA
minLoRA: a minimal PyTorch library that allows you to apply LoRA to any PyTorch model.
mlfoundations/model-soups
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
OpenMOSS/CoLLiE
Collaborative Training of Large Language Models in an Efficient Way
rui-ye/OpenFedLLM
wpeebles/G.pt
Official PyTorch Implementation of "Learning to Learn with Generative Models of Neural Network Checkpoints"
nlpyang/geval
Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"
SonyResearch/COALA
COALA: A Practical and Vision-Centric Federated Learning Platform, accepted to ICML'24
abhishekpanigrahi1996/Skill-Localization-by-grafting
zjuchenlong/Thesis-latex
my Ph.D. thesis (Zhejiang University)
yifei-he/Localize-and-Stitch
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
didizhu-zju/Model-Tailor