zhengkid's Stars
thunlp/GNNPapers
Must-read papers on graph neural networks (GNN)
microsoft/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
triton-lang/triton
Development repository for the Triton language and compiler
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
nickgkan/3d_diffuser_actor
Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"
coastalcph/lex-glue
LexGLUE: A Benchmark Dataset for Legal Language Understanding in English
wxjiao/ParroT
The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.
shizhediao/R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't Know'"
raymin0223/fast_robust_early_exit
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)
shiqiangw/iclr-scores
NiuTrans/Introduction-to-Transformers
An introduction to basic concepts of Transformers and key techniques of their recent advances.
MCG-NJU/CoMAE
[AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets
wangclnlp/DeepSpeed-Chat-Extension
This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
rycolab/differentiable-subset-pruning
HITsz-TMG/Multi-agent-peer-review
Official implementation of our paper "Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration".
OSU-NLP-Group/In-Context-Reranking
Code for "Attention in Large Language Models Yeilds Efficient Zero-Shot Re-Rankers"
xiaotong/MTBook
a tutorial on machine translation (in Chinese)
XuezheMax/mega-image
Implementation of Mega on Image Classification
ZhiYuanZeng/fairseq-moe
zhengkid/PartialFormer
Official code of PartialFormer (ACL2024 Findings)
GeorgeVern/qe-fusion
This repo contains the code for the paper "Don't Rank, Combine! Combining Machine Translation Hypotheses Using Quality Estimation"
yangjiangeyjg/Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
yangjiangeyjg/Awesome-MIM
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)