yukang2017

PhD student at CUHK. Research on LLM, Efficient DL, and Computer Vision.

CUHKHong Kong

yukang2017's Stars

xai-org/grok-1
Grok open release
Language:Python48.9k 547 1958.3k
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
Language:Python10.9k 193 1.1k1.2k
LargeWorldModel/LWM
Language:Python6.9k 67 64536
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python4.1k 41 157382
dvlab-research/MiniGemini
Official implementation for Mini-Gemini
Language:Python2.7k 23 75256
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Language:Jupyter Notebook1.9k 34 75125
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Language:Python1.5k 24 3778
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
Language:Python1.2k 14 861
deepseek-ai/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Language:Python883 13 3237
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:Python844 10 2671
mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
751 50 825
horseee/DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
Language:Python636 16 3831
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Language:Python457 9 2629
SkunkworksAI/hydra-moe
Language:Python403 22 1115
zhuzilin/ring-flash-attention
Ring attention implementation with flash attention
Language:Python380 9 1927
FranxYao/Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
Language:Python363 8 1624
xingyaoww/code-act
Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
Language:Python265 4 620
VIRL-Platform/VIRL
Code for V-IRL: Grounding Virtual Intelligence in Real Life
Language:Python262 12 27
HKUNLP/ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
Language:Python230 8 1312
for-ai/parameter-efficient-moe
Language:Python222 17 313
hkust-nlp/AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents
Language:SAS194 4 721
dyabel/AnyTool
Language:Python171 39 56
argilla-io/notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
Language:Python154 6 512
OSU-NLP-Group/TravelPlanner
[ICML'24] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
Language:Python141 9 1620
THUDM/LongAlign
LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation
Language:Python117 8 48
jeffreysijuntan/lloco
The official repo for "LLoCo: Learning Long Contexts Offline"
Language:Python91 3 29
Lucky-Lance/Expert_Sparsity
Language:Python51 2 04
lixin4ever/CUHK-PHD-Thesis-Template
CUHK PhD Thesis Template
Language:TeX49 3 019
EnVision-Research/DDSM
Denoising Diffusion Step-aware Models (ICLR2024)
Language:Python45 2 00
FengZicai/LSK3DNet
This is the official implementation of "LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels" (Accepted at CVPR 2024).
20 3 01

yukang2017

yukang2017's Stars

xai-org/grok-1

ludwig-ai/ludwig

LargeWorldModel/LWM

allenai/OLMo

dvlab-research/MiniGemini

FasterDecoding/Medusa

S-LoRA/S-LoRA

XueFuzhao/OpenMoE

deepseek-ai/DeepSeek-MoE

uclaml/SPIN

mayuelala/FollowYourClick

horseee/DeepCache

jzhang38/EasyContext

SkunkworksAI/hydra-moe

zhuzilin/ring-flash-attention

FranxYao/Long-Context-Data-Engineering

xingyaoww/code-act

VIRL-Platform/VIRL

HKUNLP/ChunkLlama

for-ai/parameter-efficient-moe

hkust-nlp/AgentBoard

dyabel/AnyTool

argilla-io/notus

OSU-NLP-Group/TravelPlanner

THUDM/LongAlign

jeffreysijuntan/lloco

Lucky-Lance/Expert_Sparsity

lixin4ever/CUHK-PHD-Thesis-Template

EnVision-Research/DDSM

FengZicai/LSK3DNet