crazyofapple

Ph.D. at Harbin Institute of Technology, Shenzhen

Shenzhen

crazyofapple's Stars

Re-Align/just-eval
A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.
Language:Python655
HITsz-TMG/UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
Language:Python71531
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python17.6k1.9k
miso-belica/jusText
Heuristic based boilerplate removal tool
Language:Python69879
princeton-nlp/USACO
Can Language Models Solve Olympiad Programming?
Language:Python835
XuezheMax/megalodon
Reference implementation of Megalodon 7B model
Language:Cuda48250
HITsz-TMG/ICL-State-Vector
Language:Python61
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Language:Python2.1k112
WJMacro/ContinualMT
A Continual Learning framework for Neural Machine Translation
Language:Python2
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter Notebook1.2k186
voidism/DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
Language:Python36045
stanfordnlp/pyreft
ReFT: Representation Finetuning for Language Models
Language:Python90772
Hritikbansal/dove
Language:Python11
stanfordnlp/string2string
String-to-String Algorithms for Natural Language Processing
Language:Jupyter Notebook50523
hiyouga/LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Language:Python24.6k3k
likenneth/othello_world
Emergent world representations: Exploring a sequence model trained on a synthetic task
Language:Jupyter Notebook15639
tatsu-lab/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
Language:Python73157
HKUNLP/icl-ceil
[ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.
Language:Python8910
xai-org/grok-1
Grok open release
Language:Python49.1k8.3k
jihoontack/MAC
Online Adaptation of Language Models with a Memory of Amortized Contexts
Language:Python471
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python5.5k491
deeplearning-wisc/args
Language:Python223
IBM/ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.
Language:Python21613
UIC-Liu-Lab/CPT
[EMNLP 2022] Continual Training of Language Models for Few-Shot Learning
Language:Python401
thunlp/ELLE
Language:Python312
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python4.1k386
nathanhu0/CaMeLS
Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.
Language:Python214
gmftbyGMFTBY/Rep-Dropout
[NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
Language:Python261
joeljang/continual-knowledge-learning
[ICLR 2022] Towards Continual Knowledge Learning of Language Models
Language:Python899
EnnengYang/Awesome-Forgetting-in-Deep-Learning
A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning. arXiv:2307.09218.
1747

crazyofapple

crazyofapple's Stars

Re-Align/just-eval

HITsz-TMG/UMOE-Scaling-Unified-Multimodal-LLMs

haotian-liu/LLaVA

miso-belica/jusText

princeton-nlp/USACO

XuezheMax/megalodon

HITsz-TMG/ICL-State-Vector

thunlp/UltraChat

WJMacro/ContinualMT

tatsu-lab/alpaca_eval

voidism/DoLa

stanfordnlp/pyreft

Hritikbansal/dove

stanfordnlp/string2string

hiyouga/LLaMA-Factory

likenneth/othello_world

tatsu-lab/alpaca_farm

HKUNLP/icl-ceil

xai-org/grok-1

jihoontack/MAC

facebookresearch/DiT

deeplearning-wisc/args

IBM/ModuleFormer

UIC-Liu-Lab/CPT

thunlp/ELLE

allenai/OLMo

nathanhu0/CaMeLS

gmftbyGMFTBY/Rep-Dropout

joeljang/continual-knowledge-learning

EnnengYang/Awesome-Forgetting-in-Deep-Learning