Pinned Repositories
Attention-Alignment-Transformer-Length-Extrapolation
BookQA
EvoPress
GST
KERPLE
large_language_monkeys
llm-foundry
LLM training code for Databricks foundation models
Poly-Encoder
structured_dialogue_discourse_parsing
zero_shot_dialogue_disentanglement
chijames's Repositories
chijames/Poly-Encoder
chijames/GST
chijames/KERPLE
chijames/zero_shot_dialogue_disentanglement
chijames/structured_dialogue_discourse_parsing
chijames/Attention-Alignment-Transformer-Length-Extrapolation
chijames/BookQA
chijames/large_language_monkeys
chijames/llm-foundry
LLM training code for Databricks foundation models
chijames/pytorch-struct
Fast, general, and tested differentiable structured prediction in PyTorch
chijames/regular_gpt
chijames/RULER
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
chijames/T5-Attention-Alignment
chijames/Parallel-Context-Windows
use pcw for long context
chijames/PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
chijames/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.