Issues
- 1
Weights are shared across the MLP layers
#8 opened by 0seba - 1
- 2
About 'Cross-Attention_head'
#6 opened by xinlong-yang - 2
OOM issue
#4 opened by ShivangiAg - 1
Eagle implementation
#5 opened by FatPigeorz - 5
The difference and connection between _grounded_proposal and _ungrounded_proposal
#3 opened by reflectionie - 1
how to train 30B model
#2 opened by MeJerry215 - 1
`hydra/data/partition_train_test.py` is missing
#1 opened by ttim