Issues
- 0
Model for genomic sequences
#36 opened by Prachiiitd - 0
Variable error with the full_bigbird_mask method in the multi head attention class
#35 opened by BetikuOluwatobi - 9
Pre-trained model for genomic sequences
#2 opened by ptynecki - 0
- 1
reproduce arxiv classification task
#20 opened by liuyang148 - 1
Why is BigBird Pegasus/Pegasus Repeating the Same Sentence for Summarization?
#29 opened by Kevin-Patyk - 5
I've added bigbird's attention to my model, but not seeing a decrease in memory
#33 opened by Currie32 - 0
Any plan to provide chinese pretrain model ?
#32 opened by DSXiangLi - 3
Export predictions for each example
#28 opened by jtfields - 0
Are encoder and decoder both implemented with sparse attention? How long is the verified output length for the decoder?
#30 opened by dongxinghua - 1
TFDS Custom Dataset Issue - normalizer.cc(51) LOG(INFO) precompiled_charsmap is empty. use identity normalization.
#27 opened by jtfields - 0
Differences between ETC and BigBird-ETC version
#26 opened by lhl2017 - 0
How is Prior Arts, which can only accept short text input, evaluated on long text datasets.
#25 opened by cmd0714 - 0
code error in version of tensorflow?
#24 opened by jaekyoungkim - 0
- 0
What's the difference of bigbr_base and bigbr_base_tf2 at the gs://bigbird-transformer/pretrain ?
#21 opened by liuyang148 - 3
- 1
- 1
- 0
- 0
- 0
- 1
- 1
- 0
- 0
- 1
Pegasus variables mapping
#9 opened by huseinzol05 - 0
- 1
Would you like to release the code about how to train a bigbird with other language
#5 opened by RyanHuangNLP - 0
- 1
Preprocessing code for TriviaQA dataset
#4 opened by sjy1203 - 4
Roberta Training
#3 opened by agemagician - 3
Question about pre-trained weights
#1 opened by patrickvonplaten