Issues
- 3
- 0
Searching DCLM-baseline
#101 opened by chtmp223 - 1
- 0
- 1
Cannot train
#97 opened by camilobrownpinilla - 4
- 0
- 2
Can bff read file formats other than jsonl?
#93 opened by ethany21 - 3
- 3
Training data of model-based filtering
#74 opened by Yu-Shi - 1
tokenization memory usage
#88 opened by brian-ham - 1
How to download pools for smaller scale tracks
#87 opened by arnavmdas - 1
- 2
- 1
How can I calculate expected-ngram-count?
#90 opened by ethany21 - 2
Reproducing experiments in the paper
#85 opened by normster - 1
buffer write is so slow
#67 opened by Yu-Shi - 2
Cannot Interpret result of bff deduplication
#80 opened by ethany21 - 4
- 4
Unable to ray up (part 2)
#79 opened by tonychenxyz - 7
fasttext cannot be found
#78 opened by tonychenxyz - 8
Unable to ray up
#69 opened by tonychenxyz - 2
- 2
deduplication removes 98% of my data
#71 opened by Yu-Shi - 9
- 2
Missing train_fasttext_classifier.py
#72 opened by yuzc19 - 1
Dedup methods
#54 opened by ch-shin - 2
TypeError: Couldn't cast array of type
#66 opened by shizhediao - 12
What is the pretrain scripts?
#68 opened by mathfinder - 2
Need multi-node training script example
#70 opened by LeoXinhaoLee - 2
Training on data with a fixed order
#65 opened by Yu-Shi - 2
- 8
Training crashes after some steps
#62 opened by Yu-Shi - 2
- 5
- 2
- 2
- 4
- 1
- 1
Local data processing (non-AWS)
#52 opened by ryoungj - 1
How to train and fine-tuning model
#34 opened by Jackjiayou - 2
- 6
Training variance
#48 opened by ttccxx - 4
- 1
What is the pearson correlation in lighteval scores between 1B/400M model and 7B model?
#46 opened by ZefanW - 1
Missing training model configs
#41 opened by ch-shin - 1
Release of Trained Models on DCLM-Baseline
#50 opened by m1k2zoo - 2
- 1
- 1
Missing FastText Config File
#40 opened by purefall