Issues
- 2
Cannot Interpret result of bff deduplication
#80 opened by ethany21 - 4
Unable to ray up (part 2)
#79 opened by tonychenxyz - 3
- 2
- 7
fasttext cannot be found
#78 opened by tonychenxyz - 8
Unable to ray up
#69 opened by tonychenxyz - 2
- 1
Training data of model-based filtering
#74 opened by Yu-Shi - 2
deduplication removes 98% of my data
#71 opened by Yu-Shi - 9
- 2
Missing train_fasttext_classifier.py
#72 opened by yuzc19 - 1
Dedup methods
#54 opened by ch-shin - 2
TypeError: Couldn't cast array of type
#66 opened by shizhediao - 1
buffer write is so slow
#67 opened by Yu-Shi - 12
What is the pretrain scripts?
#68 opened by mathfinder - 2
Need multi-node training script example
#70 opened by LeoXinhaoLee - 8
Ray Actor dies during tokenization process
#24 opened by humzaiqbal - 2
Training on data with a fixed order
#65 opened by Yu-Shi - 2
- 8
Training crashes after some steps
#62 opened by Yu-Shi - 2
- 5
- 2
- 2
- 4
- 1
- 6
Missing files or bugs in evaluation code?
#31 opened by ch-shin - 1
Local data processing (non-AWS)
#52 opened by ryoungj - 2
Any web demo?
#30 opened by MontaEllis - 1
How to train and fine-tuning model
#34 opened by Jackjiayou - 2
- 6
Training variance
#48 opened by ttccxx - 4
- 1
What is the pearson correlation in lighteval scores between 1B/400M model and 7B model?
#46 opened by ZefanW - 1
Missing training model configs
#41 opened by ch-shin - 1
Release of Trained Models on DCLM-Baseline
#50 opened by m1k2zoo - 2
- 1
- 1
Missing FastText Config File
#40 opened by purefall - 1
Missing scale configs?
#27 opened by ch-shin - 1
- 7
- 10
Unable to run `eval/eval_openlm_ckpt.py`
#22 opened by dwadden - 12
ArrowConversionError when running tokenization
#20 opened by humzaiqbal - 3
Causal Transformer for Perplexity
#11 opened by akshayg08 - 1
Would you share the 0.28T token dataset for achieve highest scores in 7B-2x experiment?
#21 opened by xinghuang2050 - 2
Tokenization file missing
#19 opened by humzaiqbal - 4
Accessing S3 bucket dcnlp-west
#16 opened by humzaiqbal - 2
Data download script
#18 opened by ch-shin - 2