Issues
- 4
The throughput is extremely low
#233 opened by zxgx - 1
tracking issue: initial version
#7 opened by NOBLES5E - 1
more examples
#214 opened by NOBLES5E - 1
Improve compile time of Rust Extension
#206 opened by williamstar - 1
feat: embedding model definition
#201 opened by williamstar - 1
feat: suppport fp16 embedding
#90 opened by williamstar - 1
PS features require access rules
#73 opened by nealgavin - 1
PS needs to be evicted by business policy
#72 opened by nealgavin - 1
terminating training by nats server signal
#33 opened by williamstar - 2
用honcho启动卡在`SingleMachine training context init done`这里
#232 opened by zxgx - 1
npy is used for small dataset in the example, what data format should be used for large-size data?
#216 opened by xpai - 2
Where is EmbeddingWorkerNatsServicePublisher?
#218 opened by zxgx - 2
请问这套架构为啥用rust写呢?
#217 opened by NHZlX - 1
Does it support Tensorflow?
#215 opened by rabintang - 0
- 2
API documentation
#23 opened by NOBLES5E - 0
organize tests with pytest and pass CI
#15 opened by NOBLES5E - 0
- 0
Generate e2e test from examples
#70 opened by williamstar - 0
More readable PersiaBatch construction
#183 opened by snowpeakz - 3
Generate criteo syn dataset
#196 opened by BLue1881euLB - 0
CI: multi-machine multi-gpu system test
#187 opened by snowpeakz - 2
Missing .env file?
#188 opened by Tokkiu - 0
Load exists checkpoint to continue training
#182 opened by williamstar - 0
- 1
- 0
support pure CPU mode
#21 opened by NOBLES5E - 1
add timeline tools to analyze bottleneck
#92 opened by snowpeakz - 0
feat: accelerate server model dump and load
#36 opened by NOBLES5E - 4
- 0
remove intent in config file
#61 opened by snowpeakz - 0
configurable start_deadlock_detection_thread
#118 opened by snowpeakz - 0
support low memory adagrad sparse optimizer
#91 opened by NOBLES5E - 1
torch ddp launch with tcp instead of env_file
#20 opened by williamstar - 1
feat: dump/load dense model use hdfs
#59 opened by snowpeakz - 1
- 1
ci: add python test and benchmark
#3 opened by NOBLES5E - 1
ci: add pytype check
#12 opened by NOBLES5E - 0
ci: add semver auto release and pypi upload
#1 opened by NOBLES5E - 0
docs: create API documentation website
#2 opened by NOBLES5E