yahoo/TensorFlowOnSpark
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
PythonApache-2.0
Issues
- 0
I have been trying to use TensorFlowOnSpark in Azure Synapse Analytics and I would like to ask if you have any information about its compatibility in this environment
#608 opened by 4ndresveg4 - 3
Model Saved with TF-2.5.0
#576 opened by doufs - 1
error while running mnist_tf_ds.py
#607 opened by jordanFisherYzw - 1
yarn mode error
#606 opened by jordanFisherYzw - 1
Evalator hangs while training
#589 opened by jiqiujia - 3
do we support scala & java code write tensorflow model with tenorflow-core-api ?
#588 opened by mullerhai - 3
can it run use ParameterServerStrategy
#587 opened by Coder-Yifan - 1
can it run on tensorflow-cpu?
#584 opened by ioslide - 3
- 12
How to integrate a model into Spark cluster
#579 opened by jahidhasanlinix - 1
Get stuck at "Added broadcast_0_piece0 in memory on" while runing Spark standalone cluster
#580 opened by icszhr - 3
Performance issues in examples/mnist/estimator (by P3)
#573 opened by DLPerf - 2
tensorflow.python.framework.errors_impl.UnimplementedError: File system scheme 'cosn' not implemented
#575 opened by doufs - 2
Retaining original columns after inference
#574 opened by dilta - 2
- 2
Performance issues in the program
#572 opened by DLPerf - 2
Writing checkpoints to HDFS takes long
#561 opened by sat2000pts - 7
TimeoutError: [Errno 110] Connection timed out
#559 opened by kwameali - 4
INFO:tensorflow:Waiting for model to be ready. Ready_for_local_init_op: Variables not initialized: hid_w, hid_b, sm_w, sm_b, global_step, hid_w/Adagrad, hid_b/Adagrad, sm_w/Adagrad, sm_b/Adagrad, ready: None
#549 opened by Berwin77 - 1
use tf.estimator
#547 opened by time-py - 2
Your dataset iterator ran out of data interrupting testing when adding validation dataset
#541 opened by FrancoisMasson1990 - 3
failed when Waiting for TFSparkNodes to start
#540 opened by OUCWIND - 3
Does TensorFlowOnSpark support overlap between computation and communication while training?
#539 opened by orwa-te - 1
set spark.task.cpus=cores in example submit script to allow tensorflow on spark utilize multi-thread environment.
#537 opened by cmxcn - 9
Wide and Deep example for new TensorFlowOnSpark
#531 opened by arunraman - 1
the doubt about the data policy
#570 opened - 3
pkg_resources.DistributionNotFound: The 'tensorflow' distribution was not found and is required by the application
#568 opened by Curry-whs - 2
MNIST example - Exception in TF background thread
#569 opened by Ipsedo - 2
when using mnist_spark.py , serializer.dump_stream Timeout while feeding partition
#562 opened by zhangqianjin - 11
- 5
1.4.4 exception: Timeout while feeding partition
#518 opened by MaQianheng - 2
Monitor Bandwidth Utilization of Nodes While Training
#548 opened by orwa-te - 2
TensorBoard files gets deleted, Profiler returns 0 Millis for communication time!
#550 opened by orwa-te - 27
- 1
- 5
- 13
fails to save model
#521 opened by OUCWIND - 2
requests.exceptions.ConnectionError: HTTPConnectionPool(host='storage.googleapis.com', port=80):
#535 opened by yolur - 1
Parameter server number confusion
#533 opened by orwa-te - 1
Splitting data between workers confusion
#532 opened by orwa-te - 16
A problem while invoking cluster.inference(dataRDD) (the process is hanging up and cannot end)
#508 opened by guoyuhaoaaa - 12
Stack on Yarn cluster mode
#519 opened by Alwaysproblem - 11
- 2
Running on AWS EMR gets failed
#513 opened by sydsim - 2
Handling big dataset with YARN and tfos
#516 opened by macro128 - 3
How to keep extra column when transform?
#520 opened by OUCWIND - 1
- 5
- 0
TypeError: interleave() missing 1 required positional argument: 'cycle_length' while running mnist example
#514 opened by siyu1992 - 5
Not able to run multiple tensorflowonspark tasks at the same time when TFOS_SERVER_PORT is configured
#510 opened by qzhong711