tensorflow/mesh

Mesh TensorFlow: Model Parallelism Made Easier

PythonApache-2.0

Issues

Error while importing Meshtensorflow
#396 opened a year ago by billygrahamram
0
Optimizer momentums not properly populated training model with DTensors
#393 opened 2 years ago by pentney
1
AttributeError: module 'tensorflow.python.framework.ops' has no attribute 'register_tensor_conversion_function'
#392 opened 2 years ago by Xnhyacinth
4
Does load-balanced loss help the loss converge？
#391 opened 2 years ago by mathfinder
0
Future of this project?
#181 opened 5 years ago by Mistobaan
2
When running BERT on GPU: Resource exhausted: failed to allocate memory
#383 opened 3 years ago by Currycurrycurry
1
Getting "NanLossDuringTrainingError: NaN loss during training."
#379 opened 3 years ago by dhruval-p
0
mask_1_flat and mask_2_flat applied to gates twice?
#378 opened 3 years ago by marhlder
0
Debug in mesh Tensorflow
#235 opened 4 years ago by patrickvonplaten
3
Mesh-tf model conversion to onnx?
#368 opened 3 years ago by b-analyst
2
About the mixture of expert model
#369 opened 3 years ago by fym0503
0
How to freeze embedding layers
#364 opened 3 years ago by lintangsutawika
0
Beam search
#362 opened 3 years ago by antonio-mastropaolo
0
the `model_executor.py` example is broken
#278 opened 4 years ago by XMaster96
0
Ability to add Custom Tensorflow Hooks
#352 opened 4 years ago by trisongz
0
[MOE-transformer] How do you build static graph of MOE-Model?
#330 opened 4 years ago by imyzx2017
0
How to use tf.contrib.opt.ScipyOptimizerInterface or tfp.optimizer.lbfgs_minimize with MeshTF ?
#328 opened 4 years ago by harshil-patel-code
0
How to assign values to specific slice of a data block on a specific GPU?
#324 opened 4 years ago by harshil-patel-code
0
performing the opposite of mtf.lowering
#318 opened 4 years ago by DavidPeleg6
1
Performance on GPUs and multiple GPU support
#80 opened 5 years ago by nict-wisdom
12
[Wrong Code Comments] In moe.py, there are two wrong code comments
#305 opened 4 years ago by xiaodathereal
0
MeshTF + pipeline parallelism?
#194 opened 4 years ago by eric-haibin-lin
0
OpenNMT-tf
#280 opened 4 years ago by vvjn
0
mtf.dropout is inverted
#162 opened 5 years ago by shawwn
0
Tensorflow Mesh needs documentation. Will this be provided anytime soon?
#276 opened 4 years ago by shyamalschandra
1
error when learning_rate_schedule is a callable
#265 opened 4 years ago by marton-avrios
0
different target score when using logits from sample_autoregressive
#262 opened 4 years ago by marton-avrios
0
Mesh tensorflow support for multi-node
#201 opened 4 years ago by assij
5
bias in selfAttention
#253 opened 4 years ago by wintersurvival
0
more memory occupation in first device
#243 opened 4 years ago by wintersurvival
1
AttributeError: module 'mesh_tensorflow' has no attribute 'auto_mtf'
#247 opened 4 years ago by zaccharieramzi
4
Does this supports tf 2 keras API?
#239 opened 4 years ago by GF-Huang
0
Memory issues when using the "distillation" class
#231 opened 4 years ago by danyaljj
1
Appropriate values for model_parallelism and tokens_per_batch to train a t5.small_ssm model on v3_512, v3_1024 and v3_2048 TPUs
#232 opened 4 years ago by sbhaktha
1
Predict vs Eval functionality
#223 opened 4 years ago by bhavanadalvi
0
Finetuning a `bfloat16` checkpoint with `float32`
#178 opened 5 years ago by saareliad
0
Preventing leak in packed sequences
#173 opened 5 years ago by saareliad
0
Communication Between TPU Cores and Encoder->Reduce->Decoder Pattern
#156 opened 5 years ago by hyang0129
0
PROBLEM=./mesh_tensorflow/transformer/gin/problems/lm1b.gin
#153 opened 5 years ago by ulapopov
0
README.md is outdated
#152 opened 5 years ago by ulapopov
0
Convolution layers in mesh tensorflow
#151 opened 5 years ago by taless474
0
tf2 in mesh_tensorflow/utils.py incompatible with tensor2tensor/rl
#146 opened 5 years ago by snowbabyjia
0
Split along layers
#144 opened 5 years ago by leogao2
0
[Bug] brackets missing
#137 opened 5 years ago by AsukiLiu
0
[Bug Fix] Evaluation and Prediction for Aligned model
#123 opened 5 years ago by agemagician
1
mixed precision support on GPUs
#101 opened 5 years ago by LiweiPeng
0
Capture performance profile using Tensorboard
#78 opened 5 years ago by mcompute
0
PermissionError: [WinError 32] The process cannot access the file because it is being used by another process
#70 opened 5 years ago by samaritanhu
0
SelfAttention & EncDecAttention in mesh transformer allow different values for query, key, value
#57 opened 5 years ago by desperadoola
0
Could you please set to False the default value of ignore_comments?
#38 opened 5 years ago by rodrigo-eai
0