Issues
- 0
- 3
- 1
Make `FileCache` able to detect updated remote files
#1602 opened by NeoLegends - 9
- 0
Torch print step info on crash
#1597 opened by albertz - 3
ConcatSeqsDataset with extended functionality
#1573 opened by Stefanwuu - 1
Make batch_size configurable for cross validation
#1567 opened by michelwi - 0
Torch: gradient_clip wrong when grad_scaler is used
#1590 opened by michelwi - 1
`rf.pack_padded` with PyTorch takes a lot of memory
#1584 opened by albertz - 1
- 16
- 1
Tensor deepcopy does not copy raw_tensor
#1541 opened by albertz - 0
Torch multiple simultaneous gradient_checkpoint_scope
#1583 opened by albertz - 0
Torch gradient_checkpoint_scope potential memory leak
#1582 opened by albertz - 4
- 0
RF parametrization breaks Conv
#1580 opened by albertz - 9
RF weight dropout and variational noise
#1518 opened by albertz - 1
- 7
Gradient checkpointing for weight noise etc in PyTorch
#1552 opened by albertz - 1
Torch: print model at log verbosity 3
#1575 opened by NeoLegends - 0
- 8
DistributeFilesDataset: _distribute_evenly_by_size suboptimal for multi-gpu sharding
#1570 opened by michelwi - 11
multiprocessing: OSError: AF_UNIX path too long
#1571 opened by michelwi - 2
Ignore a single broken gradient
#1568 opened by JackTemaki - 5
- 8
- 0
- 1
Hang in training (often with multi GPU training)
#1558 opened by albertz - 3
- 0
RF scaled_dot_product_attention
#1555 opened by albertz - 1
- 0
SlowMo (BMUF) support for PyTorch distributed training
#1553 opened by albertz - 2
- 18
- 5
- 0
`FileCache`: avoid cache-wide dir lock
#1548 opened by albertz - 0
- 5
Possible race condition in `FileCache`?
#1542 opened by NeoLegends - 6
`DistributeFilesDataset` with sharding on file level
#1531 opened by albertz - 10
- 0
RF BatchNorm running var small diff between TF-layers, pure RF and direct PyTorch, biased vs unbiased
#1539 opened by albertz - 33
Support for larger scale datasets
#1519 opened by albertz - 10
`ConcatFilesDataset` needs a better name
#1535 opened by NeoLegends - 2
- 6
RF torch `lstm` fails with torch amp option.
#1529 opened by LucaG1 - 6
- 5
- 0
PyTorch debug_add_check_numerics_ops
#1522 opened by albertz - 0
RuntimeError: CUDA error: unknown error
#1520 opened by albertz - 12