FluxML/Optimisers.jl
Optimisers.jl defines many standard optimisers and utilities for learning loops.
JuliaMIT
Issues
- 0
- 7
- 3
- 0
- 1
`nothing` does not correspond to updating the state with a zero gradient.
#140 opened by CarloLucibello - 7
- 9
Rename or outsource `iswriteable`
#99 opened by ToucheSir - 2
- 5
- 7
Frozen parameters
#107 opened by mcabbott - 10
- 2
- 2
Allow keyword arguments for optimisers
#74 opened by theabhirath - 0
tag v1.0.0
#183 opened by CarloLucibello - 2
AdamW optimizer implemented incorrectly - weight decay does not incorporate learning rate
#182 opened by BioTurboNick - 7
Optimiser state a not moving to GPU
#179 opened by vpuri3 - 2
GPU kernels for optimizers
#178 opened by vpuri3 - 4
- 0
Grokfast exponential moving average Optimizer
#176 opened by vpuri3 - 4
Restructure makes a copy
#146 opened by linusheck - 0
Document `destructure` handling shared parameters differently to ComponentArrays.jl
#161 opened by mcabbott - 1
`destructure` doesn't work on Dictionaries
#154 opened by mcabbott - 0
Documenter CI is failing
#169 opened by CarloLucibello - 7
Type instability in `Flux.setup`
#162 opened by Vilin97 - 8
Port over rule changes from Flux
#38 opened by ToucheSir - 2
`reset!(optimiser_state)`
#163 opened by Vilin97 - 4
- 4
How to handle long compile times?
#153 opened by DrChainsaw - 4
Adam(0) fails
#119 opened by cossio - 7
- 1
`OptimiserChain(..., ClipNorm)` fails on GPU
#127 opened by mcabbott - 7
Interface for gradient accumulation
#130 opened by chengchingwen - 5
Optimisers.update fails with gradient of type `CUDA.CUSPARSE.CuSparseMatrixCSC`
#141 opened by hsseung - 4
- 1
Documentation error
#131 opened by erlebach - 15
- 5
Investigate using a different AD for tests
#96 opened by ToucheSir - 3
Support dictionary of parameters
#114 opened by freddycct - 7
- 0
- 1
Optimizing scalars
#92 opened by samanklesaria - 2
update! is ambiguous with ComponentArrays
#91 opened by IlyaOrson - 2
support a nice way of changing the learning rate
#88 opened by SobhanMP - 6
Applying `Grads` fails
#76 opened by femtomc - 1
- 8
Chaining gpu and cpu models
#69 opened by lungd - 3
- 13
Register 0.2
#52 opened by mcabbott - 2
Correct the optimiser's eltype?
#55 opened by mcabbott - 7
Optimise a subset of parameters
#35 opened by mcabbott