Optimize Gradient Descent

Question

Closed this issue 6 months ago · 5 comments

Possible options to increase speed of convergence using gradient descent:

fine tune the learning rate up (this is dangerous if gradients can grow very large)
use something like adagrad to guarantee convergence on all data sets (even with exterme gradients)
use newtons method when descent slows past some metric (defining this will be data dependent and may be hard to find) to converge to a local min instantly

Answer 1 · 2024-03-16T22:30:52.000Z

@zephanrs also suggested trying SGD

Answer 2 · 2024-03-17T00:42:20.000Z

@zephanrs also suggested trying SGD

I did too 🥺 😞

Answer 3 · 2024-03-17T00:42:57.000Z

Maybe try sampling like 7 sets of 4 points in sequence or something when computing loss

Answer 4 · 2024-03-17T00:43:19.000Z

@zephanrs also suggested trying SGD

I did too 🥺 😞

Well you don't matter

Answer 5 · 2024-03-17T00:44:55.000Z

@zephanrs also suggested trying SGD

I did too 🥺 😞

Well you don't matter