Documentation
¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type AdaGrad ¶
type AdaGrad struct {
Config
}
AdaGrad assigns a different learning rate to each parameter using the sum of squares of its all historical gradients. References
Adaptive Subgradient Methods for Online Learning and Stochastic Optimization http://www.jmlr.org/papers/volume12/duchi11a/duchi11a.pdf
Click to show internal directories.
Click to hide internal directories.