ICML-A Second look at Exponential and Cosine Step Sizes Simplicity, Adaptiv ...

115

收藏 2025-08-11

A Second look at Exponential and Cosine Step Sizes:
            Simplicity, Adaptivity, and Performance

            Xiaoyu Li  1 Zhenxun Zhuang  2 Francesco Orabona 1 2 3

         Abstract             typically better scale with the complexity of the predic-
                              tors and the amount of training data compared with convex
Stochastic Gradient Descent (SGD) is a popu-       ones. One such example is the deep neural networks. Over
lar tool in traini ...

附件列表

ICML-A Second look at Exponential and Cosine Step Sizes Simplicity, Adaptivity,.pdf

大小:1 MB

只需: RMB 6 元马上下载

扫码加我拉你入群

请注明：姓名-公司-职位

以便审核进群资格，未注明则拒绝

栏目导航

扫码加我 拉你入群

分享

扫码加好友，拉您进群

扫码加我拉你入群