【PaddlePaddle Hackathon 2】12、为 Paddle 新增 OneCycleLR 优化调度器 #41825

Asthestarsfalll · 2022-04-14T13:53:42Z

PR types

Others

PR changes

APIs

Describe

解决了issue：#40322
增加了API: paddle.optimizer.lr.OneCycleLR，该优化调度器在训练过程中调整学习率从初始学习率到最大学习率，再到最小学习率。
设计文档：PaddlePaddle/community#29
中文文档： PaddlePaddle/docs#4713

…o onecyclelr

paddle-bot-old · 2022-04-14T13:53:47Z

✅ This PR's description meets the template requirements!
Please wait for other CI results.

paddle-bot-old · 2022-04-14T13:53:48Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-bot-old · 2022-04-15T01:48:53Z

PR格式检查通过，你的PR将接受Paddle专家以及开源社区的review，请及时关注PR动态。
The format inspection passed. Your PR will be reviewed by experts of Paddle and developers from the open-source community. Stay tuned.

zhiboniu · 2022-04-15T09:05:23Z

python/paddle/fluid/tests/unittests/test_lr_scheduler.py

@@ -467,6 +535,25 @@ def test_scheduler(self):
        with self.assertRaises(ValueError):
            paddle.optimizer.lr.MultiStepDecay(
                learning_rate=0.5, milestones=[1, 2, 3], gamma=2)
+        with self.assertRaises(TypeError):


加一下注释吧，每一项测试是针对什么异常输入情况

zhiboniu · 2022-04-15T09:09:19Z

python/paddle/optimizer/lr.py

+
+class OneCycleLR(LRScheduler):
+    r"""
+    Sets the learning rate according to the 1cycle learning rate scheduler.


文档是不能大面积拷贝别人的，自己组织优化一下

API描述部分参考了pytorch与其他LRScheduler的，参数部分由自己理解完成，稍后将会再进行优化。

zhiboniu · 2022-04-15T09:12:31Z

python/paddle/optimizer/lr.py

+
+
+class OneCycleLR(LRScheduler):
+    r"""


如果代码是参考别人的实现，需要遵循开源协议，添加说明引用来源

zhiboniu · 2022-04-22T02:37:42Z

python/paddle/optimizer/lr.py

    which claims that “unpublished work has shown even better results by using only two phases”.
    Set ``three_phase=True``, If you want the behaviour of this scheduler to be consistent with the paper.

+    Also note that you should update learning rate each step.
+
+    This implementation was adapted from PyTorch.


参考需要具体到文件或函数行，adapted from [文件网址]这样吧

…o onecyclelr

TCChenlong · 2022-04-29T02:14:34Z

请添加中文文档，并将链接填写在 Describe 中

Asthestarsfalll · 2022-04-29T10:24:52Z

请添加中文文档，并将链接填写在 Describe 中

已添加～

TCChenlong · 2022-05-06T07:56:28Z

python/paddle/optimizer/lr.py

+             ``final_divide_factor`` respectively.
+        total_steps (int, optional): Number of total training steps.
+            Note that one of total_steps and (epochs, steps_per_epoch) must be specified.
+            If ``total_steps`` is not specified, it will be determined by ``epochs`` and ``steps_per_epoch``. Default: None.


Default: None, means xxx.

If total_steps is not specified, it will be determined by epochs and steps_per_epoch .

这部分应该表明了默认的情况，后面就不再说明了。

TCChenlong · 2022-05-06T07:59:31Z

python/paddle/optimizer/lr.py

+        pct_start (float): The percentage of total steps, which used to increasing learning rate. Default: 0.3.
+        anneal_strategy (str, optional): Strategy of adjusting learning rate.'cos' for cosine annealing,
+            'linear' for linear annealing. Default: 'cos'.
+        divide_factor (float, optional): Initial learning rate will be determined by initial_lr = max_lr/div_factor. Default: 25.


max_lr -> max_learning_rate
div_factor -> divide_factor
保持一致，其他地方同理

TCChenlong · 2022-05-06T08:00:14Z

python/paddle/optimizer/lr.py

+        anneal_strategy (str, optional): Strategy of adjusting learning rate.'cos' for cosine annealing,
+            'linear' for linear annealing. Default: 'cos'.
+        divide_factor (float, optional): Initial learning rate will be determined by initial_lr = max_lr/div_factor. Default: 25.
+        final_divide_factor (float, optional): Minimum learning rate will be determined by minimum = max_lr/final_divide_factor. Default: 1e4.


中英文公式一致

TCChenlong · 2022-05-06T08:05:03Z

python/paddle/optimizer/lr.py

+
+    Examples:
+        .. code-block:: python
+            import paddle


import paddle 必须要和上面空一行，否则会有格式问题；
示例代码整体注意增加一些空行，保证阅读体验~

…o onecyclelr

Asthestarsfalll · 2022-05-10T13:23:50Z

@zhiboniu 已更新

Asthestarsfalll · 2022-05-10T13:29:50Z

python/paddle/optimizer/lr.py

+                 three_phase=False,
+                 last_epoch=-1,
+                 verbose=False):
+        # Check type and value of end_lr


参数部分将max_learning_rate更改为learning_rate与paddle中其他调度器对齐；
去除了额外的epochs和steps_per_epoch，实际上用户调用时可直接使用
OneCycleLR(learning_rate=0.1, total_steps=epochs*steps_per_epoch)
max_lr将由参数scale_factor推断而来；
min_lr则直接接收一个浮点数。

learning_rate的参数不需要修改，保持原来的就好。
这里涉及一个使用中关注点的问题，使用OneCycleLR其实对学习率的设置最关注的是max_lr，这个值直接影响模型训练效果和是否收敛，是比较明确的。初始学习率和最终学习率一般只是表示一个很小的值，并不具有精确的意义。所以仍然建议参数中使用原来的max_learning_rate。
这里再改一下吧，其他部分的修改我觉得没有问题。

对于别人好的设计也是可以借鉴的，并不要求全盘否定。主要实现代码和描述独立实现就好。

好的～已更新

Asthestarsfalll · 2022-05-10T13:30:30Z

python/paddle/optimizer/lr.py

+        min_lr = float(end_lr)
+
+        if three_phase:
+            if phase_pct >= 0.5:


增加了在三阶段情况下对phase_pct不能超过0.5的异常检查

Asthestarsfalll · 2022-05-10T13:31:22Z

python/paddle/optimizer/lr.py

+                self._start_steps[2] - self._start_steps[1],
+                self._start_steps[3] - self._start_steps[2],
+                self._start_steps[3] - self._start_steps[2],
+            ]


这里使用self._start_steps的元素进行相减是因为写计算表达式会因为浮点运算不精确而造成误差

Asthestarsfalll · 2022-05-10T13:35:30Z

python/paddle/optimizer/lr.py

+    Sets the learning rate according to the one cycle learning rate scheduler.
+    The scheduler adjusts the learning rate from an initial learning rate to the maximum learning rate and then
+    from that maximum learning rate to the minimum learning rate, which is much less than the initial learning rate.
+


这里的描述没有什么可以修改的方案，其本身已经足够简洁明了。

Asthestarsfalll · 2022-05-10T13:37:19Z

python/paddle/fluid/tests/unittests/test_lr_scheduler.py

+            break
+        start_step = end_step
+
+    return computed_lr


单测代码部分没有作过多修改，仍是之前的逻辑

Asthestarsfalll · 2022-05-11T06:46:17Z

python/paddle/optimizer/lr.py

+                self.total_steps - 1,
+                self.total_steps - 1,  # for the last step.
+            ]
+            # step size of each phase.


这里又做了一点修改，在get_lr时可以进行更少的计算；
同时添加了一些注释。

zhiboniu

LGTM

Asthestarsfalll · 2022-05-13T06:57:37Z

@TCChenlong @DDDivano 劳烦进行后续review！

TCChenlong

LGTM

XiaoguangHu01 · 2022-05-13T09:22:19Z

python/paddle/optimizer/lr.py

+                 max_learning_rate,
+                 total_steps,
+                 divide_factor=25.,
+                 end_lr=0.0001,


这里建议用全称吧，因为前面max_learning_rate用的是全称，保持一致
end_lr -> end_learning_rate

已修改~
这里使用end_lr主要是因为注意到前面的一些调度器如PolynomialDecay、PolynomialDecay等也都使用了end_lr

XiaoguangHu01

LG API

jeff41404 · 2022-05-16T07:00:14Z

python/paddle/optimizer/lr.py

+                 anneal_strategy='cos',
+                 three_phase=False,
+                 last_epoch=-1,
+                 verbose=False):


there are total 11 parameters of OneCycleLR API in RFC, but only 9 parameters here, which is right? RFC and code must be consistency.

here I commit a new pull request to modify RFC file.

Asthestarsfalll added 2 commits April 14, 2022 21:40

add OneCycleLR

702f47c

Merge branch 'develop' of https://github.com//PaddlePaddle/Paddle int…

3537536

…o onecyclelr

paddle-bot-old bot added contributor External developers status: proposed labels Apr 14, 2022

Asthestarsfalll changed the title ~~Onecyclelr~~ 【PaddlePaddle Hackathon 2】12、为 Paddle 新增 OneCycleLR 优化调度器 Apr 14, 2022

add missing total_steps

f8011a0

dingjiaweiww assigned zhiboniu and DDDivano Apr 15, 2022

dingjiaweiww added status: open review and removed status: proposed labels Apr 15, 2022

zhiboniu reviewed Apr 15, 2022

View reviewed changes

Asthestarsfalll added 2 commits April 20, 2022 23:05

try

9be4aeb

try

2df7465

zhiboniu reviewed Apr 22, 2022

View reviewed changes

Asthestarsfalll added 3 commits April 25, 2022 12:04

update

8bb405a

Merge branch 'develop' of https://github.com//PaddlePaddle/Paddle int…

8ad14b0

…o onecyclelr

fix conflict bug

de82b7b

zhiboniu previously approved these changes Apr 26, 2022

View reviewed changes

fix typo

af0420f

Asthestarsfalll dismissed zhiboniu’s stale review via af0420f April 29, 2022 10:43

Merge branch 'PaddlePaddle:develop' into onecyclelr

6d48016

Asthestarsfalll requested a review from zhiboniu May 5, 2022 07:34

TCChenlong reviewed May 6, 2022

View reviewed changes

Asthestarsfalll added 2 commits May 6, 2022 20:20

update doc

6c76ab5

Merge branch 'develop' of https://github.com//PaddlePaddle/Paddle int…

36864a0

…o onecyclelr

Asthestarsfalll dismissed stale reviews from DDDivano and TCChenlong via 98f9a9e May 10, 2022 13:22

Asthestarsfalll commented May 10, 2022

View reviewed changes

Asthestarsfalll requested review from zhiboniu, TCChenlong and DDDivano May 10, 2022 13:44

update

70f97a8

Asthestarsfalll commented May 11, 2022

View reviewed changes

zhiboniu previously approved these changes May 12, 2022

View reviewed changes

TCChenlong previously approved these changes May 13, 2022

View reviewed changes

DDDivano previously approved these changes May 13, 2022

View reviewed changes

TCChenlong requested a review from XiaoguangHu01 May 13, 2022 08:50

XiaoguangHu01 reviewed May 13, 2022

View reviewed changes

change end_lr to end_learning_rate

ebb04e2

Asthestarsfalll dismissed stale reviews from DDDivano, TCChenlong, and zhiboniu via ebb04e2 May 13, 2022 09:42

Merge branch 'PaddlePaddle:develop' into onecyclelr

a58cab2

TCChenlong requested review from zhiboniu and XiaoguangHu01 May 13, 2022 09:52

XiaoguangHu01 approved these changes May 13, 2022

View reviewed changes

TCChenlong approved these changes May 13, 2022

View reviewed changes

zhiboniu approved these changes May 13, 2022

View reviewed changes

jeff41404 reviewed May 16, 2022

View reviewed changes

Asthestarsfalll mentioned this pull request May 16, 2022

更新API OneCycleLR设计文档 PaddlePaddle/community#132

Merged

jeff41404 merged commit 8ffebb5 into PaddlePaddle:develop May 16, 2022

TCChenlong mentioned this pull request May 20, 2022

【PaddlePaddle Hackathon 第二期】任务总览 #40234

Closed

【PaddlePaddle Hackathon 2】12、为 Paddle 新增 OneCycleLR 优化调度器 #41825

【PaddlePaddle Hackathon 2】12、为 Paddle 新增 OneCycleLR 优化调度器 #41825

Conversation

Asthestarsfalll commented Apr 14, 2022 • edited by TCChenlong

PR types

PR changes

Describe

paddle-bot-old bot commented Apr 14, 2022 • edited

paddle-bot-old bot commented Apr 14, 2022

paddle-bot-old bot commented Apr 15, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TCChenlong commented Apr 29, 2022

Asthestarsfalll commented Apr 29, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Asthestarsfalll commented May 10, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zhiboniu left a comment

Choose a reason for hiding this comment

Asthestarsfalll commented May 13, 2022

TCChenlong left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Asthestarsfalll May 16, 2022 • edited

Choose a reason for hiding this comment

Asthestarsfalll commented Apr 14, 2022 •

edited by TCChenlong

paddle-bot-old bot commented Apr 14, 2022 •

edited

Asthestarsfalll commented Apr 29, 2022 •

edited

Asthestarsfalll May 16, 2022 •

edited