TensorFlow VS PyTorch之學習率衰減

原創

2020-02-25 18:46

在訓練神經網絡時，有些情況下，需要對學習率進行調整。在這裏分別介紹TensorFlow和PyTorch的一種方法。

tf.train.exponential_decay()

TensorFlow提供了指數衰減法

tf.train.exponential_decay(learning_rate, global_step=global_step, decay_steps=100,decay_rate=0.99, staircase=True)

計算公式：

learning_rate * decay_rate^(global_step / decay_steps)

參數

learning_rate：初始學習率
global_step：計數器，每進行一次更新，加1
decay_steps：衰減步長
decay_rate：衰減係數
staircase：若爲True，則學習率呈階梯形式下降，即global / decay_steps爲整數。相當於每隔decay_steps更新一次學習率；若爲False，則學習率呈連續下降，即global / decay_steps爲浮點型，每一步都會更新一次學習率。

torch.optim.lr_scheduler.StepLR()

PyTorch提供了基於epoch的學習率下降方法。該方法只是其中一種

torch.optim.lr_scheduler.StepLR(optimizer,step_size=100,gamma=0.99,last_epoch=-1)

計算公式：

learning_rate * gamma^(epoch / step_size)

參數

optimizer：自己定義的優化器
step_size：衰減步長，即每隔step_size個epoch，更新一次學習率
gamma(float)：衰減係數
last_epoch(int)：最後一次epoch的索引默認爲-1

optimizer = torch.optim.SGD(model.parameters(), lr=learning_rate)
scheduler = torch.optim.lr_scheduler.StepLR(optimizer, step_size=100, gamma=0.99)

# 之後搭配 scheduler.step()進行操作。

參考文獻：https://www.cnblogs.com/happystudyeveryday/p/11144433.html

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

TensorFlow Hub-人人可享的易用模型

構建機器學習模型不但需要豐富的行業經驗，還需要存取海量數據和計算機資源，這對大多數開發者來說都是道不易翻越的屏障。爲解決這一問題，開發者可以使用預訓練過的模型，而且這種解決方案要好得多。TensorFlow Hub 就是爲解決此難題而設的庫

2021-12-02 13:08:54

TensorFlow Recommenders 一個專爲搭建推薦系統設計的 TensorFlow 庫

從電商到短視頻，推薦系統唄被廣泛應用於各個場景，但實際上要打造好一個有效的推薦系統並不容易。TensorFlow Recommenders 就是這樣一個爲了打造高效可擴展的推薦系統而專門製作的庫。在本次專題演講中，我們將通過演示如何使用 T

2021-12-01 10:13:50

打造負責任的 AI 工具包

隨着全世界數十億人繼續使用內核含 AI 技術的產品或服務，現在比以往更迫切需要負責任地使用 AI。打造有益於社會、包容且對社區負責的產品一直是我們最重視的事項。在本次專題演講中，我們將簡要介紹 Responsible AI Toolkit

2021-11-30 09:48:49

基於圖像的機器學習技術將數十億的電子商務產品分爲數千個類別

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-11-29 16:28:50

2021 Google開發者大會精彩回顧

{"type":"doc","content":[{"type":"heading","attrs":{"align":null,"level":3}},{"type":"paragraph","attrs":{"indent":0,"nu

2021-11-22 10:08:54

基於 TensorFlow.js 的下一代 Web 應用

TensorFlow.js介紹，功能演示，新動向以及社區活動。直達 Google 開發者在線課程，收穫 TensorFlow 技術乾貨 https://developers.google.cn/learn/pathways?utm_sou

2021-11-18 09:28:56

利用 TensorFlow Lite Model Maker 打造簡單易用的端上模型

近年來，終端上的機器學習被廣泛應用在現實的移動應用當中。如何創建、優化、部署端上機器學習模型是TensorFlow Lite的主要關注點之一。我們開發了許多端到端的工具鏈來解決移動開發人員和機器學習初學者最爲頭疼的問題。今天，我們將介紹Te

2021-11-18 09:28:56

2021 Google 開發者大會一覽，同步Android、TensorFlow、Web開發等最新動態

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-11-16 19:33:53

騰訊發佈超大預訓練系統派大星，聚焦解決BERT等超大模型訓練時的“GPU內存牆”問題

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-11-02 13:38:53

谷歌推出面向開發者的全新開源平臺Dev Library

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-10-19 16:33:48

突破 PyTorch、TensorFlow 並行瓶頸的開源訓練加速框架到底是啥？

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-10-12 09:38:58

TensorFlow出現任意代碼執行漏洞，團隊宣佈撤銷對YAML的支持

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null

2021-09-02 08:03:56

阿里巴巴AI系統建設的思考

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-07-28 15:58:54

快手八卦！突破TensorFlow、PyTorch並行瓶頸的開源分佈式訓練框架來了！

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null

2021-07-16 14:03:55

基於 TensorFlow.js 和 MoveNet 的下一代姿態檢測

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

TensorFlow官博

2021-06-08 09:23:55

24小時熱門文章

SQL優化-20231016

最新文章

最新評論文章