batch size, mini-batch, iterations and epoch

原創

2020-06-15 20:18

Gradient descent is an iterative algorithm which computes the gradient of a function and uses it to update the parameters of the function in order to find a maximum or minimum value of the function. In case of Neural Networks, the function to be optimized (minimzed) is the loss function, and the parameters are the weights and biases in the network.

Number of iterations (n): The number of times the gradient is estimated and the parameters of the neural network are updated using a batch of training instances. The batch size B is the number of training instances used in one iteration.

When the total number of training instances (N) is large, a small number of training instances (B<<N) which constitute a mini-batch can be used in one iteration to estimate the gradient of the loss function and update the parameters of the neural network.

It takes n (=N/B) iterations to use the entire training data once. This constitutes an epoch. So, the total number of times the parameters get updated is (N/B)*E, where E is the number of epochs.

Three modes of gradient descent:

Batch mode: N=B, one epoch is same as one iteration.

Mini-batch mode: 1<B<N, one epoch consists of N/B iterations.

Stochastic mode: B=1, one epoch takes N iterations.

Note: The answer assumes N is a multiple of B. It would take int(n)+1 iterations otherwise.

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

[blog7]ubuntu18.04安裝opencv3.4.5 python2.7和python3.6

參考https://blog.csdn.net/weixin_41851439/article/details/88712465 https://opencv.org/releases.html下載想要版本的source文件，安裝依賴項

2020-07-07 23:07:48

02：像素處理

二值圖像及灰度圖像 1、前提說明：在openCV中，最小的數據類型是無符號的8位數，二值圖像是經過處理得到的 2、圖像可以理解爲一個矩陣，一個openCV灰度圖像就是一個二維數組，可以使用表達式訪問其像素值，例如可以使用[0,0]

2020-07-07 19:32:50

OpenCV筆記（二）---提取水平和垂直線/霍夫變換

膨脹腐蝕提取水平和垂直線提取步驟提取圖片：代碼： #include<opencv2/opencv.hpp> #include<opencv2/core/mat.hpp> #include<iostream> using nam

2020-07-07 10:01:44

YOLO-V3簡單介紹

yolo-v2簡單介紹 YOLO-v3：論文參考翻譯：YOLO v3 論文翻譯 yolo-v3的詳細網絡結構，方便更清楚的瞭解細節：

2020-07-06 16:08:30

06 You Only Look Once-V1學習筆記

參考文章： https://blog.csdn.net/woduoxiangfeiya/article/details/80866155（質量一般，還得對照原文看） https://zhuanlan.zhihu.com/p/5871689

2020-07-06 15:30:41

YOLO-V2簡單介紹

下附：yolo-v3簡單介紹

2020-07-06 15:30:41

訓練自己的目標檢測模型

預備工作：在tensorflow提供的模型文件的基礎上，訓練改造出自己的模型。標註自己的圖片：下載labelImg-master文件：解壓文件，打開就行 10.模型測試：執行object_detection_tutori

2020-07-06 15:30:41

Python--基於OpenCV數據集的人臉定位和識別

就是調個庫，沒什麼好說的。上代碼：事前準備： python安裝兩個庫。 pip install opencv-python pip install opencv-contrib-python 到cv2文件夾下取出三個文件，複製到工作區

2020-07-05 14:14:48

cv論文筆記（動作識別1）：Convolutional Two-Stream Network Fusion for Video Action Recognition

一、基本信息標題：Convolutional Two-Stream Network Fusion for Video Action Recognition 時間：2016 引用格式：Feichtenhofer C, Pinz A

2020-07-05 07:57:30

PaddlePaddle學習內容分享

PaddlePaddle學習內容分享課程內容第1天第2天第3天第4天第5天第6天感想課程內容疫情期間，一次巧合的機會讓我參加了百度PaddlePaddle7天打卡，7天打卡包括：第一天：新冠肺炎數據可視化；第二天：DNN手

2020-07-04 05:09:16

地表最強一階段目標檢測框架：yolov4之tf2+版本

從第一版的yolov3（http://github.com/qqwweee/keras-yolo3）在這位q神翻譯出來後，在下一直跟進yolo的發展，兩年前第一次遷移了q神的keras版。最近keras版的yolov4（h

罗小丰同学

2020-07-04 03:53:54

week1 基本圖像處理

week1 基本圖像處理 a=[1,2,3] print("Hello friends，welcome week1's class .........",29*"-","%s"%(a)) 環境配置 !pip install

2020-07-01 16:39:29

小劉總學python之matpltlib

week1 基本圖像處理 a=[1,2,3] print("Hello friends，welcome week1's class .........",29*"-","%s"%(a)) 環境配置 !pip install

2020-07-01 16:39:29

add_water.py

#coding:utf-8 from PIL import Image import os import sys #創建底圖 target = Image.new('RGBA', (300, 300), (0, 0, 0,

2020-07-01 16:39:29

Trump_CV

import cv2 import numpy as np import matplotlib.pyplot as plt %matplotlib inline def get_LBP(gray_image, ratio=3):

2020-07-01 16:39:29

24小時熱門文章

最新文章

最新評論文章