吳恩達深度學習學習筆記——C2W3——超參數調試、Batch 正則化和程序框架——作業

原創

2021-02-04 09:11

C2W3 Quiz - Hyperparameter tuning, Batch Normalization, Programming Frameworks

Ans: False

Note: Try random values, don't do grid search. Because you don't know which hyperparameters are more important than others.

Ans: False

Note: We've seen in lecture that some hyperparameters, such as the learning rate, are more critical than others.

Ans: C

Ans: B

Ans: False

Note: Minor changes in your model could potentially need you to find good hyperparameters again from scratch.

Ans: C

Ans: B

Ans: B、E

Ans: A

Ans: A、B

1. If searching among a large number of hyperparameters, you should try values in a grid rather than random values, so that you can carry out the search more systematically and not rely on chance. True or False?

False
True

Note: Try random values, don't do grid search. Because you don't know which hyperparameters are more important than others.

And to take an extreme example, let's say that hyperparameter two was that value epsilon that you have in the denominator of the Adam algorithm. So your choice of alpha matters a lot and your choice of epsilon hardly matters.

2. Every hyperparameter, if set poorly, can have a huge negative impact on training, and so all hyperparameters are about equally important to tune well. True or False?

False
True

Note: We've seen in lecture that some hyperparameters, such as the learning rate, are more critical than others.

3. During hyperparameter search, whether you try to babysit one model (“Panda” strategy) or train a lot of models in parallel (“Caviar”) is largely determined by:

Whether you use batch or mini-batch optimization
The presence of local minima (and saddle points) in your neural network
The amount of computational power you can access
The number of hyperparameters you have to tune

4. If you think β (hyperparameter for momentum) is between on 0.9 and 0.99, which of the following is the recommended way to sample a value for beta?

r = np.random.rand()

beta = 1 - 10 ** (-r - 1)

5. Finding good hyperparameter values is very time-consuming. So typically you should do it once at the start of the project, and try to find very good hyperparameters so that you don’t ever have to revisit tuning them again. True or false?

False
True

Note: Minor changes in your model could potentially need you to find good hyperparameters again from scratch.

6. In batch normalization as presented in the videos, if you apply it on the lth layer of your neural network, what are you normalizing?

z^[l]

7. In the normalization formula, why do we use epsilon?

To avoid division by zero

8. Which of the following statements about γ and β in Batch Norm are true? Only correct options listed

They can be learned using Adam, Gradient descent with momentum, or RMSprop, not just with gradient descent.
They set the mean and variance of the linear variable z^[l] of a given layer.

9. After training a neural network with Batch Norm, at test time, to evaluate the neural network on a new example you should:

Perform the needed normalizations, use μ and σ^2 estimated using an exponentially weighted average across mini-batches seen during training.

10. Which of these statements about deep learning programming frameworks are true? (Check all that apply)

A programming framework allows you to code up deep learning algorithms with typically fewer lines of code than a lower-level language such as Python.
Even if a project is currently open source, good governance of the project helps ensure that the it remains open even in the long term, rather than become closed or modified to benefit only one company.
Deep learning programming frameworks require cloud-based machines to run.

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

吳恩達深度學習學習筆記——C2W3——超參數調試、Batch 正則化和程序框架——作業

C2W3 Quiz - Hyperparameter tuning, Batch Normalization, Programming Frameworks

HDFS Standby NameNode Read功能剖析

SQL2005完整+日誌+文件+日誌備份和還原策略

2020年鉅虧56億美元，谷歌雲真的步履維艱了嗎？

吳恩達深度學習學習筆記——C2W3——超參數調試、Batch 正則化和程序框架——作業

ELSA企業日誌歸檔查詢系統

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結