【親測】神經網絡訓練時出現loss=nan或loss不變的解決辦法

原創

2020-06-09 04:53

今天用最原始的tensorfow.nn.conv2d構建一個三層CNN網絡並基於MNIST數據集訓練的時候出現了loss=nan的情況，折騰了一晚上，摸索出幾個的解決方案。

1.在loss函數某個位置添加了1e-10：

cross_entropy = tf.reduce_mean(-tf.reduce_sum(output * tf.log(prediction+1e-10), reduction_indices=[1]))

2.更換優化器

3.（最終解決辦法）對tf.nn.conv2d後的輸出進行tf.nn.relu的操作，因爲老版的tf.nn.conv2d不帶激活函數。

4. 當3行不通的時候用tf.nn.sigmoid。

5. 調整學習率，一般是調小，小概率是調大。

6. 檢查是否在最後一層全連接加了激活函數，若是，去掉

7. 多等一會，有的訓練器碧如SGD下降慢

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

詭異BUG

上傳文件一直未待識別：後臺處理異常，但是沒有返回到前端密碼不支持@字符，密碼是支持特殊字符的，但是不支持@字符（特殊字符的全角、半角格式）不同渠道修改密碼：登錄頁面修改密碼，與個人中心修改的密碼，保存的密碼不一樣，校驗規則不

xvnmeng

2020-07-08 08:23:44

error conversion from double to non-scalar type std string requested

error: conversion from ‘double’ to non-scalar type ‘std::string’ requested 源代碼： string t_strScore = m_MTaskRedis->M

寻风度陌

2020-07-07 14:48:29

error: passing ‘const std::string‘ as ‘this‘ argument of

error: passing ‘const std::string’ as ‘this’ argument of ‘std::basic_string<_CharT, _Traits, _Alloc>& std::basic_st

寻风度陌

2020-07-07 14:48:29

spark消費nc的數據 bug彙總

1.Exception in thread "main" java.lang.NoSuchMethodError: scala.collection.immutable.HashSet$.empty()Lscala/collection/

我要用代码向我喜欢的女孩表白

2020-07-06 07:10:12

本地IDEA運行sparkStreaming消費kafka出錯 Connection with localhost/127.0.0.1 disconnected

可以看到報錯第一句顯示：Connection with localhost/127.0.0.1 disconnected 但是我明明在application.yml中配置了我的Kafka Server的地址是：192.168.52.131

我要用代码向我喜欢的女孩表白

2020-07-06 07:10:12

mybatis&數據庫：BadSqlGrammarException

org.springframework.jdbc.BadSqlGrammarException: ### Error updating database. Cause: com.mysql.jdbc.exceptions.jd

life4what

2020-07-05 18:38:59

Parameter 'item' not found. Available parameters are [families, param1]

org.mybatis.spring.MyBatisSystemException: nested exception is org.apache.ibatis.binding.BindingException: Paramete

life4what

2020-07-05 18:38:59

Springboot：pom.xml出現unknown錯誤

在<properties>標籤裏添加<maven-jar-plugin.version>3.1.1</maven-jar-plugin.version>即可！ <properties> <java.version>1.8</j

life4what

2020-07-05 18:38:59

ClassNotFound：javax.xml.bind.JAXBException

https://blog.csdn.net/hadues/article/details/79188793

life4what

2020-07-05 18:38:59

sqlSession出錯；java.lang.IllegalStateException: Failed to load ApplicationContext

java.lang.IllegalStateException: Failed to load ApplicationContext at org.springframework.test.context.cache.Defau

life4what

2020-07-05 18:38:59

SpringMVC：攔截器和handlerExceptionResolver異常處理有衝突

spring.xml  <!-- <mvc:interceptors> 多個攔截器,順序執行 <mvc:interceptor> <mvc:mapping path="/**" />

life4what

2020-07-05 18:38:59

解決 error C3679: “operator”後應有一個文本後綴標識符

1>c:\program files (x86)\microsoft visual studio\2017\enterprise\vc\tools\msvc\14.16.27023\include\string(645): err

mycn027

2020-07-05 17:14:55

有關OOM問題

1、OOM類型 OOM，即OutOfMemory，內存溢出，原因是：分配的太少；用的太多；用完沒釋放。內存泄漏：內存用完沒有被釋放。大量的內存泄漏就會導致OOM，也就是內存溢出。常見的OOM情況有三種： java.lang.

゛Smlie。

2020-07-05 07:26:59

MyBatis 引號分割字符串

runProperty 參數會有多個，比如“卡班車，網點車” 傳到xml, <if test="runProperty != null and runProperty != ''"> AND RUN_PROPERTY I

゛Smlie。

2020-07-05 07:26:59

報錯信息：load: id=gralloc != hmi->id=gralloc

項目調試過程中。出現閃退。最後根據logcat日誌分析來分析去，目標鎖定到了這句話上：load: id=gralloc != hmi->id=gralloc 。根據度娘說，這個錯誤一般出現在初始化UI時，錯誤被手機給攔截了;還有種說法是

蓝天逐日者

2020-07-04 02:57:23

24小時熱門文章

【親測】神經網絡訓練時出現loss=nan或loss不變的解決辦法

leetcode 827: Making A Large Island 深度優先搜索and二維數組分塊技術（C++）

Binarized Neural Networks:Training Deep Neural Networks with Weights and Activations -1,1

【親測】神經網絡訓練時出現loss=nan或loss不變的解決辦法

論文筆記：Dorefa-Net

2019 CVPR: A main/subsidiary network framework for simplifying binary neural networks

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結