关于神经网络的输出神经元个数的思考

原創

陈知鱼

2019-07-30 13:33

博主对于神经网络的输出神经元个数的问题，起源于“识别手写数字的神经网络为什么需要10个输出而不是四个？”.

实际上，这是两种不同的编码方式，两种的网络架构都是可行的，但是我们选择十个神经元而不是四个神经元来表达各类别，是因为这是经验上的选择，从效果来说，输出为十个的效果更好。

具体理由如下：

如果输出为四个，那么输出层的每个神经元需要学习的是“1和2的手写体之间的区别”之类的断言；

如果输出为十个，那么输出层的每个神经元需要学习的只是“判断一幅图片是不是1”这样的断言。

而描述一个图片是不是某个数字比描述两个数字之间的区别容易的多。

（问题来自Neural networks and deep learning）

You might wonder why we use 10 output neurons. After all, the goal of the network is to tell us which digit (0,1,2,…,9) corresponds to the input image. A seemingly natural way of doing that is to use just 44 output neurons, treating each neuron as taking on a binary value, depending on whether the neuron's output is closer to 0 or to 1. Four neurons are enough to encode the answer, since 24=16 is more than the 10 possible values for the input digit. Why should our network use 10 neurons instead? Isn't that inefficient? The ultimate justification is empirical: we can try out both network designs, and it turns out that, for this particular problem, the network with 1010output neurons learns to recognize digits better than the network with 4 output neurons. But that leaves us wonderingwhyusing 1010output neurons works better. Is there some heuristic that would tell us in advance that we should use the 10-output encoding instead of the 4-output encoding?

……

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

关于神经网络的输出神经元个数的思考

使用c#强大的表达式树实现对象的深克隆之解决循环引用的问题

free AI online tools All In One

痞子衡嵌入式：恩智浦i.MX RT1xxx系列MCU启动那些事（12.A）- uSDHC eMMC启动时间(RT1170)

linux安装cuda和cudnn

Mellanox网卡开启SR-IOV

模拟手机设备：使用 Playwright 实现移动端自动化测试

HTML 00 Tutorial

全面系统的AI学习路径，帮助普通人也能玩转AI

从零开始：使用 Playwright 脚本录制实现自动化测试

uni-app实现上拉加载

電腦硬件基礎知識及購買指南

Ubuntu下nvidia驅動安裝方式

樹莓派之人臉檢測

常見開源腦影像分析產品

各種池化的實現

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結