2021 年将是“人工智能硬件年”

原創

2021-01-20 10:48

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"在本文中，我将讨论专为机器学习 \/ 人工智能应用开发的硬件，以及该领域的机遇。并简要介绍英伟达是如何在机器学习硬件领域实现近乎垄断的地位，以及为什么几乎没有人能成功挑战它。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"在过去的 10 年中，专用于机器学习应用的硬件研究迅猛发展，硬件与机器学习栈的每个部分都有关系。这种硬件可加速训练和推理，减少延迟时间，降低功耗，并降低这些设备的零售成本。当前通用的机器学习硬件解决方案是英伟达 GPU，这使得英伟达在市场上占据主导地位，并使其估值超越英特尔。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"随着前景广阔的研究不断涌现，英伟达继续通过出售 GPU 和它的专有 CUDA 工具箱来主导这个领域。不过，我认为有四个因素将挑战英伟达的统治地位，并且最快今年，也肯定会在 2~3 年内改变机器学习硬件的格局。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"numberedlist","attrs":{"start":1,"normalizeStart":1},"content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":1,"align":null,"origin":null},"content":[{"type":"text","text":"这个领域的学术研究正在成为主流。"}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":2,"align":null,"origin":null},"content":[{"type":"text","text":"摩尔定律已死。随着它的消亡，“技术和市场力量正在将计算推向相反的方向，使得计算机处理器不再是通用的，而是更加专业化的。”（"},{"type":"link","attrs":{"href":"https:\/\/poseidon01.ssrn.com\/delivery.php?ID=211117027007028109012099007123091067026021000060079050028086075010069007025112025105058055039060103003114025068072124026100029114044064023023011030000001096118000084057073052125100086112090110071018005011108079091010104083101111125088093082073127085122&EXT=pdf&INDEX=TRUE","title":"","type":null},"content":[{"type":"text","text":"出处"}]},{"type":"text","text":"）"}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":3,"align":null,"origin":null},"content":[{"type":"text","text":"投资人和创始人都认识到，人工智能不仅能开辟新的领域，而且能增加他们的预算。"}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":4,"align":null,"origin":null},"content":[{"type":"text","text":"人工智能产生的碳排放量过高，而且越来越高。我们需要让计算更加节能。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"背景"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"下面是典型的机器学习管道的样子："}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/85\/d4\/854b80586b989708b78ed03bab9173d4.jpg","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"对于大多数数据科学工作流而言，在训练和部署大型模型之前，通用芯片，如 CPU，就已经足够了。GPU 在“深度学习”（涉及视觉和自然语言处理等任务的神经网络体系结构）中几乎总是必不可少的。为深度学习提供 GPU 工作站的 Lambda Labs 公司估计，包括英伟达的顶级 GPU 集群在内，"},{"type":"link","attrs":{"href":"https:\/\/lambdalabs.com\/blog\/demystifying-gpt-3\/","title":"","type":null},"content":[{"type":"text","text":"训练 GPT-3 的费用大约为 460 万美元"}]},{"type":"text","text":"。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"使用 GPU 的主要优点是，与传统 CPU 相比， GPU 可以并行地执行计算，数据吞吐量更高。计算过程中，机器学习的核心计算部分是矩阵乘法，并行运行时能大大提高运算速度。专有的英伟达"},{"type":"link","attrs":{"href":"https:\/\/developer.net\/cuda-toolkit","title":"","type":null},"content":[{"type":"text","text":"CUDA"}]},{"type":"text","text":"提供了 API 和工具，以便开发者可以利用这种并行化。像 TensorFlow 和 PyTorch 这样的流行库将其抽象出来，其中一行代码会自动检测 GPU，然后利用 CUDA 后端。若要设计一种新的算法或库，需要利用并行计算的优势，CUDA 提供的工具会使这一工作更加简单。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"上世纪 90 年代初，英伟达作为一家视频游戏公司起家，希望能提供能快速绘制 3D 图像的图像芯片。它在这一业务上取得了成功，在与另一家显卡制造商 AMD 的不断交锋中，始终如一地制造出一些最强大的 GPU。巧合的是，同样的图形硬件竟然成了深度学习腾飞不可或缺的因素。CUDA 让英伟达比其他 GPU 更有优势。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/b5\/0e\/b53cf94e674d83a4820f1a7603b0850e.jpg","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"2006 年，英伟达发布了第一个 CUDA 工具包，它提供了一个 API，可以让使用 GPU 变得非常简单。3 年后，2009 年，斯坦福大学人工智能教授吴恩达及其合作者发表了一篇题为《"},{"type":"link","attrs":{"href":"http:\/\/robotics.stanford.edu\/~ang\/papers\/icml09-LargeScaleUnsupervisedDeepLearningGPU.pdf","title":"","type":null},"content":[{"type":"text","text":"使用图形处理器的大规模无监督式深度学习"}]},{"type":"text","text":"》（"},{"type":"text","marks":[{"type":"italic"}],"text":"Large-scale Deep Unsupervised Learning using Graphics Processors"},{"type":"text","text":"）的论文，指出如果 GPU 用于训练，大规模的深度学习就有可能实现。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"一年后，吴恩达和斯坦福大学的另一位教授，Google X 的共同创始人，Sebastian Thrun，向拉里·佩奇提出了在谷歌成立深度学习研究团队的想法，该团队后来成为 Google Brain。伴随着 Google Brain 的崛起和“"},{"type":"link","attrs":{"href":"https:\/\/qz.com\/1034972\/the-data-that-changed-the-direction-of-ai-research-and-possibly-the-world\/","title":"","type":null},"content":[{"type":"text","text":"Imagenet 时刻"}]},{"type":"text","text":"”的到来，英伟达的 GPU 已经成为人工智能 \/ 机器学习行业事实上的计算标准。如需更多信息，请参阅这篇文章《"},{"type":"link","attrs":{"href":"https:\/\/www.forbes.com\/sites\/aarontilley\/2016\/11\/30\/nvidia-deep-learning-ai-intel\/?sh=6a1602d27ff1","title":"","type":null},"content":[{"type":"text","text":"新的英特尔：英伟达如何从驱动视频游戏到革新人工智能"}]},{"type":"text","text":"》（"},{"type":"text","marks":[{"type":"italic"}],"text":"The New Intel: How Nvidia Went From Powering Video Games To Revolutionizing Artificial Intelligence"},{"type":"text","text":"）。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"概述：现状"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"英伟达凭借其 GPU 在深度学习硬件领域占据主导地位，这在很大程度上要归功于 CUDA。据"},{"type":"link","attrs":{"href":"https:\/\/www.forbes.com\/sites\/paulteich\/2019\/06\/17\/nvidia-dominates-the-market-for-cloud-ai-accelerators-more-than-you-think\/?sh=30a782ac5edb","title":"","type":null},"content":[{"type":"text","text":"福布斯报道"}]},{"type":"text","text":"，“2019 年 5 月，前四大云计算供应商在 97.4% 的基础设施即服务（IaaS）计算实例类型中部署了英伟达 GPU，并配备了专用加速器”。面对"},{"type":"link","attrs":{"href":"https:\/\/www.datacenterknowledge.com\/deals\/nvidia-7-billion-what-it-takes-dominate-ai-hardware","title":"","type":null},"content":[{"type":"text","text":"竞争"}]},{"type":"text","text":"，它也"},{"type":"link","attrs":{"href":"https:\/\/nvidianews.nvidia.com\/news\/nvidia-to-acquire-arm-for-40-billion-creating-worlds-premier-computing-company-for-the-age-of-ai\/","title":"","type":null},"content":[{"type":"text","text":"没有坐以待毙"}]},{"type":"text","text":"。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"谷歌早在 2015 年就开发了专门为神经网络开发的人工智能加速器芯片 TPU。在其作为特定领域加速器的狭义用例中，TPU 比 GPU 更快，也更便宜，但在谷歌的 GCP 生态系统中，TPU 被隔离起来，仅有 TensorFlow 和 PyTorch 支持（其他库需要自己编写 TPU 编译器）。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"AWS 正在对自己的芯片下赌注，尤其是机器学习。到目前为止，AWS Inferentia 芯片"},{"type":"link","attrs":{"href":"https:\/\/arstechnica.com\/gadgets\/2020\/11\/amazon-begins-shifting-alexas-cloud-ai-to-its-own-silicon\/","title":"","type":null},"content":[{"type":"text","text":"似乎是最成功的"}]},{"type":"text","text":"。这在很大程度上取决于开发者从 CUDA 切换到亚马逊 Inferentia 和其他芯片的工具包的难易程度。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"2019 年 12 月，英特尔以 20 亿美元的价格收购了 Habana Labs，这是一家以色列公司，为训练和推理工作负载制造芯片和硬件加速器。英特尔的投资似乎得到了回报，上个月，"},{"type":"link","attrs":{"href":"https:\/\/habana.ai\/habana-gaudi-ai-processors-to-bring-lower-cost-to-train-to-amazon-ec2-customers\/","title":"","type":null},"content":[{"type":"text","text":"AWS 宣布"}]},{"type":"text","text":"将提供运行 Habana 芯片的新 EC2 实例，“与当前基于 GPU 的 EC2 实例相比，为机器学习工作负载提供高达 40% 的价格性能”。英特尔还推出了新的 Xeon CPU 系列，它认为可与英伟达的 GPU 竞争。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Xilinx 是一家发明 FPGA 的上市公司，最近又涉足人工智能加速器芯片领域，2020 年 10 月被 AMD 收购。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"对人工智能计算能力的需求正在加速。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"变化与机遇"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"正如我在上面提到的，我的设想是，到 2021 年及以后，英伟达的主导地位将会受到越来越多的挑战和侵蚀。造成这种情况的原因有四个："}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"1. 学术研究变成真正的产品"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"学术界和工业界研究人员创立的一些初创公司已经开始研究机器学习专用硬件，而且还有更多的开发空间。在这个领域发表的论文并不只是提出理论上的保证，它还展示了真正的硬件原型，这些原型实现了比商业可用选项更好的指标。（"},{"type":"link","attrs":{"href":"https:\/\/eyeriss.mit.edu\/","title":"","type":null},"content":[{"type":"text","text":"实例 1"}]},{"type":"text","text":"、"},{"type":"link","attrs":{"href":"https:\/\/news.mit.edu\/2020\/thousands-artificial-brain-synapses-single-chip-0608","title":"","type":null},"content":[{"type":"text","text":"实例 2"}]},{"type":"text","text":"和"},{"type":"link","attrs":{"href":"https:\/\/ieeexplore.ieee.org\/document\/8416814","title":"","type":null},"content":[{"type":"text","text":"实例 3"}]},{"type":"text","text":"）"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"芯片和硬件加速器的种类很多，每一种都有其蓬勃发展的研究社区。简单地列举一些："}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"专用集成电路（ASIC）。谷歌 TPU 和 AWS Inferentia 都是 ASIC 的例子。ASIC 产品的研发和生产成本可能高达 5000 万美元，但是复制产品的边际成本通常很低。ASIC 可以被设计成低功耗的，而且不会对性能有太大的影响。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"现场可编程逻辑门阵列（FPGA）。FPGA 对于高频交易者来说已稀松平常，但在机器学习方面的例子包括微软的 Brainwave 和英特尔的 Arria。单个 FPGA 的生产成本较低，但多个 FPGA 的"},{"type":"link","attrs":{"href":"https:\/\/resources.pcb.cadence.com\/blog\/2019-fpga-vs-asic-differences-and-choosing-best-for-your-business","title":"","type":null},"content":[{"type":"text","text":"生产边际成本要高于 ASIC"}]},{"type":"text","text":"。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"神经形态计算。该领域试图对人脑的生物结构进行建模，并将其转换成硬件。尽管神经形态学的思想可以追溯到 20 世纪 80 年代，但该领域仍处于起步阶段。在《自然》上有一篇很好的"},{"type":"link","attrs":{"href":"https:\/\/www.nature.com\/articles\/s41586-019-1677-2","title":"","type":null},"content":[{"type":"text","text":"综述性论文"}]},{"type":"text","text":"。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"更多内容请参阅此项调查报告《"},{"type":"link","attrs":{"href":"https:\/\/arxiv.org\/pdf\/2009.00993.pdf","title":"","type":null},"content":[{"type":"text","text":"机器学习加速芯片综述"}]},{"type":"text","text":"》（"},{"type":"text","marks":[{"type":"italic"}],"text":"Survey of Machine Learning Accelerators"},{"type":"text","text":"），并关注"},{"type":"link","attrs":{"href":"https:\/\/www.iscas2020.org\/","title":"","type":null},"content":[{"type":"text","text":"ISCAS"}]},{"type":"text","text":"。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"使用上述研究结果的一些有前途的初创公司："}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Blaize 于 2019 年"},{"type":"link","attrs":{"href":"https:\/\/www.blaize.com\/products\/","title":"","type":null},"content":[{"type":"text","text":"宣称"}]},{"type":"text","text":"已经开发出一种完全可编程的低功耗处理器，可实现 10 倍的低延迟，并且“系统效率最高可提高 60%”。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"SambaNova Systems 是由斯坦福大学教授和甲骨文前高管创立的初创公司，由谷歌风投和英特尔资本出资组建。它"},{"type":"link","attrs":{"href":"https:\/\/sambanova.ai\/press\/sambanova-systems-ushers-in-new-era-of-computing-with-availability-of-sambanova-datascale-built-for-ai\/","title":"","type":null},"content":[{"type":"text","text":"刚刚宣布"}]},{"type":"text","text":"了一项新产品，该产品是一个“完整、集成的软件和硬件系统平台，可以对从算法到芯片的数据流进行优化”。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Graphcore 是一家英国初创公司，由红杉、微软、宝马和 DeepMinds 创始人领投。"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"2. 摩尔定律已死，但无论如何，专用硬件都是未来趋势"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/e4\/35\/e41604c155dcab2ecb7bd54641f95535.jpg","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"摩尔定律预测，集成电路上的晶体管数量每两年就会增加一倍。自 20 世纪 70 年代以来，这在经验上一直是正确的，并且是我们从那时起所看到的技术进步的代名词：个人计算革命、传感器和摄像头的改进、移动设备的兴起，以及为人工智能提供充足资源的崛起，凡是你能想到的一切。唯一的问题是，摩尔定律即将结束，如果它还没有结束的话。“缩小芯片的难度越来越大，这已经不是什么秘密了，而且这样做的好处也今非昔比了。去年，英伟达的创始人黄仁勋直言不讳地认为，‘摩尔定律已不再可能了’。”《"},{"type":"link","attrs":{"href":"https:\/\/www.economist.com\/technology-quarterly\/2020\/06\/11\/the-cost-of-training-machines-is-becoming-a-problem","title":"","type":null},"content":[{"type":"text","text":"经济学人"}]},{"type":"text","text":"》（The Economist）写道。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"麻省理工学院经济学家 Neil Thompson 在《"},{"type":"link","attrs":{"href":"https:\/\/www.technologyreview.com\/2020\/02\/24\/905789\/were-not-prepared-for-the-end-of-moores-law\/","title":"","type":null},"content":[{"type":"text","text":"麻省理工科技评论"}]},{"type":"text","text":"》（MIT Technology Review）上解释说：“软件和专业架构方面的进步现在将开始有选择地针对特定的问题和商业机会，对那些有充足资金和资源的人有利，而不是像摩尔定律那样‘水涨船高’，通过提供速度更快、成本更低的芯片来普及。”一些人，包括 Thomspon 在内的，都"},{"type":"link","attrs":{"href":"https:\/\/papers.ssrn.com\/sol3\/papers.cfm?abstract_id=3287769","title":"","type":null},"content":[{"type":"text","text":"认为"}]},{"type":"text","text":"，“这是一个消极的发展，因为计算硬件将开始分裂为“‘快车道’应用和‘慢车道’应用程序，前者使用功能强大的定制芯片，而后者则被卡在使用通用芯片上，而且其进展缓慢。”"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"对于这个问题，分布式计算常常是一种解决方案：让我们使用功能更少、成本更低的资源，但要使用大量的资源。但是，就连这种方案也越来越昂贵（更别提分布式梯度下降算法的复杂性了）。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"那么，接下来会发生什么呢？2018 年，CMU 的研究人员在《自然》上发表了一篇论文，题为《"},{"type":"link","attrs":{"href":"https:\/\/www.nature.com\/articles\/s41928-017-0005-9","title":"","type":null},"content":[{"type":"text","text":"摩尔定律末期的科学研究政策"}]},{"type":"text","text":"》（"},{"type":"text","marks":[{"type":"italic"}],"text":"Science and research policy at the end of Moore’s law"},{"type":"text","text":"），该论文指出，私营部门将重点放在短期盈利上，这使得摩尔定律很难找到通用的继承者。他们呼吁公私合作，共同创造计算硬件的未来。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/f7\/0a\/f775a0548617e62d45da26163852ae0a.jpg","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"虽然我并不反对公私合作（给予他们更多的权利），但我认为未来的计算硬件将是专用芯片的集合，当它们协同工作时，它们比现在的 CPU 更能胜任通用任务。我相信"},{"type":"link","attrs":{"href":"https:\/\/www.apple.com\/newsroom\/2020\/06\/apple-announces-mac-transition-to-apple-silicon\/","title":"","type":null},"content":[{"type":"text","text":"苹果向自己的芯片过渡"}]},{"type":"text","text":"是朝着这个方向迈出的一步，这证明了软硬件集成系统将优于传统芯片。特斯拉也在自动驾驶中采用了"},{"type":"link","attrs":{"href":"https:\/\/www.theverge.com\/2019\/4\/22\/18511594\/tesla-new-self-driving-chip-is-here-and-this-is-your-best-look-yet","title":"","type":null},"content":[{"type":"text","text":"自己的硬件"}]},{"type":"text","text":"。我们需要的是大量的新玩家涌入硬件生态系统，这样专业芯片的好处就可以实现大众化，并分布在昂贵的笔记本电脑、云服务器和汽车之外。（我敢说……是时候打造了吗？）"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"3. 创始人和投资者担心成本上涨"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Andreessen Horowitz 的 Martin Casado 和 Matt Bornstein 在去年年初发表了一篇题为《"},{"type":"link","attrs":{"href":"https:\/\/a16z.com\/2020\/02\/16\/the-new-business-of-ai-and-how-its-different-from-traditional-software\/","title":"","type":null},"content":[{"type":"text","text":"人工智能的新业务（及其与传统软件的区别"}]},{"type":"text","text":"》（"},{"type":"text","marks":[{"type":"italic"}],"text":"The New Business of AI (and How It’s Different From Traditional Software)"},{"type":"text","text":"）的文章，他们认为人工智能的业务与传统软件是不同的。说到底，一切都与利润有关。“云计算基础设施对人工智能公司来说是一个巨大的成本，有时甚至是隐性成本”。正如我所提到的那样，训练人工智能模型可能需要花费数千美元（如果你是 OpenAI，你就得花数百万美元），但成本并不止于这些。人工智能系统必须得到持续监控和改进。如果你的模型是“离线”训练的，那么它很容易出现概念漂移，即现实世界中的数据分布随着时间的推移与你训练的数据发生变化。这种情况可能是自然发生的，也可能是对抗性的，比如当用户试图欺骗信用风险算法时。出现这种情况时，就必须对模型进行再训练。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"对于降低概念漂移和创建与现有模型具有相同性能保证的更小的模型有一些积极的研究，但这是另一篇文章的主题。同时，该行业也正在推进更大的模型和更大的计算支出。更便宜、更专业的人工智能芯片无疑会降低这些成本。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"4. 训练大型模型有助于气候变化"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/6e\/97\/6ec0c85f75b5ee173c078de7f02e6997.jpg","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"由马萨诸塞大学阿默斯特分校进行的"},{"type":"link","attrs":{"href":"https:\/\/arxiv.org\/pdf\/1906.02243.pdf","title":"","type":null},"content":[{"type":"text","text":"一项研究"}]},{"type":"text","text":"发现，训练一个现成的自然语言处理模型所产生的碳排放量相当于从旧金山飞往纽约的一次航班。在三大云计算供应商中，只有谷歌的数据中心超过 50% 的能源来自可再生能源。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"但我认为，我不必列出我们为什么要减少人工智能的碳排放。我想说的是，现有的芯片耗电量过大，而且研究表明，其他类型的硬件加速器，如 FPGA 和超低能耗芯片（如谷歌 TPU Edge），对于机器学习和其他任务来说，"},{"type":"link","attrs":{"href":"https:\/\/arxiv.org\/pdf\/1906.11879.pdf","title":"","type":null},"content":[{"type":"text","text":"可以更加节能"}]},{"type":"text","text":"。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"即使是地理也会影响到人工智能的碳排放。"},{"type":"link","attrs":{"href":"https:\/\/hai.stanford.edu\/blog\/ais-carbon-footprint-problem","title":"","type":null},"content":[{"type":"text","text":"斯坦福大学的研究人员估计"}]},{"type":"text","text":"，“在主要依赖页岩油的爱沙尼亚举行一次会议，其产生的碳排放量是在魁北克举行的会议的 30 倍，而魁北克主要依靠水力发电。”"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"已露端倪"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"我已经提到了人工智能的硬件，但是人工智能的硬件怎么样？谷歌最近"},{"type":"link","attrs":{"href":"https:\/\/patents.google.com\/patent\/US20200279163A1\/en","title":"","type":null},"content":[{"type":"text","text":"申请了一项专利"}]},{"type":"text","text":"，该专利是关于一种利用强化学习来确定跨多个硬件设备的机器学习模型操作的位置的方法。这项专利背后的研究人员之一是"},{"type":"link","attrs":{"href":"https:\/\/www.technologyreview.com\/innovator\/azalia-mirhoseini\/","title":"","type":null},"content":[{"type":"text","text":"Azalea Mirhoseini"}]},{"type":"text","text":"，她在 Google Brain 负责机器学习硬件 \/ 系统的登月计划。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"strong"}],"text":"作者介绍："}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Andrei Kozyrev，康奈尔大学攻读计算机科学与政治学。研究机器学习中的公平性、隐私性和可解释性。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"strong"}],"text":"原文链接："}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"https:\/\/fairlydeep.substack.com\/p\/2021-will-be-the-year-of-ai-hardware"}]}]}

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

Stable Diffusion中的embedding

Stable Diffusion中的embedding 嵌入，也稱爲文本反轉，是在 Stable Diffusion 中控制圖像樣式的另一種方法。在這篇文章中，我們將學習什麼是嵌入，在哪裏可以找到它們，以及如何使用它們。什麼是嵌入embe

2024-04-25 21:31:13

AI从入门到入门之手写数字识别模型java方式Dense全连接神经网络实现

前言：授人以魚不如授人以漁.先學會用，在學原理，在學創造，可能一輩子用不到這種能力，但是不能不具備這種能力。這篇文章主要是介紹算法入門Helloword之手寫圖片識別模型java中如何實現以及部分解釋。目前大家對於人工智能-機器學習-神經網

2024-04-19 23:17:21

Pinecone: 大模型时代的智能索引与搜索解决方案

隨着人工智能技術的飛速發展，大模型（Large Models）已成爲衆多領域的重要工具。無論是自然語言處理、圖像識別還是其他複雜任務，大模型都展現出了強大的性能。然而，隨着模型規模的不斷擴大，數據量的激增，如何有效地管理、索引和搜索這些模型

2024-04-19 11:29:43

软件测试从自动化到智能化，大模型开始加入

隨着科技的飛速發展，軟件行業也在不斷地演進和創新。作爲軟件行業的關鍵環節之一，軟件測試行業也在經歷着前所未有的變革。從最初的手動測試，到自動化測試，再到如今的智能化測試，軟件測試行業正在經歷一場深刻的技術革命。在這場革命中，Testin雲測

2024-04-19 00:53:25

裁员了！别错过2024年大数据工程师必备的10项技能

在當今快速發展的世界中，數據被視爲新的石油。隨着對數據驅動洞察的日益依賴，大數據工程師的角色比以往任何時候都更爲關鍵。這些專業人員在管理和優化組織內的數據操作中扮演着至關重要的角色。在本文中，我們將探索2024年大數據工程師必須具備的十

2024-04-16 11:00:53

DevOps已死？2024年的DevOps将如何发展

隨着我們進入2024年，DevOps也隨之發生變化。新興的技術、變化的需求和發展的方法正在重新定義有效實施DevOps實踐。 IDC預測顯示，未來五年，支持DevOps實踐的產品市場繼續保持健康且快速增長，2022年-2027年的複合年增長

2024-04-08 12:51:44

从模型到部署，教你如何用Python构建机器学习API服务

本文分享自華爲雲社區《Python構建機器學習API服務從模型到部署的完整指南》，作者：檸檬味擁抱。在當今數據驅動的世界中，機器學習模型在解決各種問題中扮演着重要角色。然而，將這些模型應用到實際問題中並與其他系統集成，往往需要構建API

2024-04-08 10:33:17

测试左移已经开始影响DevOps的发展？

在軟件開發的早期，該過程通常是開發人員編寫代碼，再將其交給質量保證（QA）進行測試。這種瀑布開發方法可能會導致質量問題和延遲，因爲問題是在週期後期發現的。一、瞭解DevOps和測試左移 DevOps是Development和Operati

2024-04-07 12:48:37

黑盒Prompt优化：提升大模型反馈效果的新思路

隨着人工智能技術的快速發展，大模型在各種應用場景中發揮着越來越重要的作用。然而，如何提升大模型的反饋效果，使其更加準確、高效地爲用戶提供服務，一直是研究者和開發者關注的焦點。本文提出了一種新的思路——黑盒Prompt優化，旨在通過改進輸入提

2024-03-29 00:01:17

分布式数据库技术的演进和发展方向

這些年大家都在談分佈式數據庫，各大企業也紛紛開始做數據庫的分佈式改造。那麼，所謂的分佈式數據庫到底是什麼？採用什麼架構？優勢在哪？爲什麼越來越多企業選擇它？分佈式數據庫技術會向什麼方向發展？帶着這些疑問，一探究竟吧！參與文末的話題互動

2024-03-26 11:34:43

利用RAG技术打破大模型幻觉

隨着人工智能技術的不斷進步，大模型在各個領域中發揮着越來越重要的作用。然而，大模型幻覺問題一直是制約其進一步發展的瓶頸。爲了解決這一問題，研究者們不斷探索新的技術和方法。近年來，一種名爲RAG（檢索增強生成）的技術備受關注，它通過結合知識圖

2024-03-21 00:28:34

与 NVIDIA 再次合作、深度参与 GTC，Zilliz 与全球顶尖开发者共迎 AI 变革时刻！

Zilliz 與全球的頂尖開發者齊聚 GTC 2024。近日，備受關注的 NVIDIA GTC 2024 已拉開序幕，來自世界各地的頂尖 AI 開發者齊聚美國加州聖何塞會議中心，共同探索行業未來。作爲去年被 NVIDIA CEO 黃仁

2024-03-19 21:26:53

多模态+大模型会带来哪些“化学反应”？

導語：沒人懷疑，2024 年，AI 依然將是科技界的主角。上個月，OpenAI 推出了可以生成 60 秒高清視頻的視頻生成模型 Sora，掀起了對多模態模型的進一輪討論。多模態大模型技術的最新進展如何？這一波新技術，對於行業和消費者的體驗會

2024-03-15 13:45:01

妇女节：打开 AI 视界，成就“她力量”

根據國內招聘平臺獵聘發佈的《2024 女性人才數據洞察報告》，從 2023 年 3 月到 2024 年 2 月，女性在 AIGC 領域的求職人次同比增長了 190.49%。隨着人工智能時代的降臨，女性正以前所未有的姿態，在技術的助力下，蛻變

2024-03-09 01:06:57

AI安全白皮书 | “深度伪造”产业链调查以及四类防御措施

以下內容，摘編自頂象防禦雲業務安全情報中心正在製作的《“深度僞造”視頻識別與防禦白皮書》，對“深度僞造”感興趣的網友，可前往頂象留言，在該白皮書完成後，會爲您免費寄送一份電子版。 “深度僞造”就是創建高度逼真的虛假視頻或虛假錄音，然

2024-03-08 00:45:22

24小時熱門文章

最新文章

2021 年將是“人工智能硬件年”

最新評論文章