解讀2020:聊聊芯片技術趨勢

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"今年我們看到很多芯片領域出現的“新現象”,也有不少會讓你“陷入沉思”的時刻。對我個人來說,蘋果M1可能是衝擊最大的一個。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"我從04年開始做手機SoC,08年開始接觸ARM,斷斷續續做到16年。在蘋果M1之前,我從來沒想過基於ARM的架構可能在性能上超越X86。當然,M1並不是傳統意義上的CPU,而是一顆SoC,所以直接的比較也許並不完全公平。但如果從應用程序的角度來看,運行速度加快,功耗降低,都是實實在在的。對M1的分析很多,細看下來,大部分架構優化的思路和獲得的結果也都是非常自然的(可能也有一些我們不知道的蘋果黑科技)。但爲什麼蘋果之前沒人這麼做,或者說做不到呢?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"問題的關鍵在於"},{"type":"text","marks":[{"type":"strong"}],"text":"蘋果掌握了完整生態和供應鏈,在有充分的技術積累和豐厚利潤之後,可以做更多的“縱向優化”"},{"type":"text","text":"。這個事件本身還不只是技術上的勝利,未來會給整個產業帶來深遠的影響。當然,從這句話的限定詞來看,能成功複製者可能寥寥無幾。後面的新聞說微軟也打算給Surface自研芯片來看,不知是不是有戲。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"之前的文章對芯片架構和技術趨勢有不少討論,“通用”到“專用”("},{"type":"link","attrs":{"href":"http:\/\/mp.weixin.qq.com\/s?__biz=MzI3MDQ2MjA3OA==&mid=2247484962&idx=1&sn=c9fb09a3fa174146a8a6b5f4ce1ddd0a&chksm=ead1fb33dda67225a40d47cfd4e993be3c312d1e129225193ade44585f75d8c961488565f7af&scene=21#wechat_redirect","title":"","type":null},"content":[{"type":"text","text":"後通用芯片時代:專用芯片興起背後的經濟學"}]},{"type":"text","text":"),“同構”到“異構”,“先進工藝”到“先進封裝”("},{"type":"link","attrs":{"href":"http:\/\/mp.weixin.qq.com\/s?__biz=MzI3MDQ2MjA3OA==&mid=2247484890&idx=1&sn=a9464f059dab39c0f2d77c086ccccd66&chksm=ead1f8cbdda671ddd48789e1636839b930a7466004faa61119c5cfb76bb43f1efa667558f29c&scene=21#wechat_redirect","title":"","type":null},"content":[{"type":"text","text":"從AI Chip到AI Chiplet"}]},{"type":"text","text":"),“分而治之”到“跨邊界優化”("},{"type":"link","attrs":{"href":"http:\/\/mp.weixin.qq.com\/s?__biz=MzI3MDQ2MjA3OA==&mid=2247485406&idx=1&sn=2b737c212e4e6cc25efd36c0c434bfce&chksm=ead1facfdda673d9fd9ff830023aec85dbd8191e2421b1e6a7b0f398c50ecc714af4b9977de4&scene=21#wechat_redirect","title":"","type":null},"content":[{"type":"text","text":"AI芯片技術發展"}]},{"type":"text","text":")。爲什麼會有這些趨勢,簡單說就是被逼的,一方面摩爾定律放緩和Dark Silicon問題,另一方面特定算力需求激增。一句話,要求高了,只有精耕細作了。但問題是,即使這些趨勢大家都認可,具體該怎麼做呢?真沒那麼簡單。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"先說說異構和專用的問題。我們可以看看已經“精耕細作”了很多年的手機芯片。手機SoC是非常典型的異構系統,包括大量的專用處理器,發展到今天,已經極爲複雜。下圖是我隨便找的一個高通手機芯片的框圖,大家可以看一看,除了CPU,GPU,各種DSP(“"},{"type":"link","attrs":{"href":"http:\/\/mp.weixin.qq.com\/s?__biz=MzI3MDQ2MjA3OA==&mid=2247483910&idx=1&sn=f03a311b55a5517336a65a6457314a84&chksm=ead1ff17dda67601cfdc86b269311ce8ba6a1528e7ec8124a0f44e1c5984903d19cbb9067fc4&scene=21#wechat_redirect","title":"","type":null},"content":[{"type":"text","text":"高通的Hexagon DSP"}]},{"type":"text","text":"”)和硬件加速器,可以說把異構做到極致了。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.geekbang.org\/infoq\/5a\/5aeaef4230f91b6e6eb419529c4ad473.png","alt":null,"title":null,"style":null,"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"source: anandtech"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"爲什麼手機的主處理器在智能手機剛出現的時候就SoC的形式?也是被手機尺寸,功耗,成本這些限制給逼的,不可能照搬桌面PC獨立CPU的架構。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"這種架構的優勢是將整個系統都放在一顆芯片上,而不僅僅是通用的CPU。其中很多硬件模塊都是根據特定功能需求定製的,比如通信基帶,ISP,Connectivity,多媒體,語音,定位,可能是基於DSP,也可以是更專用的硬件加速器。這其實也就是我們今天常說的Domain-Specific的概念(可以參考“"},{"type":"link","attrs":{"href":"http:\/\/mp.weixin.qq.com\/s?__biz=MzI3MDQ2MjA3OA==&mid=2247484980&idx=1&sn=0d1b15f17b1b02bda1bf4d81fd6a01ad&chksm=ead1fb25dda672334c3d72a6d958b2a2b3202c2c5dc9d221df6367ef4dec1023e5f079ae5742&scene=21#wechat_redirect","title":"","type":null},"content":[{"type":"text","text":"專用處理器設計"}]},{"type":"text","text":"”)。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"另一方面,模塊間的通信是在片上解決,效率要高很多。這種SoC\/DSA架構的優勢是硬件PPA可以極致優化;缺點除了架構複雜度和不同類型硬件的優化之外,異構編程和軟件優化也會比較困難。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"先不看蘋果,由於安卓系統的出現,獨立的手機芯片廠商可以把精力集中在底層,反正這些DSP和硬件加速器一般也不開放,都是芯片廠商自己來優化相關軟件(常說的firmware)。這樣的架構和生態已經發展了十幾年,現存已經不多的獨立手機芯片廠商已經很有經驗。即便如此,芯片廠商提供的一些新算法的加速庫往往還不能令人滿意,也從側面說明了異構專用架構軟件優化的難度。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"再看蘋果,其技術上的優勢,很大程度得益於在“封閉”的系統(操作系統也是自己的),有可能實現“縱向優化”。在技術上的優勢轉化爲巨大的利潤之後,蘋果的芯片就進入良性循環。實際上,在蘋果開始自研芯片之後,可以說整個半導體供應鏈,從Foundry到EDA\/IP都是按照蘋果新機發布的節奏在進步。積累到今天,蘋果已經具備了挑戰傳統桌面系統架構的能力。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"對於傳統的桌面和數據中心處理器,簡單的說“不思進取”也不是很公平,CPU本身還是芯片技術的高峯。在它能做到的範圍內(主要是硬件),工藝演進,封裝演進,架構優化,硬件性能繼續提升還是可以預期的。但它多年形成的生態特徵(包括歷史包袱)和蘋果(還包括其他一些巨頭)這樣的“封閉”生態大不相同,蘋果們能做到的事情它還真做不到。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"討論到這裏,我們也不難看出,未來在技術上的優化,傳統的分而治之的策略已經越來越困難(或者並不經濟),打破邊界有可能會實現“降維打擊”的效果。雖然好處顯而易見,但成功與否的決定因素可能不在技術,而是背後掌控生態和供應鏈的能力。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"另一方面,拋開技術原因,越來越複雜的競爭關係,也是推動產業鏈重新整合的動力。今年,我們看到Nvidia收購ARM,AMD收購Xilinx,Intel推oneAPI,其實都是這方面的嘗試。對於芯片廠商來說,儘量在自己的核心領域實現生態的“壟斷”是最高目標。但是,對於複雜芯片來說,構建軟件生態可能要比設計或實現出芯片本身更爲複雜。特別是在一些專用領域,單芯片硬件架構趨向簡化,但軟件棧的實現和產品化挑戰很大。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"因此,從目前整體狀況來看,相對傳統芯片公司,科技巨頭在軟件和系統生態上具有先天的優勢的。我之前分析過Google情況(“"},{"type":"link","attrs":{"href":"http:\/\/mp.weixin.qq.com\/s?__biz=MzI3MDQ2MjA3OA==&mid=2247485562&idx=1&sn=4b97ba5ab0a1210639e74209c4f35f44&chksm=ead1f56bdda67c7db2c8007c71446e357f54325e4327dfa8def788cde3dc0a98aeba7d945d51&scene=21#wechat_redirect","title":"","type":null},"content":[{"type":"text","text":"Google的『泛芯片』科技藍圖"}]},{"type":"text","text":"”),有很多芯片廠商很難做的事情,在Google自己的生態裏面相對容易(其實是和蘋果M1的情況類似)。還有一個例子是這篇文章裏JAX這個研究性框架可以很好的發揮TPU的性能,“"},{"type":"link","attrs":{"href":"http:\/\/mp.weixin.qq.com\/s?__biz=MzI3MDQ2MjA3OA==&mid=2247485453&idx=1&sn=1436ec1278df2b900f656b9cd397e8ae&chksm=ead1f51cdda67c0a633ed8bbf8bb44f23301f7aba02530c6ae1dd45050b25a8b100a8fc3a2b4&scene=21#wechat_redirect","title":"","type":null},"content":[{"type":"text","text":"AI訓練芯片巔峯對決,如何正確“圍觀”?”。"}]},{"type":"text","marks":[{"type":"strong"}],"text":"未來我們應該能看到更多由科技巨頭髮起的發自系統需求的跨層次優化"},{"type":"text","text":"。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"回顧歷史,分而治之(分層優化)是計算機系統和通信系統這樣的複雜系統幾十年快速發展的關鍵,未來實現徹底扁平或端到端的優化是不太可能的(這個超越人類能力的問題可能得留給AI),但找到更合理的分層或分塊方式是完全有可能的。比如,我們看到深度學習框架和編譯器領域就有很多嘗試,一方面打破傳統的圖和算子的層次進行統一優化是一個廣泛關注的路徑;另一方面,通過多層IR和編譯工具來更合理的實現“分而治之”也很有前景。蘋果M1的例子再次給我們證明了現在確實是“架構的黃金時代”,芯片,軟件,系統都還有很多優化空間。重新思考非常必要,甚至"},{"type":"text","marks":[{"type":"strong"}],"text":"“重新設計輪子”有可能是非常合理且有效的選擇"},{"type":"text","text":"。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"最後說兩句“先進封裝”的問題。“Chiplet”是今年的熱詞,歷史更久且更準確的技術詞彙應該是異構集成(Heterogeneous Integration),有興趣的朋友可以看看HIR(Heterogeneous Integration Roadmap),有這個領域非常完整的分析和預測。總的來說,這個技術趨勢是比較清晰明確的,而且第一階段的Chiplet形態(比如HBM)在技術上已經比較成熟,除了成本比較高,在很多高端芯片上已經使用。而且這個技術涉及的供應鏈比較簡單,可以說是花錢就可以解決的問題。當然,這個方向還是有很多技術挑戰的,最終是要走向Monolithic 3D(如下圖)。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.geekbang.org\/infoq\/aa\/aaa71454acda01381893ebcc1646cdee.png","alt":null,"title":null,"style":null,"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"source: ERI summit 2020"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"另外一個和異構集成關係很大的是光互聯,這兩件事要是解決了,芯片和系統架構可能發生產生根本性的變化。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"italic"},{"type":"strong"}],"text":"本文經原作者授權,轉載自StarryHeavensAbove公衆號,原文鏈接:"},{"type":"link","attrs":{"href":"https:\/\/mp.weixin.qq.com\/s\/u9KVll9SKFpRWjd_id-hyA","title":"xxx","type":null},"content":[{"type":"text","marks":[{"type":"italic"}],"text":"https:\/\/mp.weixin.qq.com\/s\/u9KVll9SKFpRWjd_id-hyA"}],"marks":[{"type":"italic"},{"type":"strong"}]},{"type":"text","marks":[{"type":"italic"},{"type":"strong"}],"text":",如需轉載請聯繫作者。"}]}]}
發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章