Al與推薦技術在騰訊QQ瀏覽器的應用

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"口述 | 徐羽"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"整理 | 王強"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"在 "},{"type":"link","attrs":{"href":"https:\/\/aicon.infoq.cn\/2021\/beijing?utm_source=infoq&utm_medium=conference","title":"xxx","type":null},"content":[{"type":"text","text":"AICon 全球人工智能與機器學習技術大會(2021)北京站"}]},{"type":"text","text":"上,騰訊信息平臺與服務線 CTO、PCG 事業羣推薦與 AI 中臺負責人徐羽帶來了主題爲《Al 與推薦技術在騰訊 QQ 瀏覽器的應用》的分享,詳細介紹了 QQ 瀏覽器近年來在 AI 推薦技術方面走過的技術旅程。本文由 InfoQ 根據徐羽的演講內容進行整理,希望對你有所啓發。  "}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/13\/0c\/13cbyy6e8a067ef909082220e28fb30c.png","alt":null,"title":"騰訊信息平臺與服務線 CTO、PCG 事業羣推薦與 AI 中臺負責人徐羽","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"面向超大規模用戶羣體的推薦技術是近年來 AI 落地的熱門場景之一。在國內,騰訊是較早大規模實踐 AI 推薦技術的頭部廠商。騰訊旗下的 QQ 瀏覽器應用月活用戶超過 4 億,過去幾年來在 AI 和推薦技術方面經歷了三次大規模重構和迭代,其實戰經歷具有很高的參考與學習價值。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"QQ 瀏覽器月活用戶超過 4 億,是典型的超大規模用戶羣平臺產品。QQ 瀏覽器用戶羣體分佈與中國互聯網用戶羣體分佈相似,且用戶日均使用次數較多,能夠爲推薦系統提供高質量的大樣本數據集,爲許多形態複雜、內容多樣的推薦模型構建基礎。 "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/c9\/33\/c9ba6292f561026e06ab8bbd15593d33.png","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"QQ 瀏覽器推薦系統架構"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"在 QQ 瀏覽器中,推薦算法應用的主場景是信息流。信息流主 feed 是圖文、短視頻、小視頻混合形態。用戶點擊圖文會進入詳情頁,點擊豎屏小視頻會進入小視頻沉浸式頁面,點擊橫版視頻會進入短視頻沉浸式頁面。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"在 2015 年之前,騰訊推薦系統大多爲偏單目標和淺層的機器學習模型。2015 年微軟提出了 DSSM 雙塔召回模型,2016 年穀歌提出了 Wide&Deep 模型,之後騰訊開始大規模使用隱性特徵做召回和排序,大大增強了推薦模型的泛化能力,一定程度緩解了推薦系統中飽受詬病的信息繭房問題。2018 年穀歌發佈 MMOE 模型後,騰訊在 2020 年也發佈了自研 PLE 模型,令推薦模型可以同時做多個目標。最近騰訊還在積極探索 AutoML 應用,希望使用超參數爲模型自動化尋參。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/fc\/b9\/fc8f3b2bc0058f89c0f372b9326988b9.png","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"縱觀 QQ 瀏覽器整體推薦架構,從下往上來看,下層要計算的數據更多,負責將盡可能多的內容篩選出來向上層傳遞;上層模型更加複雜,特徵更加精細化,負責儘可能給出最優排序結果。目前各個層面都有新的技術趨勢和演進,其中召回層面正在引入更多圖模型召回、更多常序類召回;排序層面正在引入更多多模態特徵、長 session 級別特徵;多目標層面納入了點擊率、時長、互動等目標,希望實現更充分的融合;混排層面,QQ 瀏覽器需要在每次用戶刷新後決定新頁面中顯示的圖文、短視頻和小視頻的比例,現在團隊在使用強化學習技術處理相關排布,希望爭取能夠讓系統獲得最大獎勵。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"該架構在邊緣計算方面也有創新。QQ 瀏覽器終端有小排序模型,是雲端和終端聯動模型。用戶在沉浸式視頻流中看視頻時,終端模型會實時將用戶反饋數據上傳雲端,雲端根據當前視頻完播率、關注度等數據調整後續視頻排序,爲用戶在下次刷新時推薦更符合瀏覽興趣的視頻。"}]},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"召回模塊"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"該推薦系統召回模塊分爲顯式召回和隱性召回。隱性召回如今佔據主曝光量 90% 以上,是最重要的召回方式。隱性召回涉及算法模型較多,其中對曝光量影響最大的兩種分別爲 ICF(基於物品的屬性召回)與 UCF(基於用戶序列的召回)。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"ICF 方面,騰訊早期使用了經典的關聯召回,其實現簡單,效果尚可,但缺乏物品相似度計算,用戶體驗欠佳。關聯召回要求每個物品有較多用戶行爲才能給出較準確計算結果,很難處理好系統中的新到達內容,更傾向於推薦熱門商品。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"爲此團隊引入 NLP 中經典的 Word2vec,以用戶觀看減肥視頻爲例,Word2vec 能夠關聯推薦健康食品相關視頻,這樣在提升用戶體驗時顯著提升了點擊率。針對物品冷啓動問題,現在整個召回模型使用 GraphSage 圖模型算法結構。該結構要更多考慮圖模型中邊的信息與權重,與每個節點的特徵信息。GraphSage 能夠在物品沒有積累很多用戶行爲的情況下,計算出非常準確的 Embedding,方便召回。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"例如用戶觀看主題爲深圳看海的短視頻後,兩個創作者上傳了杭州森林、昆明田野的視頻。過去的 ICF 召回無法對這樣的新內容進行有效推薦,但 GraphSage 可以準確計算其 Embedding,實時向用戶推薦。這一改進不僅提升了推薦效率,還有助於平臺建設更良好生態,更好地扶持新創作者。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/3b\/e2\/3b9c6d132d5aca9c5db7edb4067699e2.png","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"UCF 召回方面,團隊使用了經典的 RNN 召回。在演進過程中,團隊加入 Attention,將用戶點擊作爲輸入,算出每個用戶的向量,結合物品向量計算相似度以進行召回。但 RNN 模型是串行計算方式,只能計算用戶過去幾十次,最多一兩百次的行爲;而平臺很多老用戶使用 QQ 瀏覽器很多年,行爲積累幾千甚至幾萬之多。針對這一方面需求,團隊將模型升級爲多層 RNN 的 Memory 的存儲建模,模型中有非常多層 RNN 網絡,每一層代表用戶不同時間的序列,如有些層代表用戶永久點擊,有些層代表用戶長期點擊,有些層是近期點擊,等等。每層網絡中間存儲和更新計算均不相同,從而最大化節省存儲空間和提升推理速度。經過這樣的改進,系統得以做到網絡線上實時推理,同時召回用戶永久興趣、長期興趣、中期興趣和短期興趣,從而大幅提升用戶體驗,使人均點擊率提升 20% 以上。"}]},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"排序模塊"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"排序模塊的精排層是特徵最多最複雜的模型結構。團隊早期使用 LR 模型,泛化能力相對較差。之後團隊演進到 Wide&Deep 模型,泛化能力大大增強,但這種模型缺乏特徵之間的交叉和融合方案。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"最近團隊將模型演進到名爲“多種算法模型融合多任務學習”的排序模型。模型最下層有用戶特徵、物品特徵和多模態特徵,從中生成一些 Embedding 向量。之後對向量調權,做 Embedding Reweight;再使用 FM、CIN 等方式對 Embedding 特徵做高維度交叉。模型中間是專家網絡層,之前使用類似谷歌 MMOE 的實現,現在則轉用騰訊內部發布的 PLE 模型。PLE 模型中,除了每個目標有自己單獨的專家網絡之外,還共享了一些專家設計,相較 MMOE 模型有更明確的效果。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"模型最上層引入分塔技術,該技術與業務場景和使用行爲高度相關。在瀏覽器首頁,新用戶和老用戶的行爲有所不同,因此新用戶和老用戶會各自單獨建模,分兩個塔,各有自己的輸入參數以提高效果。以視頻播放場景爲例,團隊發現短視頻、長視頻和中視頻的物理 Label 分佈不一樣,所以在這個層面上按照視頻物理時長做分塔切片可以提升推薦效果。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/30\/69\/300b79f6yy1b1dec1fbe044490891169.png","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"團隊還打造了特色的 pointwise+pairwise 聯合學習。其中 pointwise 一般用來預估 CPR 做 pCTR 輸出,但用戶序列學習效果不佳,所以往往 AUC 計算較好,但 GAUC 偏低。而 pairwise 的一般排序比 pointwise 更好,但前者無法生成 pCTR,亦即 CTR 用戶值,但混排在上面的載控各層需要該值。如果 pointwise、pairwise 分別部署,兩套模型需要兩倍機器。爲此團隊提出改進,在輸入時通過一些矩陣轉制同時生成 pointwise 和 pairwise 樣本,兩者共享一些共同的輸入特徵、共享部分網絡模型。上層 pointwise 和 pairwise 有各自獨立的模型和計算,這樣無需增加任何服務器成本就可以在一次樣本流裏同時輸出兩個結果,最後 pairwise 輸出的 pCTR 用在其他場景,而 pairwise 排序結果作爲精排結果輸出。"}]},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"多目標"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"在瀏覽器主 feeds 場景中,用戶點擊視頻時,系統首先要預測各個視頻點擊率、卡片點擊率;用戶點擊卡片進入視頻沉浸流後,系統要預測第二層目標,預測各個視頻完播率,預測用戶是否會下滑,預估用戶滑多少條視頻、整體時長有多少;第三層目標還要預估用戶在滑動過程中可能關注、收藏、分享哪些視頻;第四層目標需要預估用戶是否會觀看視頻評論或撰寫評論。最終,系統會預估十多個目標,而多目標融合技術就負責將這些目標融合成合理的方式排序。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"早期團隊在做多目標融合的時候主要使用加法或者乘法結合超參數,核心是尋找最合理的超參數,現在則演進爲用機器自動尋參。尋參方式從經典的網格計算演進到非個性化尋參,加入 BS 優化;最近模型演化到個性化尋參,每個用戶都有自己獨立的超參數。團隊使用進化算法實現該體系,每次迭代後參數都會更靠近最優值。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"調參結果會通過 AB 實驗系統對接線上流量,每一次並行做幾百上千組實驗,以最快時間獲得線上反饋,完成整體超參數優化。目前這套體系比之前的調參方法節約 50% 以上研發時間。"}]},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"PCG 技術中臺“無量”大規模分佈式機器學習平臺"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"QQ 瀏覽器平臺使用的大規模分佈式機器學習平臺名爲“無量”。由於 QQ 瀏覽器信息流每天曝光量達到百億級,因而需要非常強大的計算平臺。在這樣的背景下誕生的騰訊 PCG 推薦中臺超大規模深度學習系統—無量。無量平臺能夠訓練百億級樣本,生成 TB 級別模型,在線上達到毫秒級推理。 "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/0e\/c4\/0e1f187b5d0a318cf601af1107fa54c4.png","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"系統原有的模型遷移至無量後,各方面效果平均提升 20% 以上。無量採用參數服務器的設計理念,其針對平臺的推薦、搜索算法特點,每一個 batch 只會激活較少的稀疏特徵。底層可能是稀疏參數,規模千億級,上層可能是百萬級較稠密的參數。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"無量是 worker 和 server 分佈式架構,其中 server 負責 TB 級模型的存儲更新和推理,worker 負責百億級樣本的分佈式訓練。分佈式計算涉及巨量網絡通訊,所以無量引入了零字節網絡通訊拷貝、混合精度壓縮等技術,降低 worker 和 server 之間的網絡通信量。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"無量平臺不僅能支持騰訊自研模型,還支持 TensorFlow、PyTorch 等衆多第三方框架,方便現有模型代碼遷移。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"無量平臺在推理方面遇到很多挑戰,其中每次請求需要訪問多達十幾萬次 key 的數量級,系統峯值達到幾十億次 QPS 訪問量。傳統系統的讀寫鎖、主備設計無法滿足如此高的性能要求,因此騰訊自研了一套基於內存的模型存儲系統,實現多線程無鎖、單個模型定型讀取、主副版本讀寫分離等,滿足了超高規格的性能需求。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"針對大規模模型的線上更新挑戰,團隊採用了 TB 級大模型全量更新每 4 到 8 小時一次,增量模型 10 分鐘一次,實模型和 KB 級模型毫秒級更新的策略。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"團隊還將推薦模型的訓練和推理工作都遷移到了 GPU 上。爲了解決 GPU 顯存較小的挑戰,團隊應用了大量優化。訓練階段需要非常高的吞吐量,所以團隊將每一次訓練樣本中需要用到的高頻參數樣本放入 GPU 顯存,下一次可能使用的參數特徵放入 CPU 內存,其他參數放入 SSD。推理階段需要更低的延時和更高的併發,因此團隊對線上千億級特徵實時統計訪問頻率,將高頻參數放入推理的 GPU 顯存,中頻參數放入內存,低頻參數放入分佈式內存 serving 集羣。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"線上推理遷移到 GPU 後節省了一半成本,改進非常顯著。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"NLP、CV 與視頻理解"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/c8\/84\/c82935ea8a50b1455e271447e33e3e84.png","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"興趣點圖譜"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"在 QQ 瀏覽器內容算法中,興趣點圖譜的本質是打標籤。團隊面臨的挑戰是如何將標籤打全,打準,打出標籤與標籤之間的組合關係,打出一些概念標籤,像人類一樣打出感情的標籤,等等。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"2019 年,團隊主要聚焦語義領域,解決概念標籤問題。例如一篇手機評測文章沒有直接列舉手機價格、描述拍照好壞,而系統可以通過概念抽象提煉文章標籤,總結出“這是高性價比拍照功能特別好的手機”這樣的主題。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"2020 年,團隊主要開發聚集話題、事件的關聯能力。例如系統通過話題聚合和事件生成能力把和東京奧運相關的成千上萬視頻文章,聚合到事件脈絡和每個話題中,形成事件和話題的圖譜體系。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"今年團隊聚焦小視頻標籤領域,希望解決更深層次的語義提煉問題,希望 NLP 像人類一樣打情感類標籤。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"此外,系統現在可以基於用戶喜歡的明星做推薦,或者通過“高顏值”這樣的概念標籤進行推薦。現在平臺的興趣點圖譜有接近 100 萬個興趣點,全面覆蓋所有中文內容幾乎每一個垂直領域。"}]},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"神舟預訓練模型"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/6b\/e4\/6bbbabc1edf24cdd0cd4bfab7eed7de4.png","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"QQ 瀏覽器團隊在兩個月前推出了神舟預訓練模型,不僅在權威的中文語言理解測評基準拿到冠軍,還在二十多個分類和語義標籤任務上第一次超過了人類專家標註的水平,是比較有標誌性意義的技術。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"預訓練的業務在推薦技術,搜索業務裏都有大規模使用。之前的 BERT 主要優化英文,對漢語支持不佳,所以團隊做了字詞混合力度表達輸入到特徵裏;BERT 使用了絕對未知編碼,而團隊發現漢語裏用相對未知編碼效果更好,能讓語義相關的 Embedding 離得更近,而不相似語義的 Embedding 離得更遠;針對 BERT 的網絡退化問題,團隊做了跨層衰減的殘差連接,以優化整個網絡的信息傳遞;同時團隊還做了模型蒸餾技術,做了數據增強,引入世界知識。QQ瀏覽器內部有知識圖譜團隊,有百萬千萬級經過人工專家標註過的優質準確世界知識數據,因此把它引入到預訓練裏做增強有明顯效果。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"在實踐中,團隊將所有的 NLP 任務都使用神舟做預訓練,結合微調和一些混合粒度優化,結果使下游每任務的語料標註數量減少 40% 以上。同時團隊研發了一套專門基於預訓練加 fintune 的流程,讓整個 NLP 的研發週期減少 40%。"}]},{"type":"heading","attrs":{"align":null,"level":3},"content":[{"type":"text","text":"視頻多模型預訓練"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"做完 NLP 預訓練,下一步要做視頻多模型預訓練。首先要做抽真,抽真以後基於圖像級別預訓練,生成視覺方面的預訓練模型。神舟已經生成了視頻標題、簡介、字幕、各種語音 OCR 的預訓練模型,再加上視頻裏一些語音或音樂等信息,實現了視覺、圖文、聽覺方面的語義對齊,最後生成多模態預訓練模型。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"上述所有能力的底層現在都換成了多模態預訓練框架,效果和效率都有較大提升。多模態預訓練還可用在圖文和視頻的聯合 Embedding 檢索方面。圖文正文底部要做一些視頻相關推薦,系統可以用文章生成的向量和視頻的多模態向量做關聯召回檢索,顯著提升圖文視頻的用戶滲透率。"}]},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"PCG AI 與推薦中臺"}]},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/f2\/b3\/f27642ec928e639cd631e7cb4a16e1b3.png","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"PCG 的 AI 中臺是騰訊官方比較大的中臺項目。AI 中臺包括七個方向,本次分享主要介紹 CV、NLP、推薦系統,此外還有 AI 管線、AutoML、特徵中臺、索引中臺。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"騰訊中臺設計和行業傳統有一些差異。很多傳統設計方案裏,中臺是實體化架構組織,所有輪子都收到中臺裏更好地融合,實體化架構和業務組織互相隔離。因此很多業務需求在中臺聽不到,或者中臺做了很多新技術很難落地到業務。而騰訊採用的方式是類似開源組織的虛擬中臺設計。騰訊引入兩個新的角色設計:"},{"type":"text","marks":[{"type":"strong"}],"text":"主建方和共建方。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"QQ 瀏覽器是 AI 中臺主建設方,承擔中臺的技術 OKR;"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"各個業務部是共建方,他們也要派團隊一起參與到 AI 中臺的建設裏,這樣 QQ 瀏覽器團隊就能夠及時聽到這些業務端對 AI 的需求和想法。共建方也要把中臺新做的一些項目技術落地到他們的業務裏面,這是偏開源的協作方式。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"騰訊內部也有類似開源團隊的各種委員會,有 committer\/contributer 的機制設計,在此基礎上給中臺加上了一些考覈權利、物質激勵能力、資源方面的傾斜等等。團隊用這些考覈和資源能力幫助開源方式的中臺保證完成技術 OKR,幫助各個業務得到更好的發展。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"AI 中臺已經運營三年時間,在騰訊 PCG 事業羣裏應用比較廣泛,現在整個 AI 中臺的 PaaS 化數量超過 300 個,每天有超過一萬個模型在 AI 中臺訓練、部署和上線,每天調用量超過 2000 億次,意味着中臺比較好地支持了整個事業羣業務的發展。"}]},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"小結"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"總結本次分享內容,共包含以下三點見解:"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"numberedlist","attrs":{"start":null,"normalizeStart":1},"content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":1,"align":null,"origin":null},"content":[{"type":"text","text":"AI 新範式——“厚積薄發”:預訓練模型能夠更好地沉澱海量知識,fintune+AutoML 能夠更好幫助下游任務更輕巧、更快地應對各種新應用。"}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":2,"align":null,"origin":null},"content":[{"type":"text","text":"大算力,超大規模模型可以幫助我們探索人工智能和機器學習的新邊界。"}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":3,"align":null,"origin":null},"content":[{"type":"text","text":"推薦系統是沒有最優解的技術體系,我們需要不斷進化,不斷去理解我們的用戶,做更多用戶洞察,持續探索內容推薦的新趨勢和方向,爲用戶帶來更多驚喜。"}]}]}]}]}
發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章