Kyligence技術合夥人祝海林:希望通過語言這種底層技術讓AI和大數據更加普世化 | DIVE 基礎軟件大會專訪

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"在當前數字化轉型的大背景下,數據和AI已然成爲了企業生存和發展的必備生產要素。作爲支撐所有上層技術的根基,編程語言近年來越來越趨向領域化,而大數據和 AI 的蓬勃發展,必然需要一個更加爲之量身定製的語言,以 SQLFlow、MLSQL 爲代表的這類語言的誕生正是順應了這個趨勢。本文,InfoQ 有幸採訪到了"},{"type":"link","attrs":{"href":"https:\/\/dive.infoq.cn\/2021\/beijing\/","title":null,"type":null},"content":[{"type":"text","text":"【DIVE全球基礎軟件創新大會2021】"}],"marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}]},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"編程語言新風向專場講師、Kyligence技術合夥人\/資深數據架構師、MLSQL作者祝海林,聽他分享當前大背景下我們爲什麼需要一門面向大數據和AI的編程語言。"}]},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"MLSQL從無到有,都經歷了什麼?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"MLSQL語言是人工智能、大數據、雲計算三者融合之下的時代需求產物,傳統語言難以勝任這些新的需求。 "}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"InfoQ:首先請您介紹下現在所負責的工作吧?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"我現在在Kyligence 主要負責MLSQL開源相關的工作,目前已經構建起了一支專職的開源團隊。隨着團隊的全職人員不斷擴充,也說明了我們公司對開源投入很大,也很有信心。另外,這個團隊我們還在繼續招人。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"InfoQ:作爲MLSQL的作者,您設計這門語言的初衷是什麼?它誕生的背景是怎樣的?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"祝海林:"},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"天下武功,唯“快”不破。 這個快又包含了兩個層面。第一個是,我的軟件夠快,比如Kyligence旗下的開源項目Kylin就在OLAP層面做到了極致的快,能對大數據做到亞秒級的查詢速度。第二個是,軟件使用者效率快,也就是基於這個系統,使用者能夠高效率地完成自己的任務和工作,原本要一週能完成的事情,現在可以兩小時就完成,從而讓生活變得更加簡單。 "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"MLSQL的出發點是第二個,我們希望隨着人工智能、大數據、雲計算的發展,在這個大的技術發展環境下,用戶可以用一個語言,一個引擎就能在雲上、雲下涵蓋AI\/BigData的所有任務,比如OLAP查詢、ETL處理、流批一體、機器學習等等。這個語言必須足夠簡單,能夠讓大多數人接受,這個引擎也必須足夠簡單,能夠輕易被維護。而在MLSQL之前,用戶往往需要衆多的系統和語言進行組合,才能完成複雜的大數據和AI任務的,這極大地降低了使用者的效率,提高了維護成本,使得大數據和AI難以真正走入尋常百姓家。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"InfoQ:MLSQL的定位是一門"},{"type":"text","marks":[{"type":"color","attrs":{"color":"#40485b","name":"user"}},{"type":"strong"}],"text":"面向大數據和AI的語言,那它和其他語言有什麼區別?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"祝海林:"},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"MLSQL語言是人工智能、大數據、雲計算三者融合之下的時代需求產物,傳統語言難以勝任這些新的需求。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"首先,MLSQL這個語言天然是分佈式執行的,大部分傳統語言比如Python、Java必須依賴框架以及新的編程範式才能完成分佈式。典型的如Java的虛擬機(執行引擎)就是單機的,語言也是單機的,你必須依賴Spark 這種框架,學習它的分佈式編程範式,你才能進行分佈式數據處理。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"第二,傳統語言對於用戶來說入門太難了,原因在於它還是太“底層了”。真正效率高的是什麼語言? 是聲明式語言,比如SQL,還有比如命令行。聲明式語言,不用你在編程的時候關注和業務邏輯沒關係的東西,你只要描述你的任務是什麼,其他的比如任務如何生成,如何拆解和執行,都是底層語言引擎需要做的事情。這樣才能讓更多的人使用起來。畢竟,會SQL的人肯定比會Python的人多,對不對?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"第三,新時代,我們不再要求語言必須有那種納秒、微秒級別的響應時間了。比如在做大數據分析、ETL、亦或是AI的時候,我們大體都是毫秒級,秒級或者以上其實都是可以接受的。這個時候,分佈式的解釋型語言就很有價值了,雖然沒辦法做到納秒、微秒級別的執行速度,但是在可接受範圍內,它比傳統語言更易用。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"第四,傳統語言都沒有在語言層面提供安全能力。比如哪些語法,哪些特性可以針對指定用戶開啓或關閉。MLSQL在設計之初就提供了這種能力,比如用戶可不可以使用MLSQL 裏的load語句加載數據,能加載什麼數據,加載的數據的哪些列,哪些行,要不要執行脫敏策略等等,都被內置在語言層面。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"還有就是擴展性問題,傳統語言語法以及對語言引擎(VM)做擴展,是有很高的門檻的。MLSQL 執行引擎是使用Scala\/Java開發的,而且設計的時候就是插件內核,我們可以很輕易的通過插件機制擴展MLSQL的語法,或者增強執行引擎的能力。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"MLSQL就是這麼一門面向大數據和AI設計的聲明式雲原生語言,以SQL爲主體,保留了其分佈式,聲明式特性,同時彌補了原生SQL可編程性的不足。在保證足夠簡單,同時又可以高度靈活的情況下,讓用戶更好的玩轉數據,所見即所得。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"InfoQ:MLSQL發展至今,經過了哪些迭代?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"祝海林:"},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"最早MLSQL是爲了使用SQL解決ETL而誕生的,後面我們覆蓋到了流,到後來,我們發現還可以做OLAP查詢服務,再後來,我們發現我們還可以完成機器學習的工作,很驚喜的是,不僅可以做到,效果還相當的好。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"在這期間,我們給SQL添加變量,分支語句,函數,模塊等標準語言的支持,同時還擁抱了Python 語言(語言內置了Python腳本執行引擎)從而具備擁抱更大的AI生態的能力。到這個時候,我們就發現,MLSQL 已經有一個語言的完整形態了,可以解決大數據和AI領域的大多數問題。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"MLSQL的技術挑戰及未來規劃"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"開源還是挺費錢的,尤其是人力方面。"}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"InfoQ:您一門編程語言並不容易,您在做MLSQL時是否遇到了技術上的挑戰?這期間有沒有讓您印象深刻的事件?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"祝海林:"},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"挑戰肯定是有的。一個比較典型的例子是,MLSQL涉及的技術點太多了,作爲開發者來說要把這些技術點都做好,這就非常有挑戰了。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"比如在開發MLSQL的過程中,爲了展示MLSQL的能力,我們需要一個好的Web端,這個時候我就重新去學JavaScript\/TypeScript,使用ReactJs去開發了一套Web Console,爲了能夠在桌面上使用,我學習VSCode擴展開發,開發了VSCode MLSQL Lang 插件,爲了能夠把它更好地部署到K8s上,我學習Go語言,開發了mlsql-deploy, 爲了能夠支持分支語句,我用Scala手寫了一套表達式引擎,從詞法分析,到語法分析,到最後生成目標代碼全部自己實現了一遍。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"除此之外,各種大數據技術你都要有一定的掌控,融入進來,這才能做到讓大家開箱即得,這包括諸如CDC(我們支持MySQL\/HBase)、數據湖(Delta Lake)、物化視圖、聚合下推等等大數據相關技術。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"印象中,倒是沒有遇到技術上比較深刻的事情,但是有一個很深的感覺就是,現代軟件的複雜度太高了,一個人的時間是怎麼着都不夠的,無論你的精力和速度有多快。除非你是做那種小而美的軟件,解決一個非常領域化的問題,纔可能持續單兵作戰。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"InfoQ:那MLSQL又是如何做推廣的?單純依靠社區的力量是不夠的吧?是不是在資金上也投入巨大?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"祝海林:"},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"我們的推廣其實非常之少,開源後,如果這門語言確實解決了一些問題,並且能夠給用戶帶來收益,那麼就會有更多的用戶願意去嘗試。很多用戶都用了幾年後,突然某天在羣裏提了一嘴,我才知道,竟然有用這麼久的用戶了。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"在我加入Kyligence前的這些年,主要時間都花在了架構設計和代碼開發上,用於宣傳的時間非常少,偶爾去參加一些meetup做個分享,或者給一些公司做個佈道,這期間也得到了很多國內媒體的幫助,他們也願意支持像我這樣的項目,很是感謝。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"MLSQL的推廣單純依靠社區的自然生長確實是不夠的,而且宣傳這個東西,是需要有一定的財力支持的。比如我做官網,自己掏錢買了雲主機,買了域名,然後我發現我買不起https認證,儘管我一直想搭建一個線上的體驗服務,讓用戶可以直接體驗MLSQL語言,但是考慮到昂貴的費用(大數據嘛,機器配置要求略高)一直沒有買,這個時候雖然時不時會有用戶會提供機器支援,但很不穩定,這對於宣傳其實是很負面的。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"所以我個人認爲,開源項目最好還是後面能有一家公司做支持。國外的很多開源項目雖然沒有公司支持,但是有基金會支持。"},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"總之,開源還是挺費錢的,尤其是人力方面。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"InfoQ:您對MLSQL的未來規劃是怎樣的?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"祝海林:"},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"遠景目標是,MLSQL能夠成爲大數據和AI的標準語言,同時實現數據的平民化,幫助中小企業非常低成本的落地大數據和AI。我們現在正在努力將其打造成一個可信賴的產品,這包括易用性、性能、穩定性、安全等方方面面。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"加入Kyligence,從朋友圈好友走向事業合夥人"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"我們希望從語言這種更底層的方式,去更普世化地解決一些問題,讓大數據和 AI 變得更簡單,更加平民化。"}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"InfoQ:前些年您一直在埋頭苦幹搞MLSQL,那又是什麼樣的契機讓您選擇加入Kyligence?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"祝海林:"},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"MLSQL是我16年在北京的時候開源出來的,後面來到了杭州,經過五年的努力,MLSQL已經有一定的成熟度,也有不少小創新,在很多公司得到了應用實踐,不少互聯網公司也借鑑了MLSQL的很多理念。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"這個時候我覺得MLSQL如果一直作爲一個個人開發者的作品,會影響它的發展,我希望它能踏入一個新的快車道。我和Kyligence CEO 韓卿先生是很多年的“微信朋友圈好友”,這個時候他經過長時間的觀察,也頗看好MLSQL, 而在我看來,Kyligence 也是一家非常優秀的擁有開源社區運營經驗的公司,所以我們很快一拍即合,最後2021年的時候,我以技術合夥人的身份加入了Kyligence。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"InfoQ:您加入Kyligence後,MLSQL在Kyligence的業務板塊裏發揮了怎樣的作用?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"祝海林:"},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"Kyligence 一直都對開源項目有着較大的投入,Kylin 已經進入 Apache 很多年,而且也獲得很大的成功。總體來看,Kylin 以強大的分析能力和豐富的周邊生態而備受歡迎,Kylin 其實是非常好地覆蓋了 BI 這一部分,部分覆蓋 ETL\/ML。那 MLSQL 則是很好地覆蓋了 ETL\/ML,部分覆蓋了 BI ,所以兩者具有很好的互補性。第二點,我們公司也嘗試從語言層面去解決大數據和 AI 領域的問題。可能以前大部分還是從應用產品或者組件的方式去切入,這種方式的缺點是解決問題更像打地鼠,出現一個問題,用一個框架或者產品去解決。我們現在是希望能夠通過這種更底層的方式,去更普世化地解決一些問題,讓大數據和 AI 變得更簡單,更加平民化。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"InfoQ:目前MLSQL的市場接受情況是怎樣的?是否達到了您的預期?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"祝海林:對於MLSQL,"},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"用過的都說好。好些用戶都是三年以上老用戶了,唯一不足的是,一線互聯網公司很少有用的。不過因爲之前在社區方面沒有做太多工作,甚至從來沒有組織過任何活動,屬於完全放養狀態,所以用戶數也在預料之中。今年(2021年)開始,在公司的支持下,我已經在規劃這方面的工作了。在十月、十一月,我們很快會有很多新的社區工作落地,這包括新的官網、10月底線下的MLSQL meetup、線上體驗版,還有專門的公衆號,知乎專欄等等。我對未來還是充滿信心的,這五年技術的發展也讓我愈發相信MLSQL的技術路線是走對的。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}},{"type":"strong"}],"text":"嘉賓介紹:"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"link","attrs":{"href":"https:\/\/dive.infoq.cn\/2021\/beijing\/presentation\/4003","title":null,"type":null},"content":[{"type":"text","text":"祝海林"}],"marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}]},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":",Kyligence技術合夥人\/資深數據架構師、MLSQL作者,擁有13+年研發經驗。最近幾年專注在Data + AI 融合方向上,致力於幫助企業更好落地Data\/AI。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"祝海林老師將在InfoQ舉辦的首屆"},{"type":"link","attrs":{"href":"https:\/\/dive.infoq.cn\/2021\/beijing\/","title":null,"type":null},"content":[{"type":"text","text":"【DIVE全球基礎軟件創新大會2021】"}],"marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}]},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"上分享《面向Data和AI的語言MLSQL》的主題演講。無論是企業還是一線研發人員,都可以通過MLSQL更好地去落地大數據和AI。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"當前,數據庫、操作系統、中間件、編程語言……各個行業不斷變化,但時勢造英雄。基礎軟件發展的“黃金時代”中,怎麼成爲行業“英雄”?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":" "}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":"阿里雲李三紅、PingCAP黃東旭、MegaEase陳皓,還有信通院、統信軟件、極狐、清華大學、紅杉資本等各個行業的專家學者,一起講述行業變化,更要明析時代大勢下蘊含的無限機會!點擊"},{"type":"link","attrs":{"href":"https:\/\/dive.infoq.cn\/2021\/beijing\/","title":"xxx","type":null},"content":[{"type":"text","text":"此次"}]},{"type":"text","marks":[{"type":"color","attrs":{"color":"#494949","name":"user"}}],"text":",看看還有誰在等着你吧!"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.geekbang.org\/infoq\/3f\/3ff4b0120ac0aeeeb094cd139b4fb29e.png","alt":null,"title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}}]}
發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章