Apache Arrow:一種適合異構大數據系統的內存列存數據格式標準

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"本文介紹一種內存列存數據格式:Apache Arrow,它有一個非常大的願景:提供內存數據分析 (in-memory analytics) 的開發平臺,讓數據在異構大數據系統間移動、處理地更快。同時,比較特別的是這個項目的啓動形式與其他項目也不相同,Arrow 項目的草臺班子由 5 個 Apache Members、6 個 PMC Chairs 和一些其它項目的 PMC 及 committer 構成,他們直接找到 ASF 董事會,徵得同意後直接以頂級 Apache 項目身份啓動。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"本文從以下幾個方面來介紹 Arrow 項目:"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"bulletedlist","content":[{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Arrow 項目的來源"}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Arrow 如何表示定長、變長和嵌套數據"}]}]},{"type":"listitem","attrs":{"listStyle":null},"content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"內存列存數據格式與磁盤列存數據格式的設計取捨"}]}]}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"注:Arrow 即可以指內存列存數據格式,也可以指 Apache Arrow 項目整體,因此下文中將用 「Arrow」 表示格式本身,「Arrow 項目」表示整體項目。"}]},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"link","attrs":{"href":"https:\/\/tech.ipalfish.com\/blog\/2020\/12\/08\/apache_arrow_summary\/#Arrow-%E9%A1%B9%E7%9B%AE%E7%AE%80%E4%BB%8B","title":"","type":null}},{"type":"text","text":"Arrow 項目簡介"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}}]}
發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章