5分鐘看完企業數據挑戰的簡史

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"strong"}],"text":"本文最初發佈於hassenchaieb.com網站,經原作者授權由InfoQ中文站翻譯並分享。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"如今,數據生態系統正在蓬勃發展,流行名詞隨處可見,每天都有新產品面世發佈。身在其中的人們很難看清“廬山真面目”。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"在這篇文章中,我會退後一步,試着解讀當前生態系統的源頭。爲什麼我們擁有如此衆多的產品,它們在現代企業中又各自適合哪些位置?當然,我會做很多簡化。實際上,每家公司都是獨特的,有着自己獨有的需求。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"2000年代初:互聯網的崛起和數據量的增長"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"隨着互聯網的興起,企業不得不處理越來越多的數據源。公司數據被存儲在許多各自不同的關係數據庫中。這讓公司無法快速獲得關於客戶、銷售等領域的數據分析結果和可行見解。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"一種解決方案是數據倉庫,它將所有彼此孤立的關係數據庫整合到一個單一的事實來源中,用來提供客戶數據的360°全景視圖。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.infoq.cn\/resource\/image\/4e\/e5\/4ebae839010e8cyy48e120af15e848e5.gif","alt":null,"title":"","style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":"","fromPaste":false,"pastePass":false}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"然後,許多大型科技公司開始收集海量數據,因此需要全新的數據存儲和處理方式。這些工作再也不是單臺計算機可以應付的了。2006年,Hadoop誕生。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"Hadoop是一組軟件工具,可對龐大的數據集進行分佈式處理(多臺計算機)。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"接下來,許多工程師離開了這些巨型公司,開始了自己的大數據創業,並獲得了風險投資的資助。到2010年,大數據熱潮來臨。"}]}]}
發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章