Cloudera’s Distribution for Apache Hadoop

記得從0.1版本就使用過,當時還是用的是Apache Hadoop,現在都已經有自己的增強版本了,真的不錯。

 

HDFS – Self healing distributed file system

MapReduce – Powerful, parallel data processing framework

Hadoop Common – a set of utilities that support the Hadoop subprojects

HBase – Hadoop database for random read/write access

Hive – SQL-like queries and tables on large datasets

Pig – Dataflow language and compiler

Oozie – Workflow for interdependent Hadoop jobs

Sqoop – Integrate databases and data warehouses with Hadoop

Flume – Highly reliable, configurable streaming data collection

Zookeeper – Coordination service for distributed applications

Hue – User interface framework and SDK for visual Hadoop applications

 

 

下載:http://www.cloudera.com/downloads/

Hadoop 介紹:http://www.sfbayacm.org/wp/wp-content/uploads/2010/01/amr-hadoop-acm-dm-sig-jan2010.pdf

 

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章