不錯的數據收集

(一)hadoop 相關安裝部署

1、hadoop在windows cygwin下的部署:

 http://lib.open-open.com/view/1333428291655

http://blog.csdn.net/ruby97/article/details/7423088

http://blog.csdn.net/savechina/article/details/5656937

2、hadoop 僞分佈式安裝:

http://www.thegeekstuff.com/2012/02/hadoop-pseudo-distributed-installation/

3、hadoop全分佈式安裝教程:

http://hi.baidu.com/leejun_2005/item/367da95bd69f4e0ce6c4a581

4、實戰 windows7 下 eclipse 遠程調試 linux hadoop

http://my.oschina.net/leejun2005/blog/122775

5、單臺服務器上安裝Hadoop和Hive十五分鐘教程

http://rdc.taobao.com/team/top/tag/hadoop-hive-%E5%8D%81%E5%88%86%E9%92%9F%E6%95%99%E7%A8%8B/

ssh-keygen -t dsa -f ~/.ssh/id_dsa

cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys

http://blogread.cn/it/article/6103?f=wb

注意:

在centos下,僅僅上述操作是不行的,還需要如下步驟:

01 sudo vi /etc/ssh/sshd_config
02  
03 RSAAuthentication yes 
04 PubkeyAuthentication yes 
05 AuthorizedKeysFile     .ssh/authorized_keys
06  
07 service sshd restart
08  
09 注:ssh可同時支持publickey和password兩種授權方式,publickey默認不開啓,需要配置爲yes
10 如果客戶端不存在.ssh/id_rsa,則使用password授權;存在則使用publickey授權;
11 如果publickey授權失敗,依然會繼續使用password授權。不要設置 PasswordAuthentication no ,它的意思是禁止密碼登錄,這樣就只能本機登錄了!
12  
13 但是此時依然會報錯,
14 Permission denied (publickey,gssapi-keyex,gssapi-with-mic).
15  
16 然後:
17 vi /etc/selinux/config 
18 SELINUX=disabled 
19  
20 chmod 700 ~/.ssh
21 chmod 600 ~/.ssh/authorized_keys
22  
23 最後重啓你的 linux 執行 ssh localhost
參考:

http://www.linuxidc.com/Linux/2012-11/74603.htm

http://www.360doc.com/content/12/0324/12/9324714_197225609.shtml

https://www.centos.org/modules/newbb/viewtopic.php?topic_id=33048

http://flysnowxf.iteye.com/blog/1567570

8、hadoop集羣搭建總結

http://www.cnblogs.com/beanmoon/archive/2012/11/12/2767010.html

9、Hadoop For Windows

http://dongxicheng.org/mapreduce/hadoop-for-windows/


(二)hive

1、基於hive的日誌統計實戰:

http://www.csdn.net/article/2010-11-28/282620

2、Hive實例:CSDN十大常用密碼

http://my.oschina.net/leejun2005/blog/81662

http://superlxw1234.iteye.com/blog/1528688 (安裝步驟)

3、hive官方教程:

https://cwiki.apache.org/confluence/display/Hive/GettingStarted

4、Hive 隨談(四)– Hive QL

http://www.alidata.org/archives/581   # JOIN

http://wenku.baidu.com/view/242260c489eb172ded63b709.html

5、寫好Hive 程序的五個提示

http://www.alidata.org/archives/622  #排序

6、Hadoop數據倉庫工具--hive介紹(百度)

http://wenku.baidu.com/view/90dad7659b6648d7c1c7460e.html

7、hive 分享(淘寶網)

http://wenku.baidu.com/view/4e4a801ca76e58fafab003b1.html

8、hive簡介(美麗說

http://wenku.baidu.com/view/0f252121a5e9856a56126025.html

9、Hive學習筆記(阿里巴巴

http://wenku.baidu.com/view/233308340b4c2e3f5727632a.html

10、Hive - 運用於hadoop的拍字節範圍數據倉庫(論文

http://wenku.baidu.com/view/b5aebfe9998fcc22bcd10d8a.html

11、Hive: SQL for Hadoop(An Essential Tool for Hadoop-based Data Warehouses)

http://polyglotprogramming.com/papers/Hive-SQLforHadoop.pdf

12、Programming Hive

http://www.itpub.net/thread-1724707-1-1.html

13、Hive 隨談(六)– Hive 的擴展特性: 

File Format、SerDe、Map/Reduce 腳本(Transform)、UDF、UDAF

http://www.alidata.org/archives/604

14、hive 數據傾斜總結

http://www.alidata.org/archives/2109

15、用hive查詢json格式的複雜數據

http://blog.cloudera.com/blog/2012/09/analyzing-twitter-data-with-hadoop/

https://github.com/rcongiu/Hive-JSON-Serde

16、同事總結的hive sql 優化

http://hbase.iteye.com/blog/1488745

http://superlxw1234.iteye.com/blog/1564456

http://slaytanic.blog.51cto.com/2057708/1295222 

17、通過 thrift 接口實現 python 查詢 hive 數據倉庫

http://slaytanic.blog.51cto.com/2057708/734106

18、通過 thrift 接口實現 php 查詢 hive 數據倉庫(以及phpHiveAdmin簡介)

http://slaytanic.blog.51cto.com/2057708/766230

http://slaytanic.blog.51cto.com/2057708/818721

http://slaytanic.blog.51cto.com/2057708/1071263

https://cwiki.apache.org/Hive/hiveclient.html

http://csgrad.blogspot.com/2010/04/to-use-language-other-than-java-say.html

19、Hive SQL使用和數據加載的一點總結

http://slaytanic.blog.51cto.com/2057708/782175

20、hive優化之——控制hive任務中的map數和reduce數

http://superlxw1234.iteye.com/blog/1582880

21、hive中一些實用的小技巧

http://superlxw1234.iteye.com/blog/1565774

22、數據倉庫數據模型之:極限存儲--歷史拉鍊表

http://superlxw1234.iteye.com/blog/1567320

23、Programing Hive讀書筆記

http://www.gemini5201314.net/hadoop/programing-hive%E8%AF%BB%E4%B9%A6%E7%AC%94%E8%AE%B0.html

24、數據開發技術概覽(一淘數據部)

http://blog.linezing.com/wp-content/uploads/2012/12/%E6%95%B0%E6%8D%AE%E5%BC%80%E5%8F%91%E6%8A%80%E6%9C%AF-%E5%86%B7%E5%B7%9D.pdf

25、Hive r0.9.0中文文檔(二)之聯表查詢Join

http://myeyeofjava.iteye.com/blog/1703815

26、基於Hadoop的內部海量數據服務平臺(淘寶網)

http://www.infoq.com/cn/presentations/hadoop-internal-data-service-platform

27、hive 配置參數說明

http://blog.csdn.net/chaoping315/article/details/8500407

http://www.blogjava.net/changedi/archive/2013/08/13/402741.html

http://www.blogjava.net/changedi/archive/2013/08/15/402857.html

28、hive 調優(Hortonworks)

http://www.slideshare.net/adammuise/2013-jul-23thughivetuningdeepdive

29、hive--Sort Merge Bucket Map Join(桶 join)

http://superlxw1234.iteye.com/blog/1545150

30、深入學習《Programing Hive》:Tuning

http://flyingdutchman.iteye.com/blog/1871983

31、利用SemanticAnalyzerHook來過濾不加分區條件的Hive查詢

http://blog.csdn.net/lalaguozhe/article/details/11988047


(三)pig

1、pig 實戰

http://www.cnblogs.com/xuqiang/archive/2011/06/06/2073601.html

2、pig官方教程

http://pig.apache.org/

3、Apache Pig中文教程集合

http://www.codelast.com/?p=4550

4、Programming Pig

http://ofps.oreilly.com/titles/9781449302641/index.html

http://www.google.com.hk/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CCcQFjAA&url=http%3A%2F%2Fbigdata.googlecode.com%2Ffiles%2FOreilly.Programming.Pig.Sep.2011.pdf&ei=DLGDUNbcI4aTiQfus4HADQ&usg=AFQjCNGzTHIYcc2GuU6ko0TgIKm3UN9T5Q&sig2=2DZtn3yP4KVqro7xt_qAOA

5、PigFly:hadoop 統一數據分析平臺設計(淘寶)

http://www.docin.com/p-344188827.html

http://coderplay.iteye.com/blog/1233865

6、用 Apache Pig 處理百萬歌曲數據(cloudera

http://blog.cloudera.com/blog/2012/08/process-a-million-songs-with-apache-pig/

7、Pig Latin: A Not-So-Foreign Language for Data Processing(斯坦福大學論文)

http://infolab.stanford.edu/~usriv/papers/pig-latin.pdf

8、Lecture 09: Parallel Databases, Big Data, Map/Reduce, Pig-Latin

http://www.cs.washington.edu/education/courses/csep544/11au/lectures/lecture09-parallel-db.pdf

9、Pig Queries Parsing JSON on Amazons Elastic Map Reduce Using S3 Data

http://eric.lubow.org/2011/hadoop/pig-queries-parsing-json-on-amazons-elastic-map-reduce-using-s3-data/

https://github.com/a-b/elephant-bird/tree/master/javadoc

10、pig cookbook:性能調優

http://pig.apache.org/docs/r0.7.0/cookbook.html

http://pig.apache.org/docs/r0.10.0/perf.html#Replicated-Joins

11、pig stream 用法:

http://wiki.apache.org/pig/PigStreamingFunctionalSpec

http://www.slideshare.net/charmalloc/hadoop-streaming-tutorial-with-python

12、Analyzing Big Data with Twitter

UC Berkeley Course Lectures: Analyzing Big Data With Twitter

http://blogs.ischool.berkeley.edu/i290-abdt-s12/   在線觀看,自備梯子

http://www.kuaipan.cn/file/id_102542674904481817.htm  金山快盤下載

13、Apache Pig 性能優化

http://hbtc2012.hadooper.cn/subject/track1daijianyong3.pdf

http://www.cnblogs.com/kemaswill/p/3226754.html

14、Hadoop pig進階語法

http://www.cnblogs.com/siwei1988/archive/2012/08/06/2624912.html

15、在java中嵌入pig:Embedding Pig In Java Programs

http://wiki.apache.org/pig/EmbeddedPig

16、Pig 郵件組用戶精華問題彙總

http://hakunamapdata.com/football-zero-apache-pig-hero-the-essence-from-hundreds-of-posts-from-apache-pig-user-mailing-list/



(四)hadoop原理與編碼

1、hadoop使用中的幾個小細節

http://blog.csdn.net/needle2/article/details/6182515

2、hadoop中map-reduce相關過程與概念的理解:更多請瀏覽目錄

http://hi.baidu.com/shirdrn/item/085a5518be8bfa797b5f25aa

4、IBM developerworks:用 Hadoop 進行分佈式並行編程系列, 第 1 ~3 部分

http://www.ibm.com/developerworks/cn/opensource/os-cn-hadoop1/

http://www.ibm.com/developerworks/cn/opensource/os-cn-hadoop2/index.html

https://www.ibm.com/developerworks/cn/opensource/os-cn-hadoop3/

5、分佈式計算開源框架Hadoop介紹

http://www.infoq.com/cn/articles/hadoop-intro

6、Hadoop基本流程與應用開發( Java )

http://www.infoq.com/cn/articles/hadoop-process-develop 

7、hadoop 源碼分析

http://caibinbupt.iteye.com/?page=2

8、hadoop數據流、作業提交分析

http://www.cnblogs.com/spork/archive/2010/01/11/1644346.html

9、Hadoop管理員的十個最佳實踐

http://www.infoq.com/cn/articles/hadoop-ten-best-practice

10、hadoop、hive源碼分析及使用分享

http://www.oratea.net/?cat=7#

11、Hadoop計算能力調度器應用和配置(區別於默認的FIFO隊列調度)

http://www.cnblogs.com/ggjucheng/archive/2012/07/25/2608817.html

12、淺析Hadoop 中的調度策略

http://www.ibm.com/developerworks/cn/opensource/os-hadoop-scheduling/index.html

http://dongxicheng.org/mapreduce/hadoop-schedulers/

Hadoop-0.20.2公平調度器算法解析

http://dongxicheng.org/mapreduce/hadoop-fair-scheduler/

Hadoop計算能力調度器算法解析

http://dongxicheng.org/mapreduce/hadoop-capacity-scheduler/

Hadoop Capacity Scheduler配置使用記錄

http://www.cnblogs.com/panfeng412/archive/2013/03/22/hadoop-capacity-scheduler-configuration.html

hadoop mapred-queue-acls 多隊列調度配置

http://yaoyinjie.blog.51cto.com/3189782/872294

Hadoop資源感知調度器簡介

http://my.oschina.net/leejun2005/blog/96113

13、hadoop作業調優參數整理及原理

http://blog.sina.com.cn/s/blog_ae33b83901015cm9.html

14、比較全的hadoop源碼分析

http://hbase.iteye.com/blog/1024737

15、如何在Hadoop上編寫MapReduce程序

http://dongxicheng.org/mapreduce/writing-hadoop-programes/

16、Hadoop學習筆記(二):從map到reduce的數據流

http://www.cnblogs.com/beanmoon/archive/2012/12/08/2805636.html

17、通過Hadoop的API管理Job

http://blog.csdn.net/dajuezhao/article/details/6591058

18、揭祕InputFormat:掌控Map Reduce任務執行的利器

http://www.infoq.com/cn/articles/HadoopInputFormat-map-reduce

19、Hadoop MapReduce開發最佳實踐(上篇)

http://www.infoq.com/cn/articles/MapReduce-Best-Practice-1

20、Hadoop實例:二度人脈與好友推薦

http://my.oschina.net/u/176897/blog/99761

21、探索大數據分析和 Hadoop

http://www.ibm.com/developerworks/cn/training/kp/os-kp-hadoop/index.html

22、Hadoop關於處理大量小文件的問題和解決方法

http://www.csdn.net/article/2010-11-22/282301?1290758216

23、下一代 Hadoop YARN 簡介:相比於MRv1,YARN的優勢

http://my.oschina.net/leejun2005/blog/97802

24、HDFS基本知識整理

http://www.cnblogs.com/beanmoon/archive/2012/11/23/2783966.html

http://www.cnblogs.com/beanmoon/archive/2012/12/11/2809315.html

25、海量小文件的存儲和檢索:facebook 圖片存儲架構

http://www.importnew.com/3292.html

26、Hadoop -- MapReduce過程

http://blog.sina.com.cn/s/blog_61ef49250100uul8.html

27、MapReduce: 詳解 Shuffle 過程

http://my.oschina.net/leejun2005/blog/73708     Shuffle過程剖析及性能優化

http://474731198.iteye.com/blog/1635043

http://my.oschina.net/leejun2005/blog/85974

http://samuschen.iteye.com/blog/859975 混洗和排序

http://www.blogjava.net/shenh062326/archive/2011/01/14/342959.html   部分執行流程

http://wikidoop.com/wiki/Hadoop/MapReduce/Reducer     Hadoop/MapReduce/Reducer wiki

28、Hadoop MapReduce Job性能調優——修改Map和Reduce個數

http://irwenqiang.iteye.com/blog/1535809

http://samuschen.iteye.com/blog/859971

hive執行作業時reduce任務個數設置爲多少合適

http://jiedushi.blog.51cto.com/673653/602458

29、Hadoop分佈式文件系統(HDFS)可靠性的研究與優化(碩士論文)

http://www.docin.com/p-523453291.html

30、Apache Avro 與 Thrift 比較

http://www.tbdata.org/archives/1307

31、Hadoop Job Tuning(hadoop作業調優)

http://www.searchtb.com/2010/12/hadoop-job-tuning.html

32、mapreduce的二次排序 SecondarySort

http://www.cnblogs.com/xuxm2007/archive/2011/09/03/2165805.html

33、Hadoop學習總結Map-Reduce的過程解析

http://blog.csdn.net/keda8997110/article/details/8474326

34、Hadoop平臺優化綜述(一)

http://dongxicheng.org/mapreduce/hadoop-optimization-0/

      Hadoop平臺優化綜述(二)

http://dongxicheng.org/mapreduce/hadoop-optimization-1/

35、hadoop 0.20.2版本升級到1.0.3 記錄

http://blog.pureisle.net/archives/1845.html

36、MapReduce – 用戶編程接口簡介

http://www.importnew.com/4259.html

Hadoop入門教程(四):MR作業的提交監控、輸入輸出控制及特性使用 

http://www.importnew.com/4736.html

37、Quick Introduction To Apache Hadoop MapReduce Java API

http://www.slideshare.net/AdamKawa/apache-hadoop-java-api

38、中小規模Hadoop集羣優化

http://blog.csdn.net/azhao_dn/article/details/6955671

http://blog.csdn.net/cloudeagle_bupt/article/details/8983435

39、namenode 內部關鍵數據結構簡介

http://blogread.cn/it/article/2746?f=wb

40、Mapreduce/Hadoop 在淘寶測試中的應用

      應用MapReduce製作壓測利器
http://www.taobaotest.com/blogs/2515
      HDFS性能壓測工具淺析
http://www.taobaotest.com/blogs/2517
      用雲存儲實現對雲計算的監控
http://www.taobaotest.com/blogs/2519




(五)數據倉庫與挖掘

1、數據倉庫基礎培訓

http://wenku.baidu.com/view/c788400cba1aa8114431d95b.html

http://wenku.baidu.com/view/412b09e96294dd88d0d26bff.html

數據倉庫層次結構規範

http://wenku.baidu.com/view/5809061da300a6c30c229f67.html

2、數據倉庫ods基礎學習

http://wenku.baidu.com/view/bb3e6263caaedd3383c4d3bf.html

3、HBDW-PM-數據倉庫基礎

http://wenku.baidu.com/view/e25bd14769eae009581bec5d.html

4、mahout in action

http://net.pku.edu.cn/~course/cs402/2012/book/%5BMahout.in.Action(2011)%5D.Sean.Owen.pdf

5、數據倉庫之 ETL漫談

http://superlxw1234.iteye.com/blog/1666960

6、數據分析和數據挖掘的區別

http://superlxw1234.iteye.com/blog/1708718


(六)Oozie工作流

1、Oozie簡介

http://www.infoq.com/cn/articles/introductionOozie 

2、跟着示例學Oozie

http://www.infoq.com/cn/articles/oozieexample

3、擴展Oozie

http://www.infoq.com/cn/articles/ExtendingOozie

4、oozie相關安裝配置與問題解決例子

http://guoyunsky.iteye.com/category/187923

5、oozie總結

http://dirlt.com/oozie.html


(七)HBase

1、hbase官方指南及其性能調優

http://hbase.apache.org/book/performance.html

http://blog.linezing.com/2012/03/hbase-performance-optimization  HBase性能優化方法總結

http://database.51cto.com/art/201301/376723.htm   HBase性能優化的四個要點

http://kenwublog.com/hbase-performance-tuning    HBase性能參數調優

2、HBase技術介紹

http://www.searchtb.com/2011/01/understanding-hbase.html

3、HBase入門篇2-Java操作HBase例子

http://www.javabloger.com/article/apache-hbase-shell-and-java-api-html.html

4、hbase基本概念和hbase shell常用命令用法

http://www.cnblogs.com/flying5/archive/2011/09/15/2178064.html

5、 HBase簡介

http://blog.csdn.net/leeqing2011/article/details/7608261

6、HBase 官方文檔(中文版)

http://www.yankay.com/wp-content/hbase/book.html  (0.90)

http://abloz.com/hbase/book.html                            (0.95)

8、hbase系統架構及數據結構

http://blog.csdn.net/a221133/article/details/6894717

9、[翻譯] HBase存儲架構

http://www.spnguru.com/2010/07/%E7%BF%BB%E8%AF%91-hbase%E5%AD%98%E5%82%A8%E6%9E%B6%E6%9E%84/

10、HBase存儲文件格式概述

http://forchenyun.iteye.com/blog/828549

11、Hbase, Hive and Pig 介紹(肯特大學)

http://www.cs.kent.edu/~jin/Cloud12Spring/HbaseHivePig.pptx

12、python 調用HBase 實例

http://hbase.iteye.com/blog/1178063

13、hbase在淘寶的應用和優化小結

http://walkoven.com/hbase%20optimization%20and%20apply%20summary%20in%20taobao.pdf

14、hbase僞分佈式安裝指南:

http://my.oschina.net/leejun2005/blog/91952

15、HBase上關於CMS、GC碎片、大緩存的一種解決方案:Bucket Cache

http://zjushch.iteye.com/blog/1751387   

注:作者來自阿里,據稱讀性能能提升一個數量級,該patch已被hbase社區接受。

16、HBase 一些 tip

http://www.blogjava.net/changedi/archive/2012/12/28/393577.html

http://www.blogjava.net/changedi/archive/2013/01/02/393697.html  應用設計tip

17、阿里巴巴測試團隊總結的一些 Hbase 問題:

(1)HBase 線上問題分析小記 http://www.taobaotest.com/blogs/2158

(2)HBase Bug 知多少 http://www.taobaotest.com/blogs/2156

(3)HBase使用中幾個容易犯的小錯誤 http://www.taobaotest.com/blogs/2312

18、爲Hbase建立高可用性多主節點

http://www.importnew.com/3020.html

19、HBase二級索引與Join

http://rdc.taobao.com/team/jm/archives/951

20、HBase二級索引方案總結

http://blog.sina.com.cn/s/blog_4a1f59bf01018apd.html

21、Hbase存儲架構(整理)

http://asyty.iteye.com/blog/1250301

22、HBase框架簡介(整理)

http://asyty.iteye.com/blog/1250273

23、HBase列族高級配置

http://blog.sina.com.cn/s/blog_ae33b83901018euz.html

24、HBase Administration, Performance Tuning

http://www.packtpub.com/article/hbase-basic-performance-tuning

25、阿里hbase業務設計實踐

http://club.alibabatech.org/resource_detail.htm?topicId=89

26、HBase業務實踐(淘寶)

http://rdc.taobao.org/?p=457

27、HBase Architecture(譯)

http://duanple.blog.163.com/blog/static/70971767201191661620641/    上

http://duanple.blog.163.com/blog/static/709717672011923111743139/  中

http://duanple.blog.163.com/blog/static/709717672011925102028874/  下

28、HBase性能深度分析

http://www.programmer.com.cn/7246/



(八)flume

1、Flume日誌收集 原理與實踐

http://my.oschina.net/longniao/blog/93662

flume 真正分佈式配置方法

http://hi.baidu.com/izouying/item/6e7f87248df30a0b76272c24

Flume——安裝與配置 

http://blog.chinaunix.net/uid-26711636-id-3155236.html

http://log.medcl.net/item/2012/03/flume-build-process/

http://f.dataguru.cn/thread-48324-1-1.html

flume總體集羣建設方案

http://wenku.baidu.com/view/5f457188a0116c175f0e48a0.html

2、官方文檔:

http://flume.apache.org/FlumeUserGuide.html

3、Flume NG 配置

http://marsorp.iteye.com/blog/1561286

http://blog.csdn.net/hijk139/article/details/8308224

http://heipark.iteye.com/blog/1617995

4、flume概念

http://www.verydemo.com/demo_c89_i41415.html

5、flume-ng如何根據源文件名輸出到HDFS文件名

http://abloz.com/2013/02/19/flume-ng-output-according-to-the-source-file-name-to-the-hdfs-file-name.html

6、Hadoop的ETL任務—Flume使用及其優化(品友互動)

http://wenku.baidu.com/view/ab3dfe26dd36a32d7375818c.html



(九)sqoop

1、sqoop的安裝、配置及使用簡介

http://blog.csdn.net/leeqing2011/article/details/7630690?utm_source=weibolife

2、Sqoop示例

http://baiyunl.iteye.com/blog/964254

3、使用Sqoop在HDFS和RDBMS之間導數據

http://www.linuxidc.com/Linux/2011-10/45080.htm

4、Sqoop User Guide (v1.4.2)

http://sqoop.apache.org/docs/1.4.2/SqoopUserGuide.html?utm_source=weibolife#_introduction

5、用sqoop進行mysql和hdfs系統間的數據互導

http://abloz.com/2012/07/19/data-between-the-mysql-and-hdfs-system-of-mutual-conductance-using-sqoop.html

6、Mysql<->sqoop<->HDFS 數據交換實驗

http://leonarding.blog.51cto.com/6045525/1092764

7、MapReduce直接連接Mysql獲取數據

http://superlxw1234.iteye.com/blog/1880712


(十)ZooKeeper

1、ZooKeeper Administrator's Guide

http://zookeeper.apache.org/doc/r3.4.3/zookeeperAdmin.html

2、ZooKeeper快速搭建

http://nileader.blog.51cto.com/1381108/795230

3、ZooKeeper管理員指南——部署與管理ZooKeeper

http://blogread.cn/it/article/5917?f=sinat

4、Zookeeper工作原理

http://blogread.cn/it/article/4603?f=sa

5、分佈式服務框架 Zookeeper -- 管理分佈式環境中的數據

http://www.ibm.com/developerworks/cn/opensource/os-cn-zookeeper/


(十一)NOSQL

1、Redis資料彙總專題

http://blog.nosqlfan.com/html/3537.html

2、MongoDB資料彙總專題

http://blog.nosqlfan.com/html/3548.html

3、NoSQL數據庫筆談

http://sebug.net/paper/databases/nosql/Nosql.html

4、redis入門系列

http://www.cnblogs.com/xhan/archive/2011/02/08/1949867.html

5、Redis經驗談

http://www.programmer.com.cn/14577/

6、三英戰SQL:解析NoSQL的可靠性及擴展操作

http://www.csdn.net/article/2013-01-07/2813498-availability-and-operational

7、分佈式緩存-Memcached

http://blog.sina.com.cn/s/blog_493a845501013ei0.html

8、Redis 設計與實現

http://www.redisbook.com/en/latest/

9、SQL to MongoDB Mapping Chart

http://docs.mongodb.org/manual/reference/sql-comparison/

10、redis 常識

https://github.com/springside/springside4/wiki/redis

11、NoSQL反模式 - 文檔數據庫篇

http://www.yankay.com/nosql-anti-pattern-document/

12、SQL到NOSQL的思維轉變

http://blogread.cn/it/article/3130?f=wb




(十二)Hadoop 監控與管理

1、雲計算平臺管理的三大利器Nagios、Ganglia和Splunk

http://www.programmer.com.cn/11477/

2、不一樣的HBase監控系統

http://walkoven.com/?p=140

3、Hadoop和HBase集羣的JMX監控

http://slaytanic.blog.51cto.com/2057708/1179108

4、hadoop 補丁升級

http://blog.csdn.net/cloudeagle_bupt/article/details/8621078   給hadoop 0.20.2打patch補丁

http://hi.baidu.com/hovlj_1130/item/c1ed42cc0dbbeb0dac092f5b   hadoop升級

5、Analyzing Data with Hue and Hive

http://blog.cloudera.com/blog/2013/04/demo-analyzing-data-with-hue-and-hive/

6、Using Hue to Access Hive Data Through Pig

http://blog.cloudera.com/blog/2013/08/demo-using-hue-to-access-hive-data-through-pig/



(十三)Storm

1、storm 簡介及單機版安裝指南

http://my.oschina.net/leejun2005/blog/147607

2、storm入門教程

http://blog.linezing.com/category/storm-quick-start

3、Storm應用小結

http://www.cnblogs.com/panfeng412/tag/Storm/


(十四)YARN & Hadoop 2.0

1、Hadoop 1.0與Hadoop 2.0資源管理方案對比

http://dongxicheng.org/mapreduce-nextgen/hadoop-1-and-2-resource-manage/



附:

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章