Mahout下載與安裝
到$MAHOUT_HOME/bin/文件夾下,執行
chmod a+x mahout
搭建eclipse下mahout重編譯環境
The only problem this causes is that if you have another project outside Mahout and you make it depend on one of these project, that project will not get built. This is all I've come across. So: just ignore it. Or report it to m2e.
<plugins >
<!--This plugin's configuration is used to store Eclipse m2e settings only. It has no influence on the Maven build
itself. -->
<plugin >
<groupId >org.eclipse.m2e </groupId >
<artifactId >lifecycle-mapping </artifactId >
<version >1.0.0 </version >
<configuration >
<lifecycleMappingMetadata >
<pluginExecutions >
<pluginExecution >
<pluginExecutionFilter >
<groupId >org.apache.maven.plugins </groupId >
<artifactId >maven-antrun-plugin </artifactId >
<versionRange >[1.6,) </versionRange >
<goals >
<goal >run </goal >
</goals >
</pluginExecutionFilter >
<action >
<ignore />
</action >
</pluginExecution >
</pluginExecutions >
</lifecycleMappingMetadata >
</configuration >
</plugin >
</plugins >
</pluginManagement >
解決的方法是,在</build>標籤前添加如下內容 :
<plugins>
<!-- Ignore/Execute plugin execution -->
<plugin>
<groupId>org.eclipse.m2e</groupId>
<artifactId>lifecycle-mapping</artifactId>
<version>1.0.0</version>
<configuration>
<lifecycleMappingMetadata>
<pluginExecutions>
<!-- copy-dependency plugin -->
<pluginExecution>
<pluginExecutionFilter>
<groupId>org.apache.maven.plugins</groupId>
<artifactId>maven-dependency-plugin</artifactId>
<versionRange>[1.0.0,)</versionRange>
<goals>
<goal>copy-dependencies</goal>
</goals>
</pluginExecutionFilter>
<action>
<ignore />
</action>
</pluginExecution>
</pluginExecutions>
</lifecycleMappingMetadata>
</configuration>
</plugin>
</plugins>
</pluginManagement>
Mahout重編譯
Mahout入口
MAHOUT_HEAPSIZE:Java運行堆棧的內存大小,單位爲MB。
HADOOP_CONF_DIR:Hadoop 配置文件路徑主要是*-site.xml這些文件的位置
MAHOUT_OPTS:mahout運行時的java參數
MAHOUT_CONF_DIR:類文件屬性文件所在文件,文件名是類名簡寫,默認位置
MAHOUT_LOCAL:設置是否本地運行,如果設置這個參數就不會運行hadoop了,一旦設置這個參數,那HADOOP_CONF_DIR 和HADOOP_HOME 這兩個參數的設置就自動失效了。
MAHOUT_CORE:是否應用開發環境的core,還是發佈的core。
參數-core:指定引用包的路徑
$CLASS是org.apache.mahout.driver.MahoutDriver
$@所有參數的集合