關於google的word2vec項目,這裏就不用介紹了,接下來需要很乾貨的分享給大夥,在CentOS下安裝word2vec的linux教程。關於如何在各個linux操作系統上安裝word2vec,google的word2vec項目首頁並未過多講,或是詳細的描述安裝的linux教程。
word2vec項目的官網上,只有這麼幾段簡短的關於使用的描述和linux教程:
Quick start
Download the code: svn checkout http://word2vec.googlecode.com/svn/trunk/
Run ‘make’ to compile word2vec tool
Run the demo scripts: ./demo-word.sh and ./demo-phrases.sh
For questions about the toolkit, see http://groups.google.com/group/word2vec-toolkit
詳細的安裝word2vec項目的linux如下:
當然,如上所說,該操作是在CentOS系統下的。
1、確保gcc爲最新版
2、安裝subversion,用來獲取word2vec源碼
3、指定一個任意目錄爲word2vec工作目錄,通過svn指令獲取word2vec源碼
svn checkout http: //word2vec .googlecode.com /svn/trunk/ |
4、使用make指令進行安裝
可以發現在安裝的時候,會出現如下error:
gcc word2vec.c -o word2vec -lm -pthread -0fast -march=native -Wall -funroll-loops -Wno-unused-result
cc1: error: invalid option argument ?0fast?
cc1: warning: unrecognized command line option “-Wno-unused-result”
make: *** [word2vec] Error 1
這是因爲該系統的gcc不支持0fast選項的原因。
5、編輯makefile文件,更改gcc安裝選項
vi ./makefile
word2vec的makefile源碼如下:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
|
CC = gcc #The -Ofast might not work with older versions of gcc; in that case , use -O2 CFLAGS = -lm -pthread -Ofast -march=native -Wall -funroll-loops -Wno-unused-result all: word2vec word2phrase distance word-analogy compute-accuracy word2vec : word2vec.c $(CC) word2vec.c -o word2vec $(CFLAGS) word2phrase : word2phrase.c $(CC) word2phrase.c -o word2phrase $(CFLAGS) distance : distance.c $(CC) distance.c -o distance $(CFLAGS) word-analogy : word-analogy.c $(CC) word-analogy.c -o word-analogy $(CFLAGS) compute-accuracy : compute-accuracy.c $(CC) compute-accuracy.c -o compute-accuracy $(CFLAGS) chmod +x *.sh clean: rm -rf word2vec word2phrase distance word-analogy compute-accuracy |
注意觀察,第二行:
#The -0fast might not work with older versions of gcc; in that case, use -02(是0)
google的工程師這裏有寫到相關注意事項,告訴開發者,-0fast安裝選項,可能對於舊版本的gcc不支持,如果在這種情況下,需要將-0fast改爲-02
所以將第三行
CFLAGS = -lm -pthread -0fast -march=native -Wall -funroll-loops -Wno-unused-result
更改爲
CFLAGS = -lm -pthread -02-march=native -Wall -funroll-loops -Wno-unused-result
6、使用make指令安裝通過
安裝通過後的內容爲:
gcc word2vec.c -o word2vec -lm -pthread -02 -march=native -Wall -funroll-loops -Wno-unused-result
gcc word2phrase.c -o word2phrase -lm -pthread -02 -march=native -Wall -funroll-loops -Wno-unused-result
gcc distance.c -o distance -lm -pthread -O2 -march=native -Wall -funroll-loops -Wno-unused-result
gcc word-analogy.c -o word-analogy -lm -pthread -02 -march=native -Wall -funroll-loops -Wno-unused-result
gcc compute-accuracy.c -o compute-accuracy -lm -pthread -02 -march=native -Wall -funroll-loops -Wno-unused-result
chmod +x *.sh
關於CentOS下安裝word2vec的linux教程介紹到此,應該沒有問題,親自嘗試後,通過。