.NET中文分詞實現http://http://
使用
Lucene.Net.dll http://www.apache.org/dist/incubator/lucene.net/binaries/2.9.4g-incubating/
PanGu.dll http://pangusegment.codeplex.com/releases/view/50811
PanGu.Lucene.Analyzer.dll
和字典文件 http://pangusegment.codeplex.com/releases/view/31531
示例代碼:
using System;
using System.Collections.Generic;
using System.Linq;
using System.Text;
using Lucene.Net.Analysis;
using Lucene.Net.Analysis.PanGu;
using System.IO;
using System.Collections;
namespace FcCApp {
class Program {
static void Main(string[] args){
String text = "基於java語言開發的輕量級的中文分詞工具包";
Analyzer anal = new PanGuAnalyzer();//使用盤古分詞
StringReader sb = new StringReader(text);
TokenStream ts= anal.ReusableTokenStream("", sb);
Token t = null;
while ((t=ts.Next())!=null){
Console.Write(t.TermText()+"|");
}
}
}
}
結果:
基於|java|語言|開發|的|輕量級|的|中文|分詞|工具包|
示例下載地址:
http://download.csdn.net/detail/lijun7788/4412762