書中-就職演說語料庫這個小程序統計詞彙america和citizen隨時間推移的使用情況這個小程序有點bug,我電腦python版本(Python 3.6.1)
原書中代碼是
>>> cfd=nltk.ConditionalFreqDist(
... (target,file[:4])
... for fileid in inaugural.fileids()
... for w in inaugural.words(fileid)
... for target in ['america','citizen']
... if w.lower().startswith(target))
>>>
>>> cfd.plot()
這裏改成下面的就可以運行了file[:4] -> fileid[:4]:
>>> cfd=nltk.ConditionalFreqDist(
... (target,fileid[:4])
... for fileid in inaugural.fileids()
... for w in inaugural.words(fileid)
... for target in ['america','citizen']
... if w.lower().startswith(target))
>>>
>>> cfd.plot()
運行後的結果: