[Python][Windows]mecabで形態素解析

Androidで形態素解析する方法を以前書きました。
今回は、Windows/Pythonで形態素解析を行う方法です。

Pythonで形態素解析はMeCabを使って行えます。

Python用ライブラリのインストール

コマンドプロンプトを開き、以下を入力して" mecab-python3"をインストールします。
pip install mecab-python3

続いて辞書ライブラリをインストールします。

pip install unidic

辞書ライブラリに辞書自体が含まれないので以下のようにしてインストールします。

python -m unidic download

辞書をインストールしないとMeCab実行時に以下のようなエラーが出るので注意してください。

------------------- ERROR DETAILS ------------------------
arguments:
[ifs] no such file or directory: I:\python\Lib\site-packages\unidic\dicdir\mecabrc
----------------------------------------------------------

Pythonでの形態素解析の実行

MeCabとライブラリ、辞書のインストールが終わったら以下のコードで動作確認をしましょう。

import MeCab

tagger = MeCab.Tagger() 
result = tagger.parse("MeCabとライブラリ、辞書のインストールが終わったら以下のコードで動作確認をしましょう。")

print(result)
以下の結果となります。
MeCab	名詞,普通名詞,一般,,,
と	助詞,格助詞,,,,,ト,と,と,ト,と,ト,和,"","","","","","",格助,ト,ト,ト,ト,"","名詞%F1,動詞%F1,形容詞%F2@-1","",7099014038299136,25826
ライブラリ	名詞,普通名詞,一般,,,,ライブラリー,ライブラリー-library,ライブラリ,ライブラリ,ライブラリ,ライブラリ,外,"","","","","","",体,ライブラリ,ライブラリ,ライブラリ,ライブラリ,"1","C1","",10895902566916608,39639
、	補助記号,読点,,,,,,、,、,,、,,記号,"","","","","","",補助,,,,,"","","",6605693395456,24
辞書	名詞,普通名詞,一般,,,,ジショ,辞書,辞書,ジショ,辞書,ジショ,漢,"","","","","","",体,ジショ,ジショ,ジショ,ジショ,"1","C3","",4907129018393088,17852
の	助詞,格助詞,,,,,ノ,の,の,ノ,の,ノ,和,"","","","","","",格助,ノ,ノ,ノ,ノ,"","名詞%F1","",7968444268028416,28989
インストール	名詞,普通名詞,サ変可能,,,,インストール,インストール-install,インストール,インストール,インストール,インストール,外,"","","","","","",体,インストール,インストール,インストール,インストール,"4","C1","",758121890980352,2758
が	助詞,格助詞,,,,,ガ,が,が,ガ,が,ガ,和,"","","","","","",格助,ガ,ガ,ガ,ガ,"","動詞%F2@0,名詞%F1","",2168520431510016,7889
終わっ	動詞,非自立可能,,,五段-ラ行,連用形-促音便,オワル,終わる,終わっ,オワッ,終わる,オワル,和,"","","","","","",用,オワッ,オワル,オワッ,オワル,"0","C2","",1483524754514564,5397
たら	助動詞,,,,助動詞-タ,仮定形-一般,タ,た,たら,タラ,た,タ,和,"","","","","","",助動,タラ,タ,タラ,タ,"","動詞%F2@1,形容詞%F4@-2","M2@1",5948916285711073,21642
以下	名詞,普通名詞,一般,,,,イカ,以下,以下,イカ,以下,イカ,漢,"","","","","","",体,イカ,イカ,イカ,イカ,"1","C1","",442836931715584,1611
の	助詞,格助詞,,,,,ノ,の,の,ノ,の,ノ,和,"","","","","","",格助,ノ,ノ,ノ,ノ,"","名詞%F1","",7968444268028416,28989
コード	名詞,普通名詞,一般,,,,コード,コード-code,コード,コード,コード,コード,外,"","","","","","",体,コード,コード,コード,コード,"1","C1","",3658083809239552,13308
で	助詞,格助詞,,,,,デ,で,で,デ,で,デ,和,"","","","","","",格助,デ,デ,デ,デ,"","動詞%F2@0,名詞%F1","",7014343053025792,25518
動作	名詞,普通名詞,サ変可能,,,,ドウサ,動作,動作,ドーサ,動作,ドーサ,漢,"","","","","","",体,ドウサ,ドウサ,ドウサ,ドウサ,"1,0","C1","",7435730884370944,27051
確認	名詞,普通名詞,サ変可能,,,,カクニン,確認,確認,カクニン,確認,カクニン,漢,"","","","","","",体,カクニン,カクニン,カクニン,カクニン,"0","C2","",1708924571099648,6217
を	助詞,格助詞,,,,,ヲ,を,を,オ,を,オ,和,"","","","","","",格助,ヲ,ヲ,ヲ,ヲ,"","動詞%F2@0,名詞%F1,形容詞%F2@-1","",11381878116459008,41407
し	動詞,非自立可能,,,サ行変格,連用形-一般,スル,為る,し,シ,する,スル,和,"","","","","","",用,シ,スル,シ,スル,"0","C5","",5370298291593857,19537
ましょう	助動詞,,,,助動詞-マス,意志推量形,マス,ます,ましょう,マショー,ます,マス,和,"","","","","","",助動,マショウ,マス,マショウ,マス,"","動詞%F4@1","M1@1",9812325267808865,35697
。	補助記号,句点,,,,,,。,。,,。,,記号,"","","","","","",補助,,,,,"","","",6880571302400,25
EOS

0 件のコメント :

コメントを投稿