Hacker News new | ask | show | jobs
by yorwba 2883 days ago
When I do echo '邪推してしまう' | mecab I get

  邪推	名詞,サ変名詞,*,*,邪推,じゃすい,代表表記:邪推/じゃすい カテゴリ:抽象物
  して	動詞,*,サ変動詞,タ系連用テ形,する,して,代表表記:する/する 付属動詞候補(基本) 自他動詞:自:成る/なる
  しまう	接尾辞,動詞性接尾辞,子音動詞ワ行,基本形,しまう,しまう,連語
  EOS
and I have no idea why jisho.org completely ignores the しまう.

The other problems do seem to be related to the segmentation approach MeCab takes. I've usually found it to be satisfactory, but maybe I just haven't been looking hard enough for errors. I might try and see how well your Ichiran works as a replacement. Thanks!