Japanese morphological analysis engine.
Project Links
Meta
Author: Tomoko Uchida
Classifiers
Development Status
- 4 - Beta
License
- OSI Approved :: Apache Software License
Natural Language
- Japanese
Programming Language
- Python :: 3.7
- Python :: 3.8
- Python :: 3.9
- Python :: 3.10
Janome is a Japanese morphological analysis engine written in pure Python.
General documentation:
https://mocobeta.github.io/janome/en/ (English)
https://mocobeta.github.io/janome/ (Japanese)
Requirements
Python 3.7+ is required.
Install
[Note] This consumes about 500 MB memory for building.
(venv) $ pip install janome
Run
(venv) $ python
>>> from janome.tokenizer import Tokenizer
>>> t = Tokenizer()
>>> for token in t.tokenize('すもももももももものうち'):
... print(token)
...
すもも 名詞,一般,*,*,*,*,すもも,スモモ,スモモ
も 助詞,係助詞,*,*,*,*,も,モ,モ
もも 名詞,一般,*,*,*,*,もも,モモ,モモ
も 助詞,係助詞,*,*,*,*,も,モ,モ
もも 名詞,一般,*,*,*,*,もも,モモ,モモ
の 助詞,連体化,*,*,*,*,の,ノ,ノ
うち 名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ
License
Licensed under Apache License 2.0 and uses the MeCab-IPADIC dictionary/statistical model.
See LICENSE.txt and NOTICE.txt for license details.
Acknowledgement
Special thanks to @ikawaha, @takuyaa, @nakagami and @janome_oekaki.
Copyright
Copyright(C) 2015-2023, Tomoko Uchida. All rights reserved.
0.5.0
Jul 01, 2023
0.4.2
Feb 23, 2022
0.4.1
Sep 21, 2020
0.4.0
Aug 23, 2020
0.3.10
Nov 03, 2019
0.3.9
May 12, 2019
0.3.8
Apr 03, 2019
0.3.7
Dec 10, 2018
0.3.6
Dec 07, 2017
0.3.5
Aug 06, 2017
0.3.4
Jul 29, 2017
0.3.3
Jul 22, 2017
0.3.2
Jul 05, 2017
0.3.1
Jul 01, 2017
0.3.0
Jun 29, 2017
0.2.8
May 07, 2016
0.2.7
Mar 05, 2016
0.2.6
Oct 26, 2015
0.2.5
May 11, 2015
0.2.4
May 03, 2015
0.2.3
May 03, 2015
0.2.2
Apr 24, 2015
0.2.0
Apr 23, 2015
0.1.4
Apr 11, 2015
0.1.3
Apr 07, 2015
0.1.2
Apr 07, 2015
Wheel compatibility matrix
Files in release
No dependencies