{ "info": { "author": "Hajime Nakagami", "author_email": "nakagami@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 3 - Alpha", "Programming Language :: Python", "Programming Language :: Python :: 3" ], "description": "=================\njanomecabdic\n=================\n\nThis is an add-on package for janome https://mocobeta.github.io/janome/ .\n\nJanome can use compiled dictionaries for MeCab https://taku910.github.io/mecab/\nvia janomecabdic.\n\nRequirements\n-------------------------\n\n- MeCab + MeCab dictionary (utf-8 encoded)\n- Python 3.5+\n- Cython available environment (C++ compiler)\n- janome\n\nInstration\n-------------------------\n\nInstall MeCab and MeCab dictionary.\n(e.g.: Ubuntu/Debian)\n\n::\n\n sudo apt install mecab mecab-ipadic-utf8\n\nInstall C++ compiler for Cython.\n(e.g.: Ubuntu/Debian)\n\n::\n\n sudo apt install build-essential\n\nInstall janome and janomecabdic.\n\n::\n\n pip install janome janomecabdic\n\n\nExample (use /etc/mecabrc defined dictionary)\n------------------------------------------------------------\n\nSample code\n+++++++++++++++\n\n::\n\n >>> from janome.tokenizer import Tokenizer\n >>> from janomecabdic import MeCabDictionary\n >>> t = Tokenizer()\n >>> t.sys_dic = MeCabDictionary()\n >>> for token in t.tokenize('\u3059\u3082\u3082\u3082\u3082\u3082\u3082\u3082\u3082\u306e\u3046\u3061'):\n ... print(token)\n ...\n \u3059\u3082\u3082 \u540d\u8a5e,\u4e00\u822c,*,*,*,*,\u3059\u3082\u3082,\u30b9\u30e2\u30e2,\u30b9\u30e2\u30e2\n \u3082 \u52a9\u8a5e,\u4fc2\u52a9\u8a5e,*,*,*,*,\u3082,\u30e2,\u30e2\n \u3082\u3082 \u540d\u8a5e,\u4e00\u822c,*,*,*,*,\u3082\u3082,\u30e2\u30e2,\u30e2\u30e2\n \u3082 \u52a9\u8a5e,\u4fc2\u52a9\u8a5e,*,*,*,*,\u3082,\u30e2,\u30e2\n \u3082\u3082 \u540d\u8a5e,\u4e00\u822c,*,*,*,*,\u3082\u3082,\u30e2\u30e2,\u30e2\u30e2\n \u306e \u52a9\u8a5e,\u9023\u4f53\u5316,*,*,*,*,\u306e,\u30ce,\u30ce\n \u3046\u3061 \u540d\u8a5e,\u975e\u81ea\u7acb,\u526f\u8a5e\u53ef\u80fd,*,*,*,\u3046\u3061,\u30a6\u30c1,\u30a6\u30c1\n\n\nNeologd example (tokenize with a specific dictionary)\n--------------------------------------------------------------------\n\nInstall Neologd (e.g. Ubuntu/Debian)\n+++++++++++++++++++++++++++++++++++++++++\n\n::\n\n sudo apt install mecab mecab-ipadic-utf8 libmecab-dev\n git clone --depth 1 git@github.com:neologd/mecab-ipadic-neologd.git\n cd mecab-ipadic-neologd\n echo `mecab-config --dicdir`\"/mecab-ipadic-neologd\"\n ./bin/install-mecab-ipadic-neologd -n\n\n\n\nSample code\n+++++++++++++++\n\n::\n\n >>> from janome.tokenizer import Tokenizer\n >>> from janomecabdic import MeCabDictionary\n >>> t = Tokenizer()\n >>> t.sys_dic = MeCabDictionary('/usr/lib/mecab/dic/mecab-ipadic-neologd')\n >>> for token in t.tokenize('8\u67083\u65e5\u306b\u653e\u9001\u3055\u308c\u305f\u300c\u4e2d\u5c45\u6b63\u5e83\u306e\u91d1\u66dc\u65e5\u306e\u30b9\u30de\u30a4\u30eb\u305f\u3061\u3078\u300d(TBS\u7cfb)\u3067\u30011\u65e5\u305f\u3063\u305f5\u5206\u3067\u307d\u3063\u3053\u308a\u304a\u306a\u304b\u3092\u89e3\u6d88\u3059\u308b\u3068\u3044\u3046\u30c0\u30a4\u30a8\u30c3\u30c8\u65b9\u6cd5\u3092\u7d39\u4ecb\u3002\u30ad\u30f3\u30bf\u30ed\u30fc\u3002\u306e\u30c0\u30a4\u30a8\u30c3\u30c8\u306b\u3082\u5bc6\u7740'):\n ... print(token)\n ...\n 8\u67083\u65e5 \u540d\u8a5e,\u56fa\u6709\u540d\u8a5e,\u4e00\u822c,*,*,*,8\u67083\u65e5,\u30cf\u30c1\u30ac\u30c4\u30df\u30c3\u30ab,\u30cf\u30c1\u30ac\u30c4\u30df\u30c3\u30ab\n \u306b \u52a9\u8a5e,\u683c\u52a9\u8a5e,\u4e00\u822c,*,*,*,\u306b,\u30cb,\u30cb\n \u653e\u9001 \u540d\u8a5e,\u30b5\u5909\u63a5\u7d9a,*,*,*,*,\u653e\u9001,\u30db\u30a6\u30bd\u30a6,\u30db\u30fc\u30bd\u30fc\n \u3055 \u52d5\u8a5e,\u81ea\u7acb,*,*,\u30b5\u5909\u30fb\u30b9\u30eb,\u672a\u7136\u30ec\u30eb\u63a5\u7d9a,\u3059\u308b,\u30b5,\u30b5\n \u308c \u52d5\u8a5e,\u63a5\u5c3e,*,*,\u4e00\u6bb5,\u9023\u7528\u5f62,\u308c\u308b,\u30ec,\u30ec\n \u305f \u52a9\u52d5\u8a5e,*,*,*,\u7279\u6b8a\u30fb\u30bf,\u57fa\u672c\u5f62,\u305f,\u30bf,\u30bf\n \u300c \u8a18\u53f7,\u62ec\u5f27\u958b,*,*,*,*,\u300c,\u300c,\u300c\n \u4e2d\u5c45\u6b63\u5e83\u306e\u91d1\u66dc\u65e5\u306e\u30b9\u30de\u30a4\u30eb\u305f\u3061\u3078 \u540d\u8a5e,\u56fa\u6709\u540d\u8a5e,\u4e00\u822c,*,*,*,\u4e2d\u5c45\u6b63\u5e83\u306e\u91d1\u66dc\u65e5\u306e\u30b9\u30de\u30a4\u30eb\u305f\u3061\u3078,\u30ca\u30ab\u30a4\u30de\u30b5\u30d2\u30ed\u30ce\u30ad\u30f3\u30e8\u30a6\u30d3\u30ce\u30b9\u30de\u30a4\u30eb\u30bf\u30c1\u30d8,\u30ca\u30ab\u30a4\u30de\u30b5\u30d2\u30ed\u30ce\u30ad\u30f3\u30e8\u30fc\u30d3\u30ce\u30b9\u30de\u30a4\u30eb\u30bf\u30c1\u30d8\n \u300d( \u8a18\u53f7,\u4e00\u822c,*,*,*,*,\u300d(,*,*\n TBS \u540d\u8a5e,\u56fa\u6709\u540d\u8a5e,\u4e00\u822c,*,*,*,TBS,\u30c6\u30a3\u30fc\u30d3\u30fc\u30a8\u30b9,\u30c6\u30a3\u30fc\u30d3\u30fc\u30a8\u30b9\n \u7cfb \u540d\u8a5e,\u63a5\u5c3e,\u4e00\u822c,*,*,*,\u7cfb,\u30b1\u30a4,\u30b1\u30a4\n ) \u8a18\u53f7,\u4e00\u822c,*,*,*,*,),*,*\n \u3067 \u52a9\u52d5\u8a5e,*,*,*,\u7279\u6b8a\u30fb\u30c0,\u9023\u7528\u5f62,\u3060,\u30c7,\u30c7\n \u3001 \u8a18\u53f7,\u8aad\u70b9,*,*,*,*,\u3001,\u3001,\u3001\n 1\u65e5 \u540d\u8a5e,\u56fa\u6709\u540d\u8a5e,\u4e00\u822c,*,*,*,1\u65e5,\u30c4\u30a4\u30bf\u30c1,\u30c4\u30a4\u30bf\u30c1\n \u305f\u3063\u305f \u526f\u8a5e,\u52a9\u8a5e\u985e\u63a5\u7d9a,*,*,*,*,\u305f\u3063\u305f,\u30bf\u30c3\u30bf,\u30bf\u30c3\u30bf\n 5\u5206 \u540d\u8a5e,\u56fa\u6709\u540d\u8a5e,\u4e00\u822c,*,*,*,5\u5206,\u30b4\u30d5\u30f3,\u30b4\u30d5\u30f3\n \u3067 \u52a9\u8a5e,\u683c\u52a9\u8a5e,\u4e00\u822c,*,*,*,\u3067,\u30c7,\u30c7\n \u307d\u3063\u3053\u308a\u304a\u306a\u304b \u540d\u8a5e,\u56fa\u6709\u540d\u8a5e,\u4e00\u822c,*,*,*,\u307d\u3063\u3053\u308a\u304a\u306a\u304b,\u30dd\u30c3\u30b3\u30ea\u30aa\u30ca\u30ab,\u30dd\u30c3\u30b3\u30ea\u30aa\u30ca\u30ab\n \u3092 \u52a9\u8a5e,\u683c\u52a9\u8a5e,\u4e00\u822c,*,*,*,\u3092,\u30f2,\u30f2\n \u89e3\u6d88 \u540d\u8a5e,\u30b5\u5909\u63a5\u7d9a,*,*,*,*,\u89e3\u6d88,\u30ab\u30a4\u30b7\u30e7\u30a6,\u30ab\u30a4\u30b7\u30e7\u30fc\n \u3059\u308b \u52d5\u8a5e,\u81ea\u7acb,*,*,\u30b5\u5909\u30fb\u30b9\u30eb,\u57fa\u672c\u5f62,\u3059\u308b,\u30b9\u30eb,\u30b9\u30eb\n \u3068\u3044\u3046 \u52a9\u8a5e,\u683c\u52a9\u8a5e,\u9023\u8a9e,*,*,*,\u3068\u3044\u3046,\u30c8\u30a4\u30a6,\u30c8\u30e6\u30a6\n \u30c0\u30a4\u30a8\u30c3\u30c8\u65b9\u6cd5 \u540d\u8a5e,\u56fa\u6709\u540d\u8a5e,\u4e00\u822c,*,*,*,\u30c0\u30a4\u30a8\u30c3\u30c8\u65b9\u6cd5,\u30c0\u30a4\u30a8\u30c3\u30c8\u30db\u30a6\u30db\u30a6,\u30c0\u30a4\u30a8\u30c3\u30c8\u30db\u30a6\u30db\u30fc\n \u3092 \u52a9\u8a5e,\u683c\u52a9\u8a5e,\u4e00\u822c,*,*,*,\u3092,\u30f2,\u30f2\n \u7d39\u4ecb \u540d\u8a5e,\u30b5\u5909\u63a5\u7d9a,*,*,*,*,\u7d39\u4ecb,\u30b7\u30e7\u30a6\u30ab\u30a4,\u30b7\u30e7\u30fc\u30ab\u30a4\n \u3002 \u8a18\u53f7,\u53e5\u70b9,*,*,*,*,\u3002,\u3002,\u3002\n \u30ad\u30f3\u30bf\u30ed\u30fc\u3002 \u540d\u8a5e,\u56fa\u6709\u540d\u8a5e,\u4e00\u822c,*,*,*,\u30ad\u30f3\u30bf\u30ed\u30fc\u3002,\u30ad\u30f3\u30bf\u30ed\u30fc,\u30ad\u30f3\u30bf\u30ed\u30fc\n \u306e \u52a9\u8a5e,\u9023\u4f53\u5316,*,*,*,*,\u306e,\u30ce,\u30ce\n \u30c0\u30a4\u30a8\u30c3\u30c8 \u540d\u8a5e,\u30b5\u5909\u63a5\u7d9a,*,*,*,*,\u30c0\u30a4\u30a8\u30c3\u30c8,\u30c0\u30a4\u30a8\u30c3\u30c8,\u30c0\u30a4\u30a8\u30c3\u30c8\n \u306b \u52a9\u8a5e,\u683c\u52a9\u8a5e,\u4e00\u822c,*,*,*,\u306b,\u30cb,\u30cb\n \u3082 \u52a9\u8a5e,\u4fc2\u52a9\u8a5e,*,*,*,*,\u3082,\u30e2,\u30e2\n \u5bc6\u7740 \u540d\u8a5e,\u30b5\u5909\u63a5\u7d9a,*,*,*,*,\u5bc6\u7740,\u30df\u30c3\u30c1\u30e3\u30af,\u30df\u30c3\u30c1\u30e3\u30af", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/nakagami/janomecabdic/", "keywords": "janome,MeCab", "license": "LGPL2,BSD", "maintainer": "", "maintainer_email": "", "name": "janomecabdic", "package_url": "https://pypi.org/project/janomecabdic/", "platform": "", "project_url": "https://pypi.org/project/janomecabdic/", "project_urls": { "Homepage": "https://github.com/nakagami/janomecabdic/" }, "release_url": "https://pypi.org/project/janomecabdic/0.0.1/", "requires_dist": null, "requires_python": "", "summary": "MeCab dictionary access library for janome", "version": "0.0.1" }, "last_serial": 5998457, "releases": { "0.0.1": [ { "comment_text": "", "digests": { "md5": "0ca5208b00aa2115aecc072d918d94e9", "sha256": "a37045d0b1e25d7d58618372bf3abab7082af1a5bd164e86fd7decb385f09a65" }, "downloads": -1, "filename": "janomecabdic-0.0.1.tar.gz", "has_sig": false, "md5_digest": "0ca5208b00aa2115aecc072d918d94e9", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 249653, "upload_time": "2019-10-19T02:48:56", "url": "https://files.pythonhosted.org/packages/0a/62/60d6ccf6a155833fb9f73ec5ef373c1afdc1dcd0137242fc8ab3423455be/janomecabdic-0.0.1.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "0ca5208b00aa2115aecc072d918d94e9", "sha256": "a37045d0b1e25d7d58618372bf3abab7082af1a5bd164e86fd7decb385f09a65" }, "downloads": -1, "filename": "janomecabdic-0.0.1.tar.gz", "has_sig": false, "md5_digest": "0ca5208b00aa2115aecc072d918d94e9", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 249653, "upload_time": "2019-10-19T02:48:56", "url": "https://files.pythonhosted.org/packages/0a/62/60d6ccf6a155833fb9f73ec5ef373c1afdc1dcd0137242fc8ab3423455be/janomecabdic-0.0.1.tar.gz" } ] }