{ "info": { "author": "cookieshake", "author_email": "cookieshake.dev@gmail.com", "bugtrack_url": null, "classifiers": [], "description": "# Korhal\n[![GitHub tag (latest SemVer)](https://img.shields.io/github/tag/ingtranet/korhal.svg?style=flat-square)](https://github.com/ingtranet/korhal)\n[![PyPI](https://img.shields.io/pypi/v/korhal.svg?style=flat-square)](https://pypi.org/project/korhal/)\n[![Travis (.com) branch](https://img.shields.io/travis/com/ingtranet/korhal/master.svg?style=flat-square)](https://travis-ci.com/ingtranet/korhal)\n[![Codacy branch grade](https://img.shields.io/codacy/grade/b8ad24518efb4bec9a68b32eeb994d78/master.svg?style=flat-square)](https://app.codacy.com/project/cookieshake/korhal/dashboard)\n\nKorhal(KOrean Rpc-based Handy Application for Language-processing) is a python wrapper for several korean Part-Of-Speech taggers.\n\n## How to install\n\n``` sh\npip install korhal\n```\n\n## Available taggers\n\n- KOMORAN with `korhal.komoran`\n- Hannanum with `korhal.hannanum`\n- Open-source Korean Text Processor with `korhal.openkoreantext`\n\n## How to use\n\n``` python\nfrom korhal.komoran import tokenize\n\nresult = tokenize(\"\uc9d1\uc5d0 \uac00\uc11c \uc7a0\uc744 \uc790\uace0 \uc2f6\ub2e4\")\n# result => Token(text=\uc9d1,pos=NNG), Token(text=\uc5d0,pos=JKB), Token(text=\uac00,pos=VV), Token(text=\uc544\uc11c,pos=EC), Token(text=\uc7a0,pos=NNG), Token(text=\uc744,pos=JKO), Token(text=\uc790,pos=VV), Token(text=\uace0,pos=EC), Token(text=\uc2f6,pos=VX), Token(text=\ub2e4,pos=EC)]\nprint(result.text) # => \uc9d1\nprint(result.pos) # => NNG\n\nnouns = [token.text for token in result if token.pos.startswith('N')]\n```\n\n## Asynchronous methods\n\nWith `korhal.aio`, you can use asynchronous methods. The performance of multi-core systems can be slightly improved when performing extensive processing.\n\n``` python\nfrom korhal.aio.opentextkorean import tokenize\n\u00a0\ntexts = ['\ub2ec\ub514\ub2e8 \ub9db\uc788\ub294 \ucf00\uc774\ud06c\uac00 \uc788\uc5c8\ub2e4', '\uc19c\uc0ac\ud0d5 \uac19\uc774 \uadc0\uc5ec\uc6b4 \uad6c\ub984']\nfutures = [tokenize(text) for text in texts]\nresults = [f.result() for f in futures]\n```\n\n## Thanks to\n\n- [KOMORAN](http://www.shineware.co.kr/products/komoran/)\n- [Hannanum](http://semanticweb.kaist.ac.kr/hannanum/index.html)\n- [Open-source Korean Text Processor](https://github.com/open-korean-text/open-korean-text)\n- [KoalaNLP](https://koalanlp.github.io/KoalaNLP-core/)", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/cookieshake/korhal", "keywords": "korean analysis tagger tokenizer", "license": "", "maintainer": "", "maintainer_email": "", "name": "korhal", "package_url": "https://pypi.org/project/korhal/", "platform": "", "project_url": "https://pypi.org/project/korhal/", "project_urls": { "Homepage": "https://github.com/cookieshake/korhal" }, "release_url": "https://pypi.org/project/korhal/0.1.2/", "requires_dist": null, "requires_python": ">=2.6, !=3.0.*, !=3.1.*, !=3.2.*, !=3.3.*, !=3.4.*", "summary": "KOrean Rpc-based Application for Handy Application for Language-processing", "version": "0.1.2" }, "last_serial": 4436656, "releases": { "0.1.1": [ { "comment_text": "", "digests": { "md5": "5315ed4d601c252f97c695ea0130e485", "sha256": "d38f77a6d42ffd26731e186e275a729f9e76cc28db8bb20382a749d83532fada" }, "downloads": -1, "filename": "korhal-0.1.1.tar.gz", "has_sig": false, "md5_digest": "5315ed4d601c252f97c695ea0130e485", "packagetype": "sdist", "python_version": "source", "requires_python": ">=2.6, !=3.0.*, !=3.1.*, !=3.2.*, !=3.3.*, !=3.4.*", "size": 7960, "upload_time": "2018-10-30T15:11:36", "url": "https://files.pythonhosted.org/packages/5d/62/f2907fbb1b221b1d534f79c507177794835b1923e7989fb08e99b7797544/korhal-0.1.1.tar.gz" } ], "0.1.2": [ { "comment_text": "", "digests": { "md5": "d32a44dbeedd8b4c856380368d8e5784", "sha256": "141c3d0d2cd720778b1468841ce64280d9be5b398ec1115293afe86321de12b4" }, "downloads": -1, "filename": "korhal-0.1.2.tar.gz", "has_sig": false, "md5_digest": "d32a44dbeedd8b4c856380368d8e5784", "packagetype": "sdist", "python_version": "source", "requires_python": ">=2.6, !=3.0.*, !=3.1.*, !=3.2.*, !=3.3.*, !=3.4.*", "size": 8800, "upload_time": "2018-10-31T16:12:04", "url": "https://files.pythonhosted.org/packages/6e/71/07e35f443c62799e6323ff58befc26d11b0b271047858855c56ccd2bf5eb/korhal-0.1.2.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "d32a44dbeedd8b4c856380368d8e5784", "sha256": "141c3d0d2cd720778b1468841ce64280d9be5b398ec1115293afe86321de12b4" }, "downloads": -1, "filename": "korhal-0.1.2.tar.gz", "has_sig": false, "md5_digest": "d32a44dbeedd8b4c856380368d8e5784", "packagetype": "sdist", "python_version": "source", "requires_python": ">=2.6, !=3.0.*, !=3.1.*, !=3.2.*, !=3.3.*, !=3.4.*", "size": 8800, "upload_time": "2018-10-31T16:12:04", "url": "https://files.pythonhosted.org/packages/6e/71/07e35f443c62799e6323ff58befc26d11b0b271047858855c56ccd2bf5eb/korhal-0.1.2.tar.gz" } ] }