{ "info": { "author": "Baeg-il Kim", "author_email": "cedar101@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 3 - Alpha", "Environment :: Console", "Intended Audience :: Developers", "Intended Audience :: Science/Research", "License :: OSI Approved :: Apache Software License", "Natural Language :: Korean", "Programming Language :: Python", "Programming Language :: Python :: 2", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.5", "Topic :: Software Development :: Libraries :: Python Modules", "Topic :: Text Processing", "Topic :: Text Processing :: Linguistic" ], "description": "twitter-korean-py\n=================\n.. image:: https://circleci.com/gh/cedar101/twitter-korean-py.svg?style=svg\n :alt: Circle CI Build Status\n :target: https://circleci.com/gh/cedar101/twitter-korean-py\n.. image:: https://travis-ci.org/cedar101/twitter-korean-py.svg?branch=master\n :alt: Travis CI Build Status\n :target: https://travis-ci.org/cedar101/twitter-korean-py\n\n`twitter-korean-py`_ \ub294 `twitter-korean-text`_ \uc758 \uc2a4\uce7c\ub77c \ucf54\ub4dc\ub97c\n\ucc38\uace0\ud558\uc5ec \ud30c\uc774\uc36c\uc73c\ub85c \uc0c8\ub85c \ucf54\ub529\ud558\uc5ec \ud3ec\ud305\ud55c \ub77c\uc774\ube0c\ub7ec\ub9ac\uc785\ub2c8\ub2e4.\n\n* \ud604\uc7ac\ub294 \uc815\uaddc\ud654(normalizer)\ub9cc \uac00\ub2a5\ud558\uba70, \ub098\uba38\uc9c0 \uae30\ub2a5(\ud1a0\ud070\ud654, \uc5b4\uadfc\ud654, \uc5b4\uad6c \ucd94\ucd9c)\uc740 \uc544\uc9c1 \uad6c\ud604\ud558\uc9c0 \uc54a\uc558\uc2b5\ub2c8\ub2e4.\n* JPype_ \ub97c \uc0ac\uc6a9\ud55c \ub798\ud37c \uc778\ud130\ud398\uc774\uc2a4\uc778 twkorean_ \uacfc\ub294 \ub2ec\ub9ac, twitter\\-korean\\-text\uc758 \uc2a4\uce7c\ub77c/\uc790\ubc14 \ucf54\ub4dc\ub97c \uc0ac\uc6a9\ud558\uc9c0 \uc54a\uc740 \uc21c\uc218 \ud30c\uc774\uc36c(pure\\-python) \ucf54\ub4dc\uc785\ub2c8\ub2e4.\n* \uc124\uce58 \uc2a4\ud06c\ub9bd\ud2b8\ub294 twitter\\-korean\\-text\uc758 maven repository\uc5d0\uc11c JAR \ud30c\uc77c\uc744 \ub2e4\uc6b4\ubc1b\uc740 \ud6c4, \uc0ac\uc804 \ud30c\uc77c\ub9cc\uc744 \uc555\ucd95 \ud574\uc81c\ud558\uc5ec \uc0ac\uc6a9\ud569\ub2c8\ub2e4.\n * \uc774 \uac1c\ub150\uc740 twkorean\uc744 \ucc38\uace0\ud558\uc600\uc2b5\ub2c8\ub2e4.\n * \ud30c\uc774\uc36c 2.7\uc5d0\uc11c\ub294 `maven-artifact`_ \ub77c\ub294 \ud234\uc744 \uc0ac\uc6a9\ud558\uc5ec maven \uc5c6\uc774 \uc124\uce58 \uac00\ub2a5\ud569\ub2c8\ub2e4.\n * \ud30c\uc774\uc36c 3.x\uc5d0\uc11c\ub294 maven(mvn)\uc744 \uc9c1\uc811 \uc2e4\ud589\ud574\uc11c \ub2e4\uc6b4\ub85c\ub4dc\ud569\ub2c8\ub2e4.\n\n.. _`twitter-korean-py`: https://github.com/cedar101/twitter-korean-py\n.. _`twitter-korean-text`: https://github.com/twitter/twitter-korean-text\n.. _twkorean: https://github.com/jaepil/twkorean\n.. _JPype: http://jpype.sourceforge.net\n.. _`maven-artifact`: https://github.com/hamnis/maven-artifact\n\nExamples\n--------\n\n.. code:: python\n\n >>> import twitter_korean\n >>> text = u\"\ud55c\uad6d\uc5b4\ub97c \ucc98\ub9ac\ud558\ub294 \uc608\uc2dc\uc785\ub2c8\ub2fc\u314b\u314b\u314b\u314b\u314b #\ud55c\uad6d\uc5b4\"\n >>> # Normalize\n >>> normalized = twitter_korean.normalize(text)\n >>> print(normalized)\n \ud55c\uad6d\uc5b4\ub97c \ucc98\ub9ac\ud558\ub294 \uc608\uc2dc\uc785\ub2c8\ub2e4\u314b\u314b #\ud55c\uad6d\uc5b4\n >>> # Tokenize\n >>> tokens = twitter_korean.tokenize(normalized)\n Traceback (most recent call last):\n NotImplementedError: ...\n >>> tokens = [(u'\ud55c\uad6d\uc5b4', 'Noun', 0, 3), (u'\ub97c', 'Josa', 3, 1), (u' ', 'Space', 4, 1), (u'\ucc98\ub9ac', 'Noun', 5, 2), (u'\ud558\ub294', 'Verb', 7, 2), (u' ', 'Space', 9, 1), (u'\uc608\uc2dc', 'Noun', 10, 2), (u'\uc785\ub2c8', 'Adjective', 12, 2), (u'\ub2e4', 'Eomi', 14, 1), (u'\u314b\u314b', 'KoreanParticle', 15, 2), (u' ', 'Space', 17, 1), (u'#\ud55c\uad6d\uc5b4', 'Hashtag', 18, 4)]\n >>> # Stemming\n >>> stemmed = twitter_korean.stem(tokens)\n Traceback (most recent call last):\n NotImplementedError: ...\n >>> # Phrase extraction\n >>> phrases = twitter_korean.extract_phrases(tokens)\n Traceback (most recent call last):\n NotImplementedError: ...", "description_content_type": null, "docs_url": null, "download_url": "UNKNOWN", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/cedar101/twitter_korean-py", "keywords": "twkorean\ntwitter-korean\ntwitter-korean-py\ntwitter-korean-text\nmorphological analyzer\nmorphology\nanalyzer\nKorean\nKorean language\ntokenizer\nNLP\nnatural language processing\nnatural language interface to database", "license": "UNKNOWN", "maintainer": null, "maintainer_email": null, "name": "twitter-korean", "package_url": "https://pypi.org/project/twitter-korean/", "platform": "UNKNOWN", "project_url": "https://pypi.org/project/twitter-korean/", "project_urls": { "Download": "UNKNOWN", "Homepage": "https://github.com/cedar101/twitter_korean-py" }, "release_url": "https://pypi.org/project/twitter-korean/0.1.0.dev527/", "requires_dist": null, "requires_python": null, "summary": "Python port to the normalizer in https://github.com/twitter/twitter-korean-text", "version": "0.1.0.dev527" }, "last_serial": 2073556, "releases": { "0.1.0.dev522": [], "0.1.0.dev527": [ { "comment_text": "", "digests": { "md5": "725f80230b1db6c1e6f94c8aecbc5b52", "sha256": "17cb150ab2dece46df8fabaa5d9e8446f632a6f1f3a712aa1cbf6e47d49503ed" }, "downloads": -1, "filename": "twitter-korean-0.1.0.dev527.tar.gz", "has_sig": false, "md5_digest": "725f80230b1db6c1e6f94c8aecbc5b52", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 14795, "upload_time": "2016-04-20T08:16:59", "url": "https://files.pythonhosted.org/packages/45/b2/a25b29c34c1bd260cb15441f0bb047786f0d52a54fe9c0eb0b52062afcdd/twitter-korean-0.1.0.dev527.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "725f80230b1db6c1e6f94c8aecbc5b52", "sha256": "17cb150ab2dece46df8fabaa5d9e8446f632a6f1f3a712aa1cbf6e47d49503ed" }, "downloads": -1, "filename": "twitter-korean-0.1.0.dev527.tar.gz", "has_sig": false, "md5_digest": "725f80230b1db6c1e6f94c8aecbc5b52", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 14795, "upload_time": "2016-04-20T08:16:59", "url": "https://files.pythonhosted.org/packages/45/b2/a25b29c34c1bd260cb15441f0bb047786f0d52a54fe9c0eb0b52062afcdd/twitter-korean-0.1.0.dev527.tar.gz" } ] }