{ "info": { "author": "Nguy\u1ec5n H\u1ed3ng Qu\u00e2n", "author_email": "ng.hong.quan@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Natural Language :: Vietnamese", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3 :: Only", "Programming Language :: Python :: 3.6", "Programming Language :: Python :: 3.7", "Programming Language :: Python :: 3.8", "Topic :: Software Development :: Libraries :: Python Modules", "Topic :: Text Processing :: Linguistic" ], "description": "=============\nViStickedWord\n=============\n\n.. image:: https://badgen.net/pypi/v/vistickedword\n :target: https://pypi.org/project/vistickedword\n\n\nA library to split a string of many Vietnamese words sticked together to single words. It, for example, split \"khuckhuyu\" to \"khuc\" and \"khuyu\".\nThis library is not supposed to split Vietnamese by semantics, so it won't differentiate single or compound words. It will not, for example, split \"bacsitrongbenhvien\" to \"bac si\" + \"trong\" + \"benh vien\".\nIf you want such a feature, please use underthesea_.\nDue to my personal need, this library currently doesn't process fully marked words, like \"h\u1ecdamikh\u00f4ngh\u00f3tn\u1eefa\". However, it is trivial for library user to strip those marks before passing to ``ViStickedWord`` (using Unidecode_).\n\nTo make convenient for programming, some terminologies are not used accurately like it should be in linguistic. Please don't use my code as a source for learning Vietnamese grammar.\n\n----------\n\nTh\u01b0 vi\u1ec7n \u0111\u1ec3 t\u00e1ch m\u1ed9t ch\u00f9m t\u1eeb ti\u1ebfng Vi\u1ec7t vi\u1ebft d\u00ednh li\u1ec1n th\u00e0nh c\u00e1c t\u1eeb \u0111\u01a1n ri\u00eang l\u1ebb, v\u00ed d\u1ee5 t\u00e1ch \"khuckhuyu\" th\u00e0nh \"khuc\", \"khuyu\".\nTh\u01b0 vi\u1ec7n n\u00e0y kh\u00f4ng c\u00f3 \u00fd \u0111\u1ecbnh t\u00e1ch t\u1eeb d\u1ef1a theo ng\u1eef ngh\u0129a, n\u00ean n\u00f3 s\u1ebd kh\u00f4ng ph\u00e2n bi\u1ec7t t\u1eeb \u0111\u01a1n, t\u1eeb gh\u00e9p c\u1ee7a ti\u1ebfng Vi\u1ec7t. V\u00ed d\u1ee5, n\u00f3 s\u1ebd ko t\u00e1ch c\u1ee5m \"bacsitrongbenhvien\" th\u00e0nh \"bac si\" + \"trong\" + \"benh vien\".\nN\u1ebfu b\u1ea1n c\u1ea7n t\u00ednh n\u0103ng \u0111\u00f3, n\u00ean s\u1eed d\u1ee5ng underthesea_.\n\nDo nhu c\u1ea7u c\u00e1 nh\u00e2n n\u00ean hi\u1ec7n t\u1ea1i th\u01b0 vi\u1ec7n kh\u00f4ng x\u1eed l\u00fd t\u1eeb c\u00f3 \u0111\u1ea7y \u0111\u1ee7 d\u1ea5u, v\u00ed d\u1ee5 \"h\u1ecdamikh\u00f4ngh\u00f3tn\u1eefa\". Tuy nhi\u00ean, ng\u01b0\u1eddi d\u00f9ng th\u01b0 vi\u1ec7n c\u00f3 th\u1ec3 lo\u1ea1i b\u1ecf d\u1ea5u tr\u01b0\u1edbc khi truy\u1ec1n v\u00e0o ``ViStickedWord``. Vi\u1ec7c \u0111\u00f3 kh\u00f4ng kh\u00f3 (d\u00f9ng Unidecode_).\n\n\u0110\u1ec3 thu\u1eadn ti\u1ec7n cho vi\u1ec7c l\u1eadp tr\u00ecnh, m\u1ed9t s\u1ed1 thu\u1eadt ng\u1eef kh\u00f4ng \u0111\u01b0\u1ee3c d\u00f9ng ch\u00ednh x\u00e1c nh\u01b0 c\u00e1ch d\u00f9ng b\u00ean ng\u00f4n ng\u1eef h\u1ecdc. Vui l\u00f2ng \u0111\u1eebng xem code c\u1ee7a t\u00f4i l\u00e0 ngu\u1ed3n t\u00e0i li\u1ec7u h\u1ecdc ng\u1eef ph\u00e1p ti\u1ebfng Vi\u1ec7t.\n\nInstall\n-------\n\n.. code-block:: sh\n\n pip install vistickedword\n\n\nUsage\n-----\n\n.. code-block:: python\n\n from vistickedword import split_words\n\n split_words('ngoanngoeo')\n\n # Returns ('ngoan', 'ngoeo')\n\n\nCredit\n------\n\nDeveloped by by `Nguy\u1ec5n H\u1ed3ng Qu\u00e2n `_.\n\n\n.. _underthesea: https://github.com/undertheseanlp/underthesea\n.. _Unidecode: https://pypi.org/project/Unidecode/\n.. _author: https://quan.hoabinh.vn\n", "description_content_type": "text/x-rst", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/hongquan/ViStickedWord", "keywords": "", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "vistickedword", "package_url": "https://pypi.org/project/vistickedword/", "platform": "", "project_url": "https://pypi.org/project/vistickedword/", "project_urls": { "Homepage": "https://github.com/hongquan/ViStickedWord", "Repository": "https://github.com/hongquan/ViStickedWord.git" }, "release_url": "https://pypi.org/project/vistickedword/0.9.5/", "requires_dist": [ "single-version (>=1.1,<2.0)" ], "requires_python": ">=3.6,<4.0", "summary": "Library to split sticked Vietnamese words", "version": "0.9.5", "yanked": false, "yanked_reason": null }, "last_serial": 7926720, "releases": { "0.8": [ { "comment_text": "", "digests": { "md5": "85cadcd38502fa96cbe179fac7f98d9d", "sha256": "864697126639d3fee279d264b2d1f8f411cd03ef6b2dbf8937a521f7db2ca155" }, "downloads": -1, "filename": "vistickedword-0.8-py3-none-any.whl", "has_sig": false, "md5_digest": "85cadcd38502fa96cbe179fac7f98d9d", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6,<4.0", "size": 3740, "upload_time": "2019-10-30T11:04:26", "upload_time_iso_8601": "2019-10-30T11:04:26.691875Z", "url": "https://files.pythonhosted.org/packages/30/a7/c519b0d2bef16c87e4c2afd70c2d3bb82d987ba08b2e4fd0a095644481c3/vistickedword-0.8-py3-none-any.whl", "yanked": false, "yanked_reason": null }, { "comment_text": "", "digests": { "md5": "cfeb2e7f5e279c89efc64c303cf26189", "sha256": "59c9d30cd253de16b2a28c92b983b253120c91f3e0a0e1551d8a5d6916a34f6f" }, "downloads": -1, "filename": "vistickedword-0.8.tar.gz", "has_sig": false, "md5_digest": "cfeb2e7f5e279c89efc64c303cf26189", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6,<4.0", "size": 3755, "upload_time": "2019-10-30T11:04:28", "upload_time_iso_8601": "2019-10-30T11:04:28.859937Z", "url": "https://files.pythonhosted.org/packages/0f/d4/d6ad58a6147e2f3ed2edce2ee33f28b92ec3a46f2cf7931482098994ed8b/vistickedword-0.8.tar.gz", "yanked": false, "yanked_reason": null } ], "0.9": [ { "comment_text": "", "digests": { "md5": "579aa440a5034addefbf8a614bda6425", "sha256": "2a2200932bcf04a32c46aaf22eb60ae6a5769d2c96d369262b8ef816a926f891" }, "downloads": -1, "filename": "vistickedword-0.9-py3-none-any.whl", "has_sig": false, "md5_digest": "579aa440a5034addefbf8a614bda6425", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6,<4.0", "size": 5605, "upload_time": "2019-11-02T09:10:44", "upload_time_iso_8601": "2019-11-02T09:10:44.294720Z", "url": "https://files.pythonhosted.org/packages/02/c6/2f15207bebe4655689942aa4316a4fa9390121505b6e09cd7c4432bef080/vistickedword-0.9-py3-none-any.whl", "yanked": false, "yanked_reason": null }, { "comment_text": "", "digests": { "md5": "370b7a31a1857f9248a632c325876313", "sha256": "d63823805c0961c99cb7e4d1e47393381f440c24ae8edbff962c229bb86a12b4" }, "downloads": -1, "filename": "vistickedword-0.9.tar.gz", "has_sig": false, "md5_digest": "370b7a31a1857f9248a632c325876313", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6,<4.0", "size": 5632, "upload_time": "2019-11-02T09:10:46", "upload_time_iso_8601": "2019-11-02T09:10:46.032168Z", "url": "https://files.pythonhosted.org/packages/94/21/c477f6cf3c15064585d095ee92c1684d41900d718aedfd0c0f57d2b2349e/vistickedword-0.9.tar.gz", "yanked": false, "yanked_reason": null } ], "0.9.1": [ { "comment_text": "", "digests": { "md5": "5ca152c06ee5847c1d91fb5ac545842b", "sha256": "9abac46d860fb52089efc0f700bd9da866170230f15a2703a87995f26dd0309c" }, "downloads": -1, "filename": "vistickedword-0.9.1-py3-none-any.whl", "has_sig": false, "md5_digest": "5ca152c06ee5847c1d91fb5ac545842b", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6,<4.0", "size": 6158, "upload_time": "2019-11-03T14:40:47", "upload_time_iso_8601": "2019-11-03T14:40:47.978379Z", "url": "https://files.pythonhosted.org/packages/f7/74/44319cac763156b96719efef97f845a69a27a1d01260fe24f924a37f3502/vistickedword-0.9.1-py3-none-any.whl", "yanked": false, "yanked_reason": null }, { "comment_text": "", "digests": { "md5": "ce65dc2500d061b48e76b5f4b82cc829", "sha256": "f0a975be4f09a13d9e714449e690df17a66ee7c90e4f9c943bd47db73611ccf4" }, "downloads": -1, "filename": "vistickedword-0.9.1.tar.gz", "has_sig": false, "md5_digest": "ce65dc2500d061b48e76b5f4b82cc829", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6,<4.0", "size": 6211, "upload_time": "2019-11-03T14:40:49", "upload_time_iso_8601": "2019-11-03T14:40:49.705643Z", "url": "https://files.pythonhosted.org/packages/d7/ec/2ebd87f0f14700534579f273da9e4f38b0a4b2b0715a254da4a169079006/vistickedword-0.9.1.tar.gz", "yanked": false, "yanked_reason": null } ], "0.9.2": [ { "comment_text": "", "digests": { "md5": "4f2b4f60f19858393f46092ebf43bb98", "sha256": "23538976b86744d25c394d96698f79b312c9613dd100117b2f237020f087b54e" }, "downloads": -1, "filename": "vistickedword-0.9.2-py3-none-any.whl", "has_sig": false, "md5_digest": "4f2b4f60f19858393f46092ebf43bb98", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6,<4.0", "size": 6324, "upload_time": "2019-11-06T03:05:16", "upload_time_iso_8601": "2019-11-06T03:05:16.320569Z", "url": "https://files.pythonhosted.org/packages/a4/16/52ad93c93c2dab7f7a4becea159f8ee1e2ee0bbb7fcc7fd09dd18cd122a2/vistickedword-0.9.2-py3-none-any.whl", "yanked": false, "yanked_reason": null }, { "comment_text": "", "digests": { "md5": "62a8d9506bf9d5dc5d706340eeb6d248", "sha256": "0136ad9ae4b354d98c8276704b5b1f2f5e56a2af0f2dd464f224f2f24fbbd785" }, "downloads": -1, "filename": "vistickedword-0.9.2.tar.gz", "has_sig": false, "md5_digest": "62a8d9506bf9d5dc5d706340eeb6d248", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6,<4.0", "size": 6414, "upload_time": "2019-11-06T03:05:18", "upload_time_iso_8601": "2019-11-06T03:05:18.265470Z", "url": "https://files.pythonhosted.org/packages/e6/3f/a80cf341db98cdb9b0d6977f9e5008359da364333b32e50f4edebd6a30be/vistickedword-0.9.2.tar.gz", "yanked": false, "yanked_reason": null } ], "0.9.3": [ { "comment_text": "", "digests": { "md5": "2bc9fabb3013ed9c8f084e52cece2d68", "sha256": "d5fbc54475de60965cf076f087115e3204da84296a12071362fe2cea40c35356" }, "downloads": -1, "filename": "vistickedword-0.9.3-py3-none-any.whl", "has_sig": false, "md5_digest": "2bc9fabb3013ed9c8f084e52cece2d68", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6,<4.0", "size": 6369, "upload_time": "2020-03-15T17:10:28", "upload_time_iso_8601": "2020-03-15T17:10:28.956827Z", "url": "https://files.pythonhosted.org/packages/7d/ff/85cbee40ac6f02adb2822e39095ecbe304f37ca36d580e3bf4ef332ac40a/vistickedword-0.9.3-py3-none-any.whl", "yanked": false, "yanked_reason": null }, { "comment_text": "", "digests": { "md5": "5be3a42ea8b82d75fa8c87d2ee0f4916", "sha256": "624266d22a4fc6cf10f9379ea959c7860b6435ad1b24a24543db50a37f08fb4c" }, "downloads": -1, "filename": "vistickedword-0.9.3.tar.gz", "has_sig": false, "md5_digest": "5be3a42ea8b82d75fa8c87d2ee0f4916", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6,<4.0", "size": 6517, "upload_time": "2020-03-15T17:10:31", "upload_time_iso_8601": "2020-03-15T17:10:31.007497Z", "url": "https://files.pythonhosted.org/packages/98/7a/822fa87ca23794a5f003ee2afcb6fac6f6041197b6da9fb8cc37c640f936/vistickedword-0.9.3.tar.gz", "yanked": false, "yanked_reason": null } ], "0.9.4": [ { "comment_text": "", "digests": { "md5": "7eb41789c668ab4dd8f78c7269b0aca4", "sha256": "0510c4215266340d5d13ea4a4a64b4dafd71c4a4843dee88f6d2a76dd8ec92a1" }, "downloads": -1, "filename": "vistickedword-0.9.4-py3-none-any.whl", "has_sig": false, "md5_digest": "7eb41789c668ab4dd8f78c7269b0aca4", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6,<4.0", "size": 6476, "upload_time": "2020-08-10T04:26:35", "upload_time_iso_8601": "2020-08-10T04:26:35.576250Z", "url": "https://files.pythonhosted.org/packages/87/45/c0f302f3d20fb82db4150d7a8213a883ae9b17ad623985ecab018195ddf7/vistickedword-0.9.4-py3-none-any.whl", "yanked": false, "yanked_reason": null }, { "comment_text": "", "digests": { "md5": "9295d0735739fb3e3a0e3c76ac64c787", "sha256": "0c9bd15fd9fc7cc3afe848513d3e152d1ae524989758dac4157505a2e7734e83" }, "downloads": -1, "filename": "vistickedword-0.9.4.tar.gz", "has_sig": false, "md5_digest": "9295d0735739fb3e3a0e3c76ac64c787", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6,<4.0", "size": 6627, "upload_time": "2020-08-10T04:26:37", "upload_time_iso_8601": "2020-08-10T04:26:37.728925Z", "url": "https://files.pythonhosted.org/packages/5a/d2/684109dabefee260da9631623742fea477cea0d874e67026d34de49fecf1/vistickedword-0.9.4.tar.gz", "yanked": false, "yanked_reason": null } ], "0.9.5": [ { "comment_text": "", "digests": { "md5": "2b92625756d540baac856dd865a3167b", "sha256": "5798ff6641c9f13d243bdd7d81577091436e41dfc66b4ee6ce1abf0037f524a7" }, "downloads": -1, "filename": "vistickedword-0.9.5-py3-none-any.whl", "has_sig": false, "md5_digest": "2b92625756d540baac856dd865a3167b", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6,<4.0", "size": 6485, "upload_time": "2020-08-10T16:40:58", "upload_time_iso_8601": "2020-08-10T16:40:58.640093Z", "url": "https://files.pythonhosted.org/packages/70/08/3d69416bc73f4839bdf51510672971b53a06d8c116b6a2c89ff37c7df425/vistickedword-0.9.5-py3-none-any.whl", "yanked": false, "yanked_reason": null }, { "comment_text": "", "digests": { "md5": "f1a495f622061c5ddbe9d155f742b5ce", "sha256": "222e3de1b0f7e790087db25f3466458a9de461d607663bd19cf3787f92f0a022" }, "downloads": -1, "filename": "vistickedword-0.9.5.tar.gz", "has_sig": false, "md5_digest": "f1a495f622061c5ddbe9d155f742b5ce", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6,<4.0", "size": 6632, "upload_time": "2020-08-10T16:41:00", "upload_time_iso_8601": "2020-08-10T16:41:00.657238Z", "url": "https://files.pythonhosted.org/packages/f8/e3/b1375b78a1d93e5a6e4ba239b76c5f6ac6a5fe1a852e71a5f1ca065db882/vistickedword-0.9.5.tar.gz", "yanked": false, "yanked_reason": null } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "2b92625756d540baac856dd865a3167b", "sha256": "5798ff6641c9f13d243bdd7d81577091436e41dfc66b4ee6ce1abf0037f524a7" }, "downloads": -1, "filename": "vistickedword-0.9.5-py3-none-any.whl", "has_sig": false, "md5_digest": "2b92625756d540baac856dd865a3167b", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6,<4.0", "size": 6485, "upload_time": "2020-08-10T16:40:58", "upload_time_iso_8601": "2020-08-10T16:40:58.640093Z", "url": "https://files.pythonhosted.org/packages/70/08/3d69416bc73f4839bdf51510672971b53a06d8c116b6a2c89ff37c7df425/vistickedword-0.9.5-py3-none-any.whl", "yanked": false, "yanked_reason": null }, { "comment_text": "", "digests": { "md5": "f1a495f622061c5ddbe9d155f742b5ce", "sha256": "222e3de1b0f7e790087db25f3466458a9de461d607663bd19cf3787f92f0a022" }, "downloads": -1, "filename": "vistickedword-0.9.5.tar.gz", "has_sig": false, "md5_digest": "f1a495f622061c5ddbe9d155f742b5ce", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6,<4.0", "size": 6632, "upload_time": "2020-08-10T16:41:00", "upload_time_iso_8601": "2020-08-10T16:41:00.657238Z", "url": "https://files.pythonhosted.org/packages/f8/e3/b1375b78a1d93e5a6e4ba239b76c5f6ac6a5fe1a852e71a5f1ca065db882/vistickedword-0.9.5.tar.gz", "yanked": false, "yanked_reason": null } ], "vulnerabilities": [] }