{ "info": { "author": "Herman Schaaf", "author_email": "herman.schaaf@gengo.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 5 - Production/Stable", "Intended Audience :: Developers", "License :: OSI Approved :: GNU Lesser General Public License v3 (LGPLv3)", "Topic :: Software Development :: Libraries :: Python Modules", "Topic :: Text Processing :: Linguistic" ], "description": "PyThai\n======\n\nSome basic python functions for working with the Thai language. For example:\n\n```python\nimport pythai\n\npythai.split(u\"\u0e01\u0e32\u0e23\u0e17\u0e35\u0e48\u0e44\u0e14\u0e49\u0e15\u0e49\u0e2d\u0e07\u0e41\u0e2a\u0e14\u0e07\u0e27\u0e48\u0e32\u0e07\u0e32\u0e19\u0e14\u0e35\")\n>>> u\"\u0e01\u0e32\u0e23 \u0e17\u0e35\u0e48 \u0e44\u0e14\u0e49 \u0e15\u0e49\u0e2d\u0e07 \u0e41\u0e2a\u0e14\u0e07 \u0e27\u0e48\u0e32 \u0e07\u0e32\u0e19 \u0e14\u0e35\"\n\npythai.word_count(u\"\u0e01\u0e32\u0e23\u0e17\u0e35\u0e48\u0e44\u0e14\u0e49\u0e15\u0e49\u0e2d\u0e07\u0e41\u0e2a\u0e14\u0e07\u0e27\u0e48\u0e32\u0e07\u0e32\u0e19\u0e14\u0e35\")\n>>> 8\n\npythai.contains_thai(u\"hello\")\n>>> False\n\npythai.contains_thai(u\"hello\u0e01\u0e32\u0e23\u0e17\u0e35\u0e48\u0e44\")\n>>> True\n```\n\nIt's meant to be fast and efficient enough to handle large documents without breaking a sweat.\n\nIncludes\n------------\n\nCurrently the library supports these functions:\n\n- Word segmentation (`split`)\n- Word count (`word_count`) (faster than counting the result of `split`)\n- Whether a string contains Thai or not (`contains_thai`)\n\n\nInstallation\n------------\n\nPyThai equires `thailib` to work. You can install it quite easily:\n\n sudo apt-get install thailib\n\nAnd then you can simply install `pythai` through **pip**:\n\n pip install pythai\n\nMore\n------------\n\nSpecial thanks to Vee Satayamas for the original python bindings of libthai from C.\n\nThis library was written for use in [Gengo](http://www.gengo.com). It's free and open-source under the GNU lesser public license. Any contributions are welcome!", "description_content_type": null, "docs_url": null, "download_url": "UNKNOWN", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/hermanschaaf/pythai", "keywords": "thai language linguistics segmentation", "license": "GNU", "maintainer": null, "maintainer_email": null, "name": "pythai", "package_url": "https://pypi.org/project/pythai/", "platform": "UNKNOWN", "project_url": "https://pypi.org/project/pythai/", "project_urls": { "Download": "UNKNOWN", "Homepage": "https://github.com/hermanschaaf/pythai" }, "release_url": "https://pypi.org/project/pythai/0.1.3/", "requires_dist": null, "requires_python": null, "summary": "Python functions for working with the Thai language", "version": "0.1.3" }, "last_serial": 716371, "releases": { "0.1.3": [ { "comment_text": "", "digests": { "md5": "fd4d3e134e8349ee6cdffe3a79a7e60c", "sha256": "5694a793b4295287894bf83b1ceebe8cd137ca53087ce8bf47f02543a46d2911" }, "downloads": -1, "filename": "pythai-0.1.3.tar.gz", "has_sig": false, "md5_digest": "fd4d3e134e8349ee6cdffe3a79a7e60c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 13644, "upload_time": "2013-06-01T06:26:08", "url": "https://files.pythonhosted.org/packages/f5/61/416585a79955e3f6a61a72ea3c2ef6dc98f2d4e5549740a9d0b3c4abe791/pythai-0.1.3.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "fd4d3e134e8349ee6cdffe3a79a7e60c", "sha256": "5694a793b4295287894bf83b1ceebe8cd137ca53087ce8bf47f02543a46d2911" }, "downloads": -1, "filename": "pythai-0.1.3.tar.gz", "has_sig": false, "md5_digest": "fd4d3e134e8349ee6cdffe3a79a7e60c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 13644, "upload_time": "2013-06-01T06:26:08", "url": "https://files.pythonhosted.org/packages/f5/61/416585a79955e3f6a61a72ea3c2ef6dc98f2d4e5549740a9d0b3c4abe791/pythai-0.1.3.tar.gz" } ] }