{ "info": { "author": "Jehan", "author_email": "tinysegmenter@zemarmot.net", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "License :: OSI Approved :: BSD License", "Operating System :: POSIX :: Linux", "Programming Language :: Python", "Topic :: Scientific/Engineering :: Artificial Intelligence", "Topic :: Scientific/Engineering :: Information Analysis", "Topic :: Text Processing :: Linguistic" ], "description": "TinySegmenter\n=============\n\n\u201cTinySegmenter in Python\u201d is a Python port_ by Masato Hagiwara of TinySegmenter_, which is an extremely compact Japanese tokenizer originally written in JavaScript by Mr. Taku Kudo.\n\nThe library has been finally packaged by Jehan. It resulted into this fork because Masako Hagiwara did not answer emails, and packaging patches\ncould therefore not be committed upstream. But this is a friendly fork, and Masako Hagiwara is welcome to take back maintainance over his\nproject.\nFor the time being, I (Jehan) took up the maintenance, so please refer to this new website_ as being official, and\ndirect any new patch_ there. I will follow up on patchs and bug reports, but probably won't maintain an active development. Anyone wishing to\nimprove the library is welcome to participate and will be gladly given committer rights.\n\nIt works on Python 2.6 or above (works on Python 3 too).\n\n.. _port: http://lilyx.net/tinysegmenter-in-python/\n.. _TinySegmenter: http://chasen.org/~taku/software/TinySegmenter/\n.. _website: http://tinysegmenter.tuxfamily.org/\n\nAuthors\n-------\n\nSee all authors and contributors in ``AUTHORS`` file.\n\nDownload and Installation\n-------------------------\n\nThis library can be installed the common ways: with a setup.py, as a pip package...\nSee the ``INSTALL`` file in the package for more details.\n\nIf you simply want to download the source package, refer to the pypi repository: http://pypi.python.org/pypi/tinysegmenter\n\nDevelopment version can be downloaded anonymously at the Git repository::\n\n $ git clone git://git.tuxfamily.org/gitroot/tinysegmente/tinysegmenter.git\n\nor browsed online at: http://git.tuxfamily.org/tinysegmente/tinysegmenter/\n\nUsage\n-----\n\nExample code for direct usage::\n\n > import tinysegmenter\n > segmenter = tinysegmenter.TinySegmenter() \n > print(' | '.join(segmenter.tokenize(u\"\u79c1\u306e\u540d\u524d\u306f\u4e2d\u91ce\u3067\u3059\")))\n \u79c1 | \u306e | \u540d\u524d | \u306f | \u4e2d\u91ce | \u3067\u3059 \n\n\nTinySegmenter\u2018s interface is compatible with ``NLTK``\u2019s ``TokenizerI`` class, although the distribution does not directly depend on NLTK.\nHere is one way to use it as a tokenizer in NLTK (order of the multiple base classes matters)::\n\n import nltk.tokenize.api\n\n class myTinySegmenter(tinysegmenter.TinySegmenter, nltk.tokenize.api.TokenizerI):\n pass\n segmenter = myTinySegmenter()\n # This segmenter can be used any place which expects a NLTK's TokenizerI subclass.\n\nFor more about NLTK (*Natural Language Toolkit* module), see: http://nltk.org/api/nltk.tokenize.html#nltk.tokenize.api.TokenizerI\n\n.. _patch:\n\nContact, Bugs and Contributing\n------------------------------\n\nAll bug, patch, question, etc. can be sent to `tinysegmenter` at `zemarmot` dot `net`.\n\nLicense\n-------\n\nThis package is distributed under a New BSD License (see ``COPYING`` file).", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "http://tinysegmenter.tuxfamily.org/", "keywords": "", "license": "New BSD", "maintainer": "", "maintainer_email": "", "name": "tinysegmenter", "package_url": "https://pypi.org/project/tinysegmenter/", "platform": "", "project_url": "https://pypi.org/project/tinysegmenter/", "project_urls": { "Homepage": "http://tinysegmenter.tuxfamily.org/" }, "release_url": "https://pypi.org/project/tinysegmenter/0.4/", "requires_dist": null, "requires_python": "", "summary": "Very compact Japanese tokenizer", "version": "0.4" }, "last_serial": 4276195, "releases": { "0.1": [ { "comment_text": "", "digests": { "md5": "482525bea160b0b16571e5f6bcef4a9f", "sha256": "2f19799e1cbd5877e7e101d74240eac21d4d224e5036fcfa58fc8e82ca642468" }, "downloads": -1, "filename": "tinysegmenter-0.1.tar.gz", "has_sig": false, "md5_digest": "482525bea160b0b16571e5f6bcef4a9f", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 12205, "upload_time": "2012-07-25T08:53:46", "url": "https://files.pythonhosted.org/packages/f0/82/8a71b37e1c1f8c14e6bd95c2c49058bb05d35ca9c28e0efa2a9fb2c3039e/tinysegmenter-0.1.tar.gz" } ], "0.2": [ { "comment_text": "", "digests": { "md5": "e509250e0ff9d8613ba370e998705bde", "sha256": "2a4f05988ea7f3fc635988c513a5dc2b5be7fe1aec579f411aecb08357e7576c" }, "downloads": -1, "filename": "tinysegmenter-0.2.tar.gz", "has_sig": false, "md5_digest": "e509250e0ff9d8613ba370e998705bde", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 15690, "upload_time": "2014-10-11T00:26:41", "url": "https://files.pythonhosted.org/packages/22/34/07e562e956e0bf5562547b642440741d3dba314271fe81fc562cbe75bd48/tinysegmenter-0.2.tar.gz" } ], "0.3": [ { "comment_text": "", "digests": { "md5": "7a6c74afa0816b853f5be5eeed6e9eaa", "sha256": "ed1f6d2e806a4758a73be589754384cbadadc7e1a414c81a166fc9adf2d40c6d" }, "downloads": -1, "filename": "tinysegmenter-0.3.tar.gz", "has_sig": false, "md5_digest": "7a6c74afa0816b853f5be5eeed6e9eaa", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 16893, "upload_time": "2017-07-23T11:18:29", "url": "https://files.pythonhosted.org/packages/17/82/86982e4b6d16e4febc79c2a1d68ee3b707e8a020c5d2bc4af8052d0f136a/tinysegmenter-0.3.tar.gz" } ], "0.4": [ { "comment_text": "", "digests": { "md5": "c6e04dba1216b5a8971f62680f294a3a", "sha256": "64458bdba54ba7482c02c785f960cf3d6cf61594a4dab3d64c325439094fc1b7" }, "downloads": -1, "filename": "tinysegmenter-0.4.tar.gz", "has_sig": false, "md5_digest": "c6e04dba1216b5a8971f62680f294a3a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 16898, "upload_time": "2018-09-16T09:30:29", "url": "https://files.pythonhosted.org/packages/9c/70/488895cb11e160b548c9ba5847c171b65b86a8ca1e54d206d55b2976bf7b/tinysegmenter-0.4.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "c6e04dba1216b5a8971f62680f294a3a", "sha256": "64458bdba54ba7482c02c785f960cf3d6cf61594a4dab3d64c325439094fc1b7" }, "downloads": -1, "filename": "tinysegmenter-0.4.tar.gz", "has_sig": false, "md5_digest": "c6e04dba1216b5a8971f62680f294a3a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 16898, "upload_time": "2018-09-16T09:30:29", "url": "https://files.pythonhosted.org/packages/9c/70/488895cb11e160b548c9ba5847c171b65b86a8ca1e54d206d55b2976bf7b/tinysegmenter-0.4.tar.gz" } ] }