{ "info": { "author": "Mikhail Korobov", "author_email": "kmike84@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 3 - Alpha", "Intended Audience :: Developers", "Intended Audience :: Science/Research", "License :: OSI Approved :: MIT License", "Natural Language :: Russian", "Programming Language :: Python", "Programming Language :: Python :: 2", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.4", "Programming Language :: Python :: 3.5", "Programming Language :: Python :: 3.6", "Topic :: Scientific/Engineering :: Information Analysis", "Topic :: Software Development :: Libraries :: Python Modules", "Topic :: Text Processing :: Linguistic" ], "description": "===============\nrussian-tagsets\n===============\n\n.. image:: https://img.shields.io/pypi/v/russian-tagsets.svg\n :target: https://pypi.python.org/pypi/russian-tagsets\n\n.. image:: https://img.shields.io/pypi/l/russian-tagsets.svg\n :target: https://github.com/kmike/russian-tagsets/blob/master/LICENSE\n\n.. image:: https://img.shields.io/travis/kmike/russian-tagsets.svg\n :target: https://travis-ci.org/kmike/russian-tagsets\n\n``russian-tagsets`` - \u0431\u0438\u0431\u043b\u0438\u043e\u0442\u0435\u043a\u0430 \u0434\u043b\u044f \u043f\u0440\u0435\u043e\u0431\u0440\u0430\u0437\u043e\u0432\u0430\u043d\u0438\u044f \u043c\u0435\u0436\u0434\u0443\n\u0440\u0430\u0437\u043b\u0438\u0447\u043d\u044b\u043c\u0438 \u0444\u043e\u0440\u043c\u0430\u0442\u0430\u043c\u0438 \u043e\u0431\u043e\u0437\u043d\u0430\u0447\u0435\u043d\u0438\u044f \u0433\u0440\u0430\u043c\u043c\u0430\u0442\u0438\u0447\u0435\u0441\u043a\u043e\u0439 \u0438\u043d\u0444\u043e\u0440\u043c\u0430\u0446\u0438\u0438\n\u0434\u043b\u044f \u0441\u043b\u043e\u0432 \u0440\u0443\u0441\u0441\u043a\u043e\u0433\u043e \u044f\u0437\u044b\u043a\u0430. \u041b\u0438\u0446\u0435\u043d\u0437\u0438\u044f - MIT.\n\n\u041d\u0430 \u0434\u0430\u043d\u043d\u044b\u0439 \u043c\u043e\u043c\u0435\u043d\u0442 \u043f\u043e\u0434\u0434\u0435\u0440\u0436\u0438\u0432\u0430\u0435\u0442\u0441\u044f (\u0441 \u0440\u0430\u0437\u043d\u043e\u0439 \u0441\u0442\u0435\u043f\u0435\u043d\u044c\u044e \u043a\u043e\u0440\u0440\u0435\u043a\u0442\u043d\u043e\u0441\u0442\u0438)\n\u043f\u0440\u0435\u043e\u0431\u0440\u0430\u0437\u043e\u0432\u0430\u043d\u0438\u0435 \u043c\u0435\u0436\u0434\u0443 \u0441\u043b\u0435\u0434\u0443\u044e\u0449\u0438\u043c\u0438 \u0444\u043e\u0440\u043c\u0430\u0442\u0430\u043c\u0438:\n\n* `OpenCorpora`_ (\u0432.\u0442.\u0447. \u0440\u0443\u0441\u0441\u043a\u0438\u0435 \u0441\u043b\u043e\u0432\u0430\u0440\u0438 pymorphy2_);\n* `aot.ru`_ (\u0432.\u0442.\u0447. pymorphy_ 0.5.6);\n* `\u0414\u0438\u0430\u043b\u043e\u0433-2010`_;\n* `A Positional Tagset for Russian`_ (Jirka Hana and Anna Feldman, 2010);\n* `\u041d\u041a\u0420\u042f`_;\n* `Universal Dependencies`_ (v1.4, v2.0, Dialog-2017_);\n\n.. _OpenCorpora: http://opencorpora.org/dict.php?act=gram\n.. _aot.ru: http://aot.ru/docs/rusmorph.html\n.. _pymorphy: https://pymorphy.readthedocs.io/en/v0.5.6/ref/gram_info_ru.html\n.. _pymorphy2: https://github.com/kmike/pymorphy2\n.. _\u0414\u0438\u0430\u043b\u043e\u0433-2010: http://ru-eval.ru/\n.. _A Positional Tagset for Russian: http://ufal.mff.cuni.cz/~hana/morph/rutags.html\n.. _\u041d\u041a\u0420\u042f: http://www.ruscorpora.ru/corpora-morph.html\n.. _Universal Dependencies: http://universaldependencies.org/ru/pos/all.html\n.. _Dialog-2017: http://www.dialog-21.ru/evaluation/2017/morphorueval/\n\n\u0418\u0434\u0435\u044f \u0442\u0430\u043a\u0430\u044f: \u0434\u043b\u044f \u043a\u0430\u0436\u0434\u043e\u0433\u043e \u0444\u043e\u0440\u043c\u0430\u0442\u0430 \u0432 \u0431\u0438\u0431\u043b\u0438\u043e\u0442\u0435\u043a\u0435 \u043e\u043f\u0440\u0435\u0434\u0435\u043b\u044f\u0435\u0442\u0441\u044f\n\u043d\u0435\u0441\u043a\u043e\u043b\u044c\u043a\u043e \u0444\u0443\u043d\u043a\u0446\u0438\u0439, \u043a\u043e\u0442\u043e\u0440\u044b\u0435 \u043f\u0440\u0435\u043e\u0431\u0440\u0430\u0437\u0443\u044e\u0442 \u0442\u0435\u0433 \u0432 \u0434\u0440\u0443\u0433\u0438\u0435 \u0444\u043e\u0440\u043c\u0430\u0442\u044b;\n\u0435\u0441\u043b\u0438 \u043a\u0430\u043a\u043e\u0435-\u0442\u043e \u043f\u0440\u0435\u043e\u0431\u0440\u0430\u0437\u043e\u0432\u0430\u043d\u0438\u0435 \u044f\u0432\u043d\u044b\u043c \u043e\u0431\u0440\u0430\u0437\u043e\u043c \u043d\u0435 \u043e\u043f\u0440\u0435\u0434\u0435\u043b\u0435\u043d\u043e,\n``russian-tagsets`` \u043f\u044b\u0442\u0430\u0435\u0442\u0441\u044f \u043f\u043e\u0441\u0442\u0440\u043e\u0438\u0442\u044c \u0446\u0435\u043f\u043e\u0447\u043a\u0443 \u043f\u0440\u0435\u043e\u0431\u0440\u0430\u0437\u043e\u0432\u0430\u043d\u0438\u0439,\n\u043f\u0440\u0438\u0432\u043e\u0434\u044f\u0449\u0443\u044e \u043a \u043d\u0443\u0436\u043d\u043e\u043c\u0443 \u0440\u0435\u0437\u0443\u043b\u044c\u0442\u0430\u0442\u0443.\n\n.. warning::\n\n \u041f\u0440\u0435\u043e\u0431\u0440\u0430\u0437\u043e\u0432\u0430\u043d\u0438\u0435 \u043f\u043e\u0447\u0442\u0438 \u043d\u0438\u043a\u043e\u0433\u0434\u0430 \u043d\u0435 \u0432\u044b\u0445\u043e\u0434\u0438\u0442 \u043e\u0441\u0443\u0449\u0435\u0441\u0442\u0432\u0438\u0442\u044c \u0431\u0435\u0437 \u043f\u043e\u0442\u0435\u0440\u044c;\n \u0435\u0441\u043b\u0438 \u043c\u043e\u0436\u043d\u043e \u044d\u0442\u0443 \u0431\u0438\u0431\u043b\u0438\u043e\u0442\u0435\u043a\u0443 \u043d\u0435 \u0438\u0441\u043f\u043e\u043b\u044c\u0437\u043e\u0432\u0430\u0442\u044c, \u0442\u043e \u043b\u0443\u0447\u0448\u0435 \u0438 \u043d\u0435 \u0438\u0441\u043f\u043e\u043b\u044c\u0437\u043e\u0432\u0430\u0442\u044c.\n\n\u0423\u0441\u0442\u0430\u043d\u043e\u0432\u043a\u0430\n=========\n\npip install russian-tagsets\n\n\u0418\u0441\u043f\u043e\u043b\u044c\u0437\u043e\u0432\u0430\u043d\u0438\u0435\n=============\n\n\"\u0410\u0432\u0442\u043e\u043c\u0430\u0442\u0438\u0447\u0435\u0441\u043a\u0438\u0439\" \u0440\u0435\u0436\u0438\u043c (\u0431\u0438\u0431\u043b\u0438\u043e\u0442\u0435\u043a\u0430 \u043f\u044b\u0442\u0430\u0435\u0442\u0441\u044f \u043d\u0430\u0439\u0442\u0438 \u043f\u0443\u0442\u0438 \u043f\u0440\u0435\u043e\u0431\u0440\u0430\u0437\u043e\u0432\u0430\u043d\u0438\u044f)::\n\n >>> from russian_tagsets import converters\n >>> to_aot = converters.converter('opencorpora-int', 'aot')\n >>> to_aot(\"NOUN,anim,masc plur,nomn\")\n \u0421,\u043e\u0434,\u043c\u0440,\u043c\u043d,\u0438\u043c\n\n\u041f\u043e\u0434\u0434\u0435\u0440\u0436\u0438\u0432\u0430\u0435\u043c\u044b\u0435 \u043d\u0430\u043f\u0440\u044f\u043c\u0443\u044e \u0444\u043e\u0440\u043c\u0430\u0442\u044b::\n\n >>> converters.get_supported()\n [('positional', 'aot'),\n ('aot', 'positional'),\n ('aot', 'dialog2010'),\n ('dialog2010', 'aot'),\n ('opencorpora-int', 'opencorpora-ext'),\n ('opencorpora-int', 'ruscorpora'),\n ('opencorpora-int', 'ud14'),\n ('opencorpora-int', 'ud20'),\n ('opencorpora-int', 'dialog2017'),\n ('opencorpora-ext', 'opencorpora-int'),\n ('opencorpora-ext', 'aot')]\n\n\"\u0420\u0443\u0447\u043d\u043e\u0439\" \u0440\u0435\u0436\u0438\u043c (\u0431\u044b\u0441\u0442\u0440\u0435\u0435, \u043f\u0440\u0435\u043e\u0431\u0440\u0430\u0437\u043e\u0432\u0430\u043d\u0438\u0435 \u0442\u043e\u043b\u044c\u043a\u043e \u043d\u0430\u043f\u0440\u044f\u043c\u0443\u044e)::\n\n >>> import russian_tagsets\n >>> russian_tagsets.aot.to_positional(u\"\u0421,\u043e\u0434,\u043c\u0440,\u043c\u043d,\u0438\u043c\")\n Tag(\"NNMAP1-------A--\")\n\n\u0420\u0430\u0437\u0440\u0430\u0431\u043e\u0442\u043a\u0430\n==========\n\n* \u0418\u0441\u0445\u043e\u0434\u043d\u044b\u0439 \u043a\u043e\u0434: https://github.com/kmike/russian-tagsets\n* \u0411\u0430\u0433-\u0442\u0440\u0435\u043a\u0435\u0440: https://github.com/kmike/russian-tagsets/issues\n\n\u0414\u043b\u044f \u0437\u0430\u043f\u0443\u0441\u043a\u0430 \u0442\u0435\u0441\u0442\u043e\u0432 \u0432\u044b\u043f\u043e\u043b\u043d\u0438\u0442\u0435\n\n tox\n\n\u0438\u0437 \u043f\u0430\u043f\u043a\u0438 \u0441 \u0440\u0435\u043f\u043e\u0437\u0438\u0442\u043e\u0440\u0438\u0435\u043c.\n\n\nChanges\n=======\n\n0.6 (2017-06-18)\n----------------\n\n- OpenCorpora -> Universal Dependencies conversion support (UD 1.4, UD 2.0,\n Dialog 2017 tagset).\n- ``convert`` function accepts optional ``word`` argument - conversion rules\n may now depend on a word itself, not only on its tag;\n- Python 3.2, 3.3 and 2.6 support is dropped;\n- packaging improvements.\n\n0.5.2 (2014-08-19)\n------------------\n\n- Handle \"abbr\" grammeme in OpenCorpora -> RusCorpora conversion.\n\n0.5.1 (2014-04-23)\n------------------\n\n- Added support for Init OpenCorpora grammeme.\n\n0.5 (2014-04-22)\n----------------\n\n- Added support for initials in OpenCorpora -> RusCorpora conversion rules;\n- small code cleanup.\n\n0.4 (2014-03-14)\n----------------\n\n- Improved OpenCorpora -> RusCorpora rules;\n- Anph grammeme is added to OpenCorpora rules.\n\n0.3.1 (2013-04-12)\n------------------\n\n- Improved OpenCorpora -> RusCorpora rules.\n\n0.3 (2013-03-21)\n----------------\n\n- Improved OpenCorpora -> RusCorpora rules.\n\n0.2 (2013-02-07)\n----------------\n\n- OpenCorpora -> RusCorpora conversion;\n- DSL for writing conversion routines.\n\n0.1 (2012-10-07)\n----------------\n\nInitial release\n", "description_content_type": null, "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/kmike/russian-tagsets/", "keywords": "", "license": "MIT license", "maintainer": "", "maintainer_email": "", "name": "russian-tagsets", "package_url": "https://pypi.org/project/russian-tagsets/", "platform": "", "project_url": "https://pypi.org/project/russian-tagsets/", "project_urls": { "Homepage": "https://github.com/kmike/russian-tagsets/" }, "release_url": "https://pypi.org/project/russian-tagsets/0.6/", "requires_dist": null, "requires_python": "", "summary": "Russian tagset converters library", "version": "0.6" }, "last_serial": 2956989, "releases": { "0.1": [ { "comment_text": "", "digests": { "md5": "e3a9e07b0f59d635b9c950b6e8fb74f2", "sha256": "9df36165aee03e702c7d4201d46c3e4c596f0e4f458b5330ff13a5eb2a544912" }, "downloads": -1, "filename": "russian-tagsets-0.1.tar.gz", "has_sig": false, "md5_digest": "e3a9e07b0f59d635b9c950b6e8fb74f2", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 15588, "upload_time": "2012-10-06T22:22:05", "url": "https://files.pythonhosted.org/packages/ed/95/80b9c4b903d2092810c585825d419fbf8383af2fcca3be2c39d151106f0a/russian-tagsets-0.1.tar.gz" } ], "0.2": [ { "comment_text": "", "digests": { "md5": "192c6410551bbe04d1e7d0f8b8de0b9f", "sha256": "0865c9b9f7d0e21ed62c0c66c7d1625a7cc3957ebf427fc035a8d7cbc1d8d1d8" }, "downloads": -1, "filename": "russian-tagsets-0.2.tar.gz", "has_sig": false, "md5_digest": "192c6410551bbe04d1e7d0f8b8de0b9f", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 17356, "upload_time": "2013-02-06T22:36:16", "url": "https://files.pythonhosted.org/packages/45/11/e1d453b9740c7681fec211fa0f5d2adefaaac8b85f0429b0ab976054c018/russian-tagsets-0.2.tar.gz" } ], "0.3": [ { "comment_text": "", "digests": { "md5": "970a18240b2a04f353951568d5fa910b", "sha256": "6d079d0fcee9bbca8d8d552cb6625db66afc40a22a5f32c481cdd30e8b3505ab" }, "downloads": -1, "filename": "russian-tagsets-0.3.tar.gz", "has_sig": false, "md5_digest": "970a18240b2a04f353951568d5fa910b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 17484, "upload_time": "2013-03-21T00:11:06", "url": "https://files.pythonhosted.org/packages/62/7f/1eff49c3bc7458cc1f08859413a010acdc348733826ab29486d4af064149/russian-tagsets-0.3.tar.gz" } ], "0.3.1": [ { "comment_text": "", "digests": { "md5": "febf6ec33cc9e4909c3e2019116103a3", "sha256": "590e1d3974a386d9342012318c70768cc4b1655c5002af09fa583eaba77e2e93" }, "downloads": -1, "filename": "russian-tagsets-0.3.1.tar.gz", "has_sig": false, "md5_digest": "febf6ec33cc9e4909c3e2019116103a3", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 17495, "upload_time": "2013-04-12T12:19:51", "url": "https://files.pythonhosted.org/packages/cd/9b/780a05122f830a174adcabae3837afdfbc105a40f32159d6f35294f42569/russian-tagsets-0.3.1.tar.gz" } ], "0.4": [ { "comment_text": "", "digests": { "md5": "577c29b4e4ca2a43a7b52c692c9ab196", "sha256": "7d221430426664b41811a4f53513dc2b7f3f98caf9d3088dc8c59dfc1478126a" }, "downloads": -1, "filename": "russian-tagsets-0.4.tar.gz", "has_sig": false, "md5_digest": "577c29b4e4ca2a43a7b52c692c9ab196", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 17881, "upload_time": "2014-03-29T23:21:06", "url": "https://files.pythonhosted.org/packages/a7/1e/e5a9fd5a741c46763eae052d75fcd6cdf8024d2580e3aad22488ddbf41c0/russian-tagsets-0.4.tar.gz" } ], "0.5": [ { "comment_text": "", "digests": { "md5": "b3e1e37a2b4f0e4e986fbf2062b6a9e6", "sha256": "280eef0de3d2d3467f261adf51ed8493dd33556d9c0fe84b574876e0c000a940" }, "downloads": -1, "filename": "russian-tagsets-0.5.tar.gz", "has_sig": false, "md5_digest": "b3e1e37a2b4f0e4e986fbf2062b6a9e6", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 18034, "upload_time": "2014-04-21T22:44:42", "url": "https://files.pythonhosted.org/packages/32/5f/1c380810627debb2170c86a0d96d56e68312d90a576f6be049d3ef242fbe/russian-tagsets-0.5.tar.gz" } ], "0.5.1": [ { "comment_text": "", "digests": { "md5": "eb9383feab1b31c059fd7d5f225f29e8", "sha256": "e60dd813a89dbd949545e08c60353c28bb0aff42e00786393b6207b299f9adac" }, "downloads": -1, "filename": "russian-tagsets-0.5.1.tar.gz", "has_sig": false, "md5_digest": "eb9383feab1b31c059fd7d5f225f29e8", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 18094, "upload_time": "2014-04-23T09:39:25", "url": "https://files.pythonhosted.org/packages/f3/66/0c0fbde56c8a9ce207b6b854306df522da84ca6362eddcd15236415b08cc/russian-tagsets-0.5.1.tar.gz" } ], "0.5.2": [ { "comment_text": "", "digests": { "md5": "2a8401aa53a3d810885ccf94fcb479b3", "sha256": "1c8c1a6dbf0477ff9164782f8340692057a3b3f3afe73f779f2addded8ad2fb9" }, "downloads": -1, "filename": "russian-tagsets-0.5.2.tar.gz", "has_sig": false, "md5_digest": "2a8401aa53a3d810885ccf94fcb479b3", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 18165, "upload_time": "2014-08-19T17:04:47", "url": "https://files.pythonhosted.org/packages/8c/71/15efe68060597278296bd1410eb1492dd7991140472a5693765cfd5a529e/russian-tagsets-0.5.2.tar.gz" } ], "0.6": [ { "comment_text": "", "digests": { "md5": "fba1ccad6845e289614aea34265f1350", "sha256": "127f1dd25d569d96359904586587a8bc1b6237342318f38e02df025f076b31fd" }, "downloads": -1, "filename": "russian-tagsets-0.6.tar.gz", "has_sig": false, "md5_digest": "fba1ccad6845e289614aea34265f1350", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 23256, "upload_time": "2017-06-18T00:25:50", "url": "https://files.pythonhosted.org/packages/2d/b1/c9377d472a04fb9b84f59365560d68b5d868b589691f32545eb606b3be48/russian-tagsets-0.6.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "fba1ccad6845e289614aea34265f1350", "sha256": "127f1dd25d569d96359904586587a8bc1b6237342318f38e02df025f076b31fd" }, "downloads": -1, "filename": "russian-tagsets-0.6.tar.gz", "has_sig": false, "md5_digest": "fba1ccad6845e289614aea34265f1350", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 23256, "upload_time": "2017-06-18T00:25:50", "url": "https://files.pythonhosted.org/packages/2d/b1/c9377d472a04fb9b84f59365560d68b5d868b589691f32545eb606b3be48/russian-tagsets-0.6.tar.gz" } ] }