{ "info": { "author": "TakeLab", "author_email": "takelab@fer.hr", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Operating System :: OS Independent", "Programming Language :: Python :: 3" ], "description": "# spaCy + UDPipe\n\nThis package wraps the fast and efficient [UDPipe](http://ufal.mff.cuni.cz/udpipe) language-agnostic NLP pipeline\n(via its [Python bindings](https://github.com/ufal/udpipe/tree/master/bindings/python)), so you can use\n[UDPipe pre-trained models](https://lindat.mff.cuni.cz/repository/xmlui/handle/11234/1-2998) as a [spaCy](https://spacy.io/) pipeline for 50+ languages out-of-the-box.\nInspired by [spacy-stanfordnlp](https://github.com/explosion/spacy-stanfordnlp), this package offers slightly less accurate\nmodels that are in turn much faster (see benchmarks for [UDPipe](https://ufal.mff.cuni.cz/udpipe/models#universal_dependencies_24_models_performance) and [StanfordNLP](https://stanfordnlp.github.io/stanfordnlp/performance.html)).\n\n## Installation\n\nUse the package manager [pip](https://pip.pypa.io/en/stable/) to install spacy-udpipe.\n\n```bash\npip install spacy-udpipe\n```\n\nAfter installation, use `spacy_udpipe.download(lang)` to download the pre-trained model for the desired language.\n\n## Usage\nThe loaded UDPipeLanguage class returns a spaCy [`Language` object](https://spacy.io/api/language), i.e., the nlp object you can use to process text and create a [`Doc` object](https://spacy.io/api/doc).\n\n```python\nimport spacy_udpipe\n\nspacy_udpipe.download(\"en\") # download English model\n\ntext = \"Wikipedia is a free online encyclopedia, created and edited by volunteers around the world.\"\nnlp = spacy_udpipe.load(\"en\")\n\ndoc = nlp(text)\nfor token in doc:\n print(token.text, token.lemma_, token.pos_, token.dep_)\n\n```\nAs all attributes are computed once and set in the custom [`Tokenizer`](https://spacy.io/api/tokenizer), the `nlp.pipeline` is empty.\n\n## Authors and acknowledgment\nCreated by [Antonio \u0160ajatovi\u0107](http://github.com/asajatovic)\nduring an internship at [Text Analysis and Knowledge Engineering Lab (TakeLab)](http://takelab.fer.hr/).\n\n## Contributing\nPull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.\n\nPlease make sure to update tests as appropriate.\n\nTo start the tests, just run [`pytest`](https://docs.pytest.org/en/latest/contents.html) in the root source directory.\n\n## License\n[MIT](https://choosealicense.com/licenses/mit/) \u00a9 TakeLab\n\n## Project status\nMaintained by [Text Analysis and Knowledge Engineering Lab (TakeLab)](http://takelab.fer.hr/).\n\n## Notes\n* All available pre-trained models are licensed under [CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/).\n\n* All annotations match with Spacy's, except for token.tag_, which map from [CoNLL](https://universaldependencies.org/format.html) XPOS tag (language-specific part-of-speech tag), defined for each language separately by the corresponding [Universal Dependencies](https://universaldependencies.org/) treebank.\n\n* Full list of supported languages and models is available in [`languages.json`](https://github.com/TakeLab/spacy-udpipe/blob/master/spacy_udpipe/languages.json).\n\n* This package exposes a `spacy_languages` entry point in its [`setup.py`](https://github.com/TakeLab/spacy-udpipe/blob/master/setup.py) so full suport for serialization is enabled:\n ```python\n nlp = spacy_udpipe.load(\"en\")\n nlp.to_disk(\"./udpipe-spacy-model\")\n\n ```\n To properly load a saved model, you must pass the `udpipe_model` argument when loading it:\n ```python\n udpipe_model = spacy_udpipe.UDPipeModel(\"en\")\n nlp = spacy.load(\"./udpipe-spacy-model\", udpipe_model=udpipe_model)\n\n ```\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/TakeLab/spacy-udpipe", "keywords": "udpipe spacy nlp", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "spacy-udpipe", "package_url": "https://pypi.org/project/spacy-udpipe/", "platform": "", "project_url": "https://pypi.org/project/spacy-udpipe/", "project_urls": { "Homepage": "https://github.com/TakeLab/spacy-udpipe", "Source": "https://github.com/TakeLab/spacy-udpipe", "SpaCy": "https://spacy.io/", "TakeLab": "http://takelab.fer.hr/", "Tracker": "https://github.com/TakeLab/spacy-udpipe/issues", "UDPipe": "http://ufal.mff.cuni.cz/udpipe" }, "release_url": "https://pypi.org/project/spacy-udpipe/0.0.4/", "requires_dist": [ "spacy (>=2.1.0)", "ufal.udpipe (>=1.2.0)" ], "requires_python": ">=3.6", "summary": "Use fast UDPipe models directly in spaCy", "version": "0.0.4" }, "last_serial": 5979055, "releases": { "0.0.1": [ { "comment_text": "", "digests": { "md5": "e7d1a26ddc0a8ad7519b716dc2455fbc", "sha256": "179d9eb242800491729f655729fb8dda2ea5e4c5d8a648479f3063d6be70729a" }, "downloads": -1, "filename": "spacy_udpipe-0.0.1-py3-none-any.whl", "has_sig": false, "md5_digest": "e7d1a26ddc0a8ad7519b716dc2455fbc", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6", "size": 10260, "upload_time": "2019-08-08T11:15:09", "url": "https://files.pythonhosted.org/packages/cd/84/36fe1eb95dcd96b8132dc379bdf738102a70e73f3adf786309b7b11d5076/spacy_udpipe-0.0.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "32f98832a8a57344841e219d92b5dd98", "sha256": "7ae5396af564619d4af21c8c1a1edd6df91d72e70f5ea835a228249ab34b4a18" }, "downloads": -1, "filename": "spacy-udpipe-0.0.1.tar.gz", "has_sig": false, "md5_digest": "32f98832a8a57344841e219d92b5dd98", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6", "size": 9683, "upload_time": "2019-08-08T11:15:11", "url": "https://files.pythonhosted.org/packages/4d/ad/2fd97b058b19f051741aa934a6c3307fe2ab2e10240d690f3a54073156d3/spacy-udpipe-0.0.1.tar.gz" } ], "0.0.2": [ { "comment_text": "", "digests": { "md5": "85939b2ec78510e2644e75499980a71a", "sha256": "e1aee20eb08dd82cd0471a6728295b9a247fdf439419bb7e3518dfc9e7704ad9" }, "downloads": -1, "filename": "spacy_udpipe-0.0.2-py3-none-any.whl", "has_sig": false, "md5_digest": "85939b2ec78510e2644e75499980a71a", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6", "size": 10450, "upload_time": "2019-08-08T15:01:45", "url": "https://files.pythonhosted.org/packages/98/da/857b64ff66fcf72c687076f8f574dd0073734ea3157e6f8e5578a5681dcc/spacy_udpipe-0.0.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "ec59802efd22a51daa10d0ad93c935cc", "sha256": "1208e0871e502b1898266b5de078086d63baa10a7fb0383a0bc27b51be82674b" }, "downloads": -1, "filename": "spacy-udpipe-0.0.2.tar.gz", "has_sig": false, "md5_digest": "ec59802efd22a51daa10d0ad93c935cc", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6", "size": 10785, "upload_time": "2019-08-08T15:01:47", "url": "https://files.pythonhosted.org/packages/24/99/b126fc43a2765b1d3a39a6317b6d6422df60396b357dd5e8ef728726f129/spacy-udpipe-0.0.2.tar.gz" } ], "0.0.3": [ { "comment_text": "", "digests": { "md5": "94842bbb21af0abe996fe6b013142c4f", "sha256": "55e79ba16672cfabd39df3b5b8f91077791838be6d77ce19e6efc18bdc7b6335" }, "downloads": -1, "filename": "spacy_udpipe-0.0.3-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "94842bbb21af0abe996fe6b013142c4f", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": ">=3.6", "size": 10398, "upload_time": "2019-08-21T09:44:55", "url": "https://files.pythonhosted.org/packages/70/4c/ef2175b18434ab0ca530fe38dc12fb4ba86f4047fa2e9d9925a2200cb715/spacy_udpipe-0.0.3-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "89b48596b4acd4fafa00ab8cb27ab369", "sha256": "87fd855a2c7cd5b578b88c5f5fc488aa026837130df10e5c46b42a46ec520ef3" }, "downloads": -1, "filename": "spacy_udpipe-0.0.3-py3-none-any.whl", "has_sig": false, "md5_digest": "89b48596b4acd4fafa00ab8cb27ab369", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6", "size": 10392, "upload_time": "2019-08-21T09:44:57", "url": "https://files.pythonhosted.org/packages/4c/68/6bccfb048bcf28147a2a261c986dddd988ad45325ce0a73278a77a55d3fc/spacy_udpipe-0.0.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "6f69ab7947018714592f539b8ca82c33", "sha256": "90774a79c50819b71bf678760cecb6ed35e0ce79e0529d43c7384dc844b05f7d" }, "downloads": -1, "filename": "spacy-udpipe-0.0.3.tar.gz", "has_sig": false, "md5_digest": "6f69ab7947018714592f539b8ca82c33", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6", "size": 10750, "upload_time": "2019-08-21T09:44:59", "url": "https://files.pythonhosted.org/packages/a8/01/88dc8fefac862910e51fb19a9cad43351e94bf5de2df0dc97393531958d3/spacy-udpipe-0.0.3.tar.gz" } ], "0.0.4": [ { "comment_text": "", "digests": { "md5": "ae1bca33a809cafafe9cab72deda4f5a", "sha256": "e5937b068d974958e2f5fe25829ff82ce9761848823fe3e63cfbb8d8bde3012f" }, "downloads": -1, "filename": "spacy_udpipe-0.0.4-py3-none-any.whl", "has_sig": false, "md5_digest": "ae1bca33a809cafafe9cab72deda4f5a", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6", "size": 11617, "upload_time": "2019-10-15T19:31:43", "url": "https://files.pythonhosted.org/packages/32/ec/92c72d31760876771f4feeb28b368b466175c4a5a2e5c71ce4438938a025/spacy_udpipe-0.0.4-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "6db98b9bb103d50cda1f9ff7bb68f9d2", "sha256": "5ed9684991a9c5c0a5933472dfc156f06c1c0dcfa1519d79346a9526ef94d75e" }, "downloads": -1, "filename": "spacy-udpipe-0.0.4.tar.gz", "has_sig": false, "md5_digest": "6db98b9bb103d50cda1f9ff7bb68f9d2", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6", "size": 11216, "upload_time": "2019-10-15T19:31:45", "url": "https://files.pythonhosted.org/packages/a2/24/dec8e57afb68fab7f4f0c7acef9b498fbebaf1e2554457c30cf1cb7ffe47/spacy-udpipe-0.0.4.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "ae1bca33a809cafafe9cab72deda4f5a", "sha256": "e5937b068d974958e2f5fe25829ff82ce9761848823fe3e63cfbb8d8bde3012f" }, "downloads": -1, "filename": "spacy_udpipe-0.0.4-py3-none-any.whl", "has_sig": false, "md5_digest": "ae1bca33a809cafafe9cab72deda4f5a", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6", "size": 11617, "upload_time": "2019-10-15T19:31:43", "url": "https://files.pythonhosted.org/packages/32/ec/92c72d31760876771f4feeb28b368b466175c4a5a2e5c71ce4438938a025/spacy_udpipe-0.0.4-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "6db98b9bb103d50cda1f9ff7bb68f9d2", "sha256": "5ed9684991a9c5c0a5933472dfc156f06c1c0dcfa1519d79346a9526ef94d75e" }, "downloads": -1, "filename": "spacy-udpipe-0.0.4.tar.gz", "has_sig": false, "md5_digest": "6db98b9bb103d50cda1f9ff7bb68f9d2", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6", "size": 11216, "upload_time": "2019-10-15T19:31:45", "url": "https://files.pythonhosted.org/packages/a2/24/dec8e57afb68fab7f4f0c7acef9b498fbebaf1e2554457c30cf1cb7ffe47/spacy-udpipe-0.0.4.tar.gz" } ] }