{ "info": { "author": "Explosion AI", "author_email": "contact@explosion.ai", "bugtrack_url": null, "classifiers": [], "description": "# library.qai.spacy\n\nCustomized SpaCy pipeline\n\n## Installing\n\nThis is available on [PyPi](https://pypi.org/project/en-qai-sm/)\n\n```sh\n$ pip install en-qai-sm\n> installs the package and deps including spacy\n```\n\n## Usage\n\n```python\nimport spacy\nnlp = spacy.load('en_qai_sm')\n>>> doc = nlp(\"I ain't got no hands!\")\n>>> for token in doc: print(token, token.pos_)\n...\nI PRON\nain't VERB\ngot VERB\nno DET\nhands NOUN\n! PUNCT\n```\n\n## About SpaCy pipelines\n\nDefault spaCy pipeline consists of 4 steps (components):\n\n![spaCy pipeline](img/pipeline.png)\n\n- `tokenizer` - splits text into tokens\n- `tagger` - assigns part-of-speech tags\n- `parser` - assigns dependency labels\n- `ner` - detects and label named entities\n\nCustom components (ex. any functions on `doc`) can be inserted into the pipeline (at any place after the `tokenizer`. For simplicity, `tokenizer` is not listed in pipelines descriptions.)\n\nReference: [spaCy docs](https://spacy.io/usage/processing-pipelines).\n\n## Pipeline components\n\n### v1.0.0\n\nThe pipeline consists of:\n\n```python\npipeline = [\n \"merge_matcher\",\n \"tagger\",\n \"parser\",\n \"ner\"\n ]\n```\n\nwhere ```merge_matcher``` matches and merges into 1 token spans of type:\n\n- connected by hyphens ex. ```rock-hard```\n- contractions ex. ```don't```\n- special (informal) short forms ex. ```gonna```\n\n\n## License\n\nAs this is just a small extension of spaCy's `en_core_web_sm`, we include the same license - MIT.", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://explosion.ai", "keywords": "", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "en-qai-sm", "package_url": "https://pypi.org/project/en-qai-sm/", "platform": "", "project_url": "https://pypi.org/project/en-qai-sm/", "project_urls": { "Homepage": "https://explosion.ai" }, "release_url": "https://pypi.org/project/en-qai-sm/1.2.1/", "requires_dist": null, "requires_python": "", "summary": "English multi-task CNN trained on OntoNotes. Assigns context-specific token vectors, POS tags, dependency parse and named entities.", "version": "1.2.1" }, "last_serial": 5739050, "releases": { "0.0.0": [ { "comment_text": "", "digests": { "md5": "11b65ebc74e83a790d46ab14583f6848", "sha256": "61978baae8e053ad42bf0be72650586991a5b5c42e716a13f5788efa4bcd5de8" }, "downloads": -1, "filename": "en_qai_sm-0.0.0.tar.gz", "has_sig": false, "md5_digest": "11b65ebc74e83a790d46ab14583f6848", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 11077249, "upload_time": "2019-08-21T21:34:27", "url": "https://files.pythonhosted.org/packages/8f/4b/87d4951daa2f41c4e50d15b2178269f3dfc86d5be2c2e3d04babda75c327/en_qai_sm-0.0.0.tar.gz" } ], "1.2.0": [ { "comment_text": "", "digests": { "md5": "453b6e90fae6ac363919197974985257", "sha256": "7b85ac3729a80b3989038fd4fba01422785c11dcc273ef04ef7fa6355e5299b8" }, "downloads": -1, "filename": "en_qai_sm-1.2.0.tar.gz", "has_sig": false, "md5_digest": "453b6e90fae6ac363919197974985257", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 51233241, "upload_time": "2019-08-27T18:06:56", "url": "https://files.pythonhosted.org/packages/bf/13/d5dc2b7321931beec9bcb50e5a95886f650a0741e1f487d1921c776b4a25/en_qai_sm-1.2.0.tar.gz" } ], "1.2.1": [ { "comment_text": "", "digests": { "md5": "7e1ed5c53ba4721245c33bfb1c950d03", "sha256": "e8c96f6ae94cc7eb0c16e855de2b25f59c71e9121cef95234c19d51a99d52255" }, "downloads": -1, "filename": "en_qai_sm-1.2.1.tar.gz", "has_sig": false, "md5_digest": "7e1ed5c53ba4721245c33bfb1c950d03", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 51233216, "upload_time": "2019-08-27T19:03:44", "url": "https://files.pythonhosted.org/packages/e5/0d/3df9531b4c60d1ad99009822baf80aef85b5850535375b99ab931d6d07b4/en_qai_sm-1.2.1.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "7e1ed5c53ba4721245c33bfb1c950d03", "sha256": "e8c96f6ae94cc7eb0c16e855de2b25f59c71e9121cef95234c19d51a99d52255" }, "downloads": -1, "filename": "en_qai_sm-1.2.1.tar.gz", "has_sig": false, "md5_digest": "7e1ed5c53ba4721245c33bfb1c950d03", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 51233216, "upload_time": "2019-08-27T19:03:44", "url": "https://files.pythonhosted.org/packages/e5/0d/3df9531b4c60d1ad99009822baf80aef85b5850535375b99ab931d6d07b4/en_qai_sm-1.2.1.tar.gz" } ] }