{ "info": { "author": "Dominika Basaj, Barbara Rychalska, Alicja Gosiewska, Adam S\u0142ucki", "author_email": "adam.slucki@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "License :: OSI Approved :: BSD License", "Topic :: Text Processing" ], "description": "[![Documentation Status](https://readthedocs.org/projects/nlp-demo/badge/?version=latest)](https://nlp-demo.readthedocs.io/en/latest/?badge=latest)\n[![Downloads](https://pepy.tech/badge/wild-nlp)](https://pepy.tech/project/wild-nlp)\n[![PyPI version](https://badge.fury.io/py/wild-nlp.svg)](https://badge.fury.io/py/wild-nlp)\n\n![alt wildnlp-logo](logo.png) \n\nCorrupt an input text to test NLP models' robustness. \nFor details refer to https://nlp-demo.readthedocs.io\n\n## Installation\n`pip install wild-nlp`\n\n## Supported aspects\nAll together we defined and implemented 11 aspects of text corruption.\n\n1. **Articles**\n\n Randomly removes or swaps articles into wrong ones.\n\n2. **Digits2Words**\n\n Converts numbers into words. Handles floating numbers as well.\n\n3. **Misspellings**\n\n Misspells words appearing in the Wikipedia list of: \n * **commonly misspelled English words** \n * **homophones**\n\n4. **Punctuation**\n\n Randomly adds or removes specified punctuation marks.\n\n5. **QWERTY**\n\n Simulates errors made while writing on a QWERTY-type keyboard.\n\n6. **RemoveChar**\n\n Randomly removes: \n * **characters** from words or \n * **white spaces** from sentences\n\n7. **SentimentMasking**\n\n Replaces random, single character with for example an asterisk in: \n * **negative** or \n * **positive** words from Opinion Lexicon: \n http://www.cs.uic.edu/~liub/FBS/sentiment-analysis.html\n\n8. **Swap**\n\n Randomly swaps two characters within a word, excluding punctuations.\n\n9. **Change char**\n\n Randomly change characters according to chosen dictionary, default is 'ocr' to simulate simple OCR errors.\n\n10. **White spaces**\n\n Randomly add or remove white spaces (listed as a parameter).\n\n11. **Sub string**\n\n Randomly add a substring to simulate more comples signs.\n\n\n```diff\n- All aspects can be chained together with the wildnlp.aspects.utils.compose function.\n```\n\n## Supported datasets\nAspects can be applied to any text. Below is the list of datasets for which we already implemented processing pipelines. \n\n1. **CoNLL**\n\n The CoNLL-2003 shared task data for language-independent named entity recognition.\n\n2. **IMDB**\n\n The IMDB dataset containing movie reviews for a sentiment analysis. The dataset consists of 50 000 reviews of two classes, negative and positive.\n\n3. **SNLI**\n\n The SNLI dataset supporting the task of natural language inference.\n\n4. **SQuAD**\n\n The SQuAD dataset for the Machine Comprehension problem.\n\n## Usage\n```python\nfrom wildnlp.aspects.dummy import Reverser, PigLatin\nfrom wildnlp.aspects.utils import compose\nfrom wildnlp.datasets import SampleDataset\n\n# Create a dataset object and load the dataset\ndataset = SampleDataset()\ndataset.load()\n\n# Crate a composed corruptor function.\n# Functions will be applied in the same order they appear.\ncomposed = compose(Reverser(), PigLatin())\n\n# Apply the function to the dataset\nmodified = dataset.apply(composed)\n```\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/MI2DataLab/WildNLP", "keywords": "", "license": "BSD", "maintainer": "", "maintainer_email": "", "name": "wild-nlp", "package_url": "https://pypi.org/project/wild-nlp/", "platform": "", "project_url": "https://pypi.org/project/wild-nlp/", "project_urls": { "Homepage": "https://github.com/MI2DataLab/WildNLP" }, "release_url": "https://pypi.org/project/wild-nlp/1.0.2/", "requires_dist": [ "numpy", "num2words" ], "requires_python": "", "summary": "Text aspects for nlp models", "version": "1.0.2" }, "last_serial": 5805599, "releases": { "0.0.1": [ { "comment_text": "", "digests": { "md5": "39e46d6adf0e6214bb3db48fd7b570a7", "sha256": "5501bb353695cd24622a523a72d13d9ca24f3c0302351690e963ff84ea6f94dd" }, "downloads": -1, "filename": "wild_nlp-0.0.1-py3-none-any.whl", "has_sig": false, "md5_digest": "39e46d6adf0e6214bb3db48fd7b570a7", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 2730, "upload_time": "2019-03-09T16:13:05", "url": "https://files.pythonhosted.org/packages/43/3b/1827132036d9f9d7804532afa8fa38d1267f15fe91586ed1620e86edb8f4/wild_nlp-0.0.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "84fbb033a6f63dbc226523e85be14ebc", "sha256": "7eb9544db7ca8fc91a7da7c7c454ff7fc6e427c44132065458ddfebf823e2738" }, "downloads": -1, "filename": "wild-nlp-0.0.1.tar.gz", "has_sig": false, "md5_digest": "84fbb033a6f63dbc226523e85be14ebc", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1453, "upload_time": "2019-03-09T16:13:07", "url": "https://files.pythonhosted.org/packages/a0/c7/63f9c0d78ed8962220a14aadf78aa75889d9916a3ac8f01e2689736153bd/wild-nlp-0.0.1.tar.gz" } ], "0.0.2": [ { "comment_text": "", "digests": { "md5": "cbc3716728f6d580fb86c3f5d17bc4d6", "sha256": "159cd68c2db4adc32bffac43deb00708a858a679e68765a3317b6ed23ad95d6d" }, "downloads": -1, "filename": "wild_nlp-0.0.2-py3-none-any.whl", "has_sig": false, "md5_digest": "cbc3716728f6d580fb86c3f5d17bc4d6", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 2777, "upload_time": "2019-03-09T16:20:17", "url": "https://files.pythonhosted.org/packages/90/5a/a7bfd0baa99b8dc8ff90c4b1e1b8e87fc1bcf11dbb40fb620f21083b41f7/wild_nlp-0.0.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "c9a1996a2714185c6ec3a44d5b3f6f80", "sha256": "1067da9503a486aae58b64343d62c4a6d4d65fb7c1add003ed5d3042fbf4f5b6" }, "downloads": -1, "filename": "wild-nlp-0.0.2.tar.gz", "has_sig": false, "md5_digest": "c9a1996a2714185c6ec3a44d5b3f6f80", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1491, "upload_time": "2019-03-09T16:20:19", "url": "https://files.pythonhosted.org/packages/f3/f0/b74c167d04f37ce58a028c42562158ef93e8c212322eb74774c9eecbf11e/wild-nlp-0.0.2.tar.gz" } ], "0.0.3": [ { "comment_text": "", "digests": { "md5": "29c74c59a2b5dc915c01301dc559c3eb", "sha256": "7e701f84e5fd66d59635409bf69c6d5b13e8dff53bc187d10516a4e93a5c365b" }, "downloads": -1, "filename": "wild_nlp-0.0.3-py3-none-any.whl", "has_sig": false, "md5_digest": "29c74c59a2b5dc915c01301dc559c3eb", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 4562, "upload_time": "2019-03-09T16:47:46", "url": "https://files.pythonhosted.org/packages/b6/36/fff1ea60ab87da0f535c253f96e224209982e121a9c9ffeda662d080aeb0/wild_nlp-0.0.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "5ff7b21e43bb6b8445677ccc37385610", "sha256": "367e0b1cd570ef404dd1d6382cf4482cd06d286cf37731e79d12c74e07045af5" }, "downloads": -1, "filename": "wild-nlp-0.0.3.tar.gz", "has_sig": false, "md5_digest": "5ff7b21e43bb6b8445677ccc37385610", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2717, "upload_time": "2019-03-09T16:47:47", "url": "https://files.pythonhosted.org/packages/7e/ba/f7457ceab9e3ef2f580ff5217515725fb336674797d62905f0968ec26207/wild-nlp-0.0.3.tar.gz" } ], "0.0.4": [ { "comment_text": "", "digests": { "md5": "e01c3eb9112ef1f0b8222ac667fe9fd8", "sha256": "fd7e8b0f6ce66695c137a97417cf4b6465d1eba99ded5995d98e5b9ec6d93e53" }, "downloads": -1, "filename": "wild_nlp-0.0.4-py3-none-any.whl", "has_sig": false, "md5_digest": "e01c3eb9112ef1f0b8222ac667fe9fd8", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 17832, "upload_time": "2019-03-29T19:55:29", "url": "https://files.pythonhosted.org/packages/b1/2e/58b73b7a45d757574ef64cf437ba176b4691ef80977c92b53f661f1d7086/wild_nlp-0.0.4-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "102f3571965caecb5c01e111af9ee48f", "sha256": "958bbe5cddaa7e4e26d2a2b0fa16920b0664317df30729e1ebb79ed47deeda2c" }, "downloads": -1, "filename": "wild-nlp-0.0.4.tar.gz", "has_sig": false, "md5_digest": "102f3571965caecb5c01e111af9ee48f", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 41416, "upload_time": "2019-03-29T19:55:31", "url": "https://files.pythonhosted.org/packages/f9/50/b1a9065795f1d9e1acd6a8ce2e9878120c4acdedc0d853261aedcf01639b/wild-nlp-0.0.4.tar.gz" } ], "0.0.5": [ { "comment_text": "", "digests": { "md5": "6f97a3bd116aec3f31a4e139dd3b2da8", "sha256": "697a4c9d994c694772030611510a2785adff6cc17d6ea2bf8775ce51e5f864b6" }, "downloads": -1, "filename": "wild_nlp-0.0.5-py3-none-any.whl", "has_sig": false, "md5_digest": "6f97a3bd116aec3f31a4e139dd3b2da8", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 47900, "upload_time": "2019-03-29T20:03:24", "url": "https://files.pythonhosted.org/packages/83/fc/9f882ea30ca7305f6cfb8b2faefa4f1416770e8a413fa67649d4960f1836/wild_nlp-0.0.5-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "cf7e7b76e8ff817b310e9ed2be428bc2", "sha256": "3c9ebd9feb372383ecf0ef2d27364e99bbaf2217d17c52446e9043f9608480fb" }, "downloads": -1, "filename": "wild-nlp-0.0.5.tar.gz", "has_sig": false, "md5_digest": "cf7e7b76e8ff817b310e9ed2be428bc2", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 41416, "upload_time": "2019-03-29T20:03:25", "url": "https://files.pythonhosted.org/packages/71/3a/b6303d5e41262c409a24580db573f4b0375c2f3a5361e811028885c41043/wild-nlp-0.0.5.tar.gz" } ], "0.0.6": [ { "comment_text": "", "digests": { "md5": "309f830ada253ca61149bc936369b004", "sha256": "8b7a2eed8a081f3ead18e240536775940f2fb83e4b45b7f569ce111923022c11" }, "downloads": -1, "filename": "wild_nlp-0.0.6-py3-none-any.whl", "has_sig": false, "md5_digest": "309f830ada253ca61149bc936369b004", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 47900, "upload_time": "2019-03-29T20:14:10", "url": "https://files.pythonhosted.org/packages/5c/c5/b8747dc5fcbb44222114a1c3cbf7a6651bb2496258303fe1c1e4ed79f901/wild_nlp-0.0.6-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "231845b4b1e265a1b68f72322e4dcd79", "sha256": "984715f8bd3e8ef08a065978b634f4c59dbb2531f78a506a49459d0f051e3878" }, "downloads": -1, "filename": "wild-nlp-0.0.6.tar.gz", "has_sig": false, "md5_digest": "231845b4b1e265a1b68f72322e4dcd79", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 41422, "upload_time": "2019-03-29T20:14:12", "url": "https://files.pythonhosted.org/packages/3e/ec/71b920af572e70bbe9a73c69847903ca6549f669f74dbafd778fc33afc01/wild-nlp-0.0.6.tar.gz" } ], "1.0.0": [ { "comment_text": "", "digests": { "md5": "c92938042af952793118ad61386a7b99", "sha256": "80dabe258cbc35c1155df13ebae082cf507c40cda846ee8684996ef35de6cc53" }, "downloads": -1, "filename": "wild_nlp-1.0.0-py3-none-any.whl", "has_sig": false, "md5_digest": "c92938042af952793118ad61386a7b99", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 47988, "upload_time": "2019-03-29T20:28:17", "url": "https://files.pythonhosted.org/packages/d7/1a/4cd54b416302100047abeda28384644abcf64d6824005feeddc42a83b813/wild_nlp-1.0.0-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "a1e262f9a9b6bbadea324969dcd92a5e", "sha256": "f5663e26a2108c9f3fe83ca47c659f9bf7a3ffa03ab95c5600f2d6d76591a157" }, "downloads": -1, "filename": "wild-nlp-1.0.0.tar.gz", "has_sig": false, "md5_digest": "a1e262f9a9b6bbadea324969dcd92a5e", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 41464, "upload_time": "2019-03-29T20:28:18", "url": "https://files.pythonhosted.org/packages/69/16/be9b21040404284537a82303ba31136e85304816bf1da7a08eaf8686cde2/wild-nlp-1.0.0.tar.gz" } ], "1.0.1": [ { "comment_text": "", "digests": { "md5": "5184c08d7c17f13f1e1489e08dccd68a", "sha256": "56136c29af6e5cb90cf7f444806f61990955774ebdfab70b66e4e9776eac3402" }, "downloads": -1, "filename": "wild_nlp-1.0.1-py3-none-any.whl", "has_sig": false, "md5_digest": "5184c08d7c17f13f1e1489e08dccd68a", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 47983, "upload_time": "2019-03-31T08:23:19", "url": "https://files.pythonhosted.org/packages/47/0a/3b9ab5efe6e4a870716508533f680644d8cb4d5bea9908d094940425f035/wild_nlp-1.0.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "e665a35f3cda32f71e42c1d0c6e2cf58", "sha256": "af281610664ff7f2dfb6c6f0d6a51998052a515986fb8fa4e04a32d37ba308f8" }, "downloads": -1, "filename": "wild-nlp-1.0.1.tar.gz", "has_sig": false, "md5_digest": "e665a35f3cda32f71e42c1d0c6e2cf58", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 41443, "upload_time": "2019-03-31T08:23:21", "url": "https://files.pythonhosted.org/packages/8e/4e/0056fa29a23f6abe3a93adc749a8bc5b8b3cbf19d7713edb45d906dd608c/wild-nlp-1.0.1.tar.gz" } ], "1.0.2": [ { "comment_text": "", "digests": { "md5": "546c18a0bc18bbff626901772ab23a91", "sha256": "a7105880ba002f3bb0a02340a945230fe117863316bf0f80ad1218b07a98099d" }, "downloads": -1, "filename": "wild_nlp-1.0.2-py3-none-any.whl", "has_sig": false, "md5_digest": "546c18a0bc18bbff626901772ab23a91", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 53332, "upload_time": "2019-09-09T21:09:28", "url": "https://files.pythonhosted.org/packages/73/55/16cac5d14cb71bfc31297e3d12662ab7b11bf2dd8ec4e79c648255cb1bdc/wild_nlp-1.0.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "58a3b292d0824ed743fba421013c6b5c", "sha256": "def51dce4d5be1644b1109798631e75e780741a7effb99ea9ecb1a1b4a860031" }, "downloads": -1, "filename": "wild-nlp-1.0.2.tar.gz", "has_sig": false, "md5_digest": "58a3b292d0824ed743fba421013c6b5c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 44918, "upload_time": "2019-09-09T21:09:30", "url": "https://files.pythonhosted.org/packages/82/00/a656ff3a918c6b83bff6966f99a88e523a07685f1a0001dddd93f3c7bcbb/wild-nlp-1.0.2.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "546c18a0bc18bbff626901772ab23a91", "sha256": "a7105880ba002f3bb0a02340a945230fe117863316bf0f80ad1218b07a98099d" }, "downloads": -1, "filename": "wild_nlp-1.0.2-py3-none-any.whl", "has_sig": false, "md5_digest": "546c18a0bc18bbff626901772ab23a91", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 53332, "upload_time": "2019-09-09T21:09:28", "url": "https://files.pythonhosted.org/packages/73/55/16cac5d14cb71bfc31297e3d12662ab7b11bf2dd8ec4e79c648255cb1bdc/wild_nlp-1.0.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "58a3b292d0824ed743fba421013c6b5c", "sha256": "def51dce4d5be1644b1109798631e75e780741a7effb99ea9ecb1a1b4a860031" }, "downloads": -1, "filename": "wild-nlp-1.0.2.tar.gz", "has_sig": false, "md5_digest": "58a3b292d0824ed743fba421013c6b5c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 44918, "upload_time": "2019-09-09T21:09:30", "url": "https://files.pythonhosted.org/packages/82/00/a656ff3a918c6b83bff6966f99a88e523a07685f1a0001dddd93f3c7bcbb/wild-nlp-1.0.2.tar.gz" } ] }