{ "info": { "author": "Greg Clarke", "author_email": "greg@gho.st", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Programming Language :: Python", "Topic :: Software Development :: Build Tools" ], "description": "========================\nPython Biased Stop Words\n========================\n\n.. image:: https://badge.fury.io/py/biased-stop-words.svg\n :target: https://badge.fury.io/py/biased-stop-words\n\n.. image:: http://img.shields.io/badge/license-MIT-yellow.svg?style=flat\n :target: https://github.com/gregology/biased-stop-words/blob/master/LICENSE\n\n.. image:: https://img.shields.io/badge/contact-Gregology-blue.svg?style=flat\n :target: http://gregology.net/contact/\n\n.. contents::\n\nOverview\n--------\n\n*Biases are bugs*\n\nStop words are words which are filtered out before processing of natural language data. Often in text analysis there are non-casual correlations, consider the following documents:\n\n- He is an astronaut, he is on Venus\n- He is an accountant, he is on Earth\n- She is an astronaut, she is on Mars\n\nProcessing these documents into two topics will result in gendered clustering. If we remove the gendered terms:\n\n- is an astronaut, is on Venus\n- is an accountant, is on Earth\n- is an astronaut, is on Mars\n\nProcessing will result in job clustering. Both clusterings are valid, however if you are interested in employing an astronaut, you don't want male accountants showing up. There are many other examples of non casual relationships occurring in natural language; religion, ethnicity, and age to name but a few.\n\nAvailable genres\n----------------\n\n* Gendered Terms\n* US Names\n* Religious Terms (Partial)\n\nMore will be available soon. Contribute at https://github.com/gregology/biased-words\n\nInteractive Notebook\n--------------------\n\nExplore this package in an `Interactive Notebook `__\n\n.. image:: https://user-images.githubusercontent.com/1595448/48975588-00661d00-f042-11e8-97c6-ded19ad45f51.png\n :width: 80%\n\nHosted by `binder `__\n\nInstallation\n------------\n\n``biased-stop-words`` is available on PyPI\n\nhttp://pypi.python.org/pypi/biased-stop-words\n\nInstall via ``pip``\n::\n\n $ pip install biased-stop-words\n\nOr via ``easy_install``\n::\n\n $ easy_install biased-stop-words\n\nOr directly from ``biased-stop-words``'s `git repo `\n::\n\n $ git clone --recursive git://github.com/gregology/biased-stop-words.git\n $ cd biased-stop-words\n $ python setup.py install\n\nBasic usage\n-----------\n::\n\n >>> from biased_stop_words import genres, get_stop_words\n >>> genres()\n 'religious, gendered, us-common-names, us-names, us-male-names, us-female-names, gendered-nouns'\n >>> get_stop_words('gendered', 'us-common-names')\n [u'trenton', u'augustine', u'khalil', u'aiden', u'elisabeth', u'andre', u'khanum', u'elva', u'fran...\n\n\nRunning Test\n------------\n::\n\n $ python biased_stop_words/tests.py\n\nPython compatibility\n--------------------\n\nDeveloped for Python 2 & 3.", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/gregology/biased-stop-words", "keywords": "stop words machine learning ml bias biased natural language processing nlp", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "biased-stop-words", "package_url": "https://pypi.org/project/biased-stop-words/", "platform": "", "project_url": "https://pypi.org/project/biased-stop-words/", "project_urls": { "Homepage": "https://github.com/gregology/biased-stop-words" }, "release_url": "https://pypi.org/project/biased-stop-words/2018.11.29.0/", "requires_dist": null, "requires_python": "", "summary": "Generates biased stop word lists for various genres", "version": "2018.11.29.0" }, "last_serial": 4543889, "releases": { "2017.5.10.2": [ { "comment_text": "", "digests": { "md5": "5e8f34eba636878eaf35c24d3d663222", "sha256": "ba59ae87895837fcc3a9ab3f5f71bf269a183ed876527ba9433345ee2fd7c896" }, "downloads": -1, "filename": "biased_stop_words-2017.5.10.2-py2.7.egg", "has_sig": false, "md5_digest": "5e8f34eba636878eaf35c24d3d663222", "packagetype": "bdist_egg", "python_version": "2.7", "requires_python": null, "size": 40693, "upload_time": "2017-05-10T22:39:10", "url": "https://files.pythonhosted.org/packages/dc/d4/8b5b1871108aaf81497c7f77cd2d8c0dd0db058d9fe977773c79cc3f489a/biased_stop_words-2017.5.10.2-py2.7.egg" } ], "2017.5.10.3": [ { "comment_text": "", "digests": { "md5": "c22381fc78f9f52ab11dc6220ab2fd84", "sha256": "314dcea6dae1dd9f1d4c32487ca8de1abbeb75ab1f9cad12e9249fe5f368ad2e" }, "downloads": -1, "filename": "biased_stop_words-2017.5.10.3-py2.7.egg", "has_sig": false, "md5_digest": "c22381fc78f9f52ab11dc6220ab2fd84", "packagetype": "bdist_egg", "python_version": "2.7", "requires_python": null, "size": 40665, "upload_time": "2017-05-11T04:04:58", "url": "https://files.pythonhosted.org/packages/7f/b3/9c285a689aed269c72a1548924595c58c5b4aa9227f736563e9d566820c8/biased_stop_words-2017.5.10.3-py2.7.egg" } ], "2017.5.11.1": [ { "comment_text": "", "digests": { "md5": "e07ddb6972fb2f67e97163499c0ee119", "sha256": "3de2c6572dd027e4be35acfc06684be3ef2d32485cf3f8b751d5de494987ce17" }, "downloads": -1, "filename": "biased_stop_words-2017.5.11.1-py2.7.egg", "has_sig": false, "md5_digest": "e07ddb6972fb2f67e97163499c0ee119", "packagetype": "bdist_egg", "python_version": "2.7", "requires_python": null, "size": 40346, "upload_time": "2017-05-11T19:11:57", "url": "https://files.pythonhosted.org/packages/6b/03/a51d2d08fbac043bccee894a58c9fe01eba0488f2b81e582e818b4dc32c3/biased_stop_words-2017.5.11.1-py2.7.egg" } ], "2017.5.11.2": [ { "comment_text": "", "digests": { "md5": "00698d44ba79e55870320c6fbcf9db22", "sha256": "9e3a77f5324c384e8523bd0ce1783d2c9249f444a8c264d82e7b07625b1b0929" }, "downloads": -1, "filename": "biased_stop_words-2017.5.11.2-py2.7.egg", "has_sig": false, "md5_digest": "00698d44ba79e55870320c6fbcf9db22", "packagetype": "bdist_egg", "python_version": "2.7", "requires_python": null, "size": 40346, "upload_time": "2017-05-11T20:32:43", "url": "https://files.pythonhosted.org/packages/6c/90/3392d1a6f86be9697daed9225d2492e75d529edb22e2792fa38f07ec57f9/biased_stop_words-2017.5.11.2-py2.7.egg" } ], "2017.5.12.1": [ { "comment_text": "", "digests": { "md5": "a04b2d7692c9acf9a59905230dde73c3", "sha256": "cbfd73cb06bf1d41149746421a3d021397c383b7062cea163f53565117ba5e5b" }, "downloads": -1, "filename": "biased_stop_words-2017.5.12.1-py2.7.egg", "has_sig": false, "md5_digest": "a04b2d7692c9acf9a59905230dde73c3", "packagetype": "bdist_egg", "python_version": "2.7", "requires_python": null, "size": 40018, "upload_time": "2017-05-12T02:36:20", "url": "https://files.pythonhosted.org/packages/19/9b/162dc36cdfff4de7f9931dbe7863853c0cc353f31bcc94638067a5abbab0/biased_stop_words-2017.5.12.1-py2.7.egg" }, { "comment_text": "", "digests": { "md5": "bea712f1b9beed3e2e0ec327c59bbf0b", "sha256": "08f6ecf32f6567583407644f647dc75fde63cfef9714798af5cb836ae87d9cb4" }, "downloads": -1, "filename": "biased_stop_words-2017.5.12.1-py2-none-any.whl", "has_sig": false, "md5_digest": "bea712f1b9beed3e2e0ec327c59bbf0b", "packagetype": "bdist_wheel", "python_version": "py2", "requires_python": null, "size": 39986, "upload_time": "2017-05-12T14:57:00", "url": "https://files.pythonhosted.org/packages/e6/13/6925949df188508233783ddc3965fc0ed8e8fad5a12283dd85f1de5c2091/biased_stop_words-2017.5.12.1-py2-none-any.whl" }, { "comment_text": "", "digests": { "md5": "e8f9488967910911c01bf45487dada24", "sha256": "06beb775521c7b4e653cc9fe2087c760cf8285e90530119e6063d03a540458a6" }, "downloads": -1, "filename": "biased-stop-words-2017.5.12.1.tar.gz", "has_sig": false, "md5_digest": "e8f9488967910911c01bf45487dada24", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 37128, "upload_time": "2017-05-12T15:03:10", "url": "https://files.pythonhosted.org/packages/6a/c1/a97c5910dab3791c965b74983e962b45e4f936a99f71b3c7aadec8e25611/biased-stop-words-2017.5.12.1.tar.gz" } ], "2017.5.13.1": [ { "comment_text": "", "digests": { "md5": "88cc02be4d84e846f12648b6381fbb92", "sha256": "ea05693fc92ccfa12a783ce70267ec91b5f1f53602b84c90dea108a646903669" }, "downloads": -1, "filename": "biased-stop-words-2017.5.13.1.tar.gz", "has_sig": false, "md5_digest": "88cc02be4d84e846f12648b6381fbb92", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 37141, "upload_time": "2017-05-12T15:25:10", "url": "https://files.pythonhosted.org/packages/31/f8/65ce1633ff7be9e89886e6c83bc4de84b101e364836e55bb241ebbc6b16a/biased-stop-words-2017.5.13.1.tar.gz" } ], "2017.5.13.2": [ { "comment_text": "", "digests": { "md5": "72eef21f1a111fc7f25b8fbd32ee2338", "sha256": "65f808255705b57da07a09ce9605a209e23f7cbd4b9c387055f0b4fbbe840fed" }, "downloads": -1, "filename": "biased-stop-words-2017.5.13.2.tar.gz", "has_sig": false, "md5_digest": "72eef21f1a111fc7f25b8fbd32ee2338", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 39410, "upload_time": "2017-05-12T21:29:50", "url": "https://files.pythonhosted.org/packages/46/5e/1a90b38efd1c68b8c9178f349004f84257a9a43db453de4cbdf451925047/biased-stop-words-2017.5.13.2.tar.gz" } ], "2017.5.13.3": [ { "comment_text": "", "digests": { "md5": "ded915efc9e822b639c81ad6235c54cf", "sha256": "1e6d055c1980ed553d775ab43ea18ea7b6d20a86870efb0d54bc033ef82589ac" }, "downloads": -1, "filename": "biased-stop-words-2017.5.13.3.tar.gz", "has_sig": false, "md5_digest": "ded915efc9e822b639c81ad6235c54cf", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 39748, "upload_time": "2017-05-12T22:05:51", "url": "https://files.pythonhosted.org/packages/dd/05/3e0fff71e44f9b66a9aae8f8294855ac275a514f9eb4e45f745f6ec2c980/biased-stop-words-2017.5.13.3.tar.gz" } ], "2017.5.13.4": [ { "comment_text": "", "digests": { "md5": "374ef5e9cbb3d6ff20b4769e17575d35", "sha256": "168f89a785bb27f36888145ae967fa0df9585a23bad6f9021ef7ee0e56b2dac2" }, "downloads": -1, "filename": "biased-stop-words-2017.5.13.4.tar.gz", "has_sig": false, "md5_digest": "374ef5e9cbb3d6ff20b4769e17575d35", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 39752, "upload_time": "2017-05-12T22:13:51", "url": "https://files.pythonhosted.org/packages/b1/b4/7ecc79364fda0818801156a7b7227062f8c2e4f175b78494306f1ee85ec4/biased-stop-words-2017.5.13.4.tar.gz" } ], "2017.5.9.1": [ { "comment_text": "", "digests": { "md5": "f72c11bf4e99e9d3ded135fac0d3cf5e", "sha256": "fdea111102d8fc4597e8b5b4db59c073f5930a7674cfb9e5dbeed9f44d31f24c" }, "downloads": -1, "filename": "biased_stop_words-2017.5.9.1-py2.7.egg", "has_sig": false, "md5_digest": "f72c11bf4e99e9d3ded135fac0d3cf5e", "packagetype": "bdist_egg", "python_version": "2.7", "requires_python": null, "size": 4719, "upload_time": "2017-05-09T23:39:12", "url": "https://files.pythonhosted.org/packages/72/f3/1a9d171116c3c6bb0df3b2dbaae72671c689ad1de091750eda847f5c91c3/biased_stop_words-2017.5.9.1-py2.7.egg" } ], "2017.7.13.0": [ { "comment_text": "", "digests": { "md5": "7a2da618b68ccef98e341676437a47f0", "sha256": "4aa98f4f66ddb0477c174bc5ec3e3eac0f9b4a9287c1a2cfc02a0a5e3bde367e" }, "downloads": -1, "filename": "biased-stop-words-2017.7.13.0.tar.gz", "has_sig": false, "md5_digest": "7a2da618b68ccef98e341676437a47f0", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 39767, "upload_time": "2017-07-13T18:35:56", "url": "https://files.pythonhosted.org/packages/d6/ac/eeee0914cb44e782bbee88f10de9a54d16c69de5517a48a7e6515984798e/biased-stop-words-2017.7.13.0.tar.gz" } ], "2018.11.25.0": [ { "comment_text": "", "digests": { "md5": "50ef74bbb434ebcc882ffbdfed8e16ed", "sha256": "8594ae9a6418fc78e006f6a319b51e676c9a3fdbe53c6206fb57ece70cdda5d2" }, "downloads": -1, "filename": "biased-stop-words-2018.11.25.0.tar.gz", "has_sig": false, "md5_digest": "50ef74bbb434ebcc882ffbdfed8e16ed", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 41533, "upload_time": "2018-11-25T04:40:17", "url": "https://files.pythonhosted.org/packages/48/5a/0d8a1d92bff401963d3cf190a39eba6147425eb207cb7b5ceaaac18eec84/biased-stop-words-2018.11.25.0.tar.gz" } ], "2018.11.29.0": [ { "comment_text": "", "digests": { "md5": "35604875f423d341807c749a73ebded9", "sha256": "401e4e53004be07395d07a00f738d68711a121e811106cbd9a3cebf570fa70dc" }, "downloads": -1, "filename": "biased-stop-words-2018.11.29.0.tar.gz", "has_sig": false, "md5_digest": "35604875f423d341807c749a73ebded9", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 41483, "upload_time": "2018-11-29T18:07:03", "url": "https://files.pythonhosted.org/packages/c7/6a/35e8bc81cca66806c47e5f62e9151ca72ec8e58b6b065428472b35a3dc11/biased-stop-words-2018.11.29.0.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "35604875f423d341807c749a73ebded9", "sha256": "401e4e53004be07395d07a00f738d68711a121e811106cbd9a3cebf570fa70dc" }, "downloads": -1, "filename": "biased-stop-words-2018.11.29.0.tar.gz", "has_sig": false, "md5_digest": "35604875f423d341807c749a73ebded9", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 41483, "upload_time": "2018-11-29T18:07:03", "url": "https://files.pythonhosted.org/packages/c7/6a/35e8bc81cca66806c47e5f62e9151ca72ec8e58b6b065428472b35a3dc11/biased-stop-words-2018.11.29.0.tar.gz" } ] }