{ "info": { "author": "Igor Gorschal", "author_email": "gorschal@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 5 - Production/Stable", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Operating System :: OS Independent", "Programming Language :: Python", "Programming Language :: Python :: 3.3", "Programming Language :: Python :: 3.4", "Programming Language :: Python :: 3.5", "Topic :: Internet :: WWW/HTTP", "Topic :: Software Development :: Pre-processors", "Topic :: Text Processing :: Filters", "Topic :: Text Processing :: Markup :: HTML" ], "description": ".. image:: https://codeclimate.com/github/reefeed/wanish/badges/gpa.svg\n :target: https://codeclimate.com/github/reefeed/wanish\n :alt: Code Climate\n\nAbout\n-----\n\nThis package allows you to summarize text by reducing an article in size\nto several sentences retaining the idea of the text.\n\nBesides of that the package extracts the following from the document:\n\n1. Canonical URL of the article\n2. Title of the article\n3. URL of the image characterizing this article\n4. Strips the document of excessive information (headers, footers,\n navigation, advertisement, etc.) and forms a clean HTML based on\n structured data of schema.org\n\n`DEMO`_\n\nInstallation\n------------\n\n::\n\n easy_install wanish\n or\n pip install wanish\n\nUsage\n-----\n\n.. code:: python\n\n from wanish import Wanish\n wanish = Wanish()\n wanish.perform_url(document_url)\n\n # getting doc's source canonical url\n url = wanish.url\n # getting document's title\n title = wanish.title\n # getting url of related image if document has it\n image_url = wanish.image_url\n # getting two-letter code of the document's language (en, de, es...)\n language_code = wanish.language\n # getting a clean html page of a document with article\n clean_html = wanish.clean_html\n # getting a short summarized description of the article reduced to several sentences (5 by default)\n description = wanish.description\n\nAvailable kwarg options for *Wanish()* class (all are optional):\n\n.. code:: python\n\n wanish = Wanish(url=document_url,\n positive_keywords=[\"main\", \"story\"],\n negative_keywords=[\"banner\", \"adv\", \"similar\", \"top-ad\"],\n summary_sentences_qty=5,\n headers={'user-agent': 'test-purposes/0.0.1'})\n\n- **url:** Allows to pass an url of a document in constructor. If set,\n then it will automatically launch *self.perform\\_url(url)* after\n initialization. Default is None.\n- **positive\\_keywords:** A list of positive search patterns in classes\n and ids, for example: *[\u201cmain\u201d, \u201cstory\u201d]* . Default is None.\n- **negative\\_keywords:** A list of negative search patterns in classes\n and ids, for example: *[\u201cbanner\u201d, \u201cadv\u201d, \u201csimilar\u201d, \u201ctop-ad\u201d]* .\n Default is None.\n- **summary\\_sentences\\_qty:** Maximum quantity of sentences in\n summarized text of the document. Set to 5 by default.\n- **headers:** Dict of additional custom headers for GET request to\n obtain web page of the article. Default is None.\n\nSpecial Thanks\n--------------\n\n- https://github.com/nltk/nltk\n- https://github.com/buriy/python-readability\n- https://github.com/saffsd/langid.py\n\n.. _DEMO: http://reefeed.com", "description_content_type": null, "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/reefeed/wanish", "keywords": "summly", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "wanish", "package_url": "https://pypi.org/project/wanish/", "platform": "", "project_url": "https://pypi.org/project/wanish/", "project_urls": { "Homepage": "https://github.com/reefeed/wanish" }, "release_url": "https://pypi.org/project/wanish/0.6.3/", "requires_dist": null, "requires_python": "", "summary": "open source implementation of summly", "version": "0.6.3" }, "last_serial": 2511713, "releases": { "0.3": [ { "comment_text": "", "digests": { "md5": "47b148001d59ded485a5feeb6ec5d024", "sha256": "4c899413297de8dbb03cd786e8603dd871c2853ad8ce1ee78c096106b6fccf11" }, "downloads": -1, "filename": "wanish-0.3.tar.gz", "has_sig": false, "md5_digest": "47b148001d59ded485a5feeb6ec5d024", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1922115, "upload_time": "2015-03-25T13:53:10", "url": "https://files.pythonhosted.org/packages/18/91/e968a5bdad89dc79d7f0f2c382aa76f41815df2a04c55d019be8fdc69e69/wanish-0.3.tar.gz" } ], "0.3.1": [ { "comment_text": "", "digests": { "md5": "a1ae91e4f2e6c762636802733d4b41d7", "sha256": "b2bf6b7cb468fbec1cce61418cd4034a8f388254289533e830e9f96b7884f92a" }, "downloads": -1, "filename": "wanish-0.3.1.tar.gz", "has_sig": false, "md5_digest": "a1ae91e4f2e6c762636802733d4b41d7", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1926945, "upload_time": "2015-03-25T16:03:18", "url": "https://files.pythonhosted.org/packages/da/0d/5d2abafa8eb625f596c3ca8c0991f41d6014347987e6c5412eb0e3a06ea1/wanish-0.3.1.tar.gz" } ], "0.4.0": [ { "comment_text": "", "digests": { "md5": "4e08278ec270e51f45312b9636c68f99", "sha256": "17da3ecfcddd3957c7c74fb81142606c883e58da25969ac427d056047b461e35" }, "downloads": -1, "filename": "wanish-0.4.0.tar.gz", "has_sig": false, "md5_digest": "4e08278ec270e51f45312b9636c68f99", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1928390, "upload_time": "2015-03-27T12:31:04", "url": "https://files.pythonhosted.org/packages/f4/84/1e48a427d88d8b14b85b805e1176aac9ec231421ae3a5a006061411678be/wanish-0.4.0.tar.gz" } ], "0.4.1": [ { "comment_text": "", "digests": { "md5": "9a6d36461c3f27aa5b07c0f4303ff186", "sha256": "71e607ca9907206b62f3ba1585b4adfbdcb761770d3a54631f014198d4b64afb" }, "downloads": -1, "filename": "wanish-0.4.1.tar.gz", "has_sig": false, "md5_digest": "9a6d36461c3f27aa5b07c0f4303ff186", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1928370, "upload_time": "2015-03-28T07:45:35", "url": "https://files.pythonhosted.org/packages/6f/7d/018d7ed17bef086ebbccacc53b5b1151bc383a3050fc8184b5698cdccb46/wanish-0.4.1.tar.gz" } ], "0.4.2": [ { "comment_text": "", "digests": { "md5": "6cffbbced32c5fd4a65f21ebc68a5da6", "sha256": "2e6db67194d2a2f3a98aad5a0bcb7f35155f2a9c65ec8e114b879c6f6299de35" }, "downloads": -1, "filename": "wanish-0.4.2.tar.gz", "has_sig": false, "md5_digest": "6cffbbced32c5fd4a65f21ebc68a5da6", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1928608, "upload_time": "2015-03-30T13:16:16", "url": "https://files.pythonhosted.org/packages/41/f5/1ccc4991685a14ebd3b34949290b0e2b6fe0afe81a56cc471c9145859052/wanish-0.4.2.tar.gz" } ], "0.4.3": [ { "comment_text": "", "digests": { "md5": "2286715423eae9c70a2765adca074b63", "sha256": "88d9b92f2c82f1035b852aa4dfd608ca9013f5e825cf9d86d06c2e0f80c4e45d" }, "downloads": -1, "filename": "wanish-0.4.3.tar.gz", "has_sig": false, "md5_digest": "2286715423eae9c70a2765adca074b63", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1928683, "upload_time": "2015-05-06T07:05:26", "url": "https://files.pythonhosted.org/packages/13/2d/0de845b37fa8e2ea1c15b60d38187a611c401bae9db16ae55e3cb55b0406/wanish-0.4.3.tar.gz" } ], "0.4.4": [ { "comment_text": "", "digests": { "md5": "5ce8e977f4a07c1d0c29670cd30397ff", "sha256": "68a247ccc20908e5c004f79efce4fdfc4e81108c4501e948b7320b2de74ac469" }, "downloads": -1, "filename": "wanish-0.4.4.tar.gz", "has_sig": false, "md5_digest": "5ce8e977f4a07c1d0c29670cd30397ff", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1928851, "upload_time": "2015-06-19T13:19:04", "url": "https://files.pythonhosted.org/packages/a0/36/4192d898ed7eace17dd0d5ed0a4552b2adc804ba5ac0c9f16e1834818afe/wanish-0.4.4.tar.gz" } ], "0.4.5": [ { "comment_text": "", "digests": { "md5": "d70167d62828eb278e32837f8780cedf", "sha256": "45e5d1ba8a0523a02714c3e193267a072e19a6485f5ba3cc5859c8a015f60809" }, "downloads": -1, "filename": "wanish-0.4.5.tar.gz", "has_sig": false, "md5_digest": "d70167d62828eb278e32837f8780cedf", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1928845, "upload_time": "2015-06-22T07:54:21", "url": "https://files.pythonhosted.org/packages/ed/ae/4b25802d9cad1284c4439f57fd485c4efc1bafe80498d33e890a305357ff/wanish-0.4.5.tar.gz" } ], "0.4.6": [ { "comment_text": "", "digests": { "md5": "e8e4b33c4e88ca473737200f98b4ede0", "sha256": "51ffac773cb4ebf91f8573d79ceea365636566211d1b573b4bb319deffe3249c" }, "downloads": -1, "filename": "wanish-0.4.6.tar.gz", "has_sig": false, "md5_digest": "e8e4b33c4e88ca473737200f98b4ede0", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1929178, "upload_time": "2015-06-23T09:32:28", "url": "https://files.pythonhosted.org/packages/44/31/c541384685b102b6636bd571584e92cfa77ff042c5b7f935f630cb604afd/wanish-0.4.6.tar.gz" } ], "0.4.7": [ { "comment_text": "", "digests": { "md5": "8cb95eedeea9bfcd89216367034618cc", "sha256": "76d2c6b357364d08bd841015886159f0dc81ab22331b35edf48cb9fa0e01e632" }, "downloads": -1, "filename": "wanish-0.4.7.tar.gz", "has_sig": false, "md5_digest": "8cb95eedeea9bfcd89216367034618cc", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1929178, "upload_time": "2015-06-24T08:54:24", "url": "https://files.pythonhosted.org/packages/be/ba/d3a5ca8052952147ec34470ab0291bad2d339031ccd7c7c08368b6fdadd1/wanish-0.4.7.tar.gz" } ], "0.4.8": [ { "comment_text": "", "digests": { "md5": "1f50910c2b89d8916878470605d8af19", "sha256": "235a53757a2f219a3fcdcd6c91d711912c4f23b7017eaad3d23aa7b7c89be17f" }, "downloads": -1, "filename": "wanish-0.4.8.tar.gz", "has_sig": false, "md5_digest": "1f50910c2b89d8916878470605d8af19", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1928961, "upload_time": "2015-06-28T17:56:02", "url": "https://files.pythonhosted.org/packages/91/a8/c556eddf663fdc5feeeb4d7243d95ab57e309122332647d82d1799d9db5c/wanish-0.4.8.tar.gz" } ], "0.4.9": [ { "comment_text": "", "digests": { "md5": "a14620d9edb9135436a11fae81c26a26", "sha256": "5e2214f36f1dab543e2b3337be1c0dfc8e0fde02f9e22ff41658debaf74fc18c" }, "downloads": -1, "filename": "wanish-0.4.9.tar.gz", "has_sig": false, "md5_digest": "a14620d9edb9135436a11fae81c26a26", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1929054, "upload_time": "2015-07-22T15:38:04", "url": "https://files.pythonhosted.org/packages/64/28/ed720a075b20f1f2906e0f97f3fe5f88445f1f39ba3a4371a7c7ea1f24a3/wanish-0.4.9.tar.gz" } ], "0.5.0": [ { "comment_text": "", "digests": { "md5": "c05cdaca44119fb434632bdbc93ff5c9", "sha256": "1df9bc49d47d2e81efdd55b8d26f06b8583b347e44badc9eb949b2fbfb304011" }, "downloads": -1, "filename": "wanish-0.5.0.tar.gz", "has_sig": false, "md5_digest": "c05cdaca44119fb434632bdbc93ff5c9", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1929700, "upload_time": "2015-09-09T17:58:48", "url": "https://files.pythonhosted.org/packages/e1/f9/148ca3e82a1673e099ca9c999f8a3413783980bc20065d2919ff55b7f2a3/wanish-0.5.0.tar.gz" } ], "0.5.1": [ { "comment_text": "", "digests": { "md5": "51eca92bb6fd2652734cd050677a0af6", "sha256": "53785c94034e078f4cb26ed12df2cf7ece8fd3d7cfce3a6450c5af0e5b7be799" }, "downloads": -1, "filename": "wanish-0.5.1.tar.gz", "has_sig": false, "md5_digest": "51eca92bb6fd2652734cd050677a0af6", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1929554, "upload_time": "2015-09-10T10:10:57", "url": "https://files.pythonhosted.org/packages/90/e0/7d84b86a3bb6230df7978083a5da48818a0804d34445224145bbff3f6556/wanish-0.5.1.tar.gz" } ], "0.5.2": [ { "comment_text": "", "digests": { "md5": "aaa9dcb18024809249d975ac289ffdc1", "sha256": "8b5d86719e060c23000f8f0490db23c0590e938712dabd131bdad365ccd1255d" }, "downloads": -1, "filename": "wanish-0.5.2.tar.gz", "has_sig": false, "md5_digest": "aaa9dcb18024809249d975ac289ffdc1", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1930075, "upload_time": "2015-10-09T10:57:39", "url": "https://files.pythonhosted.org/packages/c1/a2/b0f238b06aa2a932dd86743aad95fdccb653f1ad7c3839c3b19efd11c1c7/wanish-0.5.2.tar.gz" } ], "0.5.3": [ { "comment_text": "", "digests": { "md5": "385c96d609e7c8f4aa06ea83b4884d0c", "sha256": "207676f8699165bb9e7898e5bcf23f5a094f764b77d8d8756ce430edea2eba60" }, "downloads": -1, "filename": "wanish-0.5.3.tar.gz", "has_sig": false, "md5_digest": "385c96d609e7c8f4aa06ea83b4884d0c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1930094, "upload_time": "2015-10-23T07:47:58", "url": "https://files.pythonhosted.org/packages/89/c9/f60e9f054c1e55cca10fdd5dd305728283e9161cf42f1fcdc0d8d130a04e/wanish-0.5.3.tar.gz" } ], "0.5.4": [ { "comment_text": "", "digests": { "md5": "c865f7446bba2f8a4b4d43803f4398bc", "sha256": "42f69e845a2d52a979f6cf8d2ddaff6870148fa0d414dd6d7c31fcf195556dd7" }, "downloads": -1, "filename": "wanish-0.5.4.tar.gz", "has_sig": false, "md5_digest": "c865f7446bba2f8a4b4d43803f4398bc", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1930337, "upload_time": "2016-04-11T11:14:47", "url": "https://files.pythonhosted.org/packages/c2/9d/fa4843757ff63df07df9e1b9214cdb9187d479586459bfde82bc7b6fc95a/wanish-0.5.4.tar.gz" } ], "0.6.0": [ { "comment_text": "", "digests": { "md5": "791f7e7bc390d40c7be2c4b037c6e435", "sha256": "973896a39058b53155b1d02f331f28f30f46aadc8e92396f012910bf7792c323" }, "downloads": -1, "filename": "wanish-0.6.0.tar.gz", "has_sig": false, "md5_digest": "791f7e7bc390d40c7be2c4b037c6e435", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1918407, "upload_time": "2016-09-16T15:07:57", "url": "https://files.pythonhosted.org/packages/5f/55/2438c2ebc90755d5d8b0698d6702e32688f2615c6ebdc3e5109963c2e464/wanish-0.6.0.tar.gz" } ], "0.6.1": [ { "comment_text": "", "digests": { "md5": "2c56cfbc27a76223b1c662add5f36251", "sha256": "483cabad69260c8017481721105fe670d11cad2d8cc43b09b578544968dda603" }, "downloads": -1, "filename": "wanish-0.6.1.tar.gz", "has_sig": false, "md5_digest": "2c56cfbc27a76223b1c662add5f36251", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1919842, "upload_time": "2016-10-18T18:04:13", "url": "https://files.pythonhosted.org/packages/a9/91/5704d8041aba920ee9c54adb8644c9c7edc4c5e08d40eaed896c48b03f92/wanish-0.6.1.tar.gz" } ], "0.6.2": [ { "comment_text": "", "digests": { "md5": "826679d6460a9fae789fe3fd2fdbdbcd", "sha256": "5135ba52ed0cd3e296ae708b71b474ca70cbff78279fc2425f4082ff2b875eed" }, "downloads": -1, "filename": "wanish-0.6.2.tar.gz", "has_sig": false, "md5_digest": "826679d6460a9fae789fe3fd2fdbdbcd", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1919818, "upload_time": "2016-10-26T18:49:25", "url": "https://files.pythonhosted.org/packages/fd/64/61fb3646adfd44925df03a0362c240a68693990d262f1ff0426625fa4d83/wanish-0.6.2.tar.gz" } ], "0.6.3": [ { "comment_text": "", "digests": { "md5": "504435bea4e6165b2b988ae5049575c6", "sha256": "d2baed7f68873fe8001a02afd6350e6feae38321013cbef2960a2c21a52155d5" }, "downloads": -1, "filename": "wanish-0.6.3.tar.gz", "has_sig": false, "md5_digest": "504435bea4e6165b2b988ae5049575c6", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1919834, "upload_time": "2016-12-11T12:10:55", "url": "https://files.pythonhosted.org/packages/de/f6/f3f18386c0275db1e88092235b8af3a22ac1f7b1f9f7fef8d3b48314bf30/wanish-0.6.3.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "504435bea4e6165b2b988ae5049575c6", "sha256": "d2baed7f68873fe8001a02afd6350e6feae38321013cbef2960a2c21a52155d5" }, "downloads": -1, "filename": "wanish-0.6.3.tar.gz", "has_sig": false, "md5_digest": "504435bea4e6165b2b988ae5049575c6", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 1919834, "upload_time": "2016-12-11T12:10:55", "url": "https://files.pythonhosted.org/packages/de/f6/f3f18386c0275db1e88092235b8af3a22ac1f7b1f9f7fef8d3b48314bf30/wanish-0.6.3.tar.gz" } ] }