{ "info": { "author": "M Santhosh Kumar", "author_email": "santhoshse7en@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Intended Audience :: Developers", "Intended Audience :: End Users/Desktop", "Intended Audience :: System Administrators", "License :: OSI Approved :: MIT License", "Operating System :: OS Independent", "Programming Language :: Python", "Topic :: Communications :: Email", "Topic :: Office/Business", "Topic :: Software Development :: Bug Tracking" ], "description": "[![PyPI Version](https://img.shields.io/pypi/v/news-fetch.svg)](https://pypi.org/project/news-fetch)\n[![Coverage Status](https://coveralls.io/repos/github/santhoshse7en/news-fetch/badge.svg?branch=master)](https://coveralls.io/github/santhoshse7en/news-fetch?branch=master)\n[![License](https://img.shields.io/pypi/l/news-fetch.svg)](https://pypi.python.org/pypi/news-fetch/)\n[![Documentation Status](https://readthedocs.org/projects/pip/badge/?version=latest&style=flat)](https://santhoshse7en.github.io/news-fetch_doc)\n\n# news-fetch\n\nnews-fetch was built on top of [news-please](https://pypi.org/project/news-please/) - [Felix Hamborg](https://www.linkedin.com/in/felixhamborg/) and [Newspaper3K](https://pypi.org/project/newspaper3k/) - [Lucas (\u00e6\u00ac\u00a7\u00e9\u02dc\u00b3\u00e8\u00b1\u00a1) Ou-Yang](https://www.linkedin.com/in/lucasouyang/) 'Thank You' both you without them it will be very hard to extract online newspaper. This package consist of both features provided my Felix's work and Lucas' work\n\nnews-fetch is an open source, easy-to-use news crawler that extracts structured information from almost any news website. . I built this to reduce most of NaN or '' or [] or 'None' values while scraping for some newspapers. Platform-independent and written in Python 3. This package can be very easily used by programmers and developers to provide access to the news data to their programs.\n\n| Source | Link |\n| --- | --- |\n| PyPI: | https://pypi.org/project/news-fetch/ |\n| Repository: | https://santhoshse7en.github.io/news-fetch/ |\n| Documentation: | https://santhoshse7en.github.io/news-fetch_doc/ |\n\n## Dependencies\n\n- news-please\n- newspaper3k\n- beautifulsoup4\n- fake_useragent\n- selenium\n- chromedriver-binary\n- fake_useragent\n- spacy\n- pandas\n\n## Dependencies Installation\n\nUse the package manager [pip](https://pip.pypa.io/en/stable/) to install following\n```bash\npip install -r requirements.txt\n```\n\n## Usage\n\nDownload it by clicking the green download button here on [Github](https://github.com/santhoshse7en/news-fetch/archive/master.zip). To extract URLs from targeted website call google_search function, you only need to parse argument of keyword and newspaper link.\n\n```python\n>>> from newsfetch.news import google_search\n>>> google = google_search('Alcoholics Anonymous', 'https://timesofindia.indiatimes.com/')\n```\n\n**Directory of google search results urls**\n\n![google](https://user-images.githubusercontent.com/47944792/60381562-67363380-9a74-11e9-99ea-51c27bf08abc.PNG)\n\nTo scrape the all news details call newspaper function\n\n```python\n>>> from newsfetch.news import newspaper\n>>> news = newspaper('https://www.bbc.co.uk/news/world-48810070')\n```\n\n**Directory of news**\n\n![newsdir](https://user-images.githubusercontent.com/47944792/60564817-c058dc80-9d7e-11e9-9b3e-d0b5a903d972.PNG)\n\n```python\n>>> news.headline\n\n'g20 summit: trump and xi agree to restart us china trade talks'\n```\n\n## Contributing\n\nPull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.\n\nPlease make sure to update tests as appropriate.\n\n## License\n[MIT](https://choosealicense.com/licenses/mit/)", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://santhoshse7en.github.io/news-fetch/", "keywords": "Newspaper,news-fetch,without-api,google_scraper,news_scraper,bs4,lxml,news-crawler,news-extractor,crawler,extractor,news,news-websites,elasticsearch,json,python,nlp,data-gathering,news-archive,news-articles,commoncrawl,extract-articles,extract-information,news-scraper,spacy", "license": "", "maintainer": "", "maintainer_email": "", "name": "news-fetch", "package_url": "https://pypi.org/project/news-fetch/", "platform": "", "project_url": "https://pypi.org/project/news-fetch/", "project_urls": { "Homepage": "https://santhoshse7en.github.io/news-fetch/" }, "release_url": "https://pypi.org/project/news-fetch/0.2.2/", "requires_dist": null, "requires_python": "", "summary": "news-fetch is an open source easy-to-use news extractor and basic nlp (cleaning_text, keywords, summary) comes handy that just works", "version": "0.2.2" }, "last_serial": 5572885, "releases": { "0.0.1": [ { "comment_text": "", "digests": { "md5": "caf24ba611f0ebf6a9d89d148ae75091", "sha256": "40da90ddb1af41c83305f0b4cc86847041e2af69db90462e38f17092389e71da" }, "downloads": -1, "filename": "news-fetch-0.0.1.tar.gz", "has_sig": false, "md5_digest": "caf24ba611f0ebf6a9d89d148ae75091", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4572, "upload_time": "2019-06-29T09:12:36", "url": "https://files.pythonhosted.org/packages/e0/27/a1efac643579a29dccd60a46c1890a70e57b89c989922ff5d15c55ecd533/news-fetch-0.0.1.tar.gz" } ], "0.0.2": [ { "comment_text": "", "digests": { "md5": "09d9e70cfbeb23cf7b130ddb41e74d17", "sha256": "e6b44887cb915d704126e888642b1ef290c9d8b57d2185b81500db44f1529785" }, "downloads": -1, "filename": "news_fetch-0.0.2.tar.gz", "has_sig": false, "md5_digest": "09d9e70cfbeb23cf7b130ddb41e74d17", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4568, "upload_time": "2019-06-29T09:19:40", "url": "https://files.pythonhosted.org/packages/2e/15/50e65ff3d95415fd113048ecfef9fb0cdadb9b83723ab2af7b5e72a2c894/news_fetch-0.0.2.tar.gz" } ], "0.0.3": [ { "comment_text": "", "digests": { "md5": "bd1445331ff39f6909e4c09a86d91aeb", "sha256": "0df638b23a149becb87a237c6704aa6b61f2fa9a247a94f64cc6fb8df863eb81" }, "downloads": -1, "filename": "news_fetch-0.0.3.tar.gz", "has_sig": false, "md5_digest": "bd1445331ff39f6909e4c09a86d91aeb", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4572, "upload_time": "2019-06-29T11:04:35", "url": "https://files.pythonhosted.org/packages/3c/19/4488239b7258a3f924b0f8bceaeac80d6e01405b8075c0029e23773b2b97/news_fetch-0.0.3.tar.gz" } ], "0.0.4": [ { "comment_text": "", "digests": { "md5": "a833dca9a66f9255d984b8c8823b571a", "sha256": "00572711d58f595a46b5ca7622c7f71fda7252da7083160d1f7b923d285e9b75" }, "downloads": -1, "filename": "news_fetch-0.0.4.tar.gz", "has_sig": false, "md5_digest": "a833dca9a66f9255d984b8c8823b571a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4612, "upload_time": "2019-06-29T11:23:36", "url": "https://files.pythonhosted.org/packages/e2/bd/e2f976c255c76bc8074a5054e24b26c943c728e52e8df67585688dd71024/news_fetch-0.0.4.tar.gz" } ], "0.0.5": [ { "comment_text": "", "digests": { "md5": "57ee3fd69363e7e3816d215dcf799c3c", "sha256": "496a5529783b49f029eb42d731fa4cc43fcb2dc2bca5c41ae5911057997e9672" }, "downloads": -1, "filename": "news_fetch-0.0.5.tar.gz", "has_sig": false, "md5_digest": "57ee3fd69363e7e3816d215dcf799c3c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4649, "upload_time": "2019-06-29T13:28:27", "url": "https://files.pythonhosted.org/packages/c9/52/58624b1bcf512b2563c81c3fe9cc0448ae11d9024e744655dc019f975988/news_fetch-0.0.5.tar.gz" } ], "0.0.6": [ { "comment_text": "", "digests": { "md5": "aba1de930ae50affb1591cc94cdd2c84", "sha256": "4bf05588ec50f4821df3dfe31e3d1436efdf729288df422af71077226f62ecc4" }, "downloads": -1, "filename": "news_fetch-0.0.6.tar.gz", "has_sig": false, "md5_digest": "aba1de930ae50affb1591cc94cdd2c84", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4735, "upload_time": "2019-06-29T13:38:11", "url": "https://files.pythonhosted.org/packages/42/2b/a970af57b0d677b392097134492ad25c9a0ee6f94aea63f80f01bf8ab32a/news_fetch-0.0.6.tar.gz" } ], "0.0.7": [ { "comment_text": "", "digests": { "md5": "0a3e9f098e57a9180f480904d0e07321", "sha256": "65af978a9ea947322d78bdc9f002cb27904e0f7512df391a72843f0626c45d68" }, "downloads": -1, "filename": "news_fetch-0.0.7.tar.gz", "has_sig": false, "md5_digest": "0a3e9f098e57a9180f480904d0e07321", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5013, "upload_time": "2019-06-29T17:23:05", "url": "https://files.pythonhosted.org/packages/bc/d8/bac46cfc172beeadea46a9ab11cd7d4e6e99583d42e7c088f28e89bd1856/news_fetch-0.0.7.tar.gz" } ], "0.0.8": [ { "comment_text": "", "digests": { "md5": "9ca86bca0a538279d1bce6aad6e091c2", "sha256": "02c012c265c33f2500b938751f025980c8911621420ed4df7c9748e1a3d52236" }, "downloads": -1, "filename": "news_fetch-0.0.8.tar.gz", "has_sig": false, "md5_digest": "9ca86bca0a538279d1bce6aad6e091c2", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5036, "upload_time": "2019-06-29T17:29:12", "url": "https://files.pythonhosted.org/packages/a5/46/a4e4c9590ce167a5501b022c27cbe30cdcb91dfdd6afed76ab3ee711274b/news_fetch-0.0.8.tar.gz" } ], "0.0.9": [ { "comment_text": "", "digests": { "md5": "8bd4425c2152c03c7ba20ed8874576e7", "sha256": "13da3d5cb0c7274de5c8ee75cb85820ac29d01c129b161a4c7279cf4b1c35d88" }, "downloads": -1, "filename": "news_fetch-0.0.9.tar.gz", "has_sig": false, "md5_digest": "8bd4425c2152c03c7ba20ed8874576e7", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5044, "upload_time": "2019-06-29T18:00:11", "url": "https://files.pythonhosted.org/packages/99/2e/36dc9858acef50e815888d9b4b8b3d5e4943a0852bdab7acf23bba64e488/news_fetch-0.0.9.tar.gz" } ], "0.1.0": [ { "comment_text": "", "digests": { "md5": "0cfa381de0220eb507e45b2bd7fa91b2", "sha256": "40286837b48f728e644baf6c87bd7bed6bb22536a1016f30b97ab62989fd00cc" }, "downloads": -1, "filename": "news_fetch-0.1.0.tar.gz", "has_sig": false, "md5_digest": "0cfa381de0220eb507e45b2bd7fa91b2", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5064, "upload_time": "2019-06-29T18:05:19", "url": "https://files.pythonhosted.org/packages/fd/23/bbd712f48d457a7678bcf07a7906c14985349997c3d9cac95e33b6f17972/news_fetch-0.1.0.tar.gz" } ], "0.1.1": [ { "comment_text": "", "digests": { "md5": "fa432994d2ecf42f177625537b8445cc", "sha256": "cf3bcfc4484fabfbd4054c5014eee7ee7079f3580469fdbe0b095ca337c59bf3" }, "downloads": -1, "filename": "news_fetch-0.1.1.tar.gz", "has_sig": false, "md5_digest": "fa432994d2ecf42f177625537b8445cc", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5199, "upload_time": "2019-06-30T04:19:58", "url": "https://files.pythonhosted.org/packages/a3/cf/d448139d37c12d1ec5cbb8f3b8404272b391721b2889171939b4905a3da7/news_fetch-0.1.1.tar.gz" } ], "0.1.2": [ { "comment_text": "", "digests": { "md5": "a079c71052fcc8d82537b25adf1c6d05", "sha256": "0b9b7dcc3cd4dd76c3a29022bfb1facfe4ec5625fe73421e9d23f84bd7cac990" }, "downloads": -1, "filename": "news_fetch-0.1.2.tar.gz", "has_sig": false, "md5_digest": "a079c71052fcc8d82537b25adf1c6d05", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5218, "upload_time": "2019-06-30T06:37:06", "url": "https://files.pythonhosted.org/packages/e5/55/b7ea8cf2f4a1e5e8c23f8f53b958ce0355853b58cffb54dab7ecb8d96abe/news_fetch-0.1.2.tar.gz" } ], "0.1.3": [ { "comment_text": "", "digests": { "md5": "95631754243375082c55dcc26b9f8e0c", "sha256": "3ae916ebafaeaccb5a768b5add2e9c0b61ec6b87046e00f2716407f44b7a7da8" }, "downloads": -1, "filename": "news_fetch-0.1.3.tar.gz", "has_sig": false, "md5_digest": "95631754243375082c55dcc26b9f8e0c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5142, "upload_time": "2019-06-30T08:15:05", "url": "https://files.pythonhosted.org/packages/ab/62/12990baf454356549ec477bc5f68568c505047d0b48128af24a9a0279b0c/news_fetch-0.1.3.tar.gz" } ], "0.1.4": [ { "comment_text": "", "digests": { "md5": "2c27a4e34c0e07dedd2def6b9edff3b4", "sha256": "2811e22f476d088324282fe52869b5aada99638cdc9f921f9ab8ab66f41db6ad" }, "downloads": -1, "filename": "news_fetch-0.1.4.tar.gz", "has_sig": false, "md5_digest": "2c27a4e34c0e07dedd2def6b9edff3b4", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5113, "upload_time": "2019-06-30T08:29:32", "url": "https://files.pythonhosted.org/packages/54/70/c073a564d61f36aed4b04cf382de20b54dc19f5ef66a88aa39c1b6a2ce44/news_fetch-0.1.4.tar.gz" } ], "0.1.5": [ { "comment_text": "", "digests": { "md5": "650bc2ac9c4d8fd0c09dbd6d1b0c4fb1", "sha256": "c33e6e420910cbf6c003cbefed38c2aaa7d41fa57fcd910f88d9e1ccfe2f786a" }, "downloads": -1, "filename": "news_fetch-0.1.5.tar.gz", "has_sig": false, "md5_digest": "650bc2ac9c4d8fd0c09dbd6d1b0c4fb1", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5182, "upload_time": "2019-06-30T10:43:28", "url": "https://files.pythonhosted.org/packages/2f/ca/8270fa0a648189b09e0659f1fe43d665aef3ca1f02fdf152a092080fd08d/news_fetch-0.1.5.tar.gz" } ], "0.1.6": [ { "comment_text": "", "digests": { "md5": "518ad8d4ce32c9a9427a24cb0ffb0486", "sha256": "8d879e7872a4e5d62bd197540b4a0e5fc3f64031ab9a5892714cfe3e2ef754b9" }, "downloads": -1, "filename": "news-fetch-0.1.6.tar.gz", "has_sig": false, "md5_digest": "518ad8d4ce32c9a9427a24cb0ffb0486", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5451, "upload_time": "2019-07-03T04:54:27", "url": "https://files.pythonhosted.org/packages/49/8a/853f85299b836647ecfa360d3360a9f5441f93e65a0488255ee7ca1ddf5b/news-fetch-0.1.6.tar.gz" } ], "0.1.7": [ { "comment_text": "", "digests": { "md5": "45fb76dad14d18a2c8fdb6a58b5b0ce6", "sha256": "5b37eea57a408e6f02e32efe52ba2a7774fa108777ef04441b17c47942f2d3e4" }, "downloads": -1, "filename": "news-fetch-0.1.7.tar.gz", "has_sig": false, "md5_digest": "45fb76dad14d18a2c8fdb6a58b5b0ce6", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5450, "upload_time": "2019-07-03T05:11:55", "url": "https://files.pythonhosted.org/packages/64/06/2258123d015894024b5cb7dfa41c405b63427474092beedf1d2809e37c57/news-fetch-0.1.7.tar.gz" } ], "0.1.8": [ { "comment_text": "", "digests": { "md5": "d898c7c3473092f0fb9fcd70932ecdce", "sha256": "5076d90ef0789bfee448167729d3f73d2a330086668cf05e76e4bcd09d70885a" }, "downloads": -1, "filename": "news-fetch-0.1.8.tar.gz", "has_sig": false, "md5_digest": "d898c7c3473092f0fb9fcd70932ecdce", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6503, "upload_time": "2019-07-14T11:19:44", "url": "https://files.pythonhosted.org/packages/79/0e/203e6c493498c7942db7ffb5a111e5d9a3dacc7801ac0dd46909e5b1cdab/news-fetch-0.1.8.tar.gz" } ], "0.1.9": [ { "comment_text": "", "digests": { "md5": "586b57fa99352960235b360244520328", "sha256": "eb5deef380abcd00744858ed6a87c7f9583db3b082cf948e99b55b52f1d26d98" }, "downloads": -1, "filename": "news-fetch-0.1.9.tar.gz", "has_sig": false, "md5_digest": "586b57fa99352960235b360244520328", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6667, "upload_time": "2019-07-23T13:08:15", "url": "https://files.pythonhosted.org/packages/25/99/2afc28b9cc65def79551a5d1cac952fcfe57be106293675f09e1572b869b/news-fetch-0.1.9.tar.gz" } ], "0.2.0": [ { "comment_text": "", "digests": { "md5": "1dec219ec6e13735efeca2724765583f", "sha256": "eaa8ce0d5d15ddda95fb8999568fece4eaabd884ab715adf25efa80641014f9b" }, "downloads": -1, "filename": "news-fetch-0.2.0.tar.gz", "has_sig": false, "md5_digest": "1dec219ec6e13735efeca2724765583f", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6663, "upload_time": "2019-07-23T13:29:54", "url": "https://files.pythonhosted.org/packages/8d/60/a4eec46627c31a0884b11d734d273dfdba7c4692e8313217d2c4ea74e64d/news-fetch-0.2.0.tar.gz" } ], "0.2.1": [ { "comment_text": "", "digests": { "md5": "fbb053fdb07592f986c1144963e5da62", "sha256": "24c36f6145bf1d7ec694c9af5b1c4e23331a853b8912322df558c6dcae6d7df0" }, "downloads": -1, "filename": "news-fetch-0.2.1.tar.gz", "has_sig": false, "md5_digest": "fbb053fdb07592f986c1144963e5da62", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6657, "upload_time": "2019-07-23T14:34:41", "url": "https://files.pythonhosted.org/packages/57/72/4b3b80f71a5dcb7fd3d36bbddb5fd94382db6334814a682dd90295f411b3/news-fetch-0.2.1.tar.gz" } ], "0.2.2": [ { "comment_text": "", "digests": { "md5": "a7bd4759326f5fb48c984f1193299ba7", "sha256": "280923ce27be10bc0ff6e252b1165bf21de9432bb9a30ccd84e54680f02e8d6b" }, "downloads": -1, "filename": "news-fetch-0.2.2.tar.gz", "has_sig": false, "md5_digest": "a7bd4759326f5fb48c984f1193299ba7", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6660, "upload_time": "2019-07-23T14:57:27", "url": "https://files.pythonhosted.org/packages/12/54/6704f0025cddcf7a5e6462edd8d0db486eadc5f43cb18f45ec6eb7fa5c28/news-fetch-0.2.2.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "a7bd4759326f5fb48c984f1193299ba7", "sha256": "280923ce27be10bc0ff6e252b1165bf21de9432bb9a30ccd84e54680f02e8d6b" }, "downloads": -1, "filename": "news-fetch-0.2.2.tar.gz", "has_sig": false, "md5_digest": "a7bd4759326f5fb48c984f1193299ba7", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6660, "upload_time": "2019-07-23T14:57:27", "url": "https://files.pythonhosted.org/packages/12/54/6704f0025cddcf7a5e6462edd8d0db486eadc5f43cb18f45ec6eb7fa5c28/news-fetch-0.2.2.tar.gz" } ] }