{ "info": { "author": "Tyler Brown", "author_email": "tylers.pile@gmail.com", "bugtrack_url": null, "classifiers": [], "description": "===========\nGatherNews\n===========\n\nGathers unstructured news data and commits it to a SQLite3 database.The goal\nfor GatherNews is to quickly and simply capture news data.\n\nGatherNews allows you to specify which News sites you want to capture by\nproviding the RSS link in \"feeds_list.txt\" like this:\n\n.. code-block:: pycon\n\t\t\n http://feeds.reuters.com/Reuters/worldNews\n http://rss.cnn.com/rss/money_latest.rss\n\nYou can then gather the news using the load_db() method:\n\n.. code-block:: pycon\n\t\t\n >>> # Create new tables if any new RSS feed addresses have been added\n >>> # Populate all tables with RSS news feeds\n >>> # Remove duplicate entries\n >>> capture_feeds.load_db()\n\nThe examples folder contains working code for each module.\n\nFeatures\n--------\n\n- Creates tables with a predefined schema\n- Populates each table in the SQLite3 database with articles\n- Removes duplicate articles from each table\n- Includes bug fixes for issues raised about version 0.1.0\n- Faster than version 0.1.0, see the benchmarks!\n\nInstallation\n------------\n\nTo install GatherNews use pip:\n\n.. code-block:: bash\n\t\t\n $ pip install gathernews\n\nTesting\n-------\n\nAfter installation, you can launch the test suite from outside the source\ndirectory (you will need to have nosetests installed):\n\n.. code-block:: bash\n\n $ nosetests --exe GatherNews\n\nBug Resolution\n--------------\n\nIn GatherNews 0.1.0, a bug was introduced that does not allow you to add new\nRSS feeds to the \u2018feeds_list.txt\u2019 after your initial call of the\ncreate_tables() method. Before using version 0.2.0 on a preexisting database\ncall the following method.\n\n.. code-block:: pycon\n\n >>> path = \"path/to_your/existing_sqlite3_db/\n >>> from gathernews.gRSS import CaptureFeeds\n >>> capture_feeds = CaptureFeeds(path)\n >>> capture_feeds.fix_create_table_bug()\n\nIf this gives you any problems, email me directly (tylers.pile@gmail.com).\nThanks.\n \nDocumentation\n-------------\n\nDocumentation is available at http://gathernews.readthedocs.org/en/latest/\n\nContribute\n----------\n\n#. Issue tracker is here: https://github.com/Bonza-Times/GatherNews/issues\n#. Fork it!", "description_content_type": null, "docs_url": null, "download_url": "UNKNOWN", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "http://pypi.python.org/pypi/GatherNews/", "keywords": null, "license": "doc/LICENSE.txt", "maintainer": null, "maintainer_email": null, "name": "GatherNews", "package_url": "https://pypi.org/project/GatherNews/", "platform": "UNKNOWN", "project_url": "https://pypi.org/project/GatherNews/", "project_urls": { "Download": "UNKNOWN", "Homepage": "http://pypi.python.org/pypi/GatherNews/" }, "release_url": "https://pypi.org/project/GatherNews/0.2.1/", "requires_dist": null, "requires_python": null, "summary": "Gathers unstructured News data into a SQLite3 db", "version": "0.2.1" }, "last_serial": 1001513, "releases": { "0.1.0": [ { "comment_text": "", "digests": { "md5": "b26fb0784712bf53f2fc4949d4e3a3a9", "sha256": "5404de3a9d0fe9dd8980b5a6b9d8d18632b5d54ecdfd6dc3ccd450ded1db72c3" }, "downloads": -1, "filename": "GatherNews-0.1.0.tar.gz", "has_sig": false, "md5_digest": "b26fb0784712bf53f2fc4949d4e3a3a9", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4207, "upload_time": "2013-11-10T19:35:51", "url": "https://files.pythonhosted.org/packages/8b/dd/9bf1091266048488c1e48a1797ce25500593f73e0928dec57ea12e0d4217/GatherNews-0.1.0.tar.gz" } ], "0.1.1": [ { "comment_text": "", "digests": { "md5": "d0cb389ead57b9ab13eec0ab5271ab4a", "sha256": "6bbbe0a49048c7d0edf75c4f7ec3dd961a2b94e9cef52edd2f30fef667e18896" }, "downloads": -1, "filename": "GatherNews-0.1.1.tar.gz", "has_sig": false, "md5_digest": "d0cb389ead57b9ab13eec0ab5271ab4a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 9776, "upload_time": "2014-02-16T20:27:56", "url": "https://files.pythonhosted.org/packages/53/78/52f540088262241b982045873f96ba2cd5d7c24d6fcffc9d952d94a175d5/GatherNews-0.1.1.tar.gz" } ], "0.2.0": [ { "comment_text": "", "digests": { "md5": "079ad66d34313b1820589f2a0e69fff6", "sha256": "becdc71e8c098f2796039ca9512b3bb539ca644b8ee1451b9c152c9aec11fde9" }, "downloads": -1, "filename": "GatherNews-0.2.0.tar.gz", "has_sig": false, "md5_digest": "079ad66d34313b1820589f2a0e69fff6", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 9774, "upload_time": "2014-02-16T20:28:25", "url": "https://files.pythonhosted.org/packages/57/1e/48a22fd760f5fd16510c8328c35e0c1b86820e1fe8894667ee7d7dc8d76c/GatherNews-0.2.0.tar.gz" } ], "0.2.1": [ { "comment_text": "", "digests": { "md5": "089efce1657d3d99f54f0f2c1ff86e3b", "sha256": "aa9fefda00ab2b4a028320e7cadf9c3c83be5961e831a3b9a93383206ae3b3a2" }, "downloads": -1, "filename": "GatherNews-0.2.1.tar.gz", "has_sig": false, "md5_digest": "089efce1657d3d99f54f0f2c1ff86e3b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 10060, "upload_time": "2014-02-16T20:34:11", "url": "https://files.pythonhosted.org/packages/11/f0/ddb4af6be78e9295c723251bf58cd102a9590f03a256ea5af099680d4567/GatherNews-0.2.1.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "089efce1657d3d99f54f0f2c1ff86e3b", "sha256": "aa9fefda00ab2b4a028320e7cadf9c3c83be5961e831a3b9a93383206ae3b3a2" }, "downloads": -1, "filename": "GatherNews-0.2.1.tar.gz", "has_sig": false, "md5_digest": "089efce1657d3d99f54f0f2c1ff86e3b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 10060, "upload_time": "2014-02-16T20:34:11", "url": "https://files.pythonhosted.org/packages/11/f0/ddb4af6be78e9295c723251bf58cd102a9590f03a256ea5af099680d4567/GatherNews-0.2.1.tar.gz" } ] }