{ "info": { "author": "Sebastian Brass (sbabrass)", "author_email": "", "bugtrack_url": null, "classifiers": [ "License :: Repoze Public License", "Programming Language :: Python", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3", "Topic :: Internet :: WWW/HTTP :: Indexing/Search" ], "description": "============\nSiteSearcher\n============\n\nAbout SiteSearcher\n==================\n\n**SiteSearcher** is a command line tool that creates fulltext search indexes of your favourite websites on your machine, and allows you to search them locally.\n\nUsage\n-----\n\n:code:`sitesearcher indexer ` - Create a local search index for :code:``\n\n:code:`sitesearcher search ` - Open search prompt for :code:``\n\nIndexing of large sites can take quite long, but you can stop the indexer at any time and continue later at the point where you left off. To halt the indexer, simply type :code:`+C` once and wait for graceful exit. To restart run the index command again with the :code:`--continue` flag, i.e. :code:`sitesearcher indexer --continue`.\n\nWeb Server Friendly\n-------------------\n\n**SiteSearcher** tries to be web server friendly, while crawling. It obeys :code:`robot.txt`, identifies itself with the :code:`\"SiteSearcher\"` UserAgent and uses the `Scrapy Autothrottle Extension `_ to reduce the load on the server.\n\nInstalling SiteSearcher\n=======================\n\nIf you have :code:`pip` installed, you can use :code:`pip` to download and install **SiteSearcher**.\n\n.. code:: bash\n\n\tpip install sitesearcher\n\n**SiteSearcher** uses the `Scrapy `_ bot framework and therefore inherits its `dependencies `_.\n\n\nGetting the source\n==================\n\nDownload source releases from PyPI at http://pypi.python.org/pypi/sitesearcher\n\nYou can check out the latest version of source code from GitHub.\n\n.. code::\n\n\tgit clone https://github.com/sbabrass/sitesearcher\n\nPython Version Support\n======================\n\n**SiteSearcher** supports Python Versions 2.7 and 3.3+.\n\nHowever switching between Python versions may require a rebuild of your indexes, as there is currently no support for SiteSearcher/Python 2 to read and write indexes created with SiteSearcher/Python 3 and vice versa.\n\n\nHistory\n=======\n\n0.1a1\n-----\n\n- Initial version of the SiteSearcher tool\n- Create ``Scrapy`` crawler to extract full text content of sites\n- Create ``Whoosh`` indexer to index stored sites\n- Create CLI for indexing and searching\n\n0.1a2\n-----\n\n- Minor code cleanups", "description_content_type": null, "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "", "keywords": "scrapy whoosh searching indexing websearch", "license": "BSD-derived (http://www.repoze.org/LICENSE.txt)", "maintainer": "", "maintainer_email": "", "name": "sitesearcher", "package_url": "https://pypi.org/project/sitesearcher/", "platform": "", "project_url": "https://pypi.org/project/sitesearcher/", "project_urls": null, "release_url": "https://pypi.org/project/sitesearcher/0.1a2/", "requires_dist": null, "requires_python": "", "summary": "A command line tool that creates fulltext search indexes of your favourite websites on your machine, and allows you to search them locally", "version": "0.1a2" }, "last_serial": 2326229, "releases": { "0.1a1": [ { "comment_text": "", "digests": { "md5": "2cd8fe43bbd1a6f2f0b1590a9edb947f", "sha256": "59db8117cdfc42b984fec59d33c8317d5a9d5f63894a9f38d695f336ea8fd5c3" }, "downloads": -1, "filename": "sitesearcher-0.1a1.tar.gz", "has_sig": false, "md5_digest": "2cd8fe43bbd1a6f2f0b1590a9edb947f", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 7386, "upload_time": "2016-09-05T05:46:36", "url": "https://files.pythonhosted.org/packages/c7/7f/7037bcbcfc8699cd05d98dad9ad061b1dcad5e2d5c6c48da3339fb1644ca/sitesearcher-0.1a1.tar.gz" } ], "0.1a2": [ { "comment_text": "", "digests": { "md5": "5476516cf00f92e5da42ba8c6af02a8b", "sha256": "56d6aa106746281deb53e823469cb3451012b45af798ef5e24df726e6d00bbfe" }, "downloads": -1, "filename": "sitesearcher-0.1a2.tar.gz", "has_sig": false, "md5_digest": "5476516cf00f92e5da42ba8c6af02a8b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 7152, "upload_time": "2016-09-05T21:31:07", "url": "https://files.pythonhosted.org/packages/96/c5/6ae1b2a6290449cffb9ffe8544df564aa20be5b108078200a7ba0fcde2c9/sitesearcher-0.1a2.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "5476516cf00f92e5da42ba8c6af02a8b", "sha256": "56d6aa106746281deb53e823469cb3451012b45af798ef5e24df726e6d00bbfe" }, "downloads": -1, "filename": "sitesearcher-0.1a2.tar.gz", "has_sig": false, "md5_digest": "5476516cf00f92e5da42ba8c6af02a8b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 7152, "upload_time": "2016-09-05T21:31:07", "url": "https://files.pythonhosted.org/packages/96/c5/6ae1b2a6290449cffb9ffe8544df564aa20be5b108078200a7ba0fcde2c9/sitesearcher-0.1a2.tar.gz" } ] }