{ "info": { "author": "Mateo Gianolio", "author_email": "gianoliomateo@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 3 - Alpha", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Programming Language :: Python :: 2", "Programming Language :: Python :: 2.6", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.2", "Programming Language :: Python :: 3.3", "Programming Language :: Python :: 3.4", "Topic :: Software Development :: Libraries" ], "description": "lurk\n====\n\nA script which extracts HTML from web pages that match a certain CSS pattern.\n::\n\n $ pip install lurk\n\n=====\nusage\n=====\n\n**in python**\n\nIn python, lurk returns a dictionary:\n\n::\n\n from lurk import lurk\n\n for link in lurk('http://en.wikipedia.org/wiki/en', 'a'):\n if 'href' in link:\n print link['href']\n\n**in bash**\n\nIn bash, lurk returns JSON.\n\nFamiliarize yourself with `CSS attribute selectors `_.\n\n::\n\n $ lurk \\\n http://www.gnu.org/software/libc/manual/html_node/Function-Index.html \\\n 'a[href*=\"#index-\"]' \\\n > links.json\n\nThis command saves a JSON object containing an array of links to all GNU C functions into **links.json**:\n\n::\n\n [\n {\n \"code\": \"*pthread_getspecific\",\n \"href\": \"Thread_002dspecific-Data.html#index-_002apthread_005fgetspecific\"\n },\n\n {\n \"code\": \"*sbrk\",\n \"href\": \"Resizing-the-Data-Segment.html#index-_002asbrk\"\n },\n\n // ...\n ]", "description_content_type": null, "docs_url": null, "download_url": "UNKNOWN", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/mateogianolio/lurk", "keywords": "lurk lurker scrape scraper scraping webscrape crawl crawler crawling", "license": "MIT", "maintainer": null, "maintainer_email": null, "name": "lurk", "package_url": "https://pypi.org/project/lurk/", "platform": "UNKNOWN", "project_url": "https://pypi.org/project/lurk/", "project_urls": { "Download": "UNKNOWN", "Homepage": "https://github.com/mateogianolio/lurk" }, "release_url": "https://pypi.org/project/lurk/0.1.3/", "requires_dist": null, "requires_python": null, "summary": "Extract html from one or multiple urls", "version": "0.1.3" }, "last_serial": 1784120, "releases": { "0.1.0": [ { "comment_text": "", "digests": { "md5": "cf716e4c65f25586c0960bfd47b1171f", "sha256": "999858051d0db228f7b94634e119b30af29bc23ad43e6d3843c78fcd2fef6d65" }, "downloads": -1, "filename": "lurk-0.1.0-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "cf716e4c65f25586c0960bfd47b1171f", "packagetype": "bdist_wheel", "python_version": "2.7", "requires_python": null, "size": 4108, "upload_time": "2015-10-24T12:04:30", "url": "https://files.pythonhosted.org/packages/e8/37/7086cbd0e15131a75c9c6b69c00273b16c8bdb4155a88177aa7ac65bda73/lurk-0.1.0-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "641192497af18aebc43e104edf3d22b9", "sha256": "4a5dbb3b47a0f1c3eecd635de4c7d532d4ebbdeddf9bff5331c1c95c9bb4741a" }, "downloads": -1, "filename": "lurk-0.1.0.tar.gz", "has_sig": false, "md5_digest": "641192497af18aebc43e104edf3d22b9", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2460, "upload_time": "2015-10-24T12:04:25", "url": "https://files.pythonhosted.org/packages/b3/4d/feea6076eaad7d5f1b03ca1c227eb797ce3846170ad58daa8efc81f17cad/lurk-0.1.0.tar.gz" } ], "0.1.1": [ { "comment_text": "", "digests": { "md5": "8c499d89af76fe58b9a113c06e1a0d54", "sha256": "14e9125bb4df9722a28cb0153f262357607dcaa41fd9ae59671e863be6ca5906" }, "downloads": -1, "filename": "lurk-0.1.1-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "8c499d89af76fe58b9a113c06e1a0d54", "packagetype": "bdist_wheel", "python_version": "2.7", "requires_python": null, "size": 4140, "upload_time": "2015-10-24T12:09:20", "url": "https://files.pythonhosted.org/packages/8c/a4/783ee0cda75d40fca104ea8bc3840eecd4e7ac7c5505a3d3ca1b8b0b5121/lurk-0.1.1-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "62d5035f8bed3b2389de4aa49f296896", "sha256": "0d3585581b9b693536561fe6be2946a663fdead12c1e87163ab2a3679dcab270" }, "downloads": -1, "filename": "lurk-0.1.1.tar.gz", "has_sig": false, "md5_digest": "62d5035f8bed3b2389de4aa49f296896", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2487, "upload_time": "2015-10-24T12:09:16", "url": "https://files.pythonhosted.org/packages/81/a5/834664896e7e16bd7238eb22dac7e3c76d54d35bfb2d223b3aa27ec66a47/lurk-0.1.1.tar.gz" } ], "0.1.2": [ { "comment_text": "", "digests": { "md5": "da737c25557701748753baf369620c26", "sha256": "ad0da97fc40d9334d2e318ef23281e6b15e102fe08d8a8016c7aef25b9c56c0e" }, "downloads": -1, "filename": "lurk-0.1.2-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "da737c25557701748753baf369620c26", "packagetype": "bdist_wheel", "python_version": "2.7", "requires_python": null, "size": 4096, "upload_time": "2015-10-24T12:28:32", "url": "https://files.pythonhosted.org/packages/d8/b3/85a026aa6baee45e989816271d590b132b534082284b8e9050924203bc89/lurk-0.1.2-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "ad7e486bdc29be1978eafd8961a99515", "sha256": "d248b5ce9de38b6a989535cec2bb5a630d46a3f4040acad6b896e368e42ca412" }, "downloads": -1, "filename": "lurk-0.1.2.tar.gz", "has_sig": false, "md5_digest": "ad7e486bdc29be1978eafd8961a99515", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2440, "upload_time": "2015-10-24T12:28:28", "url": "https://files.pythonhosted.org/packages/98/52/b47bd6a69274dc40ab9055965e5ef8f8f6b4d868aa71d42dff9428b18e8e/lurk-0.1.2.tar.gz" } ], "0.1.3": [ { "comment_text": "", "digests": { "md5": "a1dd5d12f5d5aa56792d66ac4d90f888", "sha256": "8df1928e56c4985225877202d59f20c3f685c3f3092daf04db309c69841a2dcb" }, "downloads": -1, "filename": "lurk-0.1.3-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "a1dd5d12f5d5aa56792d66ac4d90f888", "packagetype": "bdist_wheel", "python_version": "2.7", "requires_python": null, "size": 4093, "upload_time": "2015-10-24T12:34:16", "url": "https://files.pythonhosted.org/packages/0d/5a/6cb368063cb8409b0213e256c3ba611e931ddc14e8c277eaa32384d7fb5c/lurk-0.1.3-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "0fea7cd64e2bb83faa21b23e31ad32bf", "sha256": "5c93d12d655d65cb7de0457522c99f050f2ab96fa2f34e2e348ef6d315ab1469" }, "downloads": -1, "filename": "lurk-0.1.3.tar.gz", "has_sig": false, "md5_digest": "0fea7cd64e2bb83faa21b23e31ad32bf", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2435, "upload_time": "2015-10-24T12:34:03", "url": "https://files.pythonhosted.org/packages/58/89/d29d51c32ed231abe81b1b1731306af1df2c8e70bc64cca0c874c5255090/lurk-0.1.3.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "a1dd5d12f5d5aa56792d66ac4d90f888", "sha256": "8df1928e56c4985225877202d59f20c3f685c3f3092daf04db309c69841a2dcb" }, "downloads": -1, "filename": "lurk-0.1.3-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "a1dd5d12f5d5aa56792d66ac4d90f888", "packagetype": "bdist_wheel", "python_version": "2.7", "requires_python": null, "size": 4093, "upload_time": "2015-10-24T12:34:16", "url": "https://files.pythonhosted.org/packages/0d/5a/6cb368063cb8409b0213e256c3ba611e931ddc14e8c277eaa32384d7fb5c/lurk-0.1.3-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "0fea7cd64e2bb83faa21b23e31ad32bf", "sha256": "5c93d12d655d65cb7de0457522c99f050f2ab96fa2f34e2e348ef6d315ab1469" }, "downloads": -1, "filename": "lurk-0.1.3.tar.gz", "has_sig": false, "md5_digest": "0fea7cd64e2bb83faa21b23e31ad32bf", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2435, "upload_time": "2015-10-24T12:34:03", "url": "https://files.pythonhosted.org/packages/58/89/d29d51c32ed231abe81b1b1731306af1df2c8e70bc64cca0c874c5255090/lurk-0.1.3.tar.gz" } ] }