{ "info": { "author": "Ryan Stevens", "author_email": "ryan.louis.stevens@gmail.com", "bugtrack_url": null, "classifiers": [ "License :: OSI Approved :: MIT License", "Operating System :: OS Independent", "Programming Language :: Python :: 3" ], "description": "# Patent Scraper\nA python package to scrape patents from 'https://patents.google.com/'. The package is made up ofa single python class, google_scraper(). This scraper can be used both to retreive parsed html of a single patents page or a list of patents.\n\n## Main Use Cases\n\nThere are two primary ways to use this package:\n1. Scrape a single patent\n\n``` python\n# ~ Import packages ~ #\nfrom patent_scraper import google_scraper\nimport json\n\n# ~ Initialize scraper class ~ #\nscraper=google_scrape() \n\n# ~ Scrape patents individually ~ #\n#\n# Request single patent returns whether the scrape\n# was successful and the parsed html using bs4\nerr_1, soup_1 = scraper.request_single_patent('US2668287A')\nerr_2, soup_2 = scraper.request_single_patent('US266827A')\n\n# ~ Parse results of scrape ~ #\npatent_1_parsed = scraper.process_patent_html(soup_1)\npatent_2_parsed = scraper.process_patent_html(soup_2)\n```\n\n2. Scrape a list of patents\n\n```python\n# ~ Import packages ~ #\nfrom patent_scraper import google_scraper\nimport json\n\n# ~ Initialize scraper class ~ #\nscraper=google_scrape() #<- Initialize class\n\n# ~ Add patents to list ~ #\nscraper.add_patents('2668287A')\nscraper.add_patents('266827A')\n\n# ~ Scrape all patents ~ #\nscraper.scrape_all_patents()\n\n# ~ Get results of scrape ~ #\npatent_1_parsed = scraper.parsed_patents['US2668287A']\npatent_2_parsed = scraper.parsed_patents['US266827A']\n\n# ~ Print inventors of patent US2668287A ~ #\nfor inventor in json.loads(patent_1_parsed['inventor_name']):\n print('Patent inventor : {0}'.format(inventor['inventor_name']) \n```\n\n\n### Example Files\n\nI have provided two seperate example scripts for usage of this package:\n 1. Scrape a patent\n 2. Scrape many patents using multiprocessing module\n\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/rls542/patent_scraper/", "keywords": "", "license": "", "maintainer": "", "maintainer_email": "", "name": "google-patent-scraper", "package_url": "https://pypi.org/project/google-patent-scraper/", "platform": "", "project_url": "https://pypi.org/project/google-patent-scraper/", "project_urls": { "Homepage": "https://github.com/rls542/patent_scraper/" }, "release_url": "https://pypi.org/project/google-patent-scraper/1.0.5/", "requires_dist": null, "requires_python": ">=3.6", "summary": "A package to scrape patents from 'https://patents.google.com/'", "version": "1.0.5" }, "last_serial": 5787828, "releases": { "1.0.5": [ { "comment_text": "", "digests": { "md5": "ff3c511374840a29e7fbd0ae3650ece5", "sha256": "5f6189674c9416add7154438c4535b21b9fba2d03b98558fc3cf7a852f5f55a5" }, "downloads": -1, "filename": "google_patent_scraper-1.0.5-py3-none-any.whl", "has_sig": false, "md5_digest": "ff3c511374840a29e7fbd0ae3650ece5", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6", "size": 5563, "upload_time": "2019-09-05T18:25:43", "url": "https://files.pythonhosted.org/packages/52/42/c831a84680f08964f56d35e9f6ff214791587d5b8fe0337d3fb655526b11/google_patent_scraper-1.0.5-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "1fdf57bff03f2671ebf0b333962b0499", "sha256": "ddc8dabe04d05cd6e144e86d242857648e2c023607d0b029bae329ea110abffd" }, "downloads": -1, "filename": "google_patent_scraper-1.0.5.tar.gz", "has_sig": false, "md5_digest": "1fdf57bff03f2671ebf0b333962b0499", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6", "size": 4673, "upload_time": "2019-09-05T18:25:46", "url": "https://files.pythonhosted.org/packages/b7/43/f0af085129718d607eb73b2ac9fb54498d924493b80b4a985c37d72efe7c/google_patent_scraper-1.0.5.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "ff3c511374840a29e7fbd0ae3650ece5", "sha256": "5f6189674c9416add7154438c4535b21b9fba2d03b98558fc3cf7a852f5f55a5" }, "downloads": -1, "filename": "google_patent_scraper-1.0.5-py3-none-any.whl", "has_sig": false, "md5_digest": "ff3c511374840a29e7fbd0ae3650ece5", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3.6", "size": 5563, "upload_time": "2019-09-05T18:25:43", "url": "https://files.pythonhosted.org/packages/52/42/c831a84680f08964f56d35e9f6ff214791587d5b8fe0337d3fb655526b11/google_patent_scraper-1.0.5-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "1fdf57bff03f2671ebf0b333962b0499", "sha256": "ddc8dabe04d05cd6e144e86d242857648e2c023607d0b029bae329ea110abffd" }, "downloads": -1, "filename": "google_patent_scraper-1.0.5.tar.gz", "has_sig": false, "md5_digest": "1fdf57bff03f2671ebf0b333962b0499", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3.6", "size": 4673, "upload_time": "2019-09-05T18:25:46", "url": "https://files.pythonhosted.org/packages/b7/43/f0af085129718d607eb73b2ac9fb54498d924493b80b4a985c37d72efe7c/google_patent_scraper-1.0.5.tar.gz" } ] }