{ "info": { "author": "Jason Alan Palmer", "author_email": "jalanpalmer@gmail.com", "bugtrack_url": null, "classifiers": [ "Programming Language :: Python :: 2", "Programming Language :: Python :: 3" ], "description": "# pdftotext\n\n[![PyPI Status](https://img.shields.io/pypi/v/pdftotext.svg)](https://pypi.python.org/pypi/pdftotext)\n[![Build Status](https://travis-ci.org/jalan/pdftotext.svg?branch=master)](https://travis-ci.org/jalan/pdftotext)\n[![Coverage Status](https://coveralls.io/repos/github/jalan/pdftotext/badge.svg?branch=master)](https://coveralls.io/github/jalan/pdftotext?branch=master)\n[![Downloads](https://img.shields.io/pypi/dm/pdftotext.svg)](https://pypistats.org/packages/pdftotext)\n\nSimple PDF text extraction\n\n```python\nimport pdftotext\n\n# Load your PDF\nwith open(\"lorem_ipsum.pdf\", \"rb\") as f:\n pdf = pdftotext.PDF(f)\n\n# If it's password-protected\nwith open(\"secure.pdf\", \"rb\") as f:\n pdf = pdftotext.PDF(f, \"secret\")\n\n# How many pages?\nprint(len(pdf))\n\n# Iterate over all the pages\nfor page in pdf:\n print(page)\n\n# Read some individual pages\nprint(pdf[0])\nprint(pdf[1])\n\n# Read all the text into one string\nprint(\"\\n\\n\".join(pdf))\n```\n\n\n## OS Dependencies\n\nDebian, Ubuntu, and friends:\n\n```\nsudo apt-get update\nsudo apt-get install build-essential libpoppler-cpp-dev pkg-config python-dev\n```\n\nFedora, Red Hat, and friends:\n\n```\nsudo yum install gcc-c++ pkgconfig poppler-cpp-devel python-devel redhat-rpm-config\n```\n\nmacOS:\n\n```\nbrew install pkg-config poppler\n```\n\nConda users may also need `libgcc`:\n\n```\nconda install libgcc\n```\n\n## Install\n\n```\npip install pdftotext\n```", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/jalan/pdftotext", "keywords": "", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "pdftotext", "package_url": "https://pypi.org/project/pdftotext/", "platform": "", "project_url": "https://pypi.org/project/pdftotext/", "project_urls": { "Homepage": "https://github.com/jalan/pdftotext" }, "release_url": "https://pypi.org/project/pdftotext/2.1.2/", "requires_dist": null, "requires_python": "", "summary": "Simple PDF text extraction", "version": "2.1.2" }, "last_serial": 5642812, "releases": { "1.0.0": [ { "comment_text": "", "digests": { "md5": "915ecc611cfdfa188042afb718eb45a8", "sha256": "dc645669e88f138ca4a3f7d1e32e0974d4f051da5dffbbabad7cad62dba7bfc4" }, "downloads": -1, "filename": "pdftotext-1.0.0.tar.gz", "has_sig": false, "md5_digest": "915ecc611cfdfa188042afb718eb45a8", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 49226, "upload_time": "2017-06-10T20:56:16", "url": "https://files.pythonhosted.org/packages/d2/cf/b65a7431b1c0eaa54063823d6f083ecd9a2a879646af3d243eb09bc8d0b6/pdftotext-1.0.0.tar.gz" } ], "1.1.0": [ { "comment_text": "", "digests": { "md5": "4c65845ef082fa2b12029a78876e53f6", "sha256": "11536c3b91ea3fcdf9466109f304b897a4c49d25390c3ab231ec70acf6d5786d" }, "downloads": -1, "filename": "pdftotext-1.1.0.tar.gz", "has_sig": false, "md5_digest": "4c65845ef082fa2b12029a78876e53f6", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 49566, "upload_time": "2017-07-18T01:04:14", "url": "https://files.pythonhosted.org/packages/68/62/588c9a70a6c4acc6ec4fbc3c096a99032920c4965c4b231745bbd6e7d9f9/pdftotext-1.1.0.tar.gz" } ], "2.0.0": [ { "comment_text": "", "digests": { "md5": "cb52180a0636f38bce7b5bb45d9b2472", "sha256": "65b78d9c8858b1e98d6b89a6c83e6dfcdaf0bde014dca89cfe10ced7dc38fc39" }, "downloads": -1, "filename": "pdftotext-2.0.0.tar.gz", "has_sig": false, "md5_digest": "cb52180a0636f38bce7b5bb45d9b2472", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 79866, "upload_time": "2017-07-23T19:55:26", "url": "https://files.pythonhosted.org/packages/16/e3/3b432ac931a7b360899686312145ddc9d0ce7b5ac610eba43c3920937ced/pdftotext-2.0.0.tar.gz" } ], "2.0.1": [ { "comment_text": "", "digests": { "md5": "b3e45deee054dcc42ac2e6c1cec2dc15", "sha256": "947359bde2f1d158d685414dcfb604a369f3fcee49012baf717a71816045d8ef" }, "downloads": -1, "filename": "pdftotext-2.0.1.tar.gz", "has_sig": false, "md5_digest": "b3e45deee054dcc42ac2e6c1cec2dc15", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 109828, "upload_time": "2017-08-10T01:17:40", "url": "https://files.pythonhosted.org/packages/38/99/f247b4b27276f2789744900931d40e312564198a9412e653f3f1bb17def6/pdftotext-2.0.1.tar.gz" } ], "2.0.2": [ { "comment_text": "", "digests": { "md5": "2658a8827afc08e44eba314759ac2d35", "sha256": "0b82a9fd255a3f2bf5c861cf9e3174d3c4223e1e441bb060c611dcb4e65c6cb8" }, "downloads": -1, "filename": "pdftotext-2.0.2.tar.gz", "has_sig": false, "md5_digest": "2658a8827afc08e44eba314759ac2d35", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 110091, "upload_time": "2018-02-20T02:45:46", "url": "https://files.pythonhosted.org/packages/05/7c/67643e2c10e99f1ac27eadfd9a3d4e13b920c2e922906aad87152cdc154d/pdftotext-2.0.2.tar.gz" } ], "2.1.0": [ { "comment_text": "", "digests": { "md5": "8fd132b4b2873a3e41578c1036280bb2", "sha256": "b7312302007e19fc784263a321b41682f01a582af84e14200cef53b3f4e69a50" }, "downloads": -1, "filename": "pdftotext-2.1.0.tar.gz", "has_sig": false, "md5_digest": "8fd132b4b2873a3e41578c1036280bb2", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 111900, "upload_time": "2018-05-31T04:01:53", "url": "https://files.pythonhosted.org/packages/96/41/aa31f4a6809eb0574674d6c0cf6bc0e00aaf0ea53c62db8a2d9af50b7cc6/pdftotext-2.1.0.tar.gz" } ], "2.1.1": [ { "comment_text": "", "digests": { "md5": "be525c7a29ce6b1fad1bd8285ba906b2", "sha256": "e3ad11efe0aa22cbfc46aa1296b2ea5a52ad208b778288311f2801adef178ccb" }, "downloads": -1, "filename": "pdftotext-2.1.1.tar.gz", "has_sig": false, "md5_digest": "be525c7a29ce6b1fad1bd8285ba906b2", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 112921, "upload_time": "2018-10-07T19:51:39", "url": "https://files.pythonhosted.org/packages/21/35/60094dbadd9de2035873390b1cac25e01da605844eba6a07a53a82fa4adc/pdftotext-2.1.1.tar.gz" } ], "2.1.2": [ { "comment_text": "", "digests": { "md5": "8dfdefaafd94b7f4a3073bb35fdc5c4f", "sha256": "c8bdc47b08baa17b8e03ba1f960fc6335b183d2644eaf7300e088516758a6090" }, "downloads": -1, "filename": "pdftotext-2.1.2.tar.gz", "has_sig": false, "md5_digest": "8dfdefaafd94b7f4a3073bb35fdc5c4f", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 113324, "upload_time": "2019-08-07T02:42:12", "url": "https://files.pythonhosted.org/packages/a6/a7/c202adb0bcd3adc3030b0c5f7f0e21f62a721913e93296e6c4ddc305cbd3/pdftotext-2.1.2.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "8dfdefaafd94b7f4a3073bb35fdc5c4f", "sha256": "c8bdc47b08baa17b8e03ba1f960fc6335b183d2644eaf7300e088516758a6090" }, "downloads": -1, "filename": "pdftotext-2.1.2.tar.gz", "has_sig": false, "md5_digest": "8dfdefaafd94b7f4a3073bb35fdc5c4f", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 113324, "upload_time": "2019-08-07T02:42:12", "url": "https://files.pythonhosted.org/packages/a6/a7/c202adb0bcd3adc3030b0c5f7f0e21f62a721913e93296e6c4ddc305cbd3/pdftotext-2.1.2.tar.gz" } ] }