{ "info": { "author": "Turulomio", "author_email": "turulomio@yahoo.es", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Intended Audience :: System Administrators", "License :: OSI Approved :: GNU General Public License v3 (GPLv3)", "Programming Language :: Python :: 3", "Topic :: System :: Systems Administration" ], "description": "What is pdf2odt\n===============\n\nIt's a script to convert pdf to LibreOffice Writer document. Pdf pages are converted as images. It uses pdftoppm from poppler to make conversion\n\nLinks\n=====\n\nProject main page\n https://github.com/turulomio/pdf2odt/\n\nDoxygen documentation:\n http://turulomio.users.sourceforge.net/doxygen/pdf2odt/\n\nPypi web page:\n https://pypi.org/project/pdf2odt/\n\nInstallation and use in Linux\n=============================\n\nIf you use Gentoo you can find a ebuild in https://github.com/Turulomio/myportage/tree/master/dev-python/pdf2odt\n\nTo install in other distributions, you must have poppler installed to use pdftoppm command. You can use your distribution package manager\n\nThen just type:\n\n`pip install pdf2odt`\n\nOnce installed you can use it typing:\n\n`pdf2odt --pdf doc.pdf doc.odt`\n\nIf you want OCR, you have to install tesseract application then you have to run \n\n`pdf2odt --pdf doc.pdf --tesseract doc.odt`\n\nInstallation and use in Windows\n===============================\n\nYou need python installed. It works with the latest version. Don't forget to add python executables to PATH, marking it in the installation process.\n\nThen just type:\n\n`pip install pdf2odt`\n\nNow you have to download poppler for windows from https://blog.alivate.com.au/poppler-windows/. Uncompress the downloaded file and add its installation directory to Windows environment path. Here you have how to do it https://www.architectryan.com/2018/03/17/add-to-the-path-on-windows-10/ \n\n\nNow you can use it typing in windows shell:\n\n`pdf2odt --pdf doc.pdf doc.odt`\n\nIf you want OCR, ou have to download tesseract for windows fromm https://github.com/UB-Mannheim/tesseract/wiki. Then you have to add its installation directory to Windows environment path too.\n\n`pdf2odt --pdf doc.pdf --tesseract doc.odt`\n\n\nDependencies\n============\n* https://www.python.org/, as the main programming language.\n* https://pypi.org/project/colorama/, to give console colors.\n* https://pypi.org/project/pillow/, to manage png images.\n* https://github.com/turulomio/officegenerator/, to generate odt file.\n* https://poppler.freedesktop.org/, to convert pdf to images using pdftoppm.\n* https://blog.alivate.com.au/poppler-windows/ to install poppler in windows.\n* https://pypi.org/project/tqdm, to show beautyful progress bars.\n* https://github.com/tesseract-ocr/, for OCR support.\n\nChangelog\n=========\n0.6.0\n-----\n * Tesseract language is now showed in output\n * Now pdf2odt validates PDF document\n\n0.5.0\n-----\n * Now pdf2odt detects if tesseract language selected is supported.\n\n0.4.0\n-----\n * Added OCR support with tesseract\n * Now uses process concurrency and shows a progress bar\n\n0.3.0\n-----\n * Fixed problem with white spaces paths in windows.\n * Improved metadata information.\n\n0.2.0\n-----\n * Now works on Windows with popper for windows installation\n\n0.1.0\n-----\n * Basic functionality\n", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/Turulomio/pdf2odt", "keywords": "change permissions ownner files directories", "license": "GPL-3", "maintainer": "", "maintainer_email": "", "name": "pdf2odt", "package_url": "https://pypi.org/project/pdf2odt/", "platform": "", "project_url": "https://pypi.org/project/pdf2odt/", "project_urls": { "Homepage": "https://github.com/Turulomio/pdf2odt" }, "release_url": "https://pypi.org/project/pdf2odt/0.6/", "requires_dist": null, "requires_python": "", "summary": "Change files and directories permisions and owner recursivily from current directory", "version": "0.6" }, "last_serial": 5042249, "releases": { "0.1.0": [ { "comment_text": "", "digests": { "md5": "323e1631a9b7c66b4b97572886fba44c", "sha256": "81f8436d3b9abaef28e81c316098d5268fc000d3485f099eef2474615d2d53be" }, "downloads": -1, "filename": "pdf2odt-0.1.0.tar.gz", "has_sig": false, "md5_digest": "323e1631a9b7c66b4b97572886fba44c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 56215, "upload_time": "2019-03-25T22:38:09", "url": "https://files.pythonhosted.org/packages/71/9a/aecc875d821b549249d4c2914772ad98bb6a7cd5a1264a6ae5b05649ced5/pdf2odt-0.1.0.tar.gz" } ], "0.2.0": [ { "comment_text": "", "digests": { "md5": "ec9c5675852448299422b9d22d5227ba", "sha256": "b0d27d4234fead2df8332a394b04e140c193c6a556a83875f9dff2a97656d05f" }, "downloads": -1, "filename": "pdf2odt-0.2.0.tar.gz", "has_sig": false, "md5_digest": "ec9c5675852448299422b9d22d5227ba", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 56877, "upload_time": "2019-03-26T17:05:22", "url": "https://files.pythonhosted.org/packages/7c/20/a71052df72038ff533c8ad1b9d163ac99b0576059b2ac8ee879835ff7642/pdf2odt-0.2.0.tar.gz" } ], "0.3.0": [ { "comment_text": "", "digests": { "md5": "469c0d122817a426ba5e3f639930a461", "sha256": "7cb6bf7e08d32454770bbe46314346ceb50d4d45437ed97f9b01359bc95cd3eb" }, "downloads": -1, "filename": "pdf2odt-0.3.0.tar.gz", "has_sig": false, "md5_digest": "469c0d122817a426ba5e3f639930a461", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 57190, "upload_time": "2019-03-28T07:04:51", "url": "https://files.pythonhosted.org/packages/fd/72/e9ee761fd59eaa35c8a6aa6f6f6ba8cbd53b87a8919a9b69175cf0344fe3/pdf2odt-0.3.0.tar.gz" } ], "0.4.0": [ { "comment_text": "", "digests": { "md5": "4275492f26d5fcbd7fdb3e2fb47d6277", "sha256": "6b04876f80c52a03a47f30d62c235260ad5f5a24de111c62743577ef3e75ea19" }, "downloads": -1, "filename": "pdf2odt-0.4.0.tar.gz", "has_sig": false, "md5_digest": "4275492f26d5fcbd7fdb3e2fb47d6277", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 58366, "upload_time": "2019-03-31T21:57:15", "url": "https://files.pythonhosted.org/packages/d0/55/4224f1e469047ea7f79b4fe6f7762adc660a33b7f086516c639e8f15de93/pdf2odt-0.4.0.tar.gz" } ], "0.5": [ { "comment_text": "", "digests": { "md5": "af419f5a6f5bd6fd5a525643822547fd", "sha256": "737117f0e0a994d13e49dfa0d7cc778cd579de0364a3d7cf09947db0643b92f5" }, "downloads": -1, "filename": "pdf2odt-0.5.tar.gz", "has_sig": false, "md5_digest": "af419f5a6f5bd6fd5a525643822547fd", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 59217, "upload_time": "2019-04-01T20:01:33", "url": "https://files.pythonhosted.org/packages/38/f7/be898cb9b22fe90af2966a4db443ec4d4d5d2464662845662be2f92e1976/pdf2odt-0.5.tar.gz" } ], "0.6": [ { "comment_text": "", "digests": { "md5": "a299a68b1afd6541c2e0d61ea7f50c51", "sha256": "c3d4a975e67ff36398b8c9532e07cb223d1ccc321b75b72e0d310cc94745c772" }, "downloads": -1, "filename": "pdf2odt-0.6.tar.gz", "has_sig": false, "md5_digest": "a299a68b1afd6541c2e0d61ea7f50c51", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 59612, "upload_time": "2019-04-02T06:16:40", "url": "https://files.pythonhosted.org/packages/63/be/67bbf8e83bd61655e29037221628e87178302b7cf4b91530a326ff402961/pdf2odt-0.6.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "a299a68b1afd6541c2e0d61ea7f50c51", "sha256": "c3d4a975e67ff36398b8c9532e07cb223d1ccc321b75b72e0d310cc94745c772" }, "downloads": -1, "filename": "pdf2odt-0.6.tar.gz", "has_sig": false, "md5_digest": "a299a68b1afd6541c2e0d61ea7f50c51", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 59612, "upload_time": "2019-04-02T06:16:40", "url": "https://files.pythonhosted.org/packages/63/be/67bbf8e83bd61655e29037221628e87178302b7cf4b91530a326ff402961/pdf2odt-0.6.tar.gz" } ] }