{ "info": { "author": "Fedele Mantuano", "author_email": "mantuano.fedele@gmail.com", "bugtrack_url": null, "classifiers": [ "Intended Audience :: Developers", "License :: OSI Approved :: Apache Software License", "Operating System :: OS Independent", "Programming Language :: Python", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.3", "Programming Language :: Python :: 3.4", "Programming Language :: Python :: 3.5" ], "description": "`PyPI version `__ `Build\nStatus `__\n`Coverage\nStatus `__\n`BCH compliance `__\n\ntika-app-python\n===============\n\nOverview\n--------\n\ntika-app-python is a wrapper for `Apache Tika\nApp `__. With this library you can analyze: -\nfile on disk - payload in base64 - file object (like standard input)\n\nTo use file object function you should use Apache Tika version >= 1.17.\n\nApache 2 Open Source License\n~~~~~~~~~~~~~~~~~~~~~~~~~~~~\n\ntika-app-python can be downloaded, used, and modified free of charge. It\nis available under the Apache 2 license.\n\nAuthors\n-------\n\nMain Author\n~~~~~~~~~~~\n\nFedele Mantuano (**Twitter**:\n[@fedelemantuano](https://twitter.com/fedelemantuano))\n\nInstallation\n------------\n\nClone repository\n\n::\n\n git clone https://github.com/fedelemantuano/tika-app-python.git\n\nand install tika-app-python with ``setup.py``:\n\n::\n\n cd tika-app-python\n\n python setup.py install\n\nor use ``pip``:\n\n::\n\n pip install tika-app\n\nUsage in a project\n------------------\n\nImport ``TikaApp`` class:\n\n::\n\n from tikapp import TikaApp\n\n tika_client = TikaApp(file_jar=\"/opt/tika/tika-app-1.18.jar\")\n\nFor get **content type**:\n\n::\n\n tika_client.detect_content_type(\"your_file\")\n\nFor detect **language**:\n\n::\n\n tika_client.detect_language(\"your_file\")\n\nFor detect **all metadata and content**:\n\n::\n\n tika_client.extract_all_content(\"your_file\")\n\nFor detect **only content**:\n\n::\n\n tika_client.extract_only_content(\"your_file\")\n\nFor detect **only metadata**:\n\n::\n\n tika_client.extract_only_metadata(\"your_file\")\n\nYou can analyze payload in base64 with the same methods, but passing\n``payload`` argument:\n\n::\n\n tika_client.detect_content_type(payload=\"base64_payload\")\n tika_client.detect_language(payload=\"base64_payload\")\n tika_client.extract_all_content(payload=\"base64_payload\")\n tika_client.extract_only_content(payload=\"base64_payload\")\n tika_client.extract_only_metadata(payload=\"base64_payload\")\n\nor you can analyze file object (like standard input) with the same\nmethods, but passing ``objectInput`` argument:\n\n::\n\n tika_client.detect_language(objectInput=\"objectInput\")\n tika_client.extract_all_content(objectInput=\"objectInput\")\n tika_client.extract_only_content(objectInput=\"objectInput\")\n tika_client.extract_only_metadata(objectInput=\"objectInput\")\n\nUsage from command-line\n-----------------------\n\nIf you installed tika-app-python with ``pip`` or ``setup.py`` you can\nuse it with command-line. To use tika-app-python you should submit the\nApache Tika app JAR. You can: - set the enviroment value\n``TIKA_APP_JAR`` - use ``--jar`` switch\n\nThe last one overwrite all the others.\n\nThese are all swithes:\n\n::\n\n usage: tikapp [-h] (-f FILE | -p PAYLOAD | -k) [-j JAR] [-d] [-t] [-l]\n [-m] [-a] [-v]\n\n Wrapper for Apache Tika App.\n\n optional arguments:\n -h, --help show this help message and exit\n -f FILE, --file FILE File to submit (default: None)\n -p PAYLOAD, --payload PAYLOAD\n Base64 payload to submit (default: None)\n -k, --stdin Enable parsing from stdin (default: False)\n -j JAR, --jar JAR Apache Tika app JAR (default: None)\n -d, --detect Detect document type (default: False)\n -t, --text Output plain text content (default: False)\n -l, --language Output only language (default: False)\n -m, --metadata Output only metadata (default: False)\n -a, --all Output metadata and content from all embedded files\n (default: False)\n -v, --version show program's version number and exit\n\nExample from file on disk:\n\n.. code:: shell\n\n $ tikapp -f example_file -a\n\nExample from standard input\n\n.. code:: shell\n\n $ tikapp -a -k < example_file\n\nPerformance tests\n-----------------\n\nThese are the results of performance tests in\n`tests `__\nfolder:\n\n::\n\n (Python 2)\n tika_content_type() 0.704840 sec\n tika_detect_language() 1.592066 sec\n magic_content_type() 0.000215 sec\n tika_extract_all_content() 0.816366 sec\n tika_extract_only_content() 0.788667 sec\n\n (Python 3)\n tika_content_type() 0.698357 sec\n tika_detect_language() 1.593452 sec\n magic_content_type() 0.000226 sec\n tika_extract_all_content() 0.785915 sec\n tika_extract_only_content() 0.766517 sec", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/fedelemantuano/tika-app-python", "keywords": "tika", "license": "Apache License, Version 2.0", "maintainer": "", "maintainer_email": "", "name": "tika-app", "package_url": "https://pypi.org/project/tika-app/", "platform": "Linux", "project_url": "https://pypi.org/project/tika-app/", "project_urls": { "Homepage": "https://github.com/fedelemantuano/tika-app-python" }, "release_url": "https://pypi.org/project/tika-app/1.5.0/", "requires_dist": null, "requires_python": "", "summary": "Python client for Apache Tika App", "version": "1.5.0" }, "last_serial": 4190136, "releases": { "0.5.5": [ { "comment_text": "", "digests": { "md5": "d96954de5b1feec909ab6d847e49a2d8", "sha256": "dfad680a60dd0257b861847c8ca6b38e87ed07ba533d92c620a02f41a370d190" }, "downloads": -1, "filename": "tika-app-0.5.5.tar.gz", "has_sig": false, "md5_digest": "d96954de5b1feec909ab6d847e49a2d8", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5474, "upload_time": "2016-11-24T01:10:16", "url": "https://files.pythonhosted.org/packages/f3/38/841d04c8a1ab8dfb69a447b81e97c7dbd8a9fbb890a1de5833662e807b25/tika-app-0.5.5.tar.gz" } ], "1.0.0": [ { "comment_text": "", "digests": { "md5": "82971dea27a9d006a994026e5c10e364", "sha256": "9b5134a62041d5906641f4324feafcb08442c902128351cca616b0336a361689" }, "downloads": -1, "filename": "tika-app-1.0.0.tar.gz", "has_sig": false, "md5_digest": "82971dea27a9d006a994026e5c10e364", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5666, "upload_time": "2017-02-17T20:02:50", "url": "https://files.pythonhosted.org/packages/65/26/29f4721dd5b28415a02db844ed719505ee1cb63eefa7b042f1c0bee75084/tika-app-1.0.0.tar.gz" } ], "1.0.0.dev0": [ { "comment_text": "", "digests": { "md5": "2be5eb1a2cec0d94c25ff41f89ac401f", "sha256": "a42ab91b75d31920df8f41514edcec1e4fa4993fef6e8f8df2cb3b119af39261" }, "downloads": -1, "filename": "tika-app-1.0.0.dev0.tar.gz", "has_sig": false, "md5_digest": "2be5eb1a2cec0d94c25ff41f89ac401f", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5716, "upload_time": "2017-02-17T18:09:30", "url": "https://files.pythonhosted.org/packages/1a/3f/1b89f37f6c2df81593a25b3ae9f50e2eebb92268b7b0bc226fc52c433a92/tika-app-1.0.0.dev0.tar.gz" } ], "1.0.1": [ { "comment_text": "", "digests": { "md5": "1a64d733429038b2eb8bc4ccebaf22ba", "sha256": "b5c9afe69933417c8ccb8e2e926b7618b3b84ba8f700319bc811c5c0bfa560ae" }, "downloads": -1, "filename": "tika-app-1.0.1.tar.gz", "has_sig": false, "md5_digest": "1a64d733429038b2eb8bc4ccebaf22ba", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5757, "upload_time": "2017-02-23T16:36:43", "url": "https://files.pythonhosted.org/packages/44/41/a1739e4e10285c7a61ea3074829a321d90a660d3519fc87089d99962ae18/tika-app-1.0.1.tar.gz" } ], "1.1.0": [ { "comment_text": "", "digests": { "md5": "ea325527fcdb2cad6faf4b46e786ea39", "sha256": "bb3917de94d0769532bd5794ee7ef695af9f91e9a8e91ff3e3af0876bde02530" }, "downloads": -1, "filename": "tika-app-1.1.0.tar.gz", "has_sig": false, "md5_digest": "ea325527fcdb2cad6faf4b46e786ea39", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5958, "upload_time": "2017-03-17T16:06:17", "url": "https://files.pythonhosted.org/packages/a2/7f/70d8f04be4f36450fe4413f2046702f12d1b4ab9df8304eb337fbf9332f6/tika-app-1.1.0.tar.gz" } ], "1.1.1": [ { "comment_text": "", "digests": { "md5": "2399a597185fffb2bcbaa14772942d93", "sha256": "e93f3782e54eb92b52381507d44cf3afa4714025e375fab6255190df4faf19ac" }, "downloads": -1, "filename": "tika-app-1.1.1.tar.gz", "has_sig": false, "md5_digest": "2399a597185fffb2bcbaa14772942d93", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6674, "upload_time": "2017-06-25T13:36:23", "url": "https://files.pythonhosted.org/packages/32/34/98986b4fcf0c782d9f4ba5f7ab9a062ac0a35b72483e903c3fd0df697f5a/tika-app-1.1.1.tar.gz" } ], "1.2.0": [ { "comment_text": "", "digests": { "md5": "f7c56f551bf400447b8cf980758e8f46", "sha256": "7c81d67dd0641eb568b7842ceb15317bcf2b89f1d5d0667d3e551d6b696f43d3" }, "downloads": -1, "filename": "tika-app-1.2.0.tar.gz", "has_sig": false, "md5_digest": "f7c56f551bf400447b8cf980758e8f46", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6761, "upload_time": "2017-12-05T20:31:00", "url": "https://files.pythonhosted.org/packages/e8/e0/23ec99ce04e5db29bf682d6eb7b9997d78e987a7c6972a0726f8e5f802a0/tika-app-1.2.0.tar.gz" } ], "1.3.0": [ { "comment_text": "", "digests": { "md5": "a400835f6702f3683a627ab97c49c6a8", "sha256": "b51cdc59fe4766f134a655df34db492451799eb290515aef7a610525e64dd46b" }, "downloads": -1, "filename": "tika-app-1.3.0.tar.gz", "has_sig": false, "md5_digest": "a400835f6702f3683a627ab97c49c6a8", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 6746, "upload_time": "2017-12-05T23:39:45", "url": "https://files.pythonhosted.org/packages/dc/4c/3b9fe852cf737a6342d5e5bc4bd8ae89868f82143db0d53703af7211900b/tika-app-1.3.0.tar.gz" } ], "1.4.0": [ { "comment_text": "", "digests": { "md5": "47a52f7e3febc75781335827eb59337e", "sha256": "e80dfe2ba07bf104562e72853e9386982bcd0aee6cf553851ebd0071a09ecf2a" }, "downloads": -1, "filename": "tika-app-1.4.0.tar.gz", "has_sig": false, "md5_digest": "47a52f7e3febc75781335827eb59337e", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 7378, "upload_time": "2018-07-24T10:09:20", "url": "https://files.pythonhosted.org/packages/cd/b7/9a5b754b71075feeb17ed1a4c97b520128aa3048f24f4fd51e0db1ab363e/tika-app-1.4.0.tar.gz" } ], "1.5.0": [ { "comment_text": "", "digests": { "md5": "2fa7ba7ee9a32d94b78e1b9fddf977bf", "sha256": "4a9735fc602c13a61a12b99a2b346fe16aa76aba2443c2fec677f97e9dfb6294" }, "downloads": -1, "filename": "tika-app-1.5.0.tar.gz", "has_sig": false, "md5_digest": "2fa7ba7ee9a32d94b78e1b9fddf977bf", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 7476, "upload_time": "2018-08-20T21:34:09", "url": "https://files.pythonhosted.org/packages/9d/35/a8d799c0dea262181a936dd8c6d8e6095a943bfccf147a16c4e19a101a32/tika-app-1.5.0.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "2fa7ba7ee9a32d94b78e1b9fddf977bf", "sha256": "4a9735fc602c13a61a12b99a2b346fe16aa76aba2443c2fec677f97e9dfb6294" }, "downloads": -1, "filename": "tika-app-1.5.0.tar.gz", "has_sig": false, "md5_digest": "2fa7ba7ee9a32d94b78e1b9fddf977bf", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 7476, "upload_time": "2018-08-20T21:34:09", "url": "https://files.pythonhosted.org/packages/9d/35/a8d799c0dea262181a936dd8c6d8e6095a943bfccf147a16c4e19a101a32/tika-app-1.5.0.tar.gz" } ] }