{ "info": { "author": "", "author_email": "", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "License :: OSI Approved :: GNU General Public License (GPL)", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.6" ], "description": "[![CircleCI](https://circleci.com/gh/avlahop/dask-elk/tree/master.svg?style=svg)](https://circleci.com/gh/avlahop/dask-elk/tree/master)\n\n# dask-elk\nUse dask to fetch data from Elasticsearch in parallel by sending the request to each shard separatelly. \n\n# Table of Contents\n1. [Introduction](#introduction)\n1. [Usage](#usage)\n\n\n\n\n\n\n\n\n## Introduction \nThe library tries to imitate the functionality of the ES Hadoop plugin for spark. `dask-elk` performs a parallel read across all the target indices shards.\nIn order to achieve that it uses Elasticsearch scrolling mechanism. \n\n\n## Usage \nTo use the library and read from an index:\n\n```python\nfrom dask_elk.client import DaskElasticClient\n\n# First create a client\nclient = DaskElasticClient() # localhost Elasticsearch\n\nindex = 'my-index'\ndf = client.read(index=index, doc_type='_doc')\n```\n\nYou can even pass a query to push down to elasticsearch, so that any filtering can be done on the Elasticsearch side. Because `dask-elk` uses scroll mechanism aggregations are not supported\n```python\nfrom dask_elk.client import DaskElasticClient\n\n# First create a client\nclient = DaskElasticClient() # localhost Elasticsearch\nquery = {\n \"query\" : {\n \"term\" : { \"user\" : \"kimchy\" }\n }\n}\nindex = 'my-index'\ndf = client.read(query=query, index=index, doc_type='_doc')\n```\n\nRead documentation [here](https://dask-elk.readthedocs.io)\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/avlahop/dask-elk", "keywords": "elasticsearch dask parallel", "license": "GPLv3", "maintainer": "Apostolos Vlachopoulos", "maintainer_email": "avlahop@gmail.com", "name": "dask-elk", "package_url": "https://pypi.org/project/dask-elk/", "platform": "", "project_url": "https://pypi.org/project/dask-elk/", "project_urls": { "Documentation": "https://dask-elk.readthedocs.io", "Homepage": "https://github.com/avlahop/dask-elk" }, "release_url": "https://pypi.org/project/dask-elk/0.2.0/", "requires_dist": [ "dask[complete]", "elasticsearch (>=6.2)", "pandas (>=0.19.0)" ], "requires_python": ">=2.7", "summary": "Dask connection with Elasticsearch", "version": "0.2.0" }, "last_serial": 4831368, "releases": { "0.1.0": [ { "comment_text": "", "digests": { "md5": "a7d100f3c620b81af4c333c6210a63bc", "sha256": "3ab68c440520c030d1d081658ba95d6780b0853bb5fc52ea5a75c0c008e4fdf5" }, "downloads": -1, "filename": "dask_elk-0.1.0-py2-none-any.whl", "has_sig": false, "md5_digest": "a7d100f3c620b81af4c333c6210a63bc", "packagetype": "bdist_wheel", "python_version": "py2", "requires_python": ">=2.7", "size": 22974, "upload_time": "2018-10-19T18:56:41", "url": "https://files.pythonhosted.org/packages/a2/db/4b5935b987ede766039eed9d3da96386a17bf03484f8d6e490beb47759d3/dask_elk-0.1.0-py2-none-any.whl" }, { "comment_text": "", "digests": { "md5": "25728489f9f901ff9f7c8aec3aac5c4d", "sha256": "2c6a2dfce02d469cfe5ade27ae8fc70b14b74e4766fc6d9e0cc0ba94d76abab5" }, "downloads": -1, "filename": "dask_elk-0.1.0.tar.gz", "has_sig": false, "md5_digest": "25728489f9f901ff9f7c8aec3aac5c4d", "packagetype": "sdist", "python_version": "source", "requires_python": ">=2.7", "size": 7991, "upload_time": "2018-10-19T18:56:43", "url": "https://files.pythonhosted.org/packages/01/e9/73e144db4cd36112ddefcdee366cddea0efe098ca0fa56a40429fc17f6cd/dask_elk-0.1.0.tar.gz" } ], "0.1.1": [ { "comment_text": "", "digests": { "md5": "ab1b1f325d5f8e055a91c299325f8c4f", "sha256": "9e4ae28e16b73e68c9b5cf588b27129eea2b3fbc8a6edfe40aa129b4804003b4" }, "downloads": -1, "filename": "dask_elk-0.1.1-py2-none-any.whl", "has_sig": false, "md5_digest": "ab1b1f325d5f8e055a91c299325f8c4f", "packagetype": "bdist_wheel", "python_version": "py2", "requires_python": ">=2.7", "size": 23000, "upload_time": "2018-10-25T15:17:02", "url": "https://files.pythonhosted.org/packages/33/c9/7237ae17983edd9a29f352629794c7bec5e11cc954330478e402eb967d3c/dask_elk-0.1.1-py2-none-any.whl" }, { "comment_text": "", "digests": { "md5": "3146a2ce16159ca68a252caa8d75c4e2", "sha256": "bbd896640a8c55f156d89c3783a18221cb0bada5a02339b19ac3eb0605775c47" }, "downloads": -1, "filename": "dask_elk-0.1.1.tar.gz", "has_sig": false, "md5_digest": "3146a2ce16159ca68a252caa8d75c4e2", "packagetype": "sdist", "python_version": "source", "requires_python": ">=2.7", "size": 8847, "upload_time": "2018-10-25T15:17:04", "url": "https://files.pythonhosted.org/packages/44/08/5c264dbb10ba8950f47405cada261a00a8f10bce1baf5a847f18a3a07b85/dask_elk-0.1.1.tar.gz" } ], "0.2.0": [ { "comment_text": "", "digests": { "md5": "4cf5a17120ed50f65103b69da67c3ab0", "sha256": "727551328f0ad24fefee1b403aeb3549f5322483ba9edbb4ac1b987629d5dbb8" }, "downloads": -1, "filename": "dask_elk-0.2.0-py3-none-any.whl", "has_sig": false, "md5_digest": "4cf5a17120ed50f65103b69da67c3ab0", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=2.7", "size": 22992, "upload_time": "2019-02-17T12:15:59", "url": "https://files.pythonhosted.org/packages/4d/56/267fba69cee284d2e53f38309482c43113a7e73ec1fd5c54b6350d6ead94/dask_elk-0.2.0-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "f030028aff977b0265b5d03055952699", "sha256": "cb15ea8af623b4a1ae01d464f546b9770c773290b453e2b78ac29024c1c5b079" }, "downloads": -1, "filename": "dask_elk-0.2.0.tar.gz", "has_sig": false, "md5_digest": "f030028aff977b0265b5d03055952699", "packagetype": "sdist", "python_version": "source", "requires_python": ">=2.7", "size": 7997, "upload_time": "2019-02-17T12:16:01", "url": "https://files.pythonhosted.org/packages/b6/e0/b08c8db2846dc76ab9dc69a5c456f2f9dcc109b09a6059f6f7a090a797be/dask_elk-0.2.0.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "4cf5a17120ed50f65103b69da67c3ab0", "sha256": "727551328f0ad24fefee1b403aeb3549f5322483ba9edbb4ac1b987629d5dbb8" }, "downloads": -1, "filename": "dask_elk-0.2.0-py3-none-any.whl", "has_sig": false, "md5_digest": "4cf5a17120ed50f65103b69da67c3ab0", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=2.7", "size": 22992, "upload_time": "2019-02-17T12:15:59", "url": "https://files.pythonhosted.org/packages/4d/56/267fba69cee284d2e53f38309482c43113a7e73ec1fd5c54b6350d6ead94/dask_elk-0.2.0-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "f030028aff977b0265b5d03055952699", "sha256": "cb15ea8af623b4a1ae01d464f546b9770c773290b453e2b78ac29024c1c5b079" }, "downloads": -1, "filename": "dask_elk-0.2.0.tar.gz", "has_sig": false, "md5_digest": "f030028aff977b0265b5d03055952699", "packagetype": "sdist", "python_version": "source", "requires_python": ">=2.7", "size": 7997, "upload_time": "2019-02-17T12:16:01", "url": "https://files.pythonhosted.org/packages/b6/e0/b08c8db2846dc76ab9dc69a5c456f2f9dcc109b09a6059f6f7a090a797be/dask_elk-0.2.0.tar.gz" } ] }