{ "info": { "author": "Xavier Dupr\u00e9", "author_email": "xavier.dupre@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 5 - Production/Stable", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Programming Language :: Python :: 3", "Topic :: Education", "Topic :: Scientific/Engineering" ], "description": "\n.. _l-README:\n\nREADME\n======\n\n.. image:: https://travis-ci.org/sdpython/pandas_streaming.svg?branch=master\n :target: https://travis-ci.org/sdpython/pandas_streaming\n :alt: Build status\n\n.. image:: https://ci.appveyor.com/api/projects/status/4te066r8ne1ymmhy?svg=true\n :target: https://ci.appveyor.com/project/sdpython/pandas-streaming\n :alt: Build Status Windows\n\n.. image:: https://circleci.com/gh/sdpython/pandas_streaming/tree/master.svg?style=svg\n :target: https://circleci.com/gh/sdpython/pandas_streaming/tree/master\n\n.. image:: https://badge.fury.io/py/pandas_streaming.svg\n :target: http://badge.fury.io/py/pandas_streaming\n\n.. image:: https://img.shields.io/badge/license-MIT-blue.svg\n :alt: MIT License\n :target: http://opensource.org/licenses/MIT\n\n.. image:: https://requires.io/github/sdpython/pandas_streaming/requirements.svg?branch=master\n :target: https://requires.io/github/sdpython/pandas_streaming/requirements/?branch=master\n :alt: Requirements Status\n\n.. image:: https://codecov.io/github/sdpython/pandas_streaming/coverage.svg?branch=master\n :target: https://codecov.io/github/sdpython/pandas_streaming?branch=master\n\n.. image:: http://img.shields.io/github/issues/sdpython/pandas_streaming.png\n :alt: GitHub Issues\n :target: https://github.com/sdpython/pandas_streaming/issues\n\n.. image:: https://badge.waffle.io/sdpython/pandas_streaming.png?label=to%20do&title=to%20do\n :alt: Waffle\n :target: https://waffle.io/sdpython/pandas_streaming\n\n.. image:: http://www.xavierdupre.fr/app/pandas_streaming/helpsphinx/_images/nbcov.png\n :target: http://www.xavierdupre.fr/app/pandas_streaming/helpsphinx/all_notebooks_coverage.html\n :alt: Notebook Coverage\n\n.. image:: https://api.codacy.com/project/badge/Grade/f53b7f4d6a0447aa9ce0c4ad5df659ef\n :target: https://www.codacy.com/app/sdpython/pandas_streaming?utm_source=github.com&utm_medium=referral&utm_content=sdpython/pandas_streaming&utm_campaign=Badge_Grade\n\n`pandas_streaming `_\naims at processing big files with `pandas `_,\ntoo big to hold in memory, too small to be parallelized with a significant gain.\nThe module replicates a subset of `pandas `_ API\nand implements other functionalities for machine learning.\n\n::\n\n from pandas_streaming.df import StreamingDataFrame\n sdf = StreamingDataFrame.read_csv(\"filename\", sep=\"\\t\", encoding=\"utf-8\")\n\n for df in sdf:\n # process this chunk of data\n # df is a dataframe\n print(df)\n\nThe module can also stream an existing dataframe.\n\n::\n\n import pandas\n df = pandas.DataFrame([dict(cf=0, cint=0, cstr=\"0\"),\n dict(cf=1, cint=1, cstr=\"1\"),\n dict(cf=3, cint=3, cstr=\"3\")])\n\n from pandas_streaming.df import StreamingDataFrame\n sdf = StreamingDataFrame.read_df(df)\n\n for df in sdf:\n # process this chunk of data\n # df is a dataframe\n print(df)\n\n**Links:**\n\n* `GitHub/pandas_streaming `_\n* `documentation `_\n* `Blog `_\n\n.. _l-HISTORY:\n\n=======\nHistory\n=======\n\ncurrent - 2018-05-17 - 0.00Mb\n=============================\n\n* `6`: add pandas_groupby_nan from pyensae (2018-05-17)\n\n0.1.66 - 2018-02-05 - 0.02Mb\n============================\n\n* `5`: add random_state parameter to splitting functions (2018-02-04)\n* `2`: add method sample, resevoir sampling (2017-11-05)\n* `3`: method train_test_split for out-of-memory datasets (2017-10-21)\n* `1`: Excited for your project (2017-10-10)\n", "description_content_type": "", "docs_url": null, "download_url": "https://github.com/sdpython/pandas_streaming/", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "http://www.xavierdupre.fr/app/pandas_streaming/helpsphinx/index.html", "keywords": "pandas_streaming", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "pandas_streaming", "package_url": "https://pypi.org/project/pandas_streaming/", "platform": "", "project_url": "https://pypi.org/project/pandas_streaming/", "project_urls": { "Download": "https://github.com/sdpython/pandas_streaming/", "Homepage": "http://www.xavierdupre.fr/app/pandas_streaming/helpsphinx/index.html" }, "release_url": "https://pypi.org/project/pandas_streaming/0.1.87/", "requires_dist": null, "requires_python": "", "summary": "Streaming operations with pandas.", "version": "0.1.87" }, "last_serial": 3871471, "releases": { "0.1.66": [ { "comment_text": "", "digests": { "md5": "9cb041a6a4a0a6be676c16b31017e201", "sha256": "f8eb4c7daab9a38b061becd1604b29379c5984502dba850e0f886f3f6c539cdc" }, "downloads": -1, "filename": "pandas_streaming-0.1.66-py3-none-any.whl", "has_sig": false, "md5_digest": "9cb041a6a4a0a6be676c16b31017e201", "packagetype": "bdist_wheel", "python_version": "3.6", "requires_python": null, "size": 24832, "upload_time": "2018-02-05T10:11:40", "url": "https://files.pythonhosted.org/packages/b7/50/ac818d9c27c16d84a49f73947799122afabc5c4f8551967bc64d6ffcb967/pandas_streaming-0.1.66-py3-none-any.whl" } ], "0.1.87": [ { "comment_text": "", "digests": { "md5": "d023860b96235060bfae4fa6568d10ff", "sha256": "f54c74252f8613c8ffe28811b15d9214cb2dc5367862e846bfce39247a673a9c" }, "downloads": -1, "filename": "pandas_streaming-0.1.87-py3-none-any.whl", "has_sig": false, "md5_digest": "d023860b96235060bfae4fa6568d10ff", "packagetype": "bdist_wheel", "python_version": "3.6", "requires_python": null, "size": 25499, "upload_time": "2018-05-17T08:45:48", "url": "https://files.pythonhosted.org/packages/29/4a/b29c53aad11c6d7607d0423ae6f909296c03da20c7cb862355198ac9852f/pandas_streaming-0.1.87-py3-none-any.whl" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "d023860b96235060bfae4fa6568d10ff", "sha256": "f54c74252f8613c8ffe28811b15d9214cb2dc5367862e846bfce39247a673a9c" }, "downloads": -1, "filename": "pandas_streaming-0.1.87-py3-none-any.whl", "has_sig": false, "md5_digest": "d023860b96235060bfae4fa6568d10ff", "packagetype": "bdist_wheel", "python_version": "3.6", "requires_python": null, "size": 25499, "upload_time": "2018-05-17T08:45:48", "url": "https://files.pythonhosted.org/packages/29/4a/b29c53aad11c6d7607d0423ae6f909296c03da20c7cb862355198ac9852f/pandas_streaming-0.1.87-py3-none-any.whl" } ] }