{ "info": { "author": "Yifei Huang", "author_email": "yifei.huang@gmail.com", "bugtrack_url": null, "classifiers": [ "License :: OSI Approved :: MIT License", "Operating System :: OS Independent", "Programming Language :: Python :: 3" ], "description": "# Pandas Data Exploration Utility Package\n\n## Table of content\n * [Overview](#overview)\n * [Installation](#installation)\n * [Usage](#usage)\n \t+ [Visualization Module](#visualization-module)\n\t + [Pareto plot](#pareto-plot)\n\t + [Distribution plot](#distribution-plot)\n\t + [X-Y plot](#x-y-plot)\n * [Recommended development setup](#Recommended-development-setup)\n\n## Overview\nPandas Data Exploration utility is an interactive, notebook based library for quickly profiling and exploring the shape of data and the relationships between data. Using existing APIs from IpyWidget, Plot.ly, and Pandas, it creates a flexible point and click widget that allows the user to easily explore and visualize the dataset. \nThis is a work in progress, and I welcome any suggestions on features and/or enhancements.\n\n## Installation\n```\npip install Pandas-Data-Exploration-Utility-Package\n```\n\n## Usage\n\n### Visualization Module\n```\nimport pandas as pd\nimport pandas_exploration_util.viz.explore as pe\n\nglobal_temp = pd.read_csv(\"./data/GlobalTemperatures.csv\", parse_dates = [0], infer_datetime_format=True)\n\npe.generate_widget(global_temp)\n```\nsee `/test` for sample data and test jupyter notebook \nhttps://github.com/yifeihuang/pandas_exploration_util/tree/master/test\n\n***\n#### Pareto plot\nVisualize the top values of any column as ranked by aggregation of any other column. Support aggregation functions include `'count', 'sum', 'mean', 'std', 'max', 'min', 'uniques'`\n

\n \n

\n\n#### Distribution plot\nVisualize distribution of any numerical value. Binning is automatically determined by the plot.ly histogram method.\n

\n \n

\n\n#### X-Y plot\nVisualize the X-Y scatter of any column vs aggregation of any other column. Support aggregation functions include `'count', 'sum', 'mean', 'std', 'max', 'min', 'uniques'`\n

\n \n

\n\n\n## Recommended development setup\n\n### Local Dev\n1. Setup virtualenv\n2. Create a virtual environment using `virtualenv /path/to/env/dir`\n3. Activate virtual environment using `source /path/to/env/dir/bin/activate`\n4. Clone the repo locally\n5. Navigate the root directory of the repo where the `setup.py` lives\n6. Install the module in development mode using `python setup.py develop`\n7. Run the Jupyter notebook that is in the virtual environment directory, which should have installed as the part of the dependency of the module\n8. Dev away\n9. When done uninstall the package using `python setup.py develop --uninstall`\n10. Deactive the environment using `deactivate`\n\n### Building and distributing\nhttps://packaging.python.org/tutorials/packaging-projects/ \nAssuming all relevant tools are installed and the relevant project files are properly defined\n1. build the distribution using `python3 setup.py sdist bdist_wheel`\n2. upload the distribution using `twine upload dist/*{version}*`\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/yifeihuang/pandas_exploration_util", "keywords": "", "license": "", "maintainer": "", "maintainer_email": "", "name": "Pandas-Data-Exploration-Utility-Package", "package_url": "https://pypi.org/project/Pandas-Data-Exploration-Utility-Package/", "platform": "", "project_url": "https://pypi.org/project/Pandas-Data-Exploration-Utility-Package/", "project_urls": { "Homepage": "https://github.com/yifeihuang/pandas_exploration_util" }, "release_url": "https://pypi.org/project/Pandas-Data-Exploration-Utility-Package/0.0.3/", "requires_dist": [ "pandas", "plotly", "ipywidgets", "IPython" ], "requires_python": "", "summary": "Utility functions to help with exploratory data analysis on top the Pandas APIs", "version": "0.0.3" }, "last_serial": 4224391, "releases": { "0.0.1": [ { "comment_text": "", "digests": { "md5": "0fb906fb1df7a6b09fe2fce52e44c84a", "sha256": "660164551bacb186951ca5152d40900387389eba23e31efc94fbb42ec79b38a5" }, "downloads": -1, "filename": "Pandas_Data_Exploration_Utility_Package-0.0.1-py3-none-any.whl", "has_sig": false, "md5_digest": "0fb906fb1df7a6b09fe2fce52e44c84a", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 3713, "upload_time": "2018-08-12T22:40:44", "url": "https://files.pythonhosted.org/packages/7a/62/f92f94bfee725df91b39700534aed86b3dd9c930b2a90344d6bf0bae6cd9/Pandas_Data_Exploration_Utility_Package-0.0.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "752dc7dbf68198665f4ab633a1e5eb0c", "sha256": "273b097380e352bc5f384f6b491c68c4da830bdee58ca8aa0155b05595870b71" }, "downloads": -1, "filename": "Pandas Data Exploration Utility Package-0.0.1.tar.gz", "has_sig": false, "md5_digest": "752dc7dbf68198665f4ab633a1e5eb0c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2923, "upload_time": "2018-08-12T22:40:45", "url": "https://files.pythonhosted.org/packages/f7/d9/592d8e193f6a83e7d2784deddc097d140e85188cd7cf270ebf027d073d6d/Pandas%20Data%20Exploration%20Utility%20Package-0.0.1.tar.gz" } ], "0.0.2": [ { "comment_text": "", "digests": { "md5": "3c3ef7ea92ef6c6db4bcf1cfca71e603", "sha256": "42a04b43375fcf21852b63b770d13a4e2fb3f85af87b9ce3aedaaf8187a4e1aa" }, "downloads": -1, "filename": "Pandas_Data_Exploration_Utility_Package-0.0.2-py3-none-any.whl", "has_sig": false, "md5_digest": "3c3ef7ea92ef6c6db4bcf1cfca71e603", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 5122, "upload_time": "2018-08-16T23:57:43", "url": "https://files.pythonhosted.org/packages/f8/7c/ce534cb0e66aac58af2a11c957f1b34e1e9dd0cba49bd0d5a6886638a091/Pandas_Data_Exploration_Utility_Package-0.0.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "bed848faf64323633e4d41f5aa04704b", "sha256": "912b4ea842e04d106a90fcb8f0abcf03f62564151897c413230025bb30c75cc9" }, "downloads": -1, "filename": "Pandas Data Exploration Utility Package-0.0.2.tar.gz", "has_sig": false, "md5_digest": "bed848faf64323633e4d41f5aa04704b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4490, "upload_time": "2018-08-16T23:57:45", "url": "https://files.pythonhosted.org/packages/db/3e/fc8c4d537490395c83d2975b4171a5fe7e7f08628f5ba532b7a6c632fbfe/Pandas%20Data%20Exploration%20Utility%20Package-0.0.2.tar.gz" } ], "0.0.3": [ { "comment_text": "", "digests": { "md5": "44e193fd18061d4b8b896141523db132", "sha256": "adb69db3acdc64c73c104146d00a5cd612f9f68f8aa89e01d8bcccd012400e71" }, "downloads": -1, "filename": "Pandas_Data_Exploration_Utility_Package-0.0.3-py3-none-any.whl", "has_sig": false, "md5_digest": "44e193fd18061d4b8b896141523db132", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 6413, "upload_time": "2018-08-30T22:22:06", "url": "https://files.pythonhosted.org/packages/cd/83/a1476af964060b88561e2645ff2487b67aeca5faaab9edbc255a51248541/Pandas_Data_Exploration_Utility_Package-0.0.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "a5abcb3081d86bd1e68095cb286e1af1", "sha256": "372acc91c7bcfb192a078af599607acf64dcc09461f5a332d673298e20cfe8e6" }, "downloads": -1, "filename": "Pandas Data Exploration Utility Package-0.0.3.tar.gz", "has_sig": false, "md5_digest": "a5abcb3081d86bd1e68095cb286e1af1", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5479, "upload_time": "2018-08-30T22:22:07", "url": "https://files.pythonhosted.org/packages/04/46/06077509fabcd921e0c7b43b7b6d8455e90d7f6b8681e441004cfd81009a/Pandas%20Data%20Exploration%20Utility%20Package-0.0.3.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "44e193fd18061d4b8b896141523db132", "sha256": "adb69db3acdc64c73c104146d00a5cd612f9f68f8aa89e01d8bcccd012400e71" }, "downloads": -1, "filename": "Pandas_Data_Exploration_Utility_Package-0.0.3-py3-none-any.whl", "has_sig": false, "md5_digest": "44e193fd18061d4b8b896141523db132", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 6413, "upload_time": "2018-08-30T22:22:06", "url": "https://files.pythonhosted.org/packages/cd/83/a1476af964060b88561e2645ff2487b67aeca5faaab9edbc255a51248541/Pandas_Data_Exploration_Utility_Package-0.0.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "a5abcb3081d86bd1e68095cb286e1af1", "sha256": "372acc91c7bcfb192a078af599607acf64dcc09461f5a332d673298e20cfe8e6" }, "downloads": -1, "filename": "Pandas Data Exploration Utility Package-0.0.3.tar.gz", "has_sig": false, "md5_digest": "a5abcb3081d86bd1e68095cb286e1af1", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 5479, "upload_time": "2018-08-30T22:22:07", "url": "https://files.pythonhosted.org/packages/04/46/06077509fabcd921e0c7b43b7b6d8455e90d7f6b8681e441004cfd81009a/Pandas%20Data%20Exploration%20Utility%20Package-0.0.3.tar.gz" } ] }