{ "info": { "author": "Yifei Huang", "author_email": "yifei.huang@gmail.com", "bugtrack_url": null, "classifiers": [ "License :: OSI Approved :: MIT License", "Operating System :: OS Independent", "Programming Language :: Python :: 3" ], "description": "# Pandas Data Exploration Utility Package\n\n## Table of content\n * [Overview](#overview)\n * [Installation](#installation)\n * [Usage](#usage)\n \t+ [Visualization Module](#visualization-module)\n\t + [Pareto plot](#pareto-plot)\n\t + [Distribution plot](#distribution-plot)\n\t + [X-Y plot](#x-y-plot)\n * [Recommended development setup](#Recommended-development-setup)\n\n## Overview\nPandas Data Exploration utility is an interactive, notebook based library for quickly profiling and exploring the shape of data and the relationships between data. Using existing APIs from IpyWidget, Plot.ly, and Pandas, it creates a flexible point and click widget that allows the user to easily explore and visualize the dataset. \nThis is a work in progress, and I welcome any suggestions on features and/or enhancements.\n\n## Installation\n```\npip install Pandas-Data-Exploration-Utility-Package\n```\n\n## Usage\n\n### Visualization Module\n```\nimport pandas as pd\nimport pandas_exploration_util.viz.explore as pe\n\nglobal_temp = pd.read_csv(\"./data/GlobalTemperatures.csv\", parse_dates = [0], infer_datetime_format=True)\n\npe.generate_widget(global_temp)\n```\nsee `/test` for sample data and test jupyter notebook \nhttps://github.com/yifeihuang/pandas_exploration_util/tree/master/test\n\n***\n#### Pareto plot\nVisualize the top values of any column as ranked by aggregation of any other column. Support aggregation functions include `'count', 'sum', 'mean', 'std', 'max', 'min', 'uniques'`\n
\n
\n
\n
\n
\n
\n