{ "info": { "author": "Truoc Pham", "author_email": "truoc.phamkhac@asnet.com.vn", "bugtrack_url": null, "classifiers": [ "Development Status :: 2 - Pre-Alpha", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Natural Language :: English", "Programming Language :: Python :: 2", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3", "Programming Language :: Python :: 3.4", "Programming Language :: Python :: 3.5", "Programming Language :: Python :: 3.6" ], "description": "======================\nData Science Utilities\n======================\n\n\n.. image:: https://img.shields.io/pypi/v/data_science_utilities.svg\n :target: https://pypi.python.org/pypi/data_science_utilities\n\n.. image:: https://img.shields.io/travis/truocphamkhac/data-science-utilities.svg\n :target: https://travis-ci.org/truocphamkhac/data-science-utilities\n\n.. image:: https://readthedocs.org/projects/data-science-utilities/badge/?version=latest\n :target: http://data-science-utilities-python.readthedocs.io/en/latest/?badge=latest\n :alt: Documentation Status\n\n\n\n\nData Science utilities in python.\n\n\n* Free software: MIT license\n* Documentation: http://data-science-utilities-python.readthedocs.io.\n\n\nFeatures\n========\n\nMissing Data Statistic\n----------------------\n\n.. code:: python\n\n from data_science_utilities import data_science_utilities\n\n # make statistic\n missing_data = data_science_utilities.missing_data_stats(df)\n\n # display statistic\n missing_data\n\n\nRead CSV files from path\n------------------------\n\n.. code:: python\n\n from data_science_utilities import data_science_utilities\n\n train_path = '../data/raw/train.csv'\n test_path = '../data/raw/test.csv'\n\n X_train, X_test = data_science_utilities.read_csv_files(train_path, test_path)\n\n\nPlotting distribution normal\n----------------------------\n\n.. code:: python\n\n from data_science_utilities import data_science_utilities\n\n data_science_utilities.plot_dist_norm(dist, 'distribution normal')\n\n\nPlotting correlation matrix\n---------------------------\n\n.. code:: python\n\n from data_science_utilities import data_science_utilities\n\n data_science_utilities.plot_corelation_matrix(data)\n\n\nPlotting top attributes correlation matrix\n------------------------------------------\n\n.. code:: python\n\n from data_science_utilities import data_science_utilities\n\n data_science_utilities.plot_top_corelation_matrix(data, target, k=10, cmap='YlGnBu')\n\n\nPlotting attributes by scatter chart\n------------------------------------\n\n.. code:: python\n\n from data_science_utilities import data_science_utilities\n\n data_science_utilities.plot_scatter(data, column_name, target)\n\n\nPlotting attributes by box bar\n------------------------------\n\n.. code:: python\n\n from data_science_utilities import data_science_utilities\n\n data_science_utilities.plot_box(data, column_name, target)\n\n\nPlotting category by box bar\n----------------------------\n\n.. code:: python\n\n from data_science_utilities import data_science_utilities\n\n data_science_utilities.plot_category_columns(data, limit_bars=10)\n\n\nGenerate a simple plot of the test and traning learning curve\n-------------------------------------------------------------\n\n.. code:: python\n\n from data_science_utilities import data_science_utilities\n\n data_science_utilities.plot_learning_curve(estimator, title, X, y, ylim=None,\n cv=None, train_sizes=np.linspace(.1, 1.0, 5))\n\n\nCredits\n=======\n\nThis package was created with Cookiecutter_ and the `audreyr/cookiecutter-pypackage`_ project template.\n\n.. _Cookiecutter: https://github.com/audreyr/cookiecutter\n.. _`audreyr/cookiecutter-pypackage`: https://github.com/audreyr/cookiecutter-pypackage\n\n\n=======\nHistory\n=======\n\n0.2.4 (2018-05-21)\n------------------\n\n* Fixed render docs on README.\n\n\n0.2.3 (2018-05-21)\n------------------\n\n* Fixed render docs on https://pypi.org/.\n\n\n0.2.2 (2018-05-21)\n------------------\n\n* Fix render docs con't.\n\n\n0.2.1 (2018-05-21)\n------------------\n\n* Fix render docs.\n\n\n0.2.0 (2018-05-14)\n------------------\n\n* Adds utils about visualization.\n\n\n0.1.0 (2018-05-11)\n------------------\n\n* First release on PyPI.\n\n\n", "description_content_type": "", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/truocphamkhac/data-science-utilities", "keywords": "data_science_utilities", "license": "MIT license", "maintainer": "", "maintainer_email": "", "name": "data-science-utilities", "package_url": "https://pypi.org/project/data-science-utilities/", "platform": "", "project_url": "https://pypi.org/project/data-science-utilities/", "project_urls": { "Homepage": "https://github.com/truocphamkhac/data-science-utilities" }, "release_url": "https://pypi.org/project/data-science-utilities/0.2.4/", "requires_dist": [ "Click (>=6.0)" ], "requires_python": "", "summary": "Data Science utilities in python.", "version": "0.2.4" }, "last_serial": 3883588, "releases": { "0.1.0": [ { "comment_text": "", "digests": { "md5": "7b2ee23261c8bdb29f0bf7d5b4f8110e", "sha256": "b0c60c540ba4aac734019c975fb1f89b94cc3d1c79e74f6b012bfe9600ec3e9c" }, "downloads": -1, "filename": "data_science_utilities-0.1.0-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "7b2ee23261c8bdb29f0bf7d5b4f8110e", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 4730, "upload_time": "2018-05-12T04:35:20", "url": "https://files.pythonhosted.org/packages/05/35/7c9a5e2886ae987878ddd5f774cf19ae6d6a4d2160e8643cf947b5bd19f6/data_science_utilities-0.1.0-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "85802f3a8afcbefe88a1a2ea25f242a7", "sha256": "66a07fb460e7168365bbccca27ddb2a2e81debac1b7a6a370b549f4fca6eba2b" }, "downloads": -1, "filename": "data_science_utilities-0.1.0-py3.6.egg", "has_sig": false, "md5_digest": "85802f3a8afcbefe88a1a2ea25f242a7", "packagetype": "bdist_egg", "python_version": "3.6", "requires_python": null, "size": 4843, "upload_time": "2018-05-12T04:35:21", "url": "https://files.pythonhosted.org/packages/d2/1a/cf439a7fe311da36d46d5ceca9ef0fa5c40311d5de0501ced05cfa31b128/data_science_utilities-0.1.0-py3.6.egg" }, { "comment_text": "", "digests": { "md5": "e8352f98f2fccd9b1b55980130bc7018", "sha256": "3e6cf635ef75e4e5d42c22a11b6ae832b929b1df1445fe75ea910ea034a517b2" }, "downloads": -1, "filename": "data_science_utilities-0.1.0.tar.gz", "has_sig": false, "md5_digest": "e8352f98f2fccd9b1b55980130bc7018", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 8966, "upload_time": "2018-05-12T04:35:23", "url": "https://files.pythonhosted.org/packages/22/ce/206e99e063b0bbf6088eed6cf1b2e48c16534a697e9140b54d06629dbcbf/data_science_utilities-0.1.0.tar.gz" } ], "0.2.0": [ { "comment_text": "", "digests": { "md5": "d9c2df41c73fe24c31f482a8d2c6bd92", "sha256": "e194ec0f49852419f0cc9e70ccc1e43d01b485b23cd1fa8ee2e214d799fcf242" }, "downloads": -1, "filename": "data_science_utilities-0.2.0-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "d9c2df41c73fe24c31f482a8d2c6bd92", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 7643, "upload_time": "2018-05-14T02:34:06", "url": "https://files.pythonhosted.org/packages/db/e9/9904d808f3e19d597f374d09d37ae6c7c5ff6637da95cf1581edd6419e6b/data_science_utilities-0.2.0-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "14b9a22a5c9944b73dae871b23ebfddd", "sha256": "1fb0eb6803600e94126ad93958b4cf269a175304b5e228e5700e619a2ae844c7" }, "downloads": -1, "filename": "data_science_utilities-0.2.0.tar.gz", "has_sig": false, "md5_digest": "14b9a22a5c9944b73dae871b23ebfddd", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 11568, "upload_time": "2018-05-14T02:34:09", "url": "https://files.pythonhosted.org/packages/68/aa/0bbbaf32d00d2b984e2a6aaa7a4c7ac1eab6fa8b70c133f53094c80f56dc/data_science_utilities-0.2.0.tar.gz" } ], "0.2.1": [ { "comment_text": "", "digests": { "md5": "d187d1fcfdce4d31ec69d3dd3d8c1d5a", "sha256": "2212182c01930b723a5c42ecb7ca33bbfecfe2c205744040076c4dc98f021dcb" }, "downloads": -1, "filename": "data_science_utilities-0.2.1-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "d187d1fcfdce4d31ec69d3dd3d8c1d5a", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 7720, "upload_time": "2018-05-21T10:06:50", "url": "https://files.pythonhosted.org/packages/76/30/eddc89e23734b819105eb556481dac8f7261405c31f5a172af9ce7dce5b3/data_science_utilities-0.2.1-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "91ee62f654471768e541bc2a0362dbb9", "sha256": "5d3b3af39ad539c046038755b9bf65cc3edcb7b39c8bef293d3b9c2d440ba1d0" }, "downloads": -1, "filename": "data_science_utilities-0.2.1.tar.gz", "has_sig": false, "md5_digest": "91ee62f654471768e541bc2a0362dbb9", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 16926, "upload_time": "2018-05-21T10:06:51", "url": "https://files.pythonhosted.org/packages/7d/56/cea133df793cd2e17f7ef0b5d91e1f2ff337608b2699828d60152d28186f/data_science_utilities-0.2.1.tar.gz" } ], "0.2.2": [ { "comment_text": "", "digests": { "md5": "73ed10b4f6f47d2bb75ff9e10081a024", "sha256": "452f57c7f57013307849dece896dc06930051a7c2781b75eeb19e9d5516efc91" }, "downloads": -1, "filename": "data_science_utilities-0.2.2-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "73ed10b4f6f47d2bb75ff9e10081a024", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 7718, "upload_time": "2018-05-21T10:10:20", "url": "https://files.pythonhosted.org/packages/42/a3/3c43015ce7f7868ae88b6404ef45e79503b107e2d75fc61596c53afdbf35/data_science_utilities-0.2.2-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "0620520ea220f201491c6dd4af7e0dad", "sha256": "7e0588484e3a2fc26e30b490299a6540328bb0a5c8d0ffca57cbc863bf365d97" }, "downloads": -1, "filename": "data_science_utilities-0.2.2.tar.gz", "has_sig": false, "md5_digest": "0620520ea220f201491c6dd4af7e0dad", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 16920, "upload_time": "2018-05-21T10:10:21", "url": "https://files.pythonhosted.org/packages/1b/ec/d7e49a36f9dc8860ef41f78a34709cdc5ea2bb7ad27259e09b91b0061305/data_science_utilities-0.2.2.tar.gz" } ], "0.2.3": [ { "comment_text": "", "digests": { "md5": "91c0d15f3c3eb8105623dc0f074b2c27", "sha256": "78e00f2d815719018064e9f28ee7b3a0249535527bc96c0bcc4c0c61c71ac141" }, "downloads": -1, "filename": "data_science_utilities-0.2.3-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "91c0d15f3c3eb8105623dc0f074b2c27", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 7768, "upload_time": "2018-05-21T10:35:14", "url": "https://files.pythonhosted.org/packages/76/94/11d76e963aa333262374f852bd9ea3a5ac5e16ff9017686f718ce9c08324/data_science_utilities-0.2.3-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "242e1bdef1ebaee8a730f793aceae66c", "sha256": "8c65cc37bd399550bc6c56091ecc604229c65218b9f301d0639b4e487ca47948" }, "downloads": -1, "filename": "data_science_utilities-0.2.3.tar.gz", "has_sig": false, "md5_digest": "242e1bdef1ebaee8a730f793aceae66c", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 17035, "upload_time": "2018-05-21T10:35:15", "url": "https://files.pythonhosted.org/packages/e0/b5/536360ea3c6d2113e04db4064d4a9dacc159e8a58b9c3132aebd8c61f3df/data_science_utilities-0.2.3.tar.gz" } ], "0.2.4": [ { "comment_text": "", "digests": { "md5": "d4bd1765d1862f9dbbaf9e3ea14aeed7", "sha256": "eab4b129b0357e7d32ff16865e7ca60ffc6a889ce338fc5520ac2f2fc1a3ad81" }, "downloads": -1, "filename": "data_science_utilities-0.2.4-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "d4bd1765d1862f9dbbaf9e3ea14aeed7", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 7833, "upload_time": "2018-05-21T14:21:50", "url": "https://files.pythonhosted.org/packages/e4/63/aa3961bb6a87328309ec4cc7f593528e578571a1e2b9e417559459243e27/data_science_utilities-0.2.4-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "9eba17395e0c25544ff98c340ee911ce", "sha256": "618ac654ee265d8a2a65a85a151cc2c25d10c4290b7a4b81dcab442fa64aace6" }, "downloads": -1, "filename": "data_science_utilities-0.2.4.tar.gz", "has_sig": false, "md5_digest": "9eba17395e0c25544ff98c340ee911ce", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 17241, "upload_time": "2018-05-21T14:21:51", "url": "https://files.pythonhosted.org/packages/3d/5d/b9abfb634400464961d7504914a47794de39ab805d078f8ee3d1169ba895/data_science_utilities-0.2.4.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "d4bd1765d1862f9dbbaf9e3ea14aeed7", "sha256": "eab4b129b0357e7d32ff16865e7ca60ffc6a889ce338fc5520ac2f2fc1a3ad81" }, "downloads": -1, "filename": "data_science_utilities-0.2.4-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "d4bd1765d1862f9dbbaf9e3ea14aeed7", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 7833, "upload_time": "2018-05-21T14:21:50", "url": "https://files.pythonhosted.org/packages/e4/63/aa3961bb6a87328309ec4cc7f593528e578571a1e2b9e417559459243e27/data_science_utilities-0.2.4-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "9eba17395e0c25544ff98c340ee911ce", "sha256": "618ac654ee265d8a2a65a85a151cc2c25d10c4290b7a4b81dcab442fa64aace6" }, "downloads": -1, "filename": "data_science_utilities-0.2.4.tar.gz", "has_sig": false, "md5_digest": "9eba17395e0c25544ff98c340ee911ce", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 17241, "upload_time": "2018-05-21T14:21:51", "url": "https://files.pythonhosted.org/packages/3d/5d/b9abfb634400464961d7504914a47794de39ab805d078f8ee3d1169ba895/data_science_utilities-0.2.4.tar.gz" } ] }