{ "info": { "author": "Martin Skarzynski", "author_email": "marskar@gmail.com", "bugtrack_url": null, "classifiers": [ "License :: OSI Approved :: MIT License", "Operating System :: OS Independent", "Programming Language :: Python :: 3" ], "description": "# The `datacamprojects` python package\n\nSkip the boilerplate of scikit-learn machine learning examples.\n\n## Installation\n```bash\npip install datacamprojects\n```\n\n## Usage\n\nIn a shell environment, you can run `datacamprojects`\nwith no arguments to perform a Logistic Regression\non the `digits` dataset.\n\nThis will produce a 10 x 10 confusion matrix\nwith the Accuracy Score at the top.\n\nYou can also pass arguments to datacamprojects at the command line.\n\nFor example, \n```bash\ndatacamprojects -dataset diabetes -model linear_model.Lasso\n# Or\ndatacamprojects -d diabetes -m linear_model.Lasso\n```\nwill run a linear regression with lasso regularization (L1)\non the `diabetes` dataset.\n\nThe `dataset` argument can be any of\nthe following built-in scikit-learn datasets:\n- Regression\n - `boston`\n - `diabetes`\n- Classification\n - `digits`\n - `iris`\n - `wine`\n - `breast_cancer`\n\nThe `model` argument refers to the model type and name from scikit-learn.\nThe first part is the submodule, e.g. \n- `linear_model`\n- `naive_bayes`\n- `ensemble`\n- `svm`\n\nwhile the second is what is actually imported, e.g.\n- `LinearRegression`\n- `GaussianNB`\n- `RandomForestRegressor`\n- `SVC`\n\nSimplify code to a single function call per step:\n```python\nfrom sklearn.metrics import confusion_matrix, accuracy_score\nimport datacamprojects as dcp\n\ndataset = dcp.get_data('digits')\nx_train, x_test, y_train, y_test = dcp.split_data(dataset)\n\nmodel = dcp.get_model(model_type='ensemble',\n model_name='RandomForestClassifier')\n\nfit = model.fit(x_train, y_train)\ndcp.pickle_model(filename='digits_rf.pickle', model=fit)\npredictions = fit.predict(x_test)\n\nconfmat = confusion_matrix(y_true=y_test, y_pred=predictions)\naccuracy = accuracy_score(y_true=y_test, y_pred=predictions)\n\ndcp.confusion_matrix_plot(cm=confmat,\n acc=accuracy,\n filename='digits_rf.png')\n```\n\nOr run a whole pipeline with one function:\n\n```python\nimport datacamprojects as dcp\n\ndcp.classification(dataset='digits',\n model_type='ensemble',\n model_name='RandomForestClassifier',\n pickle_name='digits_rf.pickle',\n plot_name='digits_rf.png')\n```\n\nFor inspiration, look at the example pipeline in the\n[pipeline folder](https://github.com/marskar/datacamprojects/tree/master/pipeline)\nof the\n[datacamprojects repo](https://github.com/marskar/datacamprojects).\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/marskar/datacamprojects", "keywords": "", "license": "", "maintainer": "", "maintainer_email": "", "name": "datacamprojects", "package_url": "https://pypi.org/project/datacamprojects/", "platform": "", "project_url": "https://pypi.org/project/datacamprojects/", "project_urls": { "Homepage": "https://github.com/marskar/datacamprojects" }, "release_url": "https://pypi.org/project/datacamprojects/0.0.1/", "requires_dist": [ "scikit-learn", "matplotlib", "seaborn" ], "requires_python": "", "summary": "Tools for the DataCamp Creating Robust Python Projects course", "version": "0.0.1" }, "last_serial": 4411401, "releases": { "0.0.1": [ { "comment_text": "", "digests": { "md5": "b563cf96ea8a76a725fddd13ac59674e", "sha256": "ca96954f9dcc32567294a848bd8f199f83c1ed702fd39617efed1672bcdbdba3" }, "downloads": -1, "filename": "datacamprojects-0.0.1-py3-none-any.whl", "has_sig": false, "md5_digest": "b563cf96ea8a76a725fddd13ac59674e", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 4599, "upload_time": "2018-10-24T14:46:35", "url": "https://files.pythonhosted.org/packages/5b/1a/4d723cd837a214a13396f5fcb545b07bf3b3532ac3828d1e6ecdadef52c7/datacamprojects-0.0.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "d90bab66d434020942dd9414cc6b3a98", "sha256": "93777a733766b35dde7f8752ef3ed0d4397326920deaa3e21b036e2525043a41" }, "downloads": -1, "filename": "datacamprojects-0.0.1.tar.gz", "has_sig": false, "md5_digest": "d90bab66d434020942dd9414cc6b3a98", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3792, "upload_time": "2018-10-24T14:46:37", "url": "https://files.pythonhosted.org/packages/1c/5c/0c5b2c742445816c15f8df04f4af5ffed1a260875afd79736c66c83f222a/datacamprojects-0.0.1.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "b563cf96ea8a76a725fddd13ac59674e", "sha256": "ca96954f9dcc32567294a848bd8f199f83c1ed702fd39617efed1672bcdbdba3" }, "downloads": -1, "filename": "datacamprojects-0.0.1-py3-none-any.whl", "has_sig": false, "md5_digest": "b563cf96ea8a76a725fddd13ac59674e", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 4599, "upload_time": "2018-10-24T14:46:35", "url": "https://files.pythonhosted.org/packages/5b/1a/4d723cd837a214a13396f5fcb545b07bf3b3532ac3828d1e6ecdadef52c7/datacamprojects-0.0.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "d90bab66d434020942dd9414cc6b3a98", "sha256": "93777a733766b35dde7f8752ef3ed0d4397326920deaa3e21b036e2525043a41" }, "downloads": -1, "filename": "datacamprojects-0.0.1.tar.gz", "has_sig": false, "md5_digest": "d90bab66d434020942dd9414cc6b3a98", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 3792, "upload_time": "2018-10-24T14:46:37", "url": "https://files.pythonhosted.org/packages/1c/5c/0c5b2c742445816c15f8df04f4af5ffed1a260875afd79736c66c83f222a/datacamprojects-0.0.1.tar.gz" } ] }