{ "info": { "author": "Soledad Galli", "author_email": "solegalli1@gmail.com", "bugtrack_url": null, "classifiers": [ "Operating System :: OS Independent", "Programming Language :: Python :: 3" ], "description": "# Feature Engine\n\nFeature Engine is a python library that contains several transformers to engineer features for use in machine learning models.\nThe transformers follow scikit-learn like functionality. They first learn the imputing or encoding methods from the training set, and subsequently transform the dataset.\nCurrently the transformers include functionality for:\n\n* Missing value imputation\n* Categorical variable encoding\n* Outlier removal\n* Discretisation\n* Numerical Variable Transformation\n\n## Important Links\n\nDocumentation: http://feature-engine.readthedocs.io\n\n### Imputing Methods\n\n* MeanMedianImputer\n* RandomSampleImputer\n* EndTailImputer\n* AddNaNBinaryImputer\n* CategoricalVariableImputer\n* FrequentCategoryImputer\n* ArbitraryNumberImputer\n\n### Encoding Methods\n* CountFrequencyCategoricalEncoder\n* OrdinalCategoricalEncoder \n* MeanCategoricalEncoder\n* WoERatioCategoricalEncoder\n* OneHotCategoricalEncoder\n* RareLabelCategoricalEncoder\n\n### Outlier Handling methods\n* Windsorizer\n* ArbitraryOutlierCapper\n\n### Discretisation methods\n* EqualFrequencyDiscretiser\n* EqualWidthDiscretiser\n* DecisionTreeDiscretiser\n\n### Variable Transformation methods\n* LogTransformer\n* ReciprocalTransformer\n* ExponentialTransformer\n* BoxCoxTransformer\n\n### Installing\n\n```\npip install feature_engine\n```\nor\n\n```\ngit clone https://github.com/solegalli/feature_engine.git\n```\n\n### Usage\n\n```\nfrom feature_engine.categorical_encoders import RareLabelEncoder\n\nrare_encoder = RareLabelEncoder(tol = 0.05, n_categories=5)\nrare_encoder.fit(data, variables = ['Cabin', 'Age'])\ndata_encoded = rare_encoder.transform(data)\n```\n\nSee more usage examples in the jupyter notebooks in the example section\n\n### Examples\n\nYou can find jupyter notebooks in the examples folder, with directions on how to use this package and its multiple transformers.\n\n### License\n\nBSD 3-Clause\n\n### Authors\n\n* **Soledad Galli** - *Initial work* - [Feature Engineering Online Course](https://www.udemy.com/feature-engineering-for-machine-learning)\n\n\n### References\n\nMost of the engineering and encoding functionality is inspired by this [series of articles from the 2009 KDD competition](http://www.mtome.com/Publications/CiML/CiML-v3-book.pdf)\n\nTo learn more about the rationale, functionality, pros and cos of each imputer, encoder and transformer, refer to the [Feature Engineering Online Course](https://www.udemy.com/feature-engineering-for-machine-learning)\n\nFor a summary of the methods check this [presentation](https://speakerdeck.com/solegalli/engineering-and-selecting-features-for-machine-learning)\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "http://github.com/solegalli/feature_engine", "keywords": "", "license": "BSD 3 clause", "maintainer": "", "maintainer_email": "", "name": "feature-engine", "package_url": "https://pypi.org/project/feature-engine/", "platform": "", "project_url": "https://pypi.org/project/feature-engine/", "project_urls": { "Homepage": "http://github.com/solegalli/feature_engine" }, "release_url": "https://pypi.org/project/feature-engine/0.3.0/", "requires_dist": [ "numpy (>=1.12.1)", "pandas (>=0.23.0)", "scikit-learn (>=0.18.1)", "scipy (>=0.19.0)", "statsmodels (>=0.8.0)", "numpydoc (>=0.6.0)", "pytest (==4.3.1)" ], "requires_python": "", "summary": "Feature engineering package that follows sklearn functionality", "version": "0.3.0" }, "last_serial": 5634171, "releases": { "0.1": [ { "comment_text": "", "digests": { "md5": "a5f35c6f5e417316b3daed9bd699a8da", "sha256": "33cc9f4346cb44c62719393d94beb434b7fcbf5c86367a3455266b7b671a38f1" }, "downloads": -1, "filename": "feature_engine-0.1-py3-none-any.whl", "has_sig": false, "md5_digest": "a5f35c6f5e417316b3daed9bd699a8da", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 19683, "upload_time": "2018-12-31T14:20:24", "url": "https://files.pythonhosted.org/packages/42/40/e33c1e25d9563f4154afd9d27cdbad414904fb48b086a24aa8908ad169dd/feature_engine-0.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "faf1805373070eb50d8746c447da86b8", "sha256": "cd16b71526c6144eaf1a4aa3f87e361e874a85a1c46be8e406504a7ad4bf81a9" }, "downloads": -1, "filename": "feature_engine-0.1.tar.gz", "has_sig": false, "md5_digest": "faf1805373070eb50d8746c447da86b8", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 13232, "upload_time": "2018-12-31T14:20:26", "url": "https://files.pythonhosted.org/packages/eb/08/f459db75cabbd962cd9fac0433701d9ce9f86cbb10f6229712188771e925/feature_engine-0.1.tar.gz" } ], "0.1.1": [ { "comment_text": "", "digests": { "md5": "95fad9d1fa3efc6d4ba97288ad5ef65e", "sha256": "72921a7bea80ffdd0382021b5d86d01a4c8e769f16810d48cbbc7b2d70d60e27" }, "downloads": -1, "filename": "feature_engine-0.1.1-py3-none-any.whl", "has_sig": false, "md5_digest": "95fad9d1fa3efc6d4ba97288ad5ef65e", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 19782, "upload_time": "2019-01-01T15:14:50", "url": "https://files.pythonhosted.org/packages/3d/f8/de1f7aa7a45a209f7a59d6ae3bdfbdd34aca87b060fc61907892e45d19b2/feature_engine-0.1.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "0ecb5f7f9f9b901837935e9e7088fb41", "sha256": "4f1149bc32c384d7890cbbe21670c71a4ee4b00f65dbec00c578e99d25e7d566" }, "downloads": -1, "filename": "feature_engine-0.1.1.tar.gz", "has_sig": false, "md5_digest": "0ecb5f7f9f9b901837935e9e7088fb41", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 13425, "upload_time": "2019-01-01T15:14:54", "url": "https://files.pythonhosted.org/packages/d5/56/bd5327e4cef92849a3e79f9abf8fa566dac749a66234ede650c587967e4d/feature_engine-0.1.1.tar.gz" } ], "0.2": [ { "comment_text": "", "digests": { "md5": "2e019bbe722115e07fb3b55f747ba97d", "sha256": "2f82ed94596b10e10ea2ce17774d6d4c1f158eba17788e9a31548461c6119e7d" }, "downloads": -1, "filename": "feature_engine-0.2-py3-none-any.whl", "has_sig": false, "md5_digest": "2e019bbe722115e07fb3b55f747ba97d", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 19758, "upload_time": "2019-01-01T15:14:52", "url": "https://files.pythonhosted.org/packages/90/c8/129d645beaf60d1b444d1af1d4aa2fd505e030159d8c7a6a12379284459b/feature_engine-0.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "e5cd9bb55dd858e6ff5360b7e170f561", "sha256": "55d5b05e76e7a9c522547fa6affa525678af54bba5b8dc13d8d520eb5d565e4b" }, "downloads": -1, "filename": "feature_engine-0.2.tar.gz", "has_sig": false, "md5_digest": "e5cd9bb55dd858e6ff5360b7e170f561", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 13414, "upload_time": "2019-01-01T15:14:56", "url": "https://files.pythonhosted.org/packages/e2/7d/e8d69e054425ef70782214bfff56501bbb929bc2e5fa2e3bdb4575636fee/feature_engine-0.2.tar.gz" } ], "0.3.0": [ { "comment_text": "", "digests": { "md5": "5b3c610ae6635510aebee96003ab41ff", "sha256": "f6db8f6366aec3f69963bac9f1b9ac5d663774af35a79bc1bbed211ed14a3767" }, "downloads": -1, "filename": "feature_engine-0.3.0-py3-none-any.whl", "has_sig": false, "md5_digest": "5b3c610ae6635510aebee96003ab41ff", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 22965, "upload_time": "2019-08-05T13:01:39", "url": "https://files.pythonhosted.org/packages/3f/be/c2ee5b18e60423f58e20749b40964fc49ce5c12a269fafed5933775f80f2/feature_engine-0.3.0-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "05602569979b6bee0b8edc95e8ad81c4", "sha256": "3d482d739a7366e4ac491ce77a4acef851ec504aa09d49e9c43c338845a8aca8" }, "downloads": -1, "filename": "feature_engine-0.3.0.tar.gz", "has_sig": false, "md5_digest": "05602569979b6bee0b8edc95e8ad81c4", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 17798, "upload_time": "2019-08-05T13:01:41", "url": "https://files.pythonhosted.org/packages/a2/89/a672f4d6e5f4005ba911e2f9a6d7cbea84aa70cadae9ca7e4a7df53c7f54/feature_engine-0.3.0.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "5b3c610ae6635510aebee96003ab41ff", "sha256": "f6db8f6366aec3f69963bac9f1b9ac5d663774af35a79bc1bbed211ed14a3767" }, "downloads": -1, "filename": "feature_engine-0.3.0-py3-none-any.whl", "has_sig": false, "md5_digest": "5b3c610ae6635510aebee96003ab41ff", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 22965, "upload_time": "2019-08-05T13:01:39", "url": "https://files.pythonhosted.org/packages/3f/be/c2ee5b18e60423f58e20749b40964fc49ce5c12a269fafed5933775f80f2/feature_engine-0.3.0-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "05602569979b6bee0b8edc95e8ad81c4", "sha256": "3d482d739a7366e4ac491ce77a4acef851ec504aa09d49e9c43c338845a8aca8" }, "downloads": -1, "filename": "feature_engine-0.3.0.tar.gz", "has_sig": false, "md5_digest": "05602569979b6bee0b8edc95e8ad81c4", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 17798, "upload_time": "2019-08-05T13:01:41", "url": "https://files.pythonhosted.org/packages/a2/89/a672f4d6e5f4005ba911e2f9a6d7cbea84aa70cadae9ca7e4a7df53c7f54/feature_engine-0.3.0.tar.gz" } ] }