{ "info": { "author": "AutoDeploy AI", "author_email": "autodeploy.ai@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Environment :: Console", "Environment :: Web Environment", "Intended Audience :: Developers", "Intended Audience :: System Administrators", "License :: OSI Approved :: Apache Software License", "Operating System :: OS Independent", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3.5", "Programming Language :: Python :: 3.6", "Programming Language :: Python :: 3.7", "Topic :: Scientific/Engineering :: Artificial Intelligence" ], "description": "# PyPMML-Spark\n\n_PyPMML-Spark_ is a Python PMML scoring library for PySpark as SparkML Transformer, it really is the Python API for [PMML4S-Spark](https://github.com/autodeployai/pmml4s-spark).\n\n## Prerequisites\n - Java >= 1.8\n - Python 2.7 or >= 3.5\n\n## Dependencies\n - PySpark >= 2.4.0\n \n## Installation\n\n```bash\npip install pypmml-spark\n```\n\nOr install the latest version from github:\n\n```bash\npip install --upgrade git+https://github.com/autodeployai/pypmml-spark.git\n```\n\nAfter that, you need to do more to use it in Spark that must know those jars in the package `pypmml_spark.jars`. There are several ways to do that:\n\n1. The easiest way is to run the script `link_pmml4s_jars_into_spark.py` that is delivered with `pypmml-spark`:\n\n ```bash\n link_pmml4s_jars_into_spark.py\n ```\n \n2. Use those config options to specify dependent jars properly. e.g. `--jars`, or `spark.executor.extraClassPath` and `spark.executor.extraClassPath`. See [Spark](http://spark.apache.org/docs/latest/configuration.html) for details about those parameters.\n\n## Usage\n\n1. Load model from various sources, e.g. filename, string, or array of bytes.\n\n ```python\n from pypmml_spark import ScoreModel\n \n # The model is from http://dmg.org/pmml/pmml_examples/KNIME_PMML_4.1_Examples/single_iris_dectree.xml\n model = ScoreModel.fromFile('single_iris_dectree.xml')\n ```\n\n2. Call `transform(dataset)` to run a batch score against an input dataset.\n\n ```python\n # The data is from http://dmg.org/pmml/pmml_examples/Iris.csv\n df = spark.read.csv('Iris.csv', header='true')\n score_df = model.transform(df)\n ```\n\n## Use PMML in Scala or Java\nSee the [PMML4S](https://github.com/autodeployai/pmml4s) project. _PMML4S_ a PMML scoring library for Scala. It provides both Scala and Java Evaluator API for PMML.\n\n## Use PMML in Python\nSee the [PyPMML](https://github.com/autodeployai/pypmml) project. _PyPMML_ is a Python PMML scoring library, it really is the Python API for PMML4S.\n\n## Use PMML in Spark\nSee the [PMML4S-Spark](https://github.com/autodeployai/pmml4s-spark) project. _PMML4S-Spark_ is a PMML scoring library for Spark as SparkML Transformer.\n\n## Deploy PMML as REST API\nSee the [DaaS](https://www.autodeploy.ai/) system that deploys AI & ML models in production at scale on Kubernetes.\n\n## Support\nIf you have any questions about the _PyPMML-Spark_ library, please open issues on this repository.\n\nFeedback and contributions to the project, no matter what kind, are always very welcome. \n\n## License\n_PyPMML-Spark_ is licensed under [APL 2.0](http://www.apache.org/licenses/LICENSE-2.0).", "description_content_type": "text/markdown", "docs_url": null, "download_url": "https://github.com/autodeployai/pypmml-spark/archive/v0.9.3.tar.gz", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/autodeployai/pypmml-spark", "keywords": "", "license": "Apache License 2.0", "maintainer": "", "maintainer_email": "", "name": "pypmml-spark", "package_url": "https://pypi.org/project/pypmml-spark/", "platform": "", "project_url": "https://pypi.org/project/pypmml-spark/", "project_urls": { "Download": "https://github.com/autodeployai/pypmml-spark/archive/v0.9.3.tar.gz", "Homepage": "https://github.com/autodeployai/pypmml-spark" }, "release_url": "https://pypi.org/project/pypmml-spark/0.9.3/", "requires_dist": null, "requires_python": "", "summary": "Python PMML scoring library for PySpark as SparkML Transformer", "version": "0.9.3" }, "last_serial": 5798053, "releases": { "0.9.0": [ { "comment_text": "", "digests": { "md5": "f0d0a01518d61f8b52bc81bb82e8bdea", "sha256": "949960da55c19248158e5e0e99516651c5250ab227018b3f7b081e0df39449bc" }, "downloads": -1, "filename": "pypmml-spark-0.9.0.tar.gz", "has_sig": false, "md5_digest": "f0d0a01518d61f8b52bc81bb82e8bdea", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4447548, "upload_time": "2019-06-03T14:02:40", "url": "https://files.pythonhosted.org/packages/2c/5b/339c7d64602e0a537b53fddf2b79892308058acf5ae5b68e154081505d1d/pypmml-spark-0.9.0.tar.gz" } ], "0.9.1": [ { "comment_text": "", "digests": { "md5": "510791396838a0c970855a63d852ba29", "sha256": "4d7696ec7aecfd61d95fbd6dd4daa142e727a0ff304f87b832d6e9d9fbbf5059" }, "downloads": -1, "filename": "pypmml-spark-0.9.1.tar.gz", "has_sig": false, "md5_digest": "510791396838a0c970855a63d852ba29", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4448468, "upload_time": "2019-06-19T01:54:48", "url": "https://files.pythonhosted.org/packages/50/db/24c9ba9eedffa0b4c8d21c9df7020cfdd50534ea212660b9eec17dae3922/pypmml-spark-0.9.1.tar.gz" } ], "0.9.2": [ { "comment_text": "", "digests": { "md5": "be65fbc804782dc86ba1bcc1f9841e2b", "sha256": "b060de4ecf0c6fb7acf399a1c40a025c2bc3d756548bf9c9a01aa1b343fb3945" }, "downloads": -1, "filename": "pypmml-spark-0.9.2.tar.gz", "has_sig": false, "md5_digest": "be65fbc804782dc86ba1bcc1f9841e2b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4674065, "upload_time": "2019-07-13T15:40:09", "url": "https://files.pythonhosted.org/packages/83/77/449d6a0e6e85fdb70baa3a3130efee9252735675eb50cd06749f88f6ec5c/pypmml-spark-0.9.2.tar.gz" } ], "0.9.3": [ { "comment_text": "", "digests": { "md5": "81cacefbed643f92e7fb7f57c6a7b5c0", "sha256": "9f6a92955c259d18c7d04e493ae2bddbf6b6eac6645ef9bcf53c64542eec718d" }, "downloads": -1, "filename": "pypmml-spark-0.9.3.tar.gz", "has_sig": false, "md5_digest": "81cacefbed643f92e7fb7f57c6a7b5c0", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4788230, "upload_time": "2019-09-08T02:26:11", "url": "https://files.pythonhosted.org/packages/8b/89/24bb78afd38716ffa7919eb907afb6bb4f595b504a7fbc565367bcb4a877/pypmml-spark-0.9.3.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "81cacefbed643f92e7fb7f57c6a7b5c0", "sha256": "9f6a92955c259d18c7d04e493ae2bddbf6b6eac6645ef9bcf53c64542eec718d" }, "downloads": -1, "filename": "pypmml-spark-0.9.3.tar.gz", "has_sig": false, "md5_digest": "81cacefbed643f92e7fb7f57c6a7b5c0", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 4788230, "upload_time": "2019-09-08T02:26:11", "url": "https://files.pythonhosted.org/packages/8b/89/24bb78afd38716ffa7919eb907afb6bb4f595b504a7fbc565367bcb4a877/pypmml-spark-0.9.3.tar.gz" } ] }