{ "info": { "author": "Thomas Moerman", "author_email": "thomas.moerman@gmail.com", "bugtrack_url": null, "classifiers": [], "description": ".. image:: img/arboreto.png\n :alt: arboreto\n :scale: 100%\n :align: left\n\n.. image:: https://travis-ci.org/tmoerman/arboreto.svg?branch=master\n :alt: Build Status\n :target: https://travis-ci.org/tmoerman/arboreto\n\n.. image:: https://readthedocs.org/projects/arboreto/badge/?version=latest\n :alt: Documentation Status\n :target: http://arboreto.readthedocs.io/en/latest/?badge=latest\n\n.. image:: https://img.shields.io/badge/pypi-0.1.5-blue.svg\n :alt: PyPI package\n :target: https://pypi.python.org/pypi?:action=display&name=arboreto&version=0.1.5\n\n----\n\n.. epigraph::\n\n *The most satisfactory definition of man from the scientific point of view is probably Man the Tool-maker.*\n\n.. _arboreto: https://arboreto.readthedocs.io\n.. _`arboreto documentation`: https://arboreto.readthedocs.io\n.. _notebooks: https://github.com/tmoerman/arboreto/tree/master/notebooks\n.. _issue: https://github.com/tmoerman/arboreto/issues/new\n\n.. _dask: https://dask.pydata.org/en/latest/\n.. _`dask distributed`: https://distributed.readthedocs.io/en/latest/\n\n.. _GENIE3: http://www.montefiore.ulg.ac.be/~huynh-thu/GENIE3.html\n.. _`Random Forest`: https://en.wikipedia.org/wiki/Random_forest\n.. _ExtraTrees: https://en.wikipedia.org/wiki/Random_forest#ExtraTrees\n.. _`Stochastic Gradient Boosting Machine`: https://en.wikipedia.org/wiki/Gradient_boosting#Stochastic_gradient_boosting\n.. _`early-stopping`: https://en.wikipedia.org/wiki/Early_stopping\n\nInferring a gene regulatory network (GRN) from gene expression data is a computationally expensive task, exacerbated by increasing data sizes due to advances\nin high-throughput gene profiling technology.\n\nThe arboreto_ software library addresses this issue by providing a computational strategy that allows executing the class of GRN inference algorithms\nexemplified by GENIE3_ [1] on hardware ranging from a single computer to a multi-node compute cluster. This class of GRN inference algorithms is defined by\na series of steps, one for each target gene in the dataset, where the most important candidates from a set of regulators are determined from a regression\nmodel to predict a target gene's expression profile.\n\nMembers of the above class of GRN inference algorithms are attractive from a computational point of view because they are parallelizable by nature. In arboreto,\nwe specify the parallelizable computation as a dask_ graph [2], a data structure that represents the task schedule of a computation. A dask scheduler assigns the\ntasks in a dask graph to the available computational resources. Arboreto uses the `dask distributed`_ scheduler to\nspread out the computational tasks over multiple processes running on one or multiple machines.\n\nArboreto currently supports 2 GRN inference algorithms:\n\n1. **GRNBoost2**: a novel and fast GRN inference algorithm using `Stochastic Gradient Boosting Machine`_ (SGBM) [3] regression with `early-stopping`_ regularization.\n2. **GENIE3**: the classic GRN inference algorithm using `Random Forest`_ (RF) or ExtraTrees_ (ET) regression.\n\nReferences\n**********\n\n1. Huynh-Thu VA, Irrthum A, Wehenkel L, Geurts P (2010) Inferring Regulatory Networks from Expression Data Using Tree-Based Methods. PLoS ONE\n2. Rocklin, M. (2015). Dask: parallel computation with blocked algorithms and task scheduling. In Proceedings of the 14th Python in Science Conference (pp. 130-136).\n3. Friedman, J. H. (2002). Stochastic gradient boosting. Computational Statistics & Data Analysis, 38(4), 367-378.\n4. Marbach, D., Costello, J. C., Kuffner, R., Vega, N. M., Prill, R. J., Camacho, D. M., ... & Dream5 Consortium. (2012). Wisdom of crowds for robust gene network inference. Nature methods, 9(8), 796-804.\n\nGet Started\n***********\n\nArboreto was conceived with the working bioinformatician or data scientist in mind. We provide extensive documentation and examples to help you get up to speed with the library.\n\n* Read the `arboreto documentation`_.\n* Browse example notebooks_.\n* Report an issue_.\n\nLicense\n*******\n\nBSD 3-Clause License\n\n\n", "description_content_type": "", "docs_url": null, "download_url": "https://github.com/tmoerman/arboreto/archive/0.1.tar.gz", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/tmoerman/arboreto", "keywords": "gene,regulatory,network,inference,regression,ensemble,scalable,dask", "license": "BSD 3-Clause License", "maintainer": "", "maintainer_email": "", "name": "arboreto", "package_url": "https://pypi.org/project/arboreto/", "platform": "any", "project_url": "https://pypi.org/project/arboreto/", "project_urls": { "Download": "https://github.com/tmoerman/arboreto/archive/0.1.tar.gz", "Homepage": "https://github.com/tmoerman/arboreto" }, "release_url": "https://pypi.org/project/arboreto/0.1.5/", "requires_dist": [ "dask", "distributed", "numpy", "pandas", "scikit-learn", "scipy" ], "requires_python": "", "summary": "Scalable gene regulatory network inference using tree-based ensemble regressors", "version": "0.1.5" }, "last_serial": 3949795, "releases": { "0.1.3": [ { "comment_text": "", "digests": { "md5": "e354aed271b9d26ce2848681ceb30dbc", "sha256": "dfcd35ad665b2bae9eebb80c156d10b0338699b1c4a3832b43ae7f4b7b1b59c1" }, "downloads": -1, "filename": "arboreto-0.1.3-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "e354aed271b9d26ce2848681ceb30dbc", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 16310, "upload_time": "2018-06-01T11:27:00", "url": "https://files.pythonhosted.org/packages/99/a5/cd417498fd510da9760ad823a6c287b8bb4cfaa8d9819eba8f855d4180fe/arboreto-0.1.3-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "78b385ab233aea017e3acf57eb8c002e", "sha256": "4c15fcc4f7d6b8a43322299b7381930153329a0b7205d81d49aa48e4c3c6d19d" }, "downloads": -1, "filename": "arboreto-0.1.3.tar.gz", "has_sig": false, "md5_digest": "78b385ab233aea017e3acf57eb8c002e", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 13198, "upload_time": "2018-06-01T11:27:01", "url": "https://files.pythonhosted.org/packages/84/bc/dc06b7db84c1365e03b550f5aa728c4fa4b672843f6eefb78a5bf76c9c36/arboreto-0.1.3.tar.gz" } ], "0.1.4": [ { "comment_text": "", "digests": { "md5": "954fd5b2e76baf3c61a3875a490e71b8", "sha256": "581d69fe673fd4dedebb8674659ac24dfe2c1961bb1d670bd4324e72b420fcc5" }, "downloads": -1, "filename": "arboreto-0.1.4-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "954fd5b2e76baf3c61a3875a490e71b8", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 16411, "upload_time": "2018-06-05T13:49:34", "url": "https://files.pythonhosted.org/packages/e7/66/f8232d4746be9c6e1ce36a4c5e439fe06ec6ba61c6c6737f31b61ed777de/arboreto-0.1.4-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "88fb591759684d444bc675ca8878a0a7", "sha256": "740091d097f89385d803122ecdba2cd68d4d58344cc4678e27acc69a55beaffb" }, "downloads": -1, "filename": "arboreto-0.1.4.tar.gz", "has_sig": false, "md5_digest": "88fb591759684d444bc675ca8878a0a7", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 13256, "upload_time": "2018-06-05T13:49:35", "url": "https://files.pythonhosted.org/packages/38/09/92a3734fd3744145ce121ecaf875f261767738a7c0109952a4ba7e4839bc/arboreto-0.1.4.tar.gz" } ], "0.1.5": [ { "comment_text": "", "digests": { "md5": "0d050657793ce11dece6a17862f8ba90", "sha256": "6dd0ff4856b776595329db765ac494efe33cfac168da985af18387972965e070" }, "downloads": -1, "filename": "arboreto-0.1.5-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "0d050657793ce11dece6a17862f8ba90", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 16665, "upload_time": "2018-06-11T10:59:38", "url": "https://files.pythonhosted.org/packages/c5/05/e1ae69535379915c9dc2773c7190a33451aca2067872a72a5d1564a9f4ac/arboreto-0.1.5-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "e718f615d07c2e86f0c97a5e0c1c1165", "sha256": "a78c73b367eaae38dadeba150e52ac98aec6fc916df703ce61b79d27fe1015f9" }, "downloads": -1, "filename": "arboreto-0.1.5.tar.gz", "has_sig": false, "md5_digest": "e718f615d07c2e86f0c97a5e0c1c1165", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 13486, "upload_time": "2018-06-11T10:59:39", "url": "https://files.pythonhosted.org/packages/28/60/0f671aec3a3dbf111df3ccf62f5e24c213bb2f574c7f9a26c10a7e928b71/arboreto-0.1.5.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "0d050657793ce11dece6a17862f8ba90", "sha256": "6dd0ff4856b776595329db765ac494efe33cfac168da985af18387972965e070" }, "downloads": -1, "filename": "arboreto-0.1.5-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "0d050657793ce11dece6a17862f8ba90", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 16665, "upload_time": "2018-06-11T10:59:38", "url": "https://files.pythonhosted.org/packages/c5/05/e1ae69535379915c9dc2773c7190a33451aca2067872a72a5d1564a9f4ac/arboreto-0.1.5-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "e718f615d07c2e86f0c97a5e0c1c1165", "sha256": "a78c73b367eaae38dadeba150e52ac98aec6fc916df703ce61b79d27fe1015f9" }, "downloads": -1, "filename": "arboreto-0.1.5.tar.gz", "has_sig": false, "md5_digest": "e718f615d07c2e86f0c97a5e0c1c1165", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 13486, "upload_time": "2018-06-11T10:59:39", "url": "https://files.pythonhosted.org/packages/28/60/0f671aec3a3dbf111df3ccf62f5e24c213bb2f574c7f9a26c10a7e928b71/arboreto-0.1.5.tar.gz" } ] }