{ "info": { "author": "Thomas Moerman", "author_email": "thomas.moerman@gmail.com", "bugtrack_url": null, "classifiers": [], "description": ".. image:: img/arboretum.png\n :alt: arboretum\n :scale: 100%\n :align: left\n\n.. image:: https://travis-ci.org/tmoerman/arboretum.svg?branch=master\n :alt: Build Status\n :target: https://travis-ci.org/tmoerman/arboretum\n\n.. image:: https://readthedocs.org/projects/arboretum/badge/?version=latest\n :alt: Documentation Status\n :target: http://arboretum.readthedocs.io/en/latest/?badge=latest\n\n.. image:: https://img.shields.io/badge/pypi-0.1.3-blue.svg\n :alt: PyPI package\n :target: https://pypi.python.org/pypi?:action=display&name=arboretum&version=0.1.3\n\n----\n\n.. epigraph::\n\n *The most satisfactory definition of man from the scientific point of view is probably Man the Tool-maker.*\n\n.. _arboretum: https://arboretum.readthedocs.io\n.. _`arboretum documentation`: https://arboretum.readthedocs.io\n.. _notebooks: https://github.com/tmoerman/arboretum/tree/master/notebooks\n.. _issue: https://github.com/tmoerman/arboretum/issues/new\n\n.. _dask: https://dask.pydata.org/en/latest/\n.. _`dask distributed`: https://distributed.readthedocs.io/en/latest/\n\n.. _GENIE3: http://www.montefiore.ulg.ac.be/~huynh-thu/GENIE3.html\n.. _`Random Forest`: https://en.wikipedia.org/wiki/Random_forest\n.. _ExtraTrees: https://en.wikipedia.org/wiki/Random_forest#ExtraTrees\n.. _`Stochastic Gradient Boosting Machine`: https://en.wikipedia.org/wiki/Gradient_boosting#Stochastic_gradient_boosting\n.. _`early-stopping`: https://en.wikipedia.org/wiki/Early_stopping\n\nInferring a gene regulatory network (GRN) from gene expression data is a computationally expensive task, exacerbated by increasing data sizes due to advances\nin high-throughput gene profiling technology.\n\nThe arboretum_ software library addresses this issue by providing a computational strategy that allows executing the class of GRN inference algorithms\nexemplified by GENIE3_ [1] on hardware ranging from a single computer to a multi-node compute cluster. This class of GRN inference algorithms is defined by\na series of steps, one for each target gene in the network, where the most important candidates from a set of regulators are determined from a regression\nmodel to predict a target gene's expression profile.\n\nMembers of the above class of GRN inference algorithms are attractive from a computational point of view because they are parallelizable by nature. In arboretum,\nwe specify the parallelizable computation as a dask_ graph [2], a data structure that represents the task schedule of a computation. A dask scheduler assigns the\ntasks in a dask graph to the available computational resources. Arboretum uses the `dask distributed`_ scheduler to\nspread out the computational tasks over multiple processes running on one or multiple machines.\n\nArboretum currently supports 2 GRN inference algorithms:\n\n1. **GRNBoost2**: a novel and fast GRN inference algorithm using `Stochastic Gradient Boosting Machine`_ (SGBM) [3] regression with `early-stopping`_ regularization.\n2. **GENIE3**: the classic GRN inference algorithm using `Random Forest`_ (RF) or ExtraTrees_ (ET) regression.\n\nReferences\n**********\n\n1. Huynh-Thu VA, Irrthum A, Wehenkel L, Geurts P (2010) Inferring Regulatory Networks from Expression Data Using Tree-Based Methods. PLoS ONE\n2. Rocklin, M. (2015). Dask: parallel computation with blocked algorithms and task scheduling. In Proceedings of the 14th Python in Science Conference (pp. 130-136).\n3. Friedman, J. H. (2002). Stochastic gradient boosting. Computational Statistics & Data Analysis, 38(4), 367-378.\n4. Marbach, D., Costello, J. C., Kuffner, R., Vega, N. M., Prill, R. J., Camacho, D. M., ... & Dream5 Consortium. (2012). Wisdom of crowds for robust gene network inference. Nature methods, 9(8), 796-804.\n\nGet Started\n***********\n\nArboretum was conceived with the working bioinformatician or data scientist in mind. We provide extensive documentation and examples to help you get up to speed with the library.\n\n* Read the `arboretum documentation`_.\n* Browse example notebooks_.\n* Report an issue_.\n\nLicense\n*******\n\nBSD 3-Clause License\n\n\n", "description_content_type": null, "docs_url": null, "download_url": "https://github.com/tmoerman/arboretum/archive/0.1.tar.gz", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/tmoerman/arboretum", "keywords": "gene,regulatory,network,inference,regression,ensemble,scalable,dask", "license": "BSD 3-Clause License", "maintainer": "", "maintainer_email": "", "name": "arboretum", "package_url": "https://pypi.org/project/arboretum/", "platform": "any", "project_url": "https://pypi.org/project/arboretum/", "project_urls": { "Download": "https://github.com/tmoerman/arboretum/archive/0.1.tar.gz", "Homepage": "https://github.com/tmoerman/arboretum" }, "release_url": "https://pypi.org/project/arboretum/0.1.3/", "requires_dist": [ "dask", "distributed", "numpy", "pandas", "scikit-learn", "scipy" ], "requires_python": "", "summary": "Scalable gene regulatory network inference using tree-based ensemble regressors", "version": "0.1.3" }, "last_serial": 3376838, "releases": { "0.1.1": [ { "comment_text": "", "digests": { "md5": "9c34b3712a797dbe1262b4da1f55030a", "sha256": "1ddeebd838df42c839995cea2a2bff64fbc2d7c756b579a292fbab54d4326e02" }, "downloads": -1, "filename": "arboretum-0.1.1-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "9c34b3712a797dbe1262b4da1f55030a", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 16231, "upload_time": "2017-11-22T17:25:17", "url": "https://files.pythonhosted.org/packages/9c/bc/7d9058cdcc7b1b1da752b78190dba39d12cf27e2c405e8283b888fd427f8/arboretum-0.1.1-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "720b07a2f7ef4ddc8f2e0cb278f47c50", "sha256": "c7abe2c2d5f5bb65721b448fdd99ff6536fa0bcc9f627d2676b34d7e5b4e1cbf" }, "downloads": -1, "filename": "arboretum-0.1.1.tar.gz", "has_sig": false, "md5_digest": "720b07a2f7ef4ddc8f2e0cb278f47c50", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 11677, "upload_time": "2017-11-22T17:25:19", "url": "https://files.pythonhosted.org/packages/b6/db/06d216d13b6d07150e9400c632845a125648fbc3f6e32772ab514c8a1472/arboretum-0.1.1.tar.gz" } ], "0.1.2": [ { "comment_text": "", "digests": { "md5": "c10d39deb04d99a9b4880e809aa0a2ea", "sha256": "d5b1e5c830ad09ba3336cc219337f0e4af2cd88e8bbe36c22a5dd4f5a75581b0" }, "downloads": -1, "filename": "arboretum-0.1.2-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "c10d39deb04d99a9b4880e809aa0a2ea", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 16034, "upload_time": "2017-11-27T19:32:07", "url": "https://files.pythonhosted.org/packages/d1/d3/74d76233940a237528d60c5fcfb5623205c35fa508cbaf34fa676445ef1f/arboretum-0.1.2-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "56904bf2d552948fcc38cc690fae166b", "sha256": "75c280ae48591f817e082d0eb23ff95888f10daa6713e46d6e2af6cc63b04224" }, "downloads": -1, "filename": "arboretum-0.1.2.tar.gz", "has_sig": false, "md5_digest": "56904bf2d552948fcc38cc690fae166b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 12976, "upload_time": "2017-11-27T19:32:08", "url": "https://files.pythonhosted.org/packages/41/a4/562cd669cb8836ff059117556fbd1627166f1ee7e8d8529e12bcc02f7d7e/arboretum-0.1.2.tar.gz" } ], "0.1.3": [ { "comment_text": "", "digests": { "md5": "1c6975ac8f383d0ea539dec328fabb59", "sha256": "c2044e3e69641bd283772b95af5900e6a7cec829d89b2f94d7381aecb04dfad9" }, "downloads": -1, "filename": "arboretum-0.1.3-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "1c6975ac8f383d0ea539dec328fabb59", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 16333, "upload_time": "2017-11-30T09:41:00", "url": "https://files.pythonhosted.org/packages/a6/b3/2dc3bf980d96382d39c4d0cb88c75364979125d265202f99b9e0bb19a88a/arboretum-0.1.3-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "39dbbf09e3db4505a7c8e0aa200b9b73", "sha256": "26dd652dfdece868f780cfe5a31c3a226e60b3ef760170e53b225e6f81359789" }, "downloads": -1, "filename": "arboretum-0.1.3.tar.gz", "has_sig": false, "md5_digest": "39dbbf09e3db4505a7c8e0aa200b9b73", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 11852, "upload_time": "2017-11-30T09:41:02", "url": "https://files.pythonhosted.org/packages/4b/f5/afc8528c857dc5acb9ec9d2a226b947f6f198981927c58f82020fb91a2ed/arboretum-0.1.3.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "1c6975ac8f383d0ea539dec328fabb59", "sha256": "c2044e3e69641bd283772b95af5900e6a7cec829d89b2f94d7381aecb04dfad9" }, "downloads": -1, "filename": "arboretum-0.1.3-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "1c6975ac8f383d0ea539dec328fabb59", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 16333, "upload_time": "2017-11-30T09:41:00", "url": "https://files.pythonhosted.org/packages/a6/b3/2dc3bf980d96382d39c4d0cb88c75364979125d265202f99b9e0bb19a88a/arboretum-0.1.3-py2.py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "39dbbf09e3db4505a7c8e0aa200b9b73", "sha256": "26dd652dfdece868f780cfe5a31c3a226e60b3ef760170e53b225e6f81359789" }, "downloads": -1, "filename": "arboretum-0.1.3.tar.gz", "has_sig": false, "md5_digest": "39dbbf09e3db4505a7c8e0aa200b9b73", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 11852, "upload_time": "2017-11-30T09:41:02", "url": "https://files.pythonhosted.org/packages/4b/f5/afc8528c857dc5acb9ec9d2a226b947f6f198981927c58f82020fb91a2ed/arboretum-0.1.3.tar.gz" } ] }