{ "info": { "author": "Sayed Hadi Hashemi", "author_email": "SayedHadiHashemi@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "License :: OSI Approved :: Apache Software License", "Operating System :: MacOS :: MacOS X", "Operating System :: POSIX", "Programming Language :: Python :: 3 :: Only", "Topic :: Scientific/Engineering :: Bio-Informatics" ], "description": "# GenoML-core\nGenoML is an Automated Machine Learning (AutoML) for Genomic. This is the core package of GenoML. \nPlease note this repo is under development for \"functional testing\". This is not the end product, just for preliminary evaluation of software logic. \nPackage website and on-going documentation: https://genoml.github.io\n\n## Goals \n\nPlease test the code in different environments and on different datasets. The goal is to resolve the following issues:\n\n - **Dependencies:** did you need to install any package not already [listed](https://github.com/GenoML/genoml-core/blob/master/otherPackages/readMe_otherPackages.txt)? do you get dependency errors? \n - **Errors:** do you get any error? is the error not clear? \n - **Wrong output:** any discrepancies in the expected output? \n - **Corner cases:** did the code break on a particular case you were testing? \n - **Usability:** could we improve the way user interacts with the code? is there any particular feature or document you fit helpful? \n\n## Install\nFor now just download or clone the genoml-core repo and run the genoml with `python genoml.py`.\n\n## Step-by-step examples \nPlease refer to the following quick examples for running GonML (for full `usage`, please refer to [Usage](#usage)):\n\n### Step 1 - genoml data-prune:\nTo perform `data-prune` only on `genotype` and `phenotype` data:\n\n python genoml.py data-prune --geno-prefix=./exampleData/training --pheno-file=./exampleData/training.pheno\n\nTo perform `data-prune` on `genotype`, `phenotype` , `GWAS`, and `covariance` data:\n\n python genoml.py data-prune --geno-prefix=./exampleData/training --pheno-file=./exampleData/training.pheno --gwas-file=./exampleData/example_GWAS.txt \n\nTo perform `data-prune` on `genotype`, `phenotype` , `GWAS`, `covariance`, and `additional` data:\n\n python genoml.py data-prune --geno-prefix=./exampleData/training --pheno-file=./exampleData/training.pheno --cov-file=./exampleData/training.cov --gwas-file=./exampleData/example_GWAS.txt --addit-file=./exampleData/training.addit \n\nTo perform `data-prune` on `genotype`, `phenotype` , `GWAS`, and `additional` data, as well as `Heritability estimate`:\n\n python genoml.py data-prune --geno-prefix=./exampleData/training --pheno-file=./exampleData/training.pheno --gwas-file=./exampleData/example_GWAS.txt --addit-file=./exampleData/training.addit --herit=0.2 \n\nTo perform `data-prune` on `genotype`, `phenotype` , `GWAS`, `covariance`, and `additional` data, as well as `Heritability estimate`:\n\n python genoml.py data-prune --geno-prefix=./exampleData/training --pheno-file=./exampleData/example.pheno --cov-file=./exampleData/training.cov --gwas-file=./exampleData/example_GWAS.txt --addit-file=./exampleData/training.addit --herit=0.5 \n\n### Step 2 - genoml model-train:\nTo perform `model-train` on the output of `data-prune` with the prefix given to you from the prune step `prune-prefix=./tmp/20181225-230052`:\n\n python genoml.py model-train --prune-prefix=./tmp/20181225-230052 --pheno-file=./exampleData/training.pheno \n\n### Step 3 - genoml model-tune:\nTo perform `model-tune` after `model-train` on the output of `data-prune` with the prefix given to you from the prune step `prune-prefix=./tmp/20181225-230052`:\n\n python genoml.py model-tune --prune-prefix=./tmp/20181225-230052 --pheno-file=./exampleData/training.pheno\n\n### Step 4 - genoml model-validation:\nTo perform external `model-validate` only when `genotype` and `phenotype` data present:\n\n python genoml.py model-validate --prune-prefix=./tmp/20181225-230052 --geno-prefix=./exampleData/training --pheno-file=./exampleData/training.pheno --valid-geno-prefix=./exampleData/validation --valid-pheno-file=./exampleData/validation.pheno\n\nTo perform external `model-validate` when `genotype`, `phenotype`, and `GWAS` data present:\n\n python genoml.py model-validate --prune-prefix=./tmp/20181225-230052 --geno-prefix=./exampleData/training --pheno-file=./exampleData/training.pheno --valid-geno-prefix=./exampleData/validation --valid-pheno-file=./exampleData/validation.pheno --gwas-file=./exampleData/example_GWAS.txt\n\nTo perform external `model-validate` when `genotype`, `phenotype`, `GWAS`, and `additional` data present:\n\n python genoml.py model-validate --prune-prefix=./tmp/20181225-230052 --geno-prefix=./exampleData/training --pheno-file=./exampleData/training.pheno --valid-geno-prefix=./exampleData/validation --valid-pheno-file=./exampleData/validation.pheno --gwas-file=./exampleData/example_GWAS.txt --valid-addit-file=./exampleData/validation.addit\n\nTo perform external `model-validate` when `genotype`, `phenotype`, `GWAS`, `additional`, and `covariance` data present:\n\n python genoml.py model-validate --prune-prefix=./tmp/20181225-230052 --geno-prefix=./exampleData/training --pheno-file=./exampleData/training.pheno --valid-geno-prefix=./exampleData/validation --valid-pheno-file=./exampleData/validation.pheno --gwas-file=./exampleData/example_GWAS.txt --valid-addit-file=./exampleData/validation.addit --valid-cov-file=./exampleData/validation.cov\n\n\n## Usage \nFull GenoML usage:\n\n Usage:\n genoml data-prune (--geno-prefix=geno_prefix) (--pheno-file=) [--gwas-file=] [--cov-file=] [--herit=] [--addit-file=] [--temp-dir=]\n genoml model-train (--prune-prefix=prune_prefix) (--pheno-file=) [--n-cores=] [--train-speed=] [--cv-reps=] [--grid-search=] [--impute-data=]\n genoml model-tune (--prune-prefix=prune_prefix) (--pheno-file=) [--cv-reps=] [--grid-search=] [--impute-data=] [--best-model-name=]\n genoml model-validate (--prune-prefix=prune_prefix) (--pheno-file=) (--geno-prefix=geno_prefix) (--valid-geno-prefix=valid_geno_prefix) (--valid-pheno-file=) [--valid-cov-file=] [--gwas-file=] [--valid-addit-file=] [--n-cores=] [--impute-data=] [--best-model-name=]\n genoml -h | --help\n genoml --version\n\n Options:\n --geno-prefix=geno_prefix Prefix with path to genotype files in PLINK format, *.bed, *.bim and *.fam.\n --pheno-file= Path to the phenotype file in PLINK format, *.pheno.\n --gwas-file= Path to the GWAS file, if available.\n --cov-file= Path to the covariance file, if available.\n --herit= Heritability estimate of phenotype between 0 and 1, if available.\n --addit-file= Path to the additional file, if avialable.\n --temp-dir= Directory for temporary files [default: ./tmp/].\n --n-cores= Number of cores to be allocated for computation [default: 1].\n --prune-prefix=prune_prefix Prefix given to you at the end of pruning stage.\n --train-speed= Training speed: (ALL, FAST, FURIOUS, BOOSTED). Run all models, only the fastest models, run slightly slower models, or just run boosted models which usually perform best when using genotype data [default: BOOSTED].\n --cv-reps= Number of cross-validation. An integer greater than 5. Effects the speed [default: 5].\n --impute-data= Imputation: (knn, median). Governs secondary imputation and data transformation [default: median].\n --grid-search= Grid search length for parameters, integer greater than 10, 30 or greater recommended, effects speed of initial tune [default: 10].\n --best-model-name= Name for the best model [default: best_model].\n --valid-geno-prefix=valid_geno_prefix Prefix with path to the validation genotype files in PLINK format, *.bed, *.bim and *.fam.\n --valid-pheno-file= Path to the validation phenotype file in PLINK format, *.pheno.\n --valid-cov-file= Path to the validation covariance file, if available.\n --valid-addit-file= Path to the the validation additional file, if avialable.\n -h --help Show this screen.\n --version Show version.\n\n Examples:\n genoml data-prune --geno-prefix=./exampleData/example --pheno-file=./exampleData/training.pheno\n genoml data-prune --geno-prefix=./exampleData/example --pheno-file=./exampleData/training.pheno --gwas-file=./exampleData/example_GWAS.txt\n genoml data-prune --geno-prefix=./exampleData/example --pheno-file=./exampleData/training.pheno --cov-file=./exampleData/training.cov --gwas-file=./exampleData/example_GWAS.txt --addit-file=./exampleData/training.addit\n genoml data-prune --geno-prefix=./exampleData/example --pheno-file=./exampleData/training.pheno --gwas-file=./exampleData/example_GWAS.txt --addit-file=./exampleData/training.addit --herit=0.2\n genoml data-prune --geno-prefix=./exampleData/example --pheno-file=./exampleData/training.pheno --cov-file=./exampleData/training.cov --gwas-file=./exampleData/example_GWAS.txt --addit-file=./exampleData/training.addit --herit=0.5\n genoml model-train --prune-prefix=./tmp/20181225-230052 --pheno-file=./exampleData/training.pheno\n genoml model-tune --prune-prefix=./tmp/20181225-230052 --pheno-file=./exampleData/training.pheno\n genoml model-validate --prune-prefix=./tmp/20181225-230052 --pheno-file=./exampleData/training.pheno --valid-geno-prefix=./exampleData/validation --valid-pheno-file=./exampleData/validation.pheno\n\n Help:\n For help using this tool, please open an issue on the Github repository:\n https://github.com/GenoML/genoml-core/issues\n\n\n\n\n## Report issues \nPlease report any issue or suggestions on the [GenoML-core issues page](https://github.com/GenoML/genoml-core/issues).\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/genoml/genoml-core", "keywords": "", "license": "Apache-2.0", "maintainer": "Faraz Faghri", "maintainer_email": "faraz.faghri@gmail.com", "name": "genoml", "package_url": "https://pypi.org/project/genoml/", "platform": "", "project_url": "https://pypi.org/project/genoml/", "project_urls": { "Homepage": "https://github.com/genoml/genoml-core" }, "release_url": "https://pypi.org/project/genoml/1.0.3/", "requires_dist": [ "rpy2", "docopt", "pandas-plink", "numpy", "requests", "tqdm" ], "requires_python": "", "summary": "Machine Learning for Genomic", "version": "1.0.3" }, "last_serial": 4977266, "releases": { "1.0.0": [ { "comment_text": "", "digests": { "md5": "25a0cbe8fb915236a9e8a85ad87080be", "sha256": "5994a92d9a4d1eff65a7e73776fc7e68fdb7b21a4bd689cdeb12c1816be20a1b" }, "downloads": -1, "filename": "genoml-1.0.0-py3-none-any.whl", "has_sig": false, "md5_digest": "25a0cbe8fb915236a9e8a85ad87080be", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 103890, "upload_time": "2019-02-26T10:38:45", "url": "https://files.pythonhosted.org/packages/bd/2c/ec6cb2bfb466ff17573c0a6bdd615f09f8d9248315e062a221967a394e39/genoml-1.0.0-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "c21243c491d3516da88c3981cf47d0c9", "sha256": "f1e568a3c47657a5cf3f08ad475a0fdb7077516f849c8fb9b4206b7ab6d1f39e" }, "downloads": -1, "filename": "genoml-1.0.0.tar.gz", "has_sig": false, "md5_digest": "c21243c491d3516da88c3981cf47d0c9", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 78767, "upload_time": "2019-02-26T10:38:55", "url": "https://files.pythonhosted.org/packages/0c/f0/b737653f258e83c3516fad35b1cc157a1b982fc0db477ca6e91bbdee3621/genoml-1.0.0.tar.gz" } ], "1.0.1": [ { "comment_text": "", "digests": { "md5": "10d2d3b812efd15829743cc307d5becc", "sha256": "5d5538e4ddcfe2b6bb49cf69cc8bf4612d031a4da4cbe017acaf4cc7338a8d72" }, "downloads": -1, "filename": "genoml-1.0.1-py3-none-any.whl", "has_sig": false, "md5_digest": "10d2d3b812efd15829743cc307d5becc", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 103882, "upload_time": "2019-02-26T10:55:07", "url": "https://files.pythonhosted.org/packages/43/43/e6deb7f34e893b4ff685cfd687d12970d8c8dd213bde86a2d093433bcdce/genoml-1.0.1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "aa41616534cf58a669c4c4dcc0e8ea70", "sha256": "70b881a0cc8d82deb4a5a45e90a6d64cb1f24250f91dd7ea42cd8b66bbc22c9f" }, "downloads": -1, "filename": "genoml-1.0.1.tar.gz", "has_sig": false, "md5_digest": "aa41616534cf58a669c4c4dcc0e8ea70", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 78831, "upload_time": "2019-02-26T10:55:09", "url": "https://files.pythonhosted.org/packages/1c/80/f1f5ae3db0306628523ffd104a11ac7427e3bd3b7af99d6eb88206fe3ab7/genoml-1.0.1.tar.gz" } ], "1.0.2": [ { "comment_text": "", "digests": { "md5": "60485941c6fd74940e7857b84c2b9536", "sha256": "fbf03feebf8d779a085d0df894968171e5aafdf419563bf0deaa36173afdb234" }, "downloads": -1, "filename": "genoml-1.0.2-py3-none-any.whl", "has_sig": false, "md5_digest": "60485941c6fd74940e7857b84c2b9536", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 106436, "upload_time": "2019-03-06T17:33:35", "url": "https://files.pythonhosted.org/packages/5c/af/6ec99a60a91476eace9a9002df3a875d41218c31eab8d5a4915ae6dd2388/genoml-1.0.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "d4e9ded77a73c8c49721bbc595314b0d", "sha256": "429428b88dcef4d653af1e48bc1ffe455521379050e87b386988caa8b73d3314" }, "downloads": -1, "filename": "genoml-1.0.2.tar.gz", "has_sig": false, "md5_digest": "d4e9ded77a73c8c49721bbc595314b0d", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 81730, "upload_time": "2019-03-06T17:33:37", "url": "https://files.pythonhosted.org/packages/2d/98/3021cce7879b1d0c90f388b5a7c3936371bb2f98888898342a712c0b2287/genoml-1.0.2.tar.gz" } ], "1.0.3": [ { "comment_text": "", "digests": { "md5": "3966d9d19d14545b3c97de96b0ac0fb2", "sha256": "7b3e62bbb4d8704f7aca9874b1b650c6c4ec7203d9190fd821f096c507c76c8d" }, "downloads": -1, "filename": "genoml-1.0.3-py3-none-any.whl", "has_sig": false, "md5_digest": "3966d9d19d14545b3c97de96b0ac0fb2", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 107034, "upload_time": "2019-03-24T04:12:01", "url": "https://files.pythonhosted.org/packages/0b/54/b7a54c47ad34e80feff7fb9efae100d157ae49523984bcbf717159c26f32/genoml-1.0.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "d50443fe43460aec2b505eee3af50ddb", "sha256": "8a3231750168e5077e4c34245598232bcbd9f6c4e5f92220a70370570fcb77a2" }, "downloads": -1, "filename": "genoml-1.0.3.tar.gz", "has_sig": false, "md5_digest": "d50443fe43460aec2b505eee3af50ddb", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 82242, "upload_time": "2019-03-24T04:12:06", "url": "https://files.pythonhosted.org/packages/b9/87/af95e596baadc338fc0f7a8b1b6a01367ac31cfdfe43291bb54172191178/genoml-1.0.3.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "3966d9d19d14545b3c97de96b0ac0fb2", "sha256": "7b3e62bbb4d8704f7aca9874b1b650c6c4ec7203d9190fd821f096c507c76c8d" }, "downloads": -1, "filename": "genoml-1.0.3-py3-none-any.whl", "has_sig": false, "md5_digest": "3966d9d19d14545b3c97de96b0ac0fb2", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 107034, "upload_time": "2019-03-24T04:12:01", "url": "https://files.pythonhosted.org/packages/0b/54/b7a54c47ad34e80feff7fb9efae100d157ae49523984bcbf717159c26f32/genoml-1.0.3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "d50443fe43460aec2b505eee3af50ddb", "sha256": "8a3231750168e5077e4c34245598232bcbd9f6c4e5f92220a70370570fcb77a2" }, "downloads": -1, "filename": "genoml-1.0.3.tar.gz", "has_sig": false, "md5_digest": "d50443fe43460aec2b505eee3af50ddb", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 82242, "upload_time": "2019-03-24T04:12:06", "url": "https://files.pythonhosted.org/packages/b9/87/af95e596baadc338fc0f7a8b1b6a01367ac31cfdfe43291bb54172191178/genoml-1.0.3.tar.gz" } ] }