{ "info": { "author": "Mike Dacre", "author_email": "mike.dacre@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Environment :: Console", "Intended Audience :: End Users/Desktop", "Intended Audience :: System Administrators", "License :: OSI Approved :: MIT License", "Natural Language :: English", "Operating System :: MacOS :: MacOS X", "Operating System :: POSIX", "Programming Language :: Python", "Programming Language :: Python :: 2", "Programming Language :: Python :: 3", "Topic :: Utilities" ], "description": "###############\nEnrich p-values\n###############\n\nA simple script to compare p-values between a test and comparison dataset at a\nvariety of p-value cutoffs. By plotting the enrichment score at a variety of\ncutoffs, it is possible to pick the optimal cutoff for your data.\n\nVersion: 1.0-beta2\n\n.. image:: https://github.com/TheFraserLab/enrich_pvalues/raw/master/enrich_score.gif\n\n.. contents:: **Contents**\n\nAlgorithm\n=========\n\nFor each p-value in the interval between ``max_pval`` (default: 0.05) and\n``min_pval`` (default: 1e-15), we test at intervals of 1 and 5 for each order of\nmagnitute, e.g. 0.05, 0.01, 0.005, 0.001, 5e-4, 1e-4, 5e-5, 1e-6, ... 1e-15.\n\nTo test, we simply take all identities with a p-value less than the cutoff and\ncompare them to all identities in the comparison set with p-values below the\n``comp_set_pvalue``. We simply ask what percentage or the test set are in the\ncomparison set. We then do exactly the same with the entire set of identities in\nthe comparison set that have a p-value greater than 0.98.\n\nThe identities are generally going to be gene or SNP names, but they can be\nanything (e.g. coordinates) as long as they overlap in the test and comparison\ndata.\n\nInstallation\n============\n\nInstall via PyPI:\n\n.. code:: shell\n\n pip install enrich_pvalues\n\nOr install from github:\n\n.. code:: shell\n\n pip install https://github.com/TheFraserLab/enrich_pvalues/tarball/master\n\nIt should work with python 2 or 3, but python 3 is recommended.\n\nRequirements\n------------\n\nIn ``requirements.txt``, we use numpy, pandas, matplotlib, seaborn, tabulate,\nand tqdm.\n\nUsage\n=====\n\nFirst, dump a configuration file to describe your data:\n\n.. code:: shell\n\n enrich_pvalues dump-config enrich_atac.json\n\nThis will also print a help table describing each option. You need to describe\nyour comparison data and your test data, and pick your p-value thresholds.\n\nNext, split your comparison dataset into two tables: significant, and\nnot-significant:\n\n.. code:: shell\n\n enrich_pvalues split -c enrich_atac.json --prefix atac /path/to/comp_data.txt.gz\n\nNow, run the enrichment using those two tables and your test data:\n\n.. code:: shell\n\n enrich_pvalues run -c enrich_atac.json -o atac_scores.xls -p atac /path/to/test_data.txt\n\nNote, the second to last argument is the prefix from the second step.\n\nFinally, plot the data. This can also be done by passing e.g. ``--plot myplot.png``\nto the run step.\n\n.. code:: shell\n\n enrich_pvalues plot --prefix caQTL atac_scores.xls atac_plot.pdf\n\nNote: the scores can be excel format, pickled format, or text format, depending\non the suffix. Also, the prefix in this plot step is different, it is used to\ntitle the plot only, and so can be whatever you want.\n", "description_content_type": "", "docs_url": null, "download_url": "https://github.com/MikeDacre/enrich_pvalues/archive/v1.0b2.tar.gz", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/MikeDacre/enrich_pvalues", "keywords": "statistics p-values biology molecular-biology console", "license": "MIT", "maintainer": "", "maintainer_email": "", "name": "enrich_pvalues", "package_url": "https://pypi.org/project/enrich_pvalues/", "platform": "", "project_url": "https://pypi.org/project/enrich_pvalues/", "project_urls": { "Download": "https://github.com/MikeDacre/enrich_pvalues/archive/v1.0b2.tar.gz", "Homepage": "https://github.com/MikeDacre/enrich_pvalues" }, "release_url": "https://pypi.org/project/enrich_pvalues/1.0b2/", "requires_dist": null, "requires_python": "", "summary": "Compare one dataset to another at a variety of p-value cutoffs", "version": "1.0b2" }, "last_serial": 3915251, "releases": { "1.0b2": [ { "comment_text": "", "digests": { "md5": "76895d80ed73ea5988131ead86775182", "sha256": "8c475ea08ccce7299e6fa152784895d3fd2e781d53bd17275102ce9ce2fafeef" }, "downloads": -1, "filename": "enrich_pvalues-1.0b2.tar.gz", "has_sig": false, "md5_digest": "76895d80ed73ea5988131ead86775182", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 10287, "upload_time": "2018-05-31T00:29:15", "url": "https://files.pythonhosted.org/packages/fa/a3/6c6c9eaeb6cdee70e4ef5ab60e12343ba042d10647744dddbf9dce7918fb/enrich_pvalues-1.0b2.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "76895d80ed73ea5988131ead86775182", "sha256": "8c475ea08ccce7299e6fa152784895d3fd2e781d53bd17275102ce9ce2fafeef" }, "downloads": -1, "filename": "enrich_pvalues-1.0b2.tar.gz", "has_sig": false, "md5_digest": "76895d80ed73ea5988131ead86775182", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 10287, "upload_time": "2018-05-31T00:29:15", "url": "https://files.pythonhosted.org/packages/fa/a3/6c6c9eaeb6cdee70e4ef5ab60e12343ba042d10647744dddbf9dce7918fb/enrich_pvalues-1.0b2.tar.gz" } ] }