{ "info": { "author": "Pranav Khade", "author_email": "pranavk@iastate.edu", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Intended Audience :: Developers", "License :: OSI Approved :: GNU General Public License v3 (GPLv3)", "Programming Language :: Python :: 2.7", "Topic :: Software Development :: Build Tools" ], "description": "# GOFindBias: Analysis tool for finding bias in the GAF files.\nGOFindBias is developed to provide the user with some insightful statistics about the [GAF](http://www.geneontology.org/page/go-annotation-file-formats) file to determine if the conclusions on the gene ontology studies can be biased because of abstract terms or the high throughput experiments([1](http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003063))..\n\n\nStatistics Provided by the tool are as follows:\n1. Shannon's equitability.\n2. Top 'n' PubMed and GO terms.\n3. KS test to compare two different [GAF](http://www.geneontology.org/page/go-annotation-file-formats) files.\n4. Mutual terms between two GAF files which are among 'n' most frequent terms and their respective frequencies.\n\n\n### Prerequisites:\n#### Required modules. \n\nModules are available in most GNU/Linux distributions, or from their respective websites.\n\n* [Matplotib](https://matplotlib.org/)\n\n* [Biopython](http://biopython.org/)\n\n### Installation\n\nInstalling from source\n```\ngit clone https://github.com/Pranavkhade/GOFindBias\ncd GOFindBias\npython setup.py install\n```\n\nInstalling with pip\n```\npip install GOFindBias\n```\nOR\n```\npip install git+git://github.com/Pranavkhade/GOFindBias\n```\n\n### Files and instructions\n\n1. Collect the GAF file you wish to analyse. For reference GAF files you can visit ftp://ftp.ebi.ac.uk/pub/databases/GO/goa/\n2. You can also use the GAF files obtained as an output from the [debias](https://github.com/Rinoahu/debias) program.\n3. Instructions and help for the parameter is as follows:\n\n```\nusage: GOFindBias.py [-h]\n (-i GAF_FILE [GAF_FILE ...] | -cmpr FILENAME FILENAME)\n [-ls 1 OR O] [-ts TOP]\n [-e EVIDENCE_CODE [EVIDENCE_CODE ...]]\n\nGOFindBias is an analytical tool built to analyse the .gaf files. Please\nvisit: https://github.com/Pranavkhade/GOFindBias for more details.\n\noptional arguments:\n -h, --help show this help message and exit\n -i GAF_FILE [GAF_FILE ...], --input GAF_FILE [GAF_FILE ...]\n Names of the input GAF file(s).\n -cmpr FILENAME FILENAME, --compare FILENAME FILENAME\n Names of the two GAF files\n -ls 1 OR O, --logscale 1 OR O\n For graphs, 0: Counts in normal scale 1: Counts in log\n scale [default=0]\n -ts TOP, --topstat TOP\n Top n statistics sorted from highest to lowest\n [default=10]\n -e EVIDENCE_CODE [EVIDENCE_CODE ...], --evidence EVIDENCE_CODE [EVIDENCE_CODE ...]\n Accepts Standard Evidence Codes outlined in\n (http://geneontology.org/page/guide-go-evidence-\n codes). All 3 letter code for each standard evidence\n is acceptable. In addition to that EXPEC is accepted\n which will pull out all annotations which are made\n experimentally. COMPEC will extract all annotations\n which have been done computationally. Similarly,\n AUTHEC and CUREC are also accepted.\n```\n### Examples\n\n1. `GOFindBias -i test/2014.gaf -ls 1 -ts 10`\nThis command will parse 2014.gaf file for the analysis and all the GO Term counts will be represented on the Natural Log scale for better comparitive visualisation of the data. The last argument is the number of top 'n' entries with highest count in the GAF file. The output of graphs will be posted in the `/graph_output` folder with names corrosponding to GO/PubMed ID count and the ontology level (F/C/P). File named `Shannon's_Statistics.txt` will have the information about the diversity of a given .gaf file.\n\n2. `GOFindBias -cmpr test/2014.gaf test/2015.gaf -ts 50`\nThis command will compare two files and will give KS(non-parametric) Test p-value. Along with it, it will create `COMPARE.txt` having common GO terms and PMID between top 50(n) most frequent terms from each GAF file.\n\n3. `GOFindBias -i test/2014.gaf test/2015.gaf test/2016.gaf -e EXPEC`\nThis command will parse all the mentioned files and will fetch statistics for only [Experimental Evidence Codes](http://geneontology.org/page/experimental-evidence-codes).\n\n### NOTE\n\n1. If you are using Anaconda environment, make sure that the Python reads libraries from \"~anaconda2/lib/python2.7/site-packages/lib\". You can also simply copy the files in that location to an appropriate path where other python libraries are readable (importable) by Python.\n2. You can find few test .gaf files in the /lib/test/ folders.\n\n\n", "description_content_type": null, "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/Pranavkhade/GOFindBias", "keywords": "GO Annotation", "license": "", "maintainer": "", "maintainer_email": "", "name": "GOFindBias", "package_url": "https://pypi.org/project/GOFindBias/", "platform": "", "project_url": "https://pypi.org/project/GOFindBias/", "project_urls": { "Homepage": "https://github.com/Pranavkhade/GOFindBias" }, "release_url": "https://pypi.org/project/GOFindBias/1.2.3b1/", "requires_dist": [ "Biopython", "matplotlib" ], "requires_python": "", "summary": "Analysis of the .gaf files for futher debias analysis.", "version": "1.2.3b1" }, "last_serial": 3266884, "releases": { "1.2.2b1": [ { "comment_text": "", "digests": { "md5": "2c100fde40fc5af0c94a95acbf782c62", "sha256": "bb49834e5c243359e02bd008a04c4bc125dcc12450f30ab2937efe35510a60e7" }, "downloads": -1, "filename": "GOFindBias-1.2.2b1-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "2c100fde40fc5af0c94a95acbf782c62", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 6504, "upload_time": "2017-09-29T18:08:25", "url": "https://files.pythonhosted.org/packages/51/55/882f62ffcedd312c5b271c186683c9200e11f0af064c7dcb679cfdf83e67/GOFindBias-1.2.2b1-py2.py3-none-any.whl" } ], "1.2.3b1": [ { "comment_text": "", "digests": { "md5": "45629dbb2e6e94f6403cc00e87b296bd", "sha256": "18c3db3a224aa83020011ce3a0945a3b804c5dee6c5f4efe4298018395c0f0ee" }, "downloads": -1, "filename": "GOFindBias-1.2.3b1-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "45629dbb2e6e94f6403cc00e87b296bd", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 9835, "upload_time": "2017-10-20T20:35:00", "url": "https://files.pythonhosted.org/packages/94/c0/b575f7d0a4276beb3ac3ccb668a3538081cc8df484e3672b0203f66cd218/GOFindBias-1.2.3b1-py2.py3-none-any.whl" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "45629dbb2e6e94f6403cc00e87b296bd", "sha256": "18c3db3a224aa83020011ce3a0945a3b804c5dee6c5f4efe4298018395c0f0ee" }, "downloads": -1, "filename": "GOFindBias-1.2.3b1-py2.py3-none-any.whl", "has_sig": false, "md5_digest": "45629dbb2e6e94f6403cc00e87b296bd", "packagetype": "bdist_wheel", "python_version": "py2.py3", "requires_python": null, "size": 9835, "upload_time": "2017-10-20T20:35:00", "url": "https://files.pythonhosted.org/packages/94/c0/b575f7d0a4276beb3ac3ccb668a3538081cc8df484e3672b0203f66cd218/GOFindBias-1.2.3b1-py2.py3-none-any.whl" } ] }