{ "info": { "author": "John J. Farrell", "author_email": "farrell@bu.edu", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Topic :: Scientific/Engineering :: Bio-Informatics" ], "description": "HLA-Genotyper\n=============\n\nHLA-Genotyper is a python software tool to call 4-digit HLA genotypes from RNA-Seq and DNA-Seq directly from BAM files\n\n\nInstallation with easy_install\n==============================\n\nThe latest version of HLA-Genotyper is available through `pypi `_. To install, simply type::\n\n > sudo easy_install hla-genotyper\n\nOr if you do not have sudo privileges, the software can be installed into your user directory with::\n\n > easy_install --user hla-genotyper\n\nTo install easy_install, the following command line will install it into your system Python::\n\n > sudo curl https://bootstrap.pypa.io/ez_setup.py -o - | python\n\n\nInstallation with pip\n=====================\n\nTo install pip, type the following command::\n\n > sudo curl https://bootstrap.pypa.io/get-pip.py|python\n\nThen HLA-Genotyper can be installed with the pip command line::\n\n > sudo pip install hla-genotyper\n\nor to install in your user directory::\n\n > pip install --user hla-genotyper\n\n\nQuick Start \n===========\n\nFor certain alignment programs, many reads from the HLA genes are not mapped due to the high level of polymorphisms in the exons. \nFor those alignment programs, HLA-Genotyper will need to search among the unmapped reads for HLA read sequences. For example, bwa aln \ndoes not align reads well to HLA genes so searching through the unmapped reads is neccesary. The bwa mem program appears better for the MHC region as is the GEMS mapper. \n\n\nSo for a bam file created with bwa aln, the first step is to create an additional bam file with unmapped reads for hla-genotyper to scan.\n\nThe samtools command can be used to create the bam file of unmapped reads::\n\n > samtools view -u -f 4 mybam.bam > mybam.unmapped.bam \n\nOnce the bam file is available with the unmapped reads, the following commands will complete the HLA genotyping\nof the bam files depending on the type of sequencing experiment.\n\n\nWhole Genome Sequencing::\n\n > hla-genotyper mybam.bam -u mybam.unmapped.bam -e EUR --genome \n\nThe HLA genotyping results for the whole genome sequencing will be placed in 3 files (hla.EUR.mybam.genome.txt,hla.EUR.mybam.genome.dose,hla.EUR.mybam.genome.log)\n\nWhole Exome Sequencing::\n\n > hla-genotyper mybam.bam -u mybam.unmapped.bam -e EUR --exome\n\nThe HLA genotyping results for the whole exome sequencing will be placed in 3 files (hla.EUR.mybam.rnaseq.txt,hla.EUR.mybam.rnaseq.dose,hla.EUR.mybam.rnaseq.log)\n\nRNA-Seq::\n\n > hla-genotyper mybam.bam -u mybam.unmapped.bam -e EUR --rnaseq\n \nThe extra step for scanning unmapped reads is not necessary for the GEMS aligner and possibly the newest version of bwa mem (based on initial testing). In that case,\nthe creation of the unmapped bam file and the -u option can be dropped::\n\n > hla-genotyper mybam.bam -e EUR --rnaseq \n\nTo improve the speed and accuracy of the HLA genotype calls, specifying the ethnicity of the sample is recommended. This option provides a set of \npriors for the various HLA alleles found in the ethnic population. The ethnicity is specified with with the -e or --ethnicity command line option. \nThe available options are 'AFA', 'API', 'EUR', 'HIS' and 'UNK' which correspond to African-American, Asian-Pacific, European, Hispanic and Unknown. \nThese ethnic priors for the HLA alleles are based on transplant registry frequencies and supplemented with HLA alleles also found in 1000 genomes. \nFor the unknown ethnicity priors, priors are simply a uniform distribution for all the HLA alleles found in all 4 populations. However, using the -e UNK option \nwill take much longer due to the increase number of HLA allele combinations that are examined along with lower accuracy.\n\n\nHLA-Genotyper Command line Options\n==================================\n\n+------------------+------------------------------------------------------+\n|Parameter Option | Description of HLA-Genotyper Command Line Options |\n+==================+======================================================+\n|-h, --help | Show this help message |\n+------------------+------------------------------------------------------+\n|-u UNMAPPED_BAM | Bam file containing unmapped reads. This file will |\n| | be searched for reads with exact matches to known HLA|\n| | allele sequences. Recommended for alignments by bwa |\n| | and most aligners. The samtools command line, |\n| | samtools view -u -f 4 mybam.bam > mybam.unmapped.bam |\n| | will create a bam file of unmapped reads |\n+------------------+------------------------------------------------------+\n|-l READLEN\t | This is optional and can be used to specify the read |\n| | length of the reads in the bam file rather than |\n| | letting the program determine this length. This is |\n| | useful when predicting archaic human HLA genotypes |\n| | where there is wide range of variable read lengths. |\n+------------------+------------------------------------------------------+\n|-m MAPQ\t | Minimum mapping quality |\n+------------------+------------------------------------------------------+\n|--qual BQ\t | Minimum base quality within a read |\n+------------------+------------------------------------------------------+\n|--debug\t | For debugging, list HLA matched read probabilities |\n+------------------+------------------------------------------------------+\n|--rnaseq\t | Indicates the bam file is from an RNA-Seq experiment.|\n| | Reads spanning between exons are included in the |\n| | analysis. |\n+------------------+------------------------------------------------------+\n|--exome\t | Indicates the bam file is from a whole exome or |\n| | targeted sequencing experiment. |\n+------------------+------------------------------------------------------+\n|--genome\t | Flag to indicate the input bam file is from a whole |\n| | genome experiment. |\n+------------------+------------------------------------------------------+\n|-s SAMPLE | To provide the study id in the output file if not |\n| | available in the header of the bam file. |\n+------------------+------------------------------------------------------+\n|--cn CN | To provide the Sequencing Center ID for output if not|\n| | available in the header of the bam file. |\n+------------------+------------------------------------------------------+\n|-g GENE | Limits the analysis to one of the HLA genes |\n| | (A,B,C,DRB1, DQB1, DQA1) (Not Recommended) |\n+------------------+------------------------------------------------------+\n|-e ETHNICITY\t | To designate the ethnic priors to be used in the |\n| | na\u00efve Bayes algorithm. The prior probabilities for |\n| | each ethnic group are stored in the ethnic_priors.txt|\n| | file. The ethnicity code specified must match one of |\n| | those in the ethnic_priors.txt file, which are |\n| | presently coded EUR, AFA, API, and HIS. |\n+------------------+------------------------------------------------------+\n| -r --reference | To indicate the reference used in the alignment. The |\n| | options available are 37 or 38 which correspond to |\n| | GRCh37/hg19 and GRCH38/hg38. |\n+------------------+------------------------------------------------------+\n| -o | Directory for output files to be written to. |\n+------------------+------------------------------------------------------+\n\n\nHLA-Genotyper Output\n====================\n\nThe HLA-Genotyperscript generates 3 output files: a log file, a dose file and the HLA genotype file. \n\n * The log file logs the various steps during the completion of the HLA genotyping.\n\n * The dose file is a tab-delimited file useful for reading into R scripts for analysis. \n The file columns correspond to the name of the bam file, id, ethnicity and the list \n of HLA alleles examined. The HLA allele columns contained the count of each HLA allele \n found in the specific population for this individual (0,1,2). \n\n * The genotype file lists per line, the Center, bam filename, sample id, ethnic population, \n the HLA genotype, posterior probability, genotype quality followed by the read counts for\n each allele from mapped and unmapped bam files.\n\n\nReferences\n==========\n\nHLA-Genotyper Prediction of HLA Genotypes from Next Generation Sequencing Data\nJ.J. Farrell, G. Jun, L. A. Farrer, A. DeStefano, P. Sebastiani. (Program #1453M)\nPresented at the 64th Annual Meeting of The American Society of Human Genetics, October 20, 2014, San Diego, CA.\n\nContact\n=======\n\nFor questions and comments contact farrell [ at ] bu.edu", "description_content_type": null, "docs_url": null, "download_url": "UNKNOWN", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "http://github.com/jjfarrell/hla-genotyper", "keywords": null, "license": "MIT", "maintainer": null, "maintainer_email": null, "name": "hla-genotyper", "package_url": "https://pypi.org/project/hla-genotyper/", "platform": "UNKNOWN", "project_url": "https://pypi.org/project/hla-genotyper/", "project_urls": { "Download": "UNKNOWN", "Homepage": "http://github.com/jjfarrell/hla-genotyper" }, "release_url": "https://pypi.org/project/hla-genotyper/0.4.2b1/", "requires_dist": null, "requires_python": null, "summary": "Tool to predict HLA genotypes from RNA-Seq and DNA-Seq Data", "version": "0.4.2b1" }, "last_serial": 2497743, "releases": { "0.3.0b18": [], "0.3.0b19": [ { "comment_text": "", "digests": { "md5": "2e8c8ad1526fa2e4a1ce06bbe5c92e6a", "sha256": "2e40531a6b1da6abbfc0cea169c65173fb007696b44dabdd0e6e8113b0c32e99" }, "downloads": -1, "filename": "hla-genotyper-0.3.0b19.tar.gz", "has_sig": false, "md5_digest": "2e8c8ad1526fa2e4a1ce06bbe5c92e6a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2723900, "upload_time": "2015-02-24T15:55:32", "url": "https://files.pythonhosted.org/packages/fc/35/392f75f180b59dd207aa08201a6355ef7e499f0193333606c406a7ca7b5b/hla-genotyper-0.3.0b19.tar.gz" } ], "0.4.0b1": [ { "comment_text": "", "digests": { "md5": "ded9d17fa0d273b667eea6cde4ee13ff", "sha256": "fd64d6c4aa1d2374b48fcff095afea268fd3001ad3e12c8c15272f8a0fac397e" }, "downloads": -1, "filename": "hla-genotyper-0.4.0b1.tar.gz", "has_sig": false, "md5_digest": "ded9d17fa0d273b667eea6cde4ee13ff", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2723919, "upload_time": "2015-12-22T21:16:17", "url": "https://files.pythonhosted.org/packages/2c/2f/d55483a296af676a49a0e6d58454b1467adabb63068ab40bd047d80e44f8/hla-genotyper-0.4.0b1.tar.gz" } ], "0.4.1b1": [ { "comment_text": "built for Linux-2.6.32-642.6.2.el6.x86_64-x86_64-with-glibc2.2.5", "digests": { "md5": "219a0b69fd1fcc3467f26948d23a6623", "sha256": "92a5eb1948844ec5d4cb0662833e3a9f3fa26590f52075e9217a72f2c09c48b9" }, "downloads": -1, "filename": "hla-genotyper-0.4.1b1.linux-x86_64.tar.gz", "has_sig": false, "md5_digest": "219a0b69fd1fcc3467f26948d23a6623", "packagetype": "bdist_dumb", "python_version": "any", "requires_python": null, "size": 2760146, "upload_time": "2016-11-10T19:28:44", "url": "https://files.pythonhosted.org/packages/9e/c4/2c7116c6d87c90e076bf577c72b461b0a6e205a839b4c8c8543a3e488390/hla-genotyper-0.4.1b1.linux-x86_64.tar.gz" }, { "comment_text": "", "digests": { "md5": "389d5b4cb28d41160acafcd19f5b69ea", "sha256": "2e88bab27a1dcf6d94e74559f996b7c6e900faf0eb2c41d5454843361befde2a" }, "downloads": -1, "filename": "hla-genotyper-0.4.1b1.tar.gz", "has_sig": false, "md5_digest": "389d5b4cb28d41160acafcd19f5b69ea", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2726140, "upload_time": "2016-11-10T20:44:07", "url": "https://files.pythonhosted.org/packages/04/10/442291c9d5b7b83523903dcaf1063d8f52d437c5b17aec9d96aec7c8938a/hla-genotyper-0.4.1b1.tar.gz" } ], "0.4.2b1": [ { "comment_text": "", "digests": { "md5": "823e4242be04d00a5d5c17c67ac5e875", "sha256": "e743094364eaf9e35ed3869cc3e7f39a1a2f998db3d4a2fa473a4bd451a7fcea" }, "downloads": -1, "filename": "hla-genotyper-0.4.2b1.tar.gz", "has_sig": false, "md5_digest": "823e4242be04d00a5d5c17c67ac5e875", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2735048, "upload_time": "2016-12-03T20:11:44", "url": "https://files.pythonhosted.org/packages/25/9d/c8d01a047076cfd2dc539565132df3bca67a368e7e96e811dacb2dced6ea/hla-genotyper-0.4.2b1.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "823e4242be04d00a5d5c17c67ac5e875", "sha256": "e743094364eaf9e35ed3869cc3e7f39a1a2f998db3d4a2fa473a4bd451a7fcea" }, "downloads": -1, "filename": "hla-genotyper-0.4.2b1.tar.gz", "has_sig": false, "md5_digest": "823e4242be04d00a5d5c17c67ac5e875", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2735048, "upload_time": "2016-12-03T20:11:44", "url": "https://files.pythonhosted.org/packages/25/9d/c8d01a047076cfd2dc539565132df3bca67a368e7e96e811dacb2dced6ea/hla-genotyper-0.4.2b1.tar.gz" } ] }