{ "info": { "author": "Shaun Jackman", "author_email": "sjackman@gmail.com", "bugtrack_url": null, "classifiers": [ "License :: OSI Approved :: GNU General Public License v3 (GPLv3)", "Operating System :: OS Independent", "Programming Language :: Python :: 3" ], "description": "\n\n# Correct misassemblies using linked reads\n\nCut sequences at positions with few spanning molecules.\n\nWritten by [Shaun Jackman](http://sjackman.ca), Lauren Coombe, and Justin Chu.\n\n[bioRxiv doi:10.1101/304253](https://www.biorxiv.org/content/early/2018/04/20/304253) · [Slides](http://sjackman.ca/tigmint-recomb-slides) · [Poster](https://f1000research.com/posters/6-1406)\n\n# Description\n\nTigmint identifies and corrects misassemblies using linked reads from 10x Genomics Chromium. The reads are first aligned to the assembly, and the extents of the large DNA molecules are inferred from the alignments of the reads. The physical coverage of the large molecules is more consistent and less prone to coverage dropouts than that of the short read sequencing data. The sequences are cut at positions that have insufficient spanning molecules. Tigmint outputs a BED file of these cut points, and a FASTA file of the cut sequences.\n\nEach window of a specified fixed size is checked for a minimum number of spanning molecules. Sequences are cut at those positions where a window with sufficient coverage is followed by some number of windows with insufficient coverage is then followed again by a window with sufficient coverage.\n\n# Installation\n\n## Install Tigmint using Brew\n\nInstall [Linuxbrew](http://linuxbrew.sh/) on Linux or Windows Subsystem for Linux (WSL), or \ninstall [Homebrew](https://brew.sh/) on macOS, and then run the command\n\n\tbrew install tigmint\n\n## Install Tigmint from the source code\n\nDownload and extract the source code. Compiling is not needed.\n\n```\ngit clone https://github.com/bcgsc/tigmint && cd tigmint\n```\nor\n```\ncurl -L https://github.com/bcgsc/tigmint/archive/master.tar.gz | tar xz && mv tigmint-master tigmint && cd tigmint\n```\n\n# Dependencies\n\n## Install Python package dependencies\n```sh\npip3 install intervaltree pybedtools pysam statistics\n```\n\nTigmint uses Bedtools, BWA and Samtools. These dependencies may be installed using [Homebrew](https://brew.sh) on macOS or [Linuxbrew](http://linuxbrew.sh) on Linux.\n\n## Install the dependencies of Tigmint\n```sh\nbrew install bedtools bwa samtools\n```\n\n## Install the dependencies of ARCS (optional)\n```sh\nbrew tap brewsci/bio\nbrew install arcs links-scaffolder\n```\n\n## Install the dependencies for calculating assembly metrics (optional)\n```sh\nbrew install abyss seqtk\n```\n\n# Usage\n\nTo run Tigmint on the draft assembly `draft.fa` with the reads `reads.fq.gz`, which have been run through `longranger basic`:\n\n```sh\nsamtools faidx draft.fa\nbwa index draft.fa\nbwa mem -t8 -p -C draft.fa reads.fq.gz | samtools sort -@8 -tBX -o draft.reads.sortbx.bam\ntigmint-molecule draft.reads.sortbx.bam | sort -k1,1 -k2,2n -k3,3n >draft.reads.molecule.bed\ntigmint-cut -p8 -o draft.tigmint.fa draft.fa draft.reads.molecule.bed\n```\n\n- `bwa mem -C` is used to copy the BX tag from the FASTQ header to the SAM tags.\n- `samtools sort -tBX` is used to sort first by barcode and then position.\n\nAlternatively, you can run the Tigmint pipeline using the Makefile driver script `tigmint-make`. To run Tigmint on the draft assembly `myassembly.fa` with the reads `myreads.fq.gz`, which have been run through `longranger basic`:\n\n```sh\ntigmint-make tigmint draft=myassembly reads=myreads\n```\n\nTo run both Tigmint and scaffold the corrected assembly with [ARCS](https://github.com/bcgsc/arcs):\n\n```sh\ntigmint-make arcs draft=myassembly reads=myreads\n```\n\nTo run Tigmint, ARCS, and calculate assembly metrics using the reference genome `GRCh38.fa`:\n\n```sh\ntigmint-make metrics draft=myassembly reads=myreads ref=GRCh38 G=3088269832\n```\n\n# Note\n\n+ `tigmint-make` is a Makefile script, and so any `make` options may also be used with `tigmint-make`, such as `-n` (`--dry-run`).\n+ The file extension of the assembly must be `.fa` and the reads `.fq.gz`, and the extension is not included in the parameters `draft` and `reads`. These specific file name requirements result from implementing the pipeline in GNU Make.\n\n# tigmint-make commands\n\n+ `tigmint`: Run Tigmint, and produce a file named `$draft.tigmint.fa`\n+ `arcs`: Run Tigmint and ARCS, and produce a file name `$draft.tigmint.arcs.fa`\n+ `metrics`: Run, Tigmint, ARCS, and calculate assembly metrics using `abyss-fac` and `abyss-samtobreak`, and produce TSV files.\n\n# Parameters of Tigmint\n\n+ `draft`: Name of the draft assembly, `draft.fa`\n+ `reads`: Name of the reads, `reads.fq.gz`\n+ `span=20`: Number of spanning molecules threshold\n+ `window=1000`: Window size (bp) for checking spanning molecules\n+ `minsize=2000`: Minimum molecule size\n+ `as=0.65`: Minimum AS/read length ratio\n+ `nm=5`: Maximum number of mismatches\n+ `dist=50000`: Maximum distance (bp) between reads to be considered the same molecule\n+ `mapq=0`: Mapping quality threshold\n+ `trim=0`: Number of bases to trim off contigs following cuts\n+ `t=8`: Number of threads\n\n# Parameters of ARCS\n+ `c=5`\n+ `e=30000`\n+ `r=0.05`\n\n# Parameters of LINKS\n+ `a=0.1`\n+ `l=10`\n\n# Parameters for calculating assembly metrics\n\n+ `ref`: Reference genome, `ref.fa`, for calculating assembly contiguity metrics\n+ `G`: Size of the reference genome, for calculating NG50 and NGA50\n\n# Tips\n\n- If your barcoded reads are in multiple FASTQ files, the initial alignments of the barcoded reads to the draft assembly can be done in parallel and merged prior to running Tigmint.\n- When aligning with BWA-MEM, use the `-C` option to include the barcode in the BX tag of the alignments.\n- Sort by BX tag using `samtools sort -tBX`.\n- Merge multiple BAM files using `samtools merge -tBX`.\n\n# Support\n\nAfter first looking for existing issue at , please report a new issue at . Please report the names of your input files, the exact command line that you are using, and the entire output of Tigmint.\n\n# Pipeline\n\n[![Tigmint pipeline illustration](pipeline.gv.png)](pipeline.gv.svg)\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://bcgsc.github.io/tigmint/", "keywords": "", "license": "GPLv3", "maintainer": "", "maintainer_email": "", "name": "tigmint", "package_url": "https://pypi.org/project/tigmint/", "platform": "", "project_url": "https://pypi.org/project/tigmint/", "project_urls": { "Homepage": "https://bcgsc.github.io/tigmint/" }, "release_url": "https://pypi.org/project/tigmint/1.1.2/", "requires_dist": [ "intervaltree", "pybedtools", "pysam", "statistics" ], "requires_python": ">=3", "summary": "Correct misassemblies using linked reads", "version": "1.1.2" }, "last_serial": 4221284, "releases": { "1.1.2": [ { "comment_text": "", "digests": { "md5": "b68ecd45c90719152579b2d97f5455ec", "sha256": "2dadb09a1ab651472c5e17ff1db01592cc1c953cf06cbbca89ca5e0cc474806c" }, "downloads": -1, "filename": "tigmint-1.1.2-py3-none-any.whl", "has_sig": false, "md5_digest": "b68ecd45c90719152579b2d97f5455ec", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3", "size": 14293, "upload_time": "2018-08-30T05:06:55", "url": "https://files.pythonhosted.org/packages/33/3b/d9cbe294e7c895d7b8a4befa10408f58aa630ae7f9114db3d06474d51a83/tigmint-1.1.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "e8127911baef8fe547c390ba5b75bb44", "sha256": "17ea87a303e3f4eb0e466c21730a3e3c78bc328b64a4f314f9e53a887b3460f4" }, "downloads": -1, "filename": "tigmint-1.1.2.tar.gz", "has_sig": false, "md5_digest": "e8127911baef8fe547c390ba5b75bb44", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3", "size": 14724, "upload_time": "2018-08-30T05:06:57", "url": "https://files.pythonhosted.org/packages/f3/70/8f92c45efef77a6b82f357be0ec7238a4bb0cf636950d6c49d06fa6475dc/tigmint-1.1.2.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "b68ecd45c90719152579b2d97f5455ec", "sha256": "2dadb09a1ab651472c5e17ff1db01592cc1c953cf06cbbca89ca5e0cc474806c" }, "downloads": -1, "filename": "tigmint-1.1.2-py3-none-any.whl", "has_sig": false, "md5_digest": "b68ecd45c90719152579b2d97f5455ec", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": ">=3", "size": 14293, "upload_time": "2018-08-30T05:06:55", "url": "https://files.pythonhosted.org/packages/33/3b/d9cbe294e7c895d7b8a4befa10408f58aa630ae7f9114db3d06474d51a83/tigmint-1.1.2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "e8127911baef8fe547c390ba5b75bb44", "sha256": "17ea87a303e3f4eb0e466c21730a3e3c78bc328b64a4f314f9e53a887b3460f4" }, "downloads": -1, "filename": "tigmint-1.1.2.tar.gz", "has_sig": false, "md5_digest": "e8127911baef8fe547c390ba5b75bb44", "packagetype": "sdist", "python_version": "source", "requires_python": ">=3", "size": 14724, "upload_time": "2018-08-30T05:06:57", "url": "https://files.pythonhosted.org/packages/f3/70/8f92c45efef77a6b82f357be0ec7238a4bb0cf636950d6c49d06fa6475dc/tigmint-1.1.2.tar.gz" } ] }