{ "info": { "author": "Diogo N Silva", "author_email": "o.diogosilva@gmail.com", "bugtrack_url": null, "classifiers": [], "description": "# Assemblerflow :whale2::package:\n\n![Nextflow version](https://img.shields.io/badge/nextflow->0.26.0-brightgreen.svg)\n![Python version](https://img.shields.io/badge/python-3.6-brightgreen.svg)\n[![Build Status](https://travis-ci.org/ODiogoSilva/assemblerflow.svg?branch=master)](https://travis-ci.org/ODiogoSilva/assemblerflow)\n[![codecov](https://codecov.io/gh/ODiogoSilva/assemblerflow/branch/master/graph/badge.svg)](https://codecov.io/gh/ODiogoSilva/assemblerflow)\n[![Codacy Badge](https://api.codacy.com/project/badge/Grade/f518854f780b41a08ca2fb1c14e360f0)](https://www.codacy.com/app/o.diogosilva/assemblerflow?utm_source=github.com&utm_medium=referral&utm_content=ODiogoSilva/assemblerflow&utm_campaign=Badge_Grade)\n[![Documentation Status](https://readthedocs.org/projects/assemblerflow/badge/?version=latest)](http://assemblerflow.readthedocs.io/en/latest/?badge=latest)\n[![PyPI](https://img.shields.io/pypi/v/assemblerflow.svg)](https://pypi.python.org/pypi/assemblerflow/1.0.0rc3)\n\nA [Nextflow](https://www.nextflow.io/) pipeline assembler for genomics.\nPick your modules. Assemble them. Run the pipeline.\n\n## The premisse\n\n#### Build a pipeline\n\nWhat if building your own genomics pipeline would be as simple as:\n\n```\nassemblerflow.py build -t \"trimmomatic fastqc skesa pilon\" -o my_pipeline.nf\n```\n\nSeems pretty simple right? What if we could run this pipeline with a single command on any linux machine or cluster by leveraging\nthe awesomeness of [nextflow](https://www.nextflow.io/) and [docker](https://www.docker.com/)/[singularity](http://singularity.lbl.gov/)\ncontainers without having to install any of the pipeline dependencies?\n\n#### Run the pipeline\n\n```\nnextflow run my_pipeline.nf --fastq path/to/fastq\n\nN E X T F L O W ~ version 0.28.0\nLaunching `my_pipeline` [prickly_mcnulty] - revision: 041b7b793f\n\n============================================================\n M Y P I P E L I N E\n============================================================\nBuilt using assemblerflow v1.0.2\n\n Input FastQ : 2\n Input samples : 1\n Reports are found in : ./reports\n Results are found in : ./results\n Profile : standard\n\nStarting pipeline at Sun Apr 08 18:22:24 WEST 2018\n\n[warm up] executor > local\n[7c/eb5f2f] Submitted process > integrity_coverage_1_1 (02AR0553)\n(...)\n[31/7d90a1] Submitted process > compile_pilon_report_1_6\n\nCompleted at: Sun Apr 08 18:43:41 WEST 2018\nDuration : 21m 17s\nSuccess : true\nExit status : 0\n```\n\nCongratulations! You just built and executed your own pipeline with\nonly two commands! :tada:\n\n## Installation\n\nAssemblerflow is available as a bioconda package, which already brings\nnextflow:\n\n```\nconda install assemblerflow\n```\n\n#### Container engines\n\nPipelines built with assemblerflow require at least one container\nengine to be installed, among `docker`, `singularity` or `shifter`.\nIf you already have any one of these installed, you're good to go.\nIf not, we recommend installing singularity, which also has a bioconda\npackage:\n\n```\nconda install singularity\n```\n\n## How to use it\n\nThe complete user guide of assemblerflow can be found on [readthedocs.org](http://assemblerflow.readthedocs.io/en/latest/?badge=latest).\nFor a quick and dirty demonstration, see below.\n\n### Quick guide\n\n#### Building a pipeline\n\nAssemblerflow comes with a number of [ready-to-use components](http://assemblerflow.readthedocs.io/en/latest/user/available_components.html) to build your\nown pipeline. Following some basic rules, such as the output type of one process\nmust match the input type of the next process, assembling a pipeline is done\nusing the `build` mode and the `-t` option:\n\n```\nassemblerflow build -t \"trimmomatic spades abricate\" -o my_pipeline.nf -n \"assembly pipe\"\n```\n\nThis command will generate everything that is necessary to run the\npipeline automatically, but the main pipeline executable\nfile will be `my_pipeline.nf`. This file will contain a nextflow pipeline\nfor genome assembly starts with `trimmomatic` and finishes with anti-microbial\ngene annotation using `abricate`.\n\n#### Wait... what about the software parameters?\n\nEach component in the pipeline has its own set of parameters that can be\nmodified before or when executing the pipeline. These parameters are\ndescribed in the documentation of each process and you can check the options\nof your particular pipeline using the `help` option:\n\n```\nnextflow my_pipeline.nf --help\n\nN E X T F L O W ~ version 0.28.0\nLaunching `my_pipeline.nf` [prickly_keller] - revision: 1b3fec5658\n\n============================================================\n A S S E M B L Y P I P E\n============================================================\nBuilt using assemblerflow v1.0.2\n\n\nUsage:\n nextflow run my_pipeline.nf\n\n --fastq Path expression to paired-end fastq files. (default: fastq/*_{1,2}.*) (integrity_coverage)\n --genomeSize Genome size estimate for the samples. It is used to estimate the coverage and other assembly parameters andchecks (default: 2.1) (integrity_coverage)\n --minCoverage Minimum coverage for a sample to proceed. Can be set to0 to allow any coverage (default: 15) (integrity_coverage)\n --adapters Path to adapters files, if any (default: None) (trimmomatic)\n --trimSlidingWindow Perform sliding window trimming, cutting once the average quality within the window falls below a threshold (default: 5:20) (trimmomatic)\n --trimLeading Cut bases off the start of a read, if below a threshold quality (default: 3 (trimmomatic)\n --trimTrailing Cut bases of the end of a read, if below a threshold quality (default: 3) (trimmomatic)\n --trimMinLength Drop the read if it is below a specified length (default: 55) (trimmomatic)\n --spadesMinCoverage The minimum number of reads to consider an edge in the de Bruijn graph during the assembly (default: 2) (spades)\n --spadesMinKmerCoverage Minimum contigs K-mer coverage. After assembly only keep contigs with reported k-mer coverage equal or above this value (default: 2) (spades)\n --spadesKmers If 'auto' the SPAdes k-mer lengths will be determined from the maximum read length of each assembly. If 'default', SPAdes will use the default k-mer lengths. (default: auto) (spades)\n --abricateDatabases Specify the databases for abricate. (abricate)\n\n```\n\nThis help message is dynamically generated depending on the pipeline you build.\nSince this pipeline starts with `trimmomatic`, which receives fastq files as input,\n`--fastq` is the default parameter for providing paired-end fastq files.\n\n#### Running a pipeline\n\nNow that we have our nextflow pipeline built, we are ready to executed it by\nproviding input data. By default, assemblerflow pipelines will run locally and use\n`singularity` to run the containers of each component. This can be\nchanged in multiple ways, but for convenience assemblerflow has already defined\nprofiles for most configurations of `executors` and `container` engines.\n\nRunning a pipeline locally with `singularity` can be done with:\n\n```\n# Pattern for paired-end fastq is '_1.fastq.gz _2.fastq.gz'\nnextflow run my_pipeline --fastq \"path/to/fastq/*_{1,2}.*\"\n```\n\nIf you want to run a pipeline in a cluster with SLURM and singularity, just use\nthe appropriate profile:\n\n```\nnextflow run my_pipeline --fastq \"path/to/fastq/*_{1,2}.*\" -profile slurm_sing\n```\n\nDuring the execution of the pipeline, the results and reports for each component\nare continuously saved to the `results` and `reports` directory, respectively.\n\n## Why not just write a Nextflow pipeline?\n\nIn many cases, building a static nextflow pipeline is sufficient for our goals.\nHowever, when building our own pipelines, we often felt the need to add dynamism\nto this process, particularly if we take into account how fast new tools arise\nand existing ones change. Our biological goals also change over time and we\nmight need different pipelines to answer different questions. Assemblerflow makes\nthis very easy, by having a set of pre-made and ready-to-use components that can\nbe freely assembled.\n\nFor instance, changing the assembly software in a genome assembly pipeline becomes\nas easy as:\n\n```\n# Use spades\ntrimmomatic spades pilon\n# Use skesa\ntrimmomatic skesa pilon\n```\n\n![example1](https://github.com/ODiogoSilva/assemblerflow/raw/master/docs/resources/example_3.png)\n\nIf you are interested in having some sort of genome annotation, simply add those\ncomponents at the end, using a fork syntax:\n\n```\n# Run prokka and abricate at the end of the assembly\ntrimmomatic spades pilon (prokka | abricate)\n```\n\n![example2](https://github.com/ODiogoSilva/assemblerflow/raw/master/docs/resources/example_1.png)\n\nOn the other hand, if you are interest in just perform allele calling for wgMLST,\nsimply add `chewbbaca`:\n\n```\ntrimmomatic spades pilon chewbbaca\n```\n\n![example3](https://github.com/ODiogoSilva/assemblerflow/raw/master/docs/resources/example_2.png)\n\nSince nextflow handles parallelism of large sets of data so well, simple pipelines\nof two components are also useful to build:\n\n```\ntrimmomatic fastqc\n```\n\nAs the number of existing components grow, so does your freedom to build pipelines.\n\n## Developer guide\n\n### Adding new components\n\nIs there a missing component that you would like to see included? We would love\nto expand! You could make a component request in our\n[issue tracker](https://github.com/ODiogoSilva/assemblerflow/issues).\n\nIf you want to be part of the team, you can contribute with the code as well. Each component\nin assemblerflow can be independently added without having to worry about\nthe rest of the code base. You'll just need to have some knowledge of python\nand nextflow. [Check the developer documentation for how-to guides](http://assemblerflow.readthedocs.io/en/latest/)\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/ODiogoSilva/assemblerflow", "keywords": "", "license": "GPL3", "maintainer": "", "maintainer_email": "", "name": "assemblerflow", "package_url": "https://pypi.org/project/assemblerflow/", "platform": "", "project_url": "https://pypi.org/project/assemblerflow/", "project_urls": { "Homepage": "https://github.com/ODiogoSilva/assemblerflow" }, "release_url": "https://pypi.org/project/assemblerflow/1.1.0.post3/", "requires_dist": [ "argparse", "jinja2" ], "requires_python": "", "summary": "A Nextflow pipeline assembler for genomics. Pick your modules. Assemble them. Run the pipeline.", "version": "1.1.0.post3" }, "last_serial": 3784824, "releases": { "1.0.0rc1": [ { "comment_text": "", "digests": { "md5": "dfd007a48cbc85657f58c1e6442cfe7a", "sha256": "8b86ae555f637f4e1c4247c68003e98a186593853fe0dbbd745184690dd6040c" }, "downloads": -1, "filename": "assemblerflow-1.0.0rc1.tar.gz", "has_sig": false, "md5_digest": "dfd007a48cbc85657f58c1e6442cfe7a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 48327, "upload_time": "2018-01-22T16:30:27", "url": "https://files.pythonhosted.org/packages/d8/ce/3acdfdfce729e45aa03e445a75b3010aa1eda7c23925ead950e8217b97ff/assemblerflow-1.0.0rc1.tar.gz" } ], "1.0.0rc2": [ { "comment_text": "", "digests": { "md5": "031ee51ebef94c10f4fbdf24def2a12e", "sha256": "3835ef17e848b2cac9f6fc94b443ed6e751a262fac51cee9bf51358aab6116e4" }, "downloads": -1, "filename": "assemblerflow-1.0.0rc2.tar.gz", "has_sig": false, "md5_digest": "031ee51ebef94c10f4fbdf24def2a12e", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 49909, "upload_time": "2018-01-22T16:34:18", "url": "https://files.pythonhosted.org/packages/b3/82/0790c1c41ffe8586b04383c5c5596da2e1b8ec7f3254dde8136c4c2a2db8/assemblerflow-1.0.0rc2.tar.gz" } ], "1.0.0rc3": [ { "comment_text": "", "digests": { "md5": "7c17bc2d865a4cd580d676811a29160d", "sha256": "eb8f06f70858ac2ed830b2024ada9e84a17e1a7132eb4bf7fcc2c9b7e8fd94ac" }, "downloads": -1, "filename": "assemblerflow-1.0.0rc3.tar.gz", "has_sig": false, "md5_digest": "7c17bc2d865a4cd580d676811a29160d", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 52213, "upload_time": "2018-01-22T16:36:30", "url": "https://files.pythonhosted.org/packages/9f/63/d9882751a06d0d43e12d8be490463dd0d65bdb795bb993914c3e0ac36e49/assemblerflow-1.0.0rc3.tar.gz" } ], "1.0.1": [ { "comment_text": "", "digests": { "md5": "8cd28e77d261052a342e9b6fc433ae60", "sha256": "0ececf9f224eff53e18df4a33d6486668bd73a62d99477338acc41889108e543" }, "downloads": -1, "filename": "assemblerflow-1.0.1.tar.gz", "has_sig": false, "md5_digest": "8cd28e77d261052a342e9b6fc433ae60", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 52629, "upload_time": "2018-02-07T17:20:39", "url": "https://files.pythonhosted.org/packages/5b/2b/d33bed73b52df70ddee7de396baf494b933d6f1c690e397c574f01aec73a/assemblerflow-1.0.1.tar.gz" } ], "1.1.0": [ { "comment_text": "", "digests": { "md5": "695f56d98d0148c77027c7fb9a945187", "sha256": "568b4d1f4b874ee6072c42896fd72b3a806d63358c2f9ef84efd7a1db0e2f608" }, "downloads": -1, "filename": "assemblerflow-1.1.0-py3-none-any.whl", "has_sig": false, "md5_digest": "695f56d98d0148c77027c7fb9a945187", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 136803, "upload_time": "2018-04-20T12:21:57", "url": "https://files.pythonhosted.org/packages/c4/30/3d2c635f711001702fbd98341c6b24ea14006eee02f660da82e759aa2c26/assemblerflow-1.1.0-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "63cf96b0cabe881fd9ef7e3801fedf8d", "sha256": "145301eca0425b2e7b742876f64970b41c5ffd2e866c2c50dff4bf6692c17ce5" }, "downloads": -1, "filename": "assemblerflow-1.1.0.tar.gz", "has_sig": false, "md5_digest": "63cf96b0cabe881fd9ef7e3801fedf8d", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 94665, "upload_time": "2018-04-20T12:22:00", "url": "https://files.pythonhosted.org/packages/3b/a3/441a5cea4b5fe3429d2e39ce909e533adfc839c24508e584414e4885a59c/assemblerflow-1.1.0.tar.gz" } ], "1.1.0.post1": [ { "comment_text": "", "digests": { "md5": "8e96685ff43096b72d31412c90956a4b", "sha256": "4bd4ca55c6ef99a9cd91ec2deb0218e6d789e7910611d5f5cf8cb15c64a90b64" }, "downloads": -1, "filename": "assemblerflow-1.1.0.post1-py3-none-any.whl", "has_sig": false, "md5_digest": "8e96685ff43096b72d31412c90956a4b", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 149210, "upload_time": "2018-04-20T14:05:16", "url": "https://files.pythonhosted.org/packages/a5/23/1d032896ec202c547d450a9cf9c4bc565a7d2b767f1c1672db38d070d93d/assemblerflow-1.1.0.post1-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "dfb538a33afec1b1b00c9cde8de53209", "sha256": "1fc14badf68a95136f3c5f169909fd279fbc93b20c6ebf11f923a0a989dce0a4" }, "downloads": -1, "filename": "assemblerflow-1.1.0.post1.tar.gz", "has_sig": false, "md5_digest": "dfb538a33afec1b1b00c9cde8de53209", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 110403, "upload_time": "2018-04-20T14:05:18", "url": "https://files.pythonhosted.org/packages/03/83/18e7d49b8de400bed0dbadf7ba4f372002f0a79740dcf281f77575dfe786/assemblerflow-1.1.0.post1.tar.gz" } ], "1.1.0.post2": [ { "comment_text": "", "digests": { "md5": "d6bbde44aae3c2f516951779a525dd4d", "sha256": "1661d962b19d9f3ec781b2edc18294057fd217a2e306910c8e91420b3327eb58" }, "downloads": -1, "filename": "assemblerflow-1.1.0.post2-py3-none-any.whl", "has_sig": false, "md5_digest": "d6bbde44aae3c2f516951779a525dd4d", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 149209, "upload_time": "2018-04-20T15:02:06", "url": "https://files.pythonhosted.org/packages/2b/82/b9830b5c7c3c0c24190f8e2e96ce44cdad37423be8c72e4fab97da366269/assemblerflow-1.1.0.post2-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "77eecb484a9ca82a0073f057c534f659", "sha256": "529b1e6a60e3b592aba2cbe8fbd8e7c677d98990bafa16fd8f394160c8e9618f" }, "downloads": -1, "filename": "assemblerflow-1.1.0.post2.tar.gz", "has_sig": false, "md5_digest": "77eecb484a9ca82a0073f057c534f659", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 110399, "upload_time": "2018-04-20T15:02:08", "url": "https://files.pythonhosted.org/packages/90/06/9364b51d4684f7750e1582090049dfa5ece0eac487e3422855b59e59f278/assemblerflow-1.1.0.post2.tar.gz" } ], "1.1.0.post3": [ { "comment_text": "", "digests": { "md5": "d0f43f314ecacbe17aae7f1e94d70ac8", "sha256": "d5760eb578afdce1a9696a93cd0fb423d6de17868504f908e4a31cc78b16f608" }, "downloads": -1, "filename": "assemblerflow-1.1.0.post3-py3-none-any.whl", "has_sig": false, "md5_digest": "d0f43f314ecacbe17aae7f1e94d70ac8", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 151044, "upload_time": "2018-04-20T15:29:33", "url": "https://files.pythonhosted.org/packages/f5/13/f0984f7b3d40a909ebfc95d39ccbcedfa45a5ca98261fdd3a581096038f9/assemblerflow-1.1.0.post3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "fc4796ad457f7e201685ba569fde282b", "sha256": "b0390727602e90dad1c3afa8dd408e5c30471a0f6d35b323989e9df4e8aa70fb" }, "downloads": -1, "filename": "assemblerflow-1.1.0.post3.tar.gz", "has_sig": false, "md5_digest": "fc4796ad457f7e201685ba569fde282b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 111819, "upload_time": "2018-04-20T15:29:34", "url": "https://files.pythonhosted.org/packages/52/99/c6d141ccbb454b71e912d1c87234178b07f797a88086d998644336aa2ddc/assemblerflow-1.1.0.post3.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "d0f43f314ecacbe17aae7f1e94d70ac8", "sha256": "d5760eb578afdce1a9696a93cd0fb423d6de17868504f908e4a31cc78b16f608" }, "downloads": -1, "filename": "assemblerflow-1.1.0.post3-py3-none-any.whl", "has_sig": false, "md5_digest": "d0f43f314ecacbe17aae7f1e94d70ac8", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 151044, "upload_time": "2018-04-20T15:29:33", "url": "https://files.pythonhosted.org/packages/f5/13/f0984f7b3d40a909ebfc95d39ccbcedfa45a5ca98261fdd3a581096038f9/assemblerflow-1.1.0.post3-py3-none-any.whl" }, { "comment_text": "", "digests": { "md5": "fc4796ad457f7e201685ba569fde282b", "sha256": "b0390727602e90dad1c3afa8dd408e5c30471a0f6d35b323989e9df4e8aa70fb" }, "downloads": -1, "filename": "assemblerflow-1.1.0.post3.tar.gz", "has_sig": false, "md5_digest": "fc4796ad457f7e201685ba569fde282b", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 111819, "upload_time": "2018-04-20T15:29:34", "url": "https://files.pythonhosted.org/packages/52/99/c6d141ccbb454b71e912d1c87234178b07f797a88086d998644336aa2ddc/assemblerflow-1.1.0.post3.tar.gz" } ] }