{ "info": { "author": "Ping Wu", "author_email": "wpwupingwp@outlook.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Intended Audience :: Developers", "Intended Audience :: Science/Research", "License :: OSI Approved :: GNU Affero General Public License v3", "Operating System :: OS Independent", "Programming Language :: Python :: 3.5", "Programming Language :: Python :: 3.6", "Programming Language :: Python :: 3.7", "Topic :: Scientific/Engineering :: Bio-Informatics" ], "description": "# join_and_split.py\n\nHelper for join and split fastq files.\n\nRequired python 3.6 or above.\n\n## Usage\n```\n# Linux\n## split\npython3 join_and_split.py split -m fastq_file\n## join\npython3 join_and_split.py join -f forward.fastq -r reverse.fastq\n# Windows\n## split\npython join_and_split.py split -m fastq_file\n## join\npython join_and_split.py join -f forward.fastq -r reverse.fastq\n```\n\nUse -t to set linker text, by default the program use \"JOINTEXT\".\n\nWhen split, \"fastq_file\" could be multiple files, use \"\\*.fastq\" (include\nquotation mark) to represent all \".fastq\" files in current folder.\n\n# divide.py\n\nDivide NGS data by barcode and primer.\n\n## Prerequisite\n\n* Python 3.5 or above\n* Biopython\n* regex\n* [vsearch](https://github.com/torognes/vsearch) (Optional)\n\nTo install Biopython and regex, run as administrator:\n\n> pip install biopython regex\n\n## Changelog\n\n### v4.6\nSupport ambiguous base.\n\n### v4.5\nExtend vsearch options.\nImprove output\n\n### v4.2\nIntegrate vsearch.\n\n### v4.0\nUse regex instead of BLAST. Faster and easier.\n\n### v3.3\nParallel version, use BLAST.\n\n### v2.1\nSingle core version. Use BLAST.\n\n### v1.0\nDeprecated.\n\n## Sequence structure\n\nIt can handle merged pair-end sequence like this:\n\n>barcode-adapter-primer-sequence-primer-adapter-barcode\n\nOr just handle one direction:\n\n>barcode-adapter-primer-sequence\n\nSequences will be divided by barcode according to given barcode file.\nIf barcode is wrong even only one base, it will be dropped.\n\n## adapter\n\nSome one adds sequence between barcode and primer, if you do not have it, just\nset adapter length to zero by \"--adapter 0\". The default value is 14.\n\n## Barcode mode\n\nUse \"-m\" to set barcode mode, like \"8\\*1\", means barcode with length 5 repeats\nonly 1 times. The default is \"5\\*2\", i.e., 5-base barcode repeats twice.\n\nNote that the forward and reverse barcode may be different sequence, but they\n*SHOULD FOLLOW THE SAME MODE!*\n\n## Strict option\n\nUse \"-s\" or \"--strict\" to use strict version. If set, the program will check\nbarcode in head and tail is equal or not and whether barcode in tail (3') is\ncorrect. If not, it will only check barcode in head (5') of sequence.\n\n## Barcode file\n\nBarcode file looks like this:\n\n> sample,barcode-f,barcode-r\n\n> S0001,ATACG,ATACG\n\n> S0002,ATATA,TATAC\n\n> S0003,ATACG\n\n> ...\n\nThe _barcode-f_ means barcode in 5' direction and _barcode-r_ means barcode in\n3' direction. All sequences should be *forward*.\n\nIf forward and reverse barcode are same, you can omit the reverse barcode in\nthe table.\n\nTo avoid potential error, _please do not use space in sample info_.\n\nAnd notice that here it use **English comma** to seperate two fields rather\nthan **Chinese comma**.\n\n## Primer file\nPrimer file looks like this:\n\n> gene,forward,reverse\n\n> rbcL,ATCGATCGATCGA,TACGTACGTACG\n\n> matK,AAAATTTTCCCC,GGGGTTACCAAAA\n\n> ...\n\nOr:\n\n> gene,sequence\n\n> rbcL-f,ATCGATCGATCGA\n\n> rbcL-r,TACGTACGTACG\n\n\nYou can use Microsoft Excel to prepare these two files and save as CSV format,\nor use any text editor you prefer.\n\n**Make sure you don't miss the first line.**\n\n# task.sh\n\nIf you use PBS task submitting system, you can use this script to submit the\ntask, and you can finish the work from combine two direction sequence by flash and join_fastq.py to divide them.\n\n\n", "description_content_type": "text/markdown", "docs_url": null, "download_url": "", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/wpwupingwp/divide", "keywords": "", "license": "GNU AGPL v3", "maintainer": "", "maintainer_email": "", "name": "divide-seq", "package_url": "https://pypi.org/project/divide-seq/", "platform": "", "project_url": "https://pypi.org/project/divide-seq/", "project_urls": { "Homepage": "https://github.com/wpwupingwp/divide" }, "release_url": "https://pypi.org/project/divide-seq/5.1/", "requires_dist": [ "biopython (>=1.72)", "pip (>=18.0)", "regex (>=2019.08.19)", "wheel (>=0.32.3)" ], "requires_python": "", "summary": "Divide amplicon sequences", "version": "5.1" }, "last_serial": 5942938, "releases": { "5.0": [ { "comment_text": "", "digests": { "md5": "e29659ff8ee061ca91fc978a8783a683", "sha256": "15141c3dcbe52255e0b0342b55289113b1c5c4d7c251bfbd23be6ef01736bcae" }, "downloads": -1, "filename": "divide_seq-5.0-py3-none-any.whl", "has_sig": false, "md5_digest": "e29659ff8ee061ca91fc978a8783a683", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 18800, "upload_time": "2019-10-07T09:26:47", "url": "https://files.pythonhosted.org/packages/c4/d5/61fae31e120cde52d27e9d3416013b55fed29cd60a144c96ece79b282b3f/divide_seq-5.0-py3-none-any.whl" } ], "5.1": [ { "comment_text": "", "digests": { "md5": "825e06e9aaa85863b1e8a38ade21a9d6", "sha256": "62488a9b3f661c15aa8bc4b00ac87da2bd56f5eea4fe6fa8e29454a5dc1a5292" }, "downloads": -1, "filename": "divide_seq-5.1-py3-none-any.whl", "has_sig": false, "md5_digest": "825e06e9aaa85863b1e8a38ade21a9d6", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 18793, "upload_time": "2019-10-08T05:30:32", "url": "https://files.pythonhosted.org/packages/3f/57/5ac6bdaca710309c1941163e557cd40cbc1b7f40a1d5980de745c54c27f7/divide_seq-5.1-py3-none-any.whl" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "825e06e9aaa85863b1e8a38ade21a9d6", "sha256": "62488a9b3f661c15aa8bc4b00ac87da2bd56f5eea4fe6fa8e29454a5dc1a5292" }, "downloads": -1, "filename": "divide_seq-5.1-py3-none-any.whl", "has_sig": false, "md5_digest": "825e06e9aaa85863b1e8a38ade21a9d6", "packagetype": "bdist_wheel", "python_version": "py3", "requires_python": null, "size": 18793, "upload_time": "2019-10-08T05:30:32", "url": "https://files.pythonhosted.org/packages/3f/57/5ac6bdaca710309c1941163e557cd40cbc1b7f40a1d5980de745c54c27f7/divide_seq-5.1-py3-none-any.whl" } ] }