{ "info": { "author": "Thomas Levine", "author_email": "_@thomaslevine.com", "bugtrack_url": null, "classifiers": [], "description": "Qudth randomly samples the lines within a large file and calculates statistics\nabout each line. For example, in a 10-gigabyte text file, you might want to know\nhow long a typical line is.\n\nLine lengths\n-----------------\nIt would be very convenient if line length is what you are interested in, as\nthat is the only thing we implement right now. ::\n\n $ qudth qudth/cli.py -n 5 --bins 8\n \n \u2581 \u2581 \u2582 \u2581 \u2581 \u2581 \u2583 \u2583\n 01 52 59\n Lengths of 5 lines in qudth/cli.py\n (simple random sample with replacement)\n\nBenchmarking\n---------------\n``wc -l`` is equivalent to qudth's line length estimation,\nbut qudth's sampling makes it much faster\non large files. ``big-file.csv`` is 1 gigabyte in size. ::\n\n _:~ t$ time qudth big-file.csv > /dev/null\n\n real 0m0.287s\n user 0m0.161s\n sys 0m0.032s\n _:~ t$ time wc -l big-file.csv > /dev/null\n\n real 0m2.515s\n user 0m1.475s\n sys 0m0.440s\n\nFuture work\n------------\nA more standard thing would perhaps be something that emitted\na random sample to stdout. It could support different sampling\nstrategies perhaps.\n", "description_content_type": null, "docs_url": null, "download_url": "UNKNOWN", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "http://dada.pink/qudth/", "keywords": null, "license": "LGPL", "maintainer": null, "maintainer_email": null, "name": "qudth", "package_url": "https://pypi.org/project/qudth/", "platform": "UNKNOWN", "project_url": "https://pypi.org/project/qudth/", "project_urls": { "Download": "UNKNOWN", "Homepage": "http://dada.pink/qudth/" }, "release_url": "https://pypi.org/project/qudth/0.0.3/", "requires_dist": null, "requires_python": null, "summary": "Estimate the distributions of line lengths of files.", "version": "0.0.3" }, "last_serial": 1678836, "releases": { "0.0.1": [ { "comment_text": "", "digests": { "md5": "48508b2ec98d4b8aeb8ef2565e1908e3", "sha256": "7e9c1f65e83efb76447b4295801067b9b60ad7d4e9ed37f42b849c991d4bda61" }, "downloads": -1, "filename": "qudth-0.0.1.tar.gz", "has_sig": false, "md5_digest": "48508b2ec98d4b8aeb8ef2565e1908e3", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2217, "upload_time": "2015-08-15T19:25:38", "url": "https://files.pythonhosted.org/packages/10/4a/b0c04dc06aad590bd2c6208d2bf24915c4fd91eba1601523748e34dcfa1d/qudth-0.0.1.tar.gz" } ], "0.0.2": [ { "comment_text": "", "digests": { "md5": "fdebd3d916d354a7d2fb14ae80b0f0c5", "sha256": "9b8d2526b1855b43d25926f336e381f0bfc59aa3ed1a69a1e271b8287989bedd" }, "downloads": -1, "filename": "qudth-0.0.2.tar.gz", "has_sig": false, "md5_digest": "fdebd3d916d354a7d2fb14ae80b0f0c5", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2306, "upload_time": "2015-08-15T19:33:41", "url": "https://files.pythonhosted.org/packages/f3/03/2a66e86083854b0c11ca409dcb66cf987346de8e6b66c3750d13d225c3e3/qudth-0.0.2.tar.gz" } ], "0.0.3": [ { "comment_text": "", "digests": { "md5": "4f37cca1229a12091dcf6430545bc863", "sha256": "6dfc361ccd72775fa12e91a558b1815b54920622464b35454e44312444a4ff2d" }, "downloads": -1, "filename": "qudth-0.0.3.tar.gz", "has_sig": false, "md5_digest": "4f37cca1229a12091dcf6430545bc863", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2704, "upload_time": "2015-08-15T19:55:25", "url": "https://files.pythonhosted.org/packages/45/ac/4672947113652f4cec572697f40369594a175a13891f451e3ff6e4a39ba4/qudth-0.0.3.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "4f37cca1229a12091dcf6430545bc863", "sha256": "6dfc361ccd72775fa12e91a558b1815b54920622464b35454e44312444a4ff2d" }, "downloads": -1, "filename": "qudth-0.0.3.tar.gz", "has_sig": false, "md5_digest": "4f37cca1229a12091dcf6430545bc863", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 2704, "upload_time": "2015-08-15T19:55:25", "url": "https://files.pythonhosted.org/packages/45/ac/4672947113652f4cec572697f40369594a175a13891f451e3ff6e4a39ba4/qudth-0.0.3.tar.gz" } ] }