{ "info": { "author": "Paul Makepeace", "author_email": "paulm@paulm.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 3 - Alpha", "Intended Audience :: Developers", "License :: OSI Approved :: GNU General Public License (GPL)", "Operating System :: OS Independent", "Programming Language :: Python", "Topic :: Software Development :: Libraries :: Python Modules", "Topic :: Text Processing" ], "description": "===================================\nGoogle Refine Python Client Library\n===================================\n\nThe Google Refine Python Client Library provides an interface to\ncommunicating with a Google Refine server.\n\nCurrently, the following API is supported:\n\n- project creation/import, deletion, export\n- facet computation\n\n - text\n - text filter\n - numeric\n - blank\n - starred & flagged\n - ... extensible class\n\n- 'engine': managing multiple facets and their computation results\n- sorting & reordering\n- clustering\n- transforms\n- transposes\n- single and mass edits\n- annotation (star/flag)\n- column\n\n - move\n - add\n - split\n - rename\n - reorder\n - remove\n\n- reconciliation\n\n - reconciliation judgment facet\n - guessing column type\n - querying reconciliation services preferences\n - perform reconciliation\n\nConfiguration\n=============\n\nBy default the Google Refine server URL is http://127.0.0.1:3333\nThe environment variables ``GOOGLE_REFINE_HOST`` and ``GOOGLE_REFINE_PORT``\nenable overriding the host & port.\n\nIn order to run all tests, a live Refine server is needed. No existing projects\nare affected.\n\nInstallation\n============\n\n(Someone with more familiarity with python's byzantine collection of installation\nframeworks is very welcome to improve/\"best practice\" all this.)\n\n#. Install dependencies, which currently is ``urllib2_file``:\n\n ``sudo pip install -r requirements.txt``\n\n#. Ensure you have a Refine server running somewhere and, if necessary, set\n the envvars as above.\n\n#. Run tests, build, and install:\n\n ``python setup.py test # to do a subset, e.g., --test-suite tests.test_facet``\n\n ``python setup.py build``\n\n ``python setup.py install``\n\nThere is a Makefile that will do this too, and more.\n\nTODO\n====\n\nThe API so far has been filled out from building a test suite to carry out the\nactions in `David Huynh's Refine tutorial `_ which while certainly showing off a\nwide range of Refine features doesn't cover the entire suite. Notable exceptions\ncurrently include:\n\n- reconciliation support is useful but not complete\n- undo/redo\n- Freebase\n- join columns\n- columns from URL\n\nContribute\n============\n\nPatches welcome! Source is at https://github.com/PaulMakepeace/refine-client-py\n\nUseful Tools\n------------\n\nOne aspect of development is watching HTTP transactions. To that end, I found\n`Fiddler `_ on Windows and `HTTPScoop\n`_ invaluable. The latter won't URL-decode nor nicely\nformat JSON but the `Online JavaScript Beautifier `_\nwill.\n\nCredits\n=======\n\nPaul Makepeace, author, \n\nDavid Huynh, `initial cut `_\n\n`Artfinder `_, inspiration\n\nSome data used in the test suite has been used from publicly available sources,\n\n- louisiana-elected-officials.csv: from\n http://www.sos.louisiana.gov/tabid/136/Default.aspx\n\n- us_economic_assistance.csv: `\"The Green Book\" `_\n\n- eli-lilly.csv: `ProPublica's \"Docs for Dollars\" `_ leading to a `Lilly Faculty PDF `_ processed by `David Huynh's ScraperWiki script `_", "description_content_type": null, "docs_url": null, "download_url": "UNKNOWN", "downloads": { "last_day": -1, "last_month": -1, "last_week": -1 }, "home_page": "https://github.com/PaulMakepeace/refine-client-py", "keywords": null, "license": "UNKNOWN", "maintainer": null, "maintainer_email": null, "name": "refine-client", "package_url": "https://pypi.org/project/refine-client/", "platform": "Any", "project_url": "https://pypi.org/project/refine-client/", "project_urls": { "Download": "UNKNOWN", "Homepage": "https://github.com/PaulMakepeace/refine-client-py" }, "release_url": "https://pypi.org/project/refine-client/0.2.1/", "requires_dist": null, "requires_python": null, "summary": "The Google Refine Python Client Library provides an interface to communicating with a Google Refine server.", "version": "0.2.1" }, "last_serial": 798742, "releases": { "0.1.0": [ { "comment_text": "", "digests": { "md5": "fe917131ecbfc93703af4cc77193b883", "sha256": "e1e079f6675ca99d36dda2f3a25017b31fcc3c1dada546de6845e926a9af7621" }, "downloads": -1, "filename": "refine-client-0.1.0.tar.gz", "has_sig": false, "md5_digest": "fe917131ecbfc93703af4cc77193b883", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 540394, "upload_time": "2011-05-01T21:39:24", "url": "https://files.pythonhosted.org/packages/f4/ec/987c6bfc59c9a8de5ae5647c0a4e16944f233173bece4d5f3007caf33822/refine-client-0.1.0.tar.gz" } ], "0.2.1": [ { "comment_text": "", "digests": { "md5": "901820ddec5afb06959029bfd680b97a", "sha256": "05f82b559b060cc00634423fb5767153aa803bdec3f3e4f769f0442b305e6280" }, "downloads": -1, "filename": "refine-client-0.2.1.tar.gz", "has_sig": false, "md5_digest": "901820ddec5afb06959029bfd680b97a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 550920, "upload_time": "2011-07-22T12:55:44", "url": "https://files.pythonhosted.org/packages/1e/cc/b5df9928f76fdf13371b4a8ba89c6d44d9ea9c9af96add379971bed8a912/refine-client-0.2.1.tar.gz" } ] }, "urls": [ { "comment_text": "", "digests": { "md5": "901820ddec5afb06959029bfd680b97a", "sha256": "05f82b559b060cc00634423fb5767153aa803bdec3f3e4f769f0442b305e6280" }, "downloads": -1, "filename": "refine-client-0.2.1.tar.gz", "has_sig": false, "md5_digest": "901820ddec5afb06959029bfd680b97a", "packagetype": "sdist", "python_version": "source", "requires_python": null, "size": 550920, "upload_time": "2011-07-22T12:55:44", "url": "https://files.pythonhosted.org/packages/1e/cc/b5df9928f76fdf13371b4a8ba89c6d44d9ea9c9af96add379971bed8a912/refine-client-0.2.1.tar.gz" } ] }