{ "info": { "author": "Stefan Marsiske", "author_email": "stefan.marsiske@gmail.com", "bugtrack_url": null, "classifiers": [ "Development Status :: 4 - Beta", "Environment :: Web Environment", "License :: OSI Approved :: GNU Affero General Public License v3", "Topic :: Internet :: WWW/HTTP", "Topic :: Internet :: WWW/HTTP :: HTTP Servers", "Topic :: Security", "Topic :: Security :: Cryptography" ], "description": "Welcome to urlclean's documentation!\n************************************\n\nurlclean provides functions:\n\n* to follow a http redirect,\n\n* to follow a HTML META redirect,\n\n* to remove Urchin and Facebook tracker URL parameters,\n\n* plugins for futher cleaning power,\n\n* combines all these to unshorten and resolve various URLS\n\nTry it out from the commandline:\n\n python -m urlclean \n\nContents:\n\n\nIndices and tables\n******************\n\n* *Index*\n\n* *Module Index*\n\n* *Search Page*\n\n\nDocumentation for the Code\n==========================\n\nurlcleaner a module that resolves redirected urls and removes tracking\nurl params\n\nurlclean.weedparams(url)\n\n removes Urchin Tracker and Facebook surveillance params from urls.\n\n Args:\n\n url (str): The url to scrub of ugly params\n\n Returns:\n\n (str). The return cleaned url\n\nurlclean.httpresolve(url, ua=None, proxyhost='', proxyport='')\n\n resolve one redirection of a http request.\n\n Args:\n\n url (str): The url to follow one redirect\n\n ua (fn): A function returning a User Agent string (optional)\n\n proxyhost (str): http proxy server (optional)\n\n proxyport (int): http proxy server port (optional)\n\n Returns: (str, httplib.response). The return resolved url, and\n the response from the http query\n\nurlclean.unmeta(url, res)\n\n Finds any meta redirects a httplib.response object that has\n text/html as content-type.\n\n Args:\n\n url (str): The url to follow one redirect\n\n res (httplib.response): a http.response object\n\n Returns: (str). The return resolved url\n\nurlclean.unshorten(url, cache=None, ua=None, >>**<>**<