{ "info": { "author": "Leonard Richardson", "author_email": "leonardr@segfault.org", "bugtrack_url": null, "classifiers": [ "Development Status :: 5 - Production/Stable", "Intended Audience :: Developers", "License :: OSI Approved :: MIT License", "Programming Language :: Python", "Programming Language :: Python :: 2.7", "Programming Language :: Python :: 3", "Topic :: Software Development :: Libraries :: Python Modules", "Topic :: Text Processing :: Markup :: HTML", "Topic :: Text Processing :: Markup :: SGML", "Topic :: Text Processing :: Markup :: XML" ], "description": "Beautiful Soup is a library that makes it easy to scrape information\nfrom web pages. It sits atop an HTML or XML parser, providing Pythonic\nidioms for iterating, searching, and modifying the parse tree.\n\n# Quick start\n\n```\n>>> from bs4 import BeautifulSoup\n>>> soup = BeautifulSoup(\"
SomebadHTML\")\n>>> print soup.prettify()\n\n\n
\nSome\n\nbad\n\nHTML\n\n\n
\n\n\n\n>>> soup.find(text=\"bad\")\nu'bad'\n\n>>> soup.i\nHTML\n\n>>> soup = BeautifulSoup(\"