Python Source
A
Ajax
Aspect-Oriented
 
B
Bloggers
Build Systems
 
C
CMS
Charting & Reporting
Code Analyzers
Connection Pools
 
D
Databases
 
E
ERP & CRM
ESB
 
F
Forum Soft
 
G
General Purpose
 
H
HTML Parsers
 
I
IDEs
Installers
Issue Tracking
 
L
Logging Tools
 
N
Network Clients
Network Servers
 
P
PDF Libraries
Parser Generators
Persistence
Project Management
 
R
RSS & RDF Tools
 
S
Search Engines
Security
Source Control
 
T
Template Engines
Testing Tools
 
W
Web Frameworks
Web Mail
Web Servers
Web Services
Web Testing
Wiki Engines
Workflow Engines
 
X
XML Parsers
 

Open Source HTML Parsers in Python

Beautiful Soup

Beautiful Soup is a Python HTML/XML parser designed for quick turnaround projects like screen-scraping. Features:

 * Beautiful Soup won't choke if you give it bad markup. It yields a parse tree that makes approximately as much sense as your original document. This is usually good enough to collect the data you need and run away.
 * Beautiful Soup provides a few simple methods and Pythonic idioms for navigating, searching, and modifying a parse tree: a toolkit for dissecting a document and extracting what you need. You don't have to create a custom parser for each application.
 * Beautiful Soup automatically converts incoming documents to Unicode and outgoing documents to UTF-8. .

Go To Beautiful Soup