HTML5Lib (Python)

Software Screenshot:
HTML5Lib (Python)
Software Details:
Version: 0.99999 / 1.0b3
Upload Date: 12 May 15
Distribution Type: Freeware
Downloads: 46

Rating: nan/5 (Total Votes: 0)

It follows the original WHATWG official HTML5 specification.

The parser is designed to handle all flavours of HTML and parses invalid documents using well-defined error handling rules compatible with the behaviour of major desktop web browsers.

The output is palced inside a tree structure.

It supports output to ElementTree, DOM and lxml tree formats as well as a simple custom format.

HTML5Lib is packaged with distutils.

HTML5Lib is also available in:

Ruby - download HTML5Lib for Ruby here.
Python - download HTML5Lib for Python here.
PHP - download HTML5Lib for PHP here.

What is new in this release:

  • Parses valid and invalid HTML documents to a tree
  • Support for minidom, ElementTree (including cElementTree and lxml.etree), BeautifulSoup (deprecated) and custom simpletree output formats
  • DOM to SAX converter
  • Reports parse errors
  • Character encoding detection
  • Filtering and serializing of trees
  • HTML+CSS sanitizer
  • Many unit tests

Similar Software

screenfull.js
screenfull.js

10 Dec 15

Compass
Compass

28 Feb 15

Less4j
Less4j

28 Feb 15

Dindent
Dindent

13 Apr 15

Other Software of Developer HTML5Lib Development Team

HTML5Lib (PHP)
HTML5Lib (PHP)

21 Jul 15

Comments to HTML5Lib (Python)

Comments not found
Add Comment
Turn on images!