HTML5Lib (Python)

Software Screenshot:
HTML5Lib (Python)
Software Details:
Version: 0.99999 / 1.0b3
Upload Date: 12 May 15
Distribution Type: Freeware
Downloads: 201

Rating: nan/5 (Total Votes: 0)

It follows the original WHATWG official HTML5 specification.

The parser is designed to handle all flavours of HTML and parses invalid documents using well-defined error handling rules compatible with the behaviour of major desktop web browsers.

The output is palced inside a tree structure.

It supports output to ElementTree, DOM and lxml tree formats as well as a simple custom format.

HTML5Lib is packaged with distutils.

HTML5Lib is also available in:

Ruby - download HTML5Lib for Ruby here.
Python - download HTML5Lib for Python here.
PHP - download HTML5Lib for PHP here.

What is new in this release:

  • Parses valid and invalid HTML documents to a tree
  • Support for minidom, ElementTree (including cElementTree and lxml.etree), BeautifulSoup (deprecated) and custom simpletree output formats
  • DOM to SAX converter
  • Reports parse errors
  • Character encoding detection
  • Filtering and serializing of trees
  • HTML+CSS sanitizer
  • Many unit tests

Similar Software

FluentDOM
FluentDOM

22 Jul 15

HTML(.js)
HTML(.js)

13 Apr 15

SlickMap CSS
SlickMap CSS

21 Jul 15

Other Software of Developer HTML5Lib Development Team

HTML5Lib (PHP)
HTML5Lib (PHP)

21 Jul 15

Comments to HTML5Lib (Python)

Comments not found
Add Comment
Turn on images!