Software Details:
Version: 2.0.0
Upload Date: 6 Jun 15
Distribution Type: Freeware
Downloads: 52
Started out as a simple HTML parser for Node.js, the NodeHtmlParser module has evolved into a fully working RSS and XML parser as well.
NodeHtmlParser recently added support for client-side environments, allowing the library to be called and used inside any browser out there.
Usage examples are included with the package's README file to help developers get started on their own Web crawlers.
What is new in this release:
- Parser output minimized (unnecessary values removed)
- Element attribute list renamed from attribs to attributes
- Node types consolidated; "script" and "style" moved to "tag"
- An order of magnitude more tests, with many targeting the parser rather than just the builders
- Tests consolidated into single files per test type (e.g. parser tests, HTML tests, RSS tests)
- Testing code rewritten (e.g. direct object comparator instead of comparison of and object's JSON)
- Brand new parser, handles edge cases old parser did not
- Parser handlers renamed to builders
- Builder method signature simplified
- Moved element position calculation to builders for efficiency
- Added case-sensitivity options for tag and attribute names
Requirements:
- Python
- Django
Comments not found