Norconex HTTP Collector is a web spider or crawler designed to help you browse the Internet and extract the information that you need for your projects. It is a portable tool that can be used in command line mode or as a Java library in order to load a...

Norconex Importer is a Java library and command-line application meant to parse and extract content out of a computer file as plain text, whatever its format (HTML, PDF, and Word). The program allows you to specify the files that will be parsed and the...

Norconex Committer is a Java library responsible for committing or applying the result of a document extraction or transformation to a target data source. The library is developed using Java and allows programmers to implement this feature in their...

Norconex JEF is first and foremost a Java API library. It is meant to facilitate the lives of developers and integrators who have to build any kind of maintenance tasks on a server. Norconex JEF lets you build those jobs as you normally would use Java....

Norconex Commons Lang is a generic Java library containing utility classes that complements the Java API and is not found in commonly available libraries. It is a useful collection of tools that aim to facilitate the task of programmers and developers....