arachnode.net is open source web-search software. It builds on C#, MS-SQL 2005/2008 and Lucene.NET adding a crawler, a link-graph database, parsers for HTML and an extensible plugin architecture. FEATURES: Multi-threaded crawler. Pre- and post-request crawl rules and actions. Full-text search via Lucene.NET and SQL Server 2005/2008. Microsoft Word, PowerPoint, Excel and Adobe PDF indexing. Web page parsing. HTML to XML/XHTML. EXIF data extraction. Web and web service search interface. SSIS packages and CLR functions for term and phrase extraction. Visual Studio 2008 solution and MS-SQL 2005/2008 database.
What is new in this release:
Dynamic content rendering and DOM interaction capabilities.
Requirements:
Visual Studio 2008, MS-SQL Server 2005/2008
Limitations:
Crawl time is limited
Comments not found