SourceForge Files: Go to SourceForge files. VietSpider is free and open source software. You can download source code from here.


If you like the software, please consider a donation. Feel free to donate any amount you'd like. Donations of $50 or more will receive full version with full support.

VietSpider Web Data Extractor

Software crawls the data from the websites (Data Scraper), format to XML standard (Text, CDATA,...) then store in the relational database or export to MS Excel, CSV,... by the plugins. Product supports the various of RDBMs such as Oracle, MySQL, SQL Server, H2, HSQL, Apache Derby, Postgres,... VietSpider Crawler supports Session (login, query by form input), multi downloading, JavaScript handling, Proxy (and multi proxy by auto scan the proxies from website),...

Download Version 3 build 19 (Last Released: 01/15/2012): Windows (32 Bit - 64 Bit), Linux 32 Bit, Linux 64 Bit, Mac OS X(Cocoa 32), Mac OS X(Cocoa 64).

VietSpider - Vietnamese News Extractor

The new version of Vietspider allow to crawl and extract the articles, news, blog from the complex sites,... in Vietnamese. It also supports the various RDBM database such Oracle, MySQL, Postgres,... By the plugins, VietSpider can post the contents to Joomla, Drupal, WordPress, NukeViet, VBulletin,...

Lasted version: build 21 (Released 02/11/2015). Download: Windows, Windows with JRE, Linux.

Java HTML Parser

HTMLParser : Pure Java HTML DOM parser, support HTML 4.0.1. It is a fast, syntax checker, automatically closes elements with optional end tags; and can handle mismatched inline element tags.

Download Build 10: Java HTML Parser (Open Source).