Name: | tagsoup |
---|---|
Version: | 1.2.1 |
Release: | 8.el6 |
Architecture: | noarch |
Group: | Text Processing/Markup/XML |
Size: | 143062 |
License: | ASL 2.0 and (GPLv2+ or AFL) |
RPM: | tagsoup-1.2.1-8.el6.noarch.rpm |
Source RPM: | tagsoup-1.2.1-8.el6.src.rpm |
Build Date: | Tue Oct 14 2014 |
Build Host: | ca-buildj3.us.oracle.com |
Vendor: | Oracle America |
URL: | http://home.ccil.org/~cowan/XML/tagsoup/ |
Summary: | A SAX-compliant HTML parser written in Java |
Description: | TagSoup is a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design. By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML. |