tagsoup - A SAX-compliant parser written in Java that parses HTML as it is found in the wild: nasty and brutish
| Website: | http://mercury.ccil.org/~cowan/XML/tagsoup/ |
|---|---|
| License: | GPL |
| Vendor: | JPackage Project |
- Description:
TagSoup is a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML, parses HTML as it is found in the wild: nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design. By providing a SAX interface, it allows standard XML tools to be applied to even the worst HTML.
Packages
| tagsoup-1.0.1-1jpp.src [78 KiB] |
Changelog
by Sebastiano Vigna (2007-01-20):
- Upgraded to 1.0.1 |
| tagsoup-1.0.1-1jpp.noarch [60 KiB] |
Changelog
by Sebastiano Vigna (2007-01-20):
- Upgraded to 1.0.1 |