Links:
Beautiful Soup Beautiful Soup is a self-contained parser that makes screen-scraping easy. It parses both good and bad HTML and XML and offers methods for traversing the parse tree and extracting specific parts of a document. ESPX ESPX is an ECMAScript-coded parser for a subset of XML 1.0. eZ xml eZ xml is a XML DOM parser written in pure PHP. It's fully compatible with the libXml parser used in eZ publish, but does not require any external libraries. HotSAX HotSAX is a small fast SAX2 parser for HTML, XHTML and XML. HXML HXML is a non-validating XML parser written in Haskell. It is designed for space-efficiency, taking advantage of lazy evaluation to reduce memory requirements. HXML may be used as a drop-in replacement for the HaXml parser in existing programs. NekoXNI NekoXNI is a collection of small, useful XML tools written for the Xerces Native Interface (XNI) that is the foundation of the Xerces2 implementation. The NekoXNI tools are written to illustrate the power and flexibility of the XNI framework as well as provide useful tools for XML application developers. NQXML NQXML is a pure Ruby implementation of a non-validating XML processor. It includes an XML tokenizer, a SAX-style streaming XML parser, a DOM-style tree XML Parser, and an XML writer. phpXML phpXML is a fast XPath parser written in plain PHP. Python Browsing Probe Python Browsing Probe is a web test tool based on John J. Lee's mechanize. It exposes the browser functionality at the level of a shell-like interpreter so that testers can quickly write tests in a simple language designed specifically for that purpose. RapidXml RapidXml is an attempt to create the fastest XML DOM parser possible, while retaining useability, portability, and reasonable W3C compatibility. It is an in-situ parser written in C++, with parsing speed approaching that of strlen() function executed on the same data. The entire parser is contained in a single header file, so no building or configuration is necessary. It has no dependencies other than a very small subset of standard C++ library, which can be easily emulated manually if use of standard library is not desired. REXML Ruby Electric XML: an XML parser written in and for the Ruby programming language SAX SAX is the Simple API for XML, originally a Java-only API. SAX was the first widely adopted API for XML in Java, and is a ?de facto? standard. TestLink TestLink is a testcase management and execution tracking system built using PHP, MySQL, and several other open source tools. It is a tool that helps streamline the software development process. Xerces-C++ Xerces-C++ is a validating XML parser written in a portable subset of C++. Xerces-C++ makes it easy to give your application the ability to read and write XML data. Xerces2 Xerces2 is the next generation of high performance, fully compliant XML parsers in the Apache Xerces family. xmlwrapp xmlwrapp is a clean and easy to use C++ wrapper around popular XML parsers found in the wild. It is not meant to expose every feature of the backend parser but instead support a high level API for working with XML data.