Expat non validating xml parser

XML was initially "developed by a W3C Generic SGML Editorial Review Board formed under the auspices of the W3 Consortium in 1996 and chaired by Jon Bosak of Sun Microsystems, with the very active participation of a Generic SGML Working Group also organized by the W3C." An XML WG (Working Group) under W3C served initially as an editorial board, which received input from an XML Special Interest Group.

As of late 1998, the XML design effort was re-chartered under the direction of an XML Coordination Group and XML Plenary Interest Group to be carried out in five new XML working groups: XML Schema Working Group, XML Fragment Working Group, XML Linking Working Group (XLink and XPointer), XML Information Set Working Group, and XML Syntax Working Group.

Structured information contains both content (words, pictures, etc.) and some indication of what role that content plays (for example, content in a section heading has a different meaning from content in a footnote, which means something different than content in a figure caption or content in a database table, etc.). A markup language is a mechanism to identify structures in a document.

The XML specification defines a standard way to add markup to documents.

The next version of 'HTML' is expected to be reformulated as an XML application, so that it will be based upon XML rather than upon SGML.

As of December 1998, 'Voyager' was the W3C code name for HTML reformulated as an application of XML.

Several introductory and tutorial articles on the Extensible Markup Language (XML) are referenced in the shorter XML Introduction document. "The Extensible Markup Language (XML) is the universal format for structured documents and data on the Web." -- W3C XML Web site, 2000-07-06.

It is an Open Source module, provided under an Apache-style license.

In addition to covering the XML 1.0 Specification, this article outlines related XML specifications, which are evolving.

The article is organized in four main sections plus an appendix. XML is a markup language for documents containing structured information.

(I suspect all the comparisons to comments are kinda misleading/unhelpful.) from an XML document or external parsed entity, a CDATA section is a section of element content that is marked for the parser to interpret as only character data, not markup.

" thus text inside CDATA is seen by the parser but only as characters not as XML nodes. You can use this to escape some characters which otherwise will be treated as regular XML. For example, if you want to pass a URL that contains And as a explanation for why this example uses CData (and not the appropriate pub Data and dc:creator tags)..is for website display using a RSS widget for which we have no real formatting control.