Home > Articles > Web Services > XML

XML Reference Guide

📄 Contents

  1. XML Reference Guide
  2. Overview
  3. What Is XML?
  4. Informit Articles and Sample Chapters
  5. Books and e-Books
  6. Official Documentation
  7. Table of Contents
  8. The Document Object Model
  9. Informit Articles and Sample Chapters
  10. Books and e-Books
  11. Official Documentation
  12. DOM and Java
  13. Informit Articles and Sample Chapters
  14. Books and e-Books
  15. Implementations
  16. DOM and JavaScript
  17. Using a Repeater
  18. Repeaters and XML
  19. Repeater Resources
  20. DOM and .NET
  21. Informit Articles and Sample Chapters
  22. Books and e-Books
  23. Documentation and Downloads
  24. DOM and C++
  25. DOM and C++ Resources
  26. DOM and Perl
  27. DOM and Perl Resources
  28. DOM and PHP
  29. DOM and PHP Resources
  30. DOM Level 3
  31. DOM Level 3 Core
  32. DOM Level 3 Load and Save
  33. DOM Level 3 XPath
  34. DOM Level 3 Validation
  35. Informit Articles and Sample Chapters
  36. Books and e-Books
  37. Documentation and Implementations
  38. The Simple API for XML (SAX)
  39. Informit Articles and Sample Chapters
  40. Books and e-Books
  41. Official Documentation
  42. SAX and Java
  43. Informit Articles and Sample Chapters
  44. Books and e-Books
  45. SAX and .NET
  46. Informit Articles and Sample Chapters
  47. SAX and Perl
  48. SAX and Perl Resources
  49. SAX and PHP
  50. SAX and PHP Resources
  51. Validation
  52. Informit Articles and Sample Chapters
  53. Books and e-Books
  54. Official Documentation
  55. Document Type Definitions (DTDs)
  56. Informit Articles and Sample Chapters
  57. Books and e-Books
  58. Official Documentation
  59. XML Schemas
  60. Informit Articles and Sample Chapters
  61. Books and e-Books
  62. Official Documentation
  63. RELAX NG
  64. Informit Articles and Sample Chapters
  65. Books and e-Books
  66. Official Documentation
  67. Schematron
  68. Official Documentation and Implementations
  69. Validation in Applications
  70. Informit Articles and Sample Chapters
  71. Books and e-Books
  72. XSL Transformations (XSLT)
  73. Informit Articles and Sample Chapters
  74. Books and e-Books
  75. Official Documentation
  76. XSLT in Java
  77. Java in XSLT Resources
  78. XSLT and RSS in .NET
  79. XSLT and RSS in .NET Resources
  80. XSL-FO
  81. Informit Articles and Sample Chapters
  82. Books and e-Books
  83. Official Documentation
  84. XPath
  85. Informit Articles and Sample Chapters
  86. Books and e-Books
  87. Official Documentation
  88. XML Base
  89. Informit Articles and Sample Chapters
  90. Official Documentation
  91. XHTML
  92. Informit Articles and Sample Chapters
  93. Books and e-Books
  94. Official Documentation
  95. XHTML 2.0
  96. Documentation
  97. Cascading Style Sheets
  98. Informit Articles and Sample Chapters
  99. Books and e-Books
  100. Official Documentation
  101. XUL
  102. XUL References
  103. XML Events
  104. XML Events Resources
  105. XML Data Binding
  106. Informit Articles and Sample Chapters
  107. Books and e-Books
  108. Specifications
  109. Implementations
  110. XML and Databases
  111. Informit Articles and Sample Chapters
  112. Books and e-Books
  113. Online Resources
  114. Official Documentation
  115. SQL Server and FOR XML
  116. Informit Articles and Sample Chapters
  117. Books and e-Books
  118. Documentation and Implementations
  119. Service Oriented Architecture
  120. Web Services
  121. Informit Articles and Sample Chapters
  122. Books and e-Books
  123. Official Documentation
  124. Creating a Perl Web Service Client
  125. SOAP::Lite
  126. Amazon Web Services
  127. Creating the Movable Type Plug-in
  128. Perl, Amazon, and Movable Type Resources
  129. Apache Axis2
  130. REST
  131. REST Resources
  132. SOAP
  133. Informit Articles and Sample Chapters
  134. Books and e-Books
  135. Official Documentation
  136. SOAP and Java
  137. Informit Articles and Sample Chapters
  138. Books and e-Books
  139. Official Documentation
  140. WSDL
  141. Informit Articles and Sample Chapters
  142. Books and e-Books
  143. Official Documentation
  144. UDDI
  145. UDDI Resources
  146. XML-RPC
  147. XML-RPC in PHP
  148. Informit Articles and Sample Chapters
  149. Books and e-Books
  150. Official Documentation
  151. Ajax
  152. Asynchronous Javascript
  153. Client-side XSLT
  154. SAJAX and PHP
  155. Ajax Resources
  156. JSON
  157. Ruby on Rails
  158. Creating Objects
  159. Ruby Basics: Arrays and Other Sundry Bits
  160. Ruby Basics: Iterators and Persistence
  161. Starting on the Rails
  162. Rails and Databases
  163. Rails: Ajax and Partials
  164. Rails Resources
  165. Web Services Security
  166. Web Services Security Resources
  167. SAML
  168. Informit Articles and Sample Chapters
  169. Books and e-Books
  170. Specification and Implementation
  171. XML Digital Signatures
  172. XML Digital Signatures Resources
  173. XML Key Management Services
  174. Resources for XML Key Management Services
  175. Internationalization
  176. Resources
  177. Grid Computing
  178. Grid Resources
  179. Web Services Resource Framework
  180. Web Services Resource Framework Resources
  181. WS-Addressing
  182. WS-Addressing Resources
  183. WS-Notifications
  184. New Languages: XML in Use
  185. Informit Articles and Sample Chapters
  186. Books and e-Books
  187. Official Documentation
  188. Google Web Toolkit
  189. GWT Basic Interactivity
  190. Google Sitemaps
  191. Google Sitemaps Resources
  192. Accessibility
  193. Web Accessibility
  194. XML Accessibility
  195. Accessibility Resources
  196. The Semantic Web
  197. Defining a New Ontology
  198. OWL: Web Ontology Language
  199. Semantic Web Resources
  200. Google Base
  201. Microformats
  202. StructuredBlogging
  203. Live Clipboard
  204. WML
  205. XHTML-MP
  206. WML Resources
  207. Google Web Services
  208. Google Web Services API
  209. Google Web Services Resources
  210. The Yahoo! Web Services Interface
  211. Yahoo! Web Services and PHP
  212. Yahoo! Web Services Resources
  213. eBay REST API
  214. WordML
  215. WordML Part 2: Lists
  216. WordML Part 3: Tables
  217. WordML Resources
  218. DocBook
  219. Articles
  220. Books and e-Books
  221. Official Documentation and Implementations
  222. XML Query
  223. Informit Articles and Sample Chapters
  224. Books and e-Books
  225. Official Documentation
  226. XForms
  227. Informit Articles and Sample Chapters
  228. Books and e-Books
  229. Official Documentation
  230. Resource Description Framework (RDF)
  231. Informit Articles and Sample Chapters
  232. Books and e-Books
  233. Official Documentation
  234. Topic Maps
  235. Informit Articles and Sample Chapters
  236. Books and e-Books
  237. Official Documentation, Implementations, and Other Resources
  238. Rich Site Summary (RSS)
  239. Informit Articles and Sample Chapters
  240. Books and e-Books
  241. Official Documentation
  242. Simple Sharing Extensions (SSE)
  243. Atom
  244. Podcasting
  245. Podcasting Resources
  246. Scalable Vector Graphics (SVG)
  247. Informit Articles and Sample Chapters
  248. Books and e-Books
  249. Official Documentation
  250. OPML
  251. OPML Resources
  252. Summary
  253. Projects
  254. JavaScript TimeTracker: JSON and PHP
  255. The Javascript Timetracker
  256. Refactoring to Javascript Objects
  257. Creating the Yahoo! Widget
  258. Web Mashup
  259. Google Maps
  260. Indeed Mashup
  261. Mashup Part 3: Putting It All Together
  262. Additional Resources
  263. Frequently Asked Questions About XML
  264. What's XML, and why should I use it?
  265. What's a well-formed document?
  266. What's the difference between XML and HTML?
  267. What's the difference between HTML and XHTML?
  268. Can I use XML in a browser?
  269. Should I use elements or attributes for my document?
  270. What's a namespace?
  271. Where can I get an XML parser?
  272. What's the difference between a well-formed document and a valid document?
  273. What's a validating parser?
  274. Should I use DOM or SAX for my application?
  275. How can I stop a SAX parser before it has parsed the entire document?
  276. 2005 Predictions
  277. 2006 Predictions
  278. Nick's Book Picks

The Semantic Web is a great idea. It defines a way in which content can be marked up so that it may be understood by various applications that can make sense of data that is currently opaque to analysis. It allows meaningful relationships to be drawn from the data by machines as well as people. But it has one tiny minor little problem.

The web already exists.

Okay, so that's not a tiny minor little problem. It's a huge, honking problem. Why? Well, for two reasons. The first reason is that millions, or perhaps billions of pages already exist, and they don't have Semantic Web information in them. The second reason is that thousands, or perhaps millions of web programmers and content authors already exists, and they don't have the Semantic Web mindset in them.

That's not to say that they don't have an appreciation for what could be achieved with the Semantic Web. Most people would love to see a world that includes Tim Berners-Lee's vision of the Web as a place where you can perform actions such as having an intelligent agent coordinate schedules and make a doctor's appointment for you, but they don't want to have to learn complicated RDF to make it happen.

Enter microformats. Microformats are, on the surface, standard ways of representing commonly published information. For example, the vCard, a standard way of exchanging "business card"-type information has been around for more than a decade, and the iCalendar standard has been around almost that long. Both of them provide a standard, interoperable way of providing information that is commonly published on the Web. If an easy -- and most of all, standard -- way can be found for developers and content authors to represent this information in web pages, it can be available for semantic searches.

Note the use of the lowercase "s" in "semantic". Tantek Çelik has been giving talks on microformats and calling it the "semantic web", emphasizing the lowercase nature of it. This is not the Semantic Web of RDF and OWL, where the emphasis is on making data machine-readable. Instead, the emphasis is all in making data that is human readable first, and machine-readable second.

In essence, the microformats movement has two aspects. The first is to define the information represented by a particular kind of entity. For example, a vCard has the following information:

BEGIN:VCARD
VERSION:3.0
N:Chase;Nicholas
FN:Nicholas Chase
URL:http://nicholaschase.com/
ORG:InformIT
END:VCARD

The hCard microformat takes this information and provides a standard way to represent it in XHTML:

<div class="vcard">
 <a class="url fn" href="http://nicholaschase.com/">Nicholas Chase</a>

 <div class="org">InformIT</div>
</div>

Note the use of classes to specify information. This is a feature of microformats; they try very hard not to change what people are doing, but rather to simply add on in a way that is useful. For example, because this is XHTML with class information, you can simply create a style sheet such as:

<style type="text/css">
   .vcard {border: 1px dotted black}
   .fn {font-weight: bold; text-decoration: none}
   .org {display: inline}
</style>

And the hCard would be displayed as:

Adding information this way is not onerous to content authors, but it provides enough information that an aggregator could easily find specific hCard data.

And that's the other focus of microformats: defining how this information should be represented. The Microformats wiki defines them as:

simple conventions 
for embedding semantic markup 
            for a specific problem domain 
in human-readable (X)HTML/XML documents, Atom/RSS feeds, and "plain" XML 
            that normalize existing content usage patterns 
            using brief, descriptive class names 
            often based on existing interoperable standards 
to enable decentralized development 
            of resources, tools, and services 

And it does seem to be catching on. The wiki lists nine specifications, and at least as many drafts. They fall into two categories: elemental microformats and compound microformats.

Elemental microformats are meant to be composed into compound microformats. For example, the list of elemental microformats includes:

XOXO:
This microformat represents lists. In fact, to the naked eye it is indistinguishable from XHTML lists and definitions, encompassing the ol, ul, li, dl, dt, and dd elements. All are used in their normal manner, but since most people don't use the dl, dt, and dd elements properly, it is a formal declaration that they are to be used as a definition lists, definition terms, and definition descriptions, respectively. (For example, this list is marked up as a definition list.) Also, by adding the xoxo class to lists intended to be part of a microformat, you specify their purpose, and not just their presentation.
RelTag:
This microformat has a leg up in the blogging world, because of its adoption by Technorati, where Tantek Celik is Chief Technologist. It specifies that information in is a "keyword" or, more commonly, a "tag". For example, I might tag this document as:
<a href="http://www.technorati.com/tag/microformats" rel="tag">microformats</a>
to indicate that this was a document about microformats. The rel tag is what specifies that as a tag, and the href attribute is the "tag space", with the tag consisting of the last part of the URL. So while most people use URLs that point back to, say, Technorati, which maintains a listing of pages for each tag, it is only important to point to something relevant. For example, as the specification suggests, we could point to
<a href="http://www.wikipedia.com/microformats" rel="tag">microformats</a>
and still have are relevant tag.
XFN:
This is an interesting little specification that enables you to note your relationship to other people in social networks. I could create a blogroll and specify my relationship with the authors with regards to several axes, such as friendship, whether or not I've met them, geographical, professional, family, and so on. For example:
<a href="http://www.chaosmagnet.com" rel="self">Chaos Magnet</a><br />

<a href=" http://www.squidoo.com/genealogyhowtoinfo" rel="spouse">Sarah Chase</a><br />
<a href="http://www.backstopmedia.com" rel="co-worker friend met">Troy Mott</a><br />
<a href="http://www.workbench.com" rel="acquaintance colleague">Rogers Cadenhead</a><br />
RelLicense:
This format enables you to link to the license for a particular page. For example, my blog is subject to a Creative Commons license, so I could add a link of:
This weblog is licensed under a  <a href="http://creativecommons.org/licenses/by-nd/1.0/" 
rel="license">Creative Commons License</a>

The Microformats group has also specified several compound formats:

hCard:
This is the format we described at the start of this entry.
hCalendar:
Used to mark up events, appointments, and other such calendar-related information, this format encodes the information normally found in the iCalendar specification as XHTML. For example, if I wanted to add a conference on speaking and in June to my blog, I could mark it up as:
<div class="vcalendar">
<div class="vevent">
 <a class="url" href="http://www-306.ibm.com/software/rational/events/rsdc2006/">

  <span class="summary">IBM Rational Software Development Conference 2006</span>: 
  <abbr class="dtstart" title="2006-06-04">June 4</abbr>-
  <abbr class="dtend" title="2006-06-09">8</abbr>,
 at the <span class="location">Walt Disney World Swan and Dolphin Resort Orlando, Florida</span>

 </a>
</div>
<div class="vevent">
...
</div>
</div>
Note that the vCalendar div is optional; you'll want to include it, however, if you're showing more than one event. Also note that it is only the class names that are important. You can just as easily include the event in a span, and list several of them on one line. Note also that the dtend value refers to the day after the event, hence the disconnect between the data and the human readable version.
VoteLinks:
The specification provides a way to link to something without showing your approval of it. Consider this example taken from the specification:
<a rev="vote-for" href="http://ragingcow.blogspot.com"  
   title="neat spoof">Raging Cow</a>
<a rev="vote-for" href="http://ragingcow.blogspot.com"  
   title="the boycott site">I support the Raging Cow</a>

<a rev="vote-against" href="http://ragingcow.com"  
   title="nasty corn syrup drink">Raging Cow</a>
The page links to both favored an unfavored sites, using the rev attribute to distinguish between them. Yes, this is a real attribute for this element, intended to denote a "reverse link", but rarely used.
hReview:
Reviews are perhaps one of the most commonly cited purposes for microformats. This is probably because so much blogging is done to tell people what you think about something. This specification is intentionally minimalist, capturing only properties that are common to all reviews. Consider this example from the specification:
<div class="hreview">
 <span><span class="rating">5</span> out of 5 stars</span>
 <h4 class="summary">Crepes on Cole is awesome</h4>

 <span class="reviewer vcard">Reviewer: <span class="fn">Tantek</span> - 
 <abbr class="dtreviewed" title="20050418T2300-0700">April 18, 2005</abbr></span>
 <div class="description item vcard"><p>

  <span class="fn org">Crepes on Cole</span> is one of the best little 
  creperies in <span class="adr"><span class="locality">San Francisco</span></span>.
  Excellent food and service. Plenty of tables in a variety of sizes 
  for parties large and small.  Window seating makes for excellent 
  people watching to/from the N-Judah which stops right outside.  
  I've had many fun social gatherings here, as well as gotten 
  plenty of work done thanks to neighborhood WiFi.
 </p></div>

 <p>Visit date: <span>April 2005</span></p>
 <p>Food eaten: <span>Florentine crepe</span></p>
</div>
Notice that the hReview specification makes use of other microformats, such as hCard. This is, after all, a compound specification, and does include other attributes for specific types of reviews, such as product reviews or "multidimensional" reviews. See the specification for more information.

Now we need an easy way for authors to generate this information.

InformIT Promotional Mailings & Special Offers

I would like to receive exclusive offers and hear about products from InformIT and its family of brands. I can unsubscribe at any time.