Home > Articles > Web Services > XML

XML Reference Guide

📄 Contents

  1. XML Reference Guide
  2. Overview
  3. What Is XML?
  4. Informit Articles and Sample Chapters
  5. Books and e-Books
  6. Official Documentation
  7. Table of Contents
  8. The Document Object Model
  9. Informit Articles and Sample Chapters
  10. Books and e-Books
  11. Official Documentation
  12. DOM and Java
  13. Informit Articles and Sample Chapters
  14. Books and e-Books
  15. Implementations
  16. DOM and JavaScript
  17. Using a Repeater
  18. Repeaters and XML
  19. Repeater Resources
  20. DOM and .NET
  21. Informit Articles and Sample Chapters
  22. Books and e-Books
  23. Documentation and Downloads
  24. DOM and C++
  25. DOM and C++ Resources
  26. DOM and Perl
  27. DOM and Perl Resources
  28. DOM and PHP
  29. DOM and PHP Resources
  30. DOM Level 3
  31. DOM Level 3 Core
  32. DOM Level 3 Load and Save
  33. DOM Level 3 XPath
  34. DOM Level 3 Validation
  35. Informit Articles and Sample Chapters
  36. Books and e-Books
  37. Documentation and Implementations
  38. The Simple API for XML (SAX)
  39. Informit Articles and Sample Chapters
  40. Books and e-Books
  41. Official Documentation
  42. SAX and Java
  43. Informit Articles and Sample Chapters
  44. Books and e-Books
  45. SAX and .NET
  46. Informit Articles and Sample Chapters
  47. SAX and Perl
  48. SAX and Perl Resources
  49. SAX and PHP
  50. SAX and PHP Resources
  51. Validation
  52. Informit Articles and Sample Chapters
  53. Books and e-Books
  54. Official Documentation
  55. Document Type Definitions (DTDs)
  56. Informit Articles and Sample Chapters
  57. Books and e-Books
  58. Official Documentation
  59. XML Schemas
  60. Informit Articles and Sample Chapters
  61. Books and e-Books
  62. Official Documentation
  63. RELAX NG
  64. Informit Articles and Sample Chapters
  65. Books and e-Books
  66. Official Documentation
  67. Schematron
  68. Official Documentation and Implementations
  69. Validation in Applications
  70. Informit Articles and Sample Chapters
  71. Books and e-Books
  72. XSL Transformations (XSLT)
  73. Informit Articles and Sample Chapters
  74. Books and e-Books
  75. Official Documentation
  76. XSLT in Java
  77. Java in XSLT Resources
  78. XSLT and RSS in .NET
  79. XSLT and RSS in .NET Resources
  80. XSL-FO
  81. Informit Articles and Sample Chapters
  82. Books and e-Books
  83. Official Documentation
  84. XPath
  85. Informit Articles and Sample Chapters
  86. Books and e-Books
  87. Official Documentation
  88. XML Base
  89. Informit Articles and Sample Chapters
  90. Official Documentation
  91. XHTML
  92. Informit Articles and Sample Chapters
  93. Books and e-Books
  94. Official Documentation
  95. XHTML 2.0
  96. Documentation
  97. Cascading Style Sheets
  98. Informit Articles and Sample Chapters
  99. Books and e-Books
  100. Official Documentation
  101. XUL
  102. XUL References
  103. XML Events
  104. XML Events Resources
  105. XML Data Binding
  106. Informit Articles and Sample Chapters
  107. Books and e-Books
  108. Specifications
  109. Implementations
  110. XML and Databases
  111. Informit Articles and Sample Chapters
  112. Books and e-Books
  113. Online Resources
  114. Official Documentation
  115. SQL Server and FOR XML
  116. Informit Articles and Sample Chapters
  117. Books and e-Books
  118. Documentation and Implementations
  119. Service Oriented Architecture
  120. Web Services
  121. Informit Articles and Sample Chapters
  122. Books and e-Books
  123. Official Documentation
  124. Creating a Perl Web Service Client
  125. SOAP::Lite
  126. Amazon Web Services
  127. Creating the Movable Type Plug-in
  128. Perl, Amazon, and Movable Type Resources
  129. Apache Axis2
  130. REST
  131. REST Resources
  132. SOAP
  133. Informit Articles and Sample Chapters
  134. Books and e-Books
  135. Official Documentation
  136. SOAP and Java
  137. Informit Articles and Sample Chapters
  138. Books and e-Books
  139. Official Documentation
  140. WSDL
  141. Informit Articles and Sample Chapters
  142. Books and e-Books
  143. Official Documentation
  144. UDDI
  145. UDDI Resources
  146. XML-RPC
  147. XML-RPC in PHP
  148. Informit Articles and Sample Chapters
  149. Books and e-Books
  150. Official Documentation
  151. Ajax
  152. Asynchronous Javascript
  153. Client-side XSLT
  154. SAJAX and PHP
  155. Ajax Resources
  156. JSON
  157. Ruby on Rails
  158. Creating Objects
  159. Ruby Basics: Arrays and Other Sundry Bits
  160. Ruby Basics: Iterators and Persistence
  161. Starting on the Rails
  162. Rails and Databases
  163. Rails: Ajax and Partials
  164. Rails Resources
  165. Web Services Security
  166. Web Services Security Resources
  167. SAML
  168. Informit Articles and Sample Chapters
  169. Books and e-Books
  170. Specification and Implementation
  171. XML Digital Signatures
  172. XML Digital Signatures Resources
  173. XML Key Management Services
  174. Resources for XML Key Management Services
  175. Internationalization
  176. Resources
  177. Grid Computing
  178. Grid Resources
  179. Web Services Resource Framework
  180. Web Services Resource Framework Resources
  181. WS-Addressing
  182. WS-Addressing Resources
  183. WS-Notifications
  184. New Languages: XML in Use
  185. Informit Articles and Sample Chapters
  186. Books and e-Books
  187. Official Documentation
  188. Google Web Toolkit
  189. GWT Basic Interactivity
  190. Google Sitemaps
  191. Google Sitemaps Resources
  192. Accessibility
  193. Web Accessibility
  194. XML Accessibility
  195. Accessibility Resources
  196. The Semantic Web
  197. Defining a New Ontology
  198. OWL: Web Ontology Language
  199. Semantic Web Resources
  200. Google Base
  201. Microformats
  202. StructuredBlogging
  203. Live Clipboard
  204. WML
  205. XHTML-MP
  206. WML Resources
  207. Google Web Services
  208. Google Web Services API
  209. Google Web Services Resources
  210. The Yahoo! Web Services Interface
  211. Yahoo! Web Services and PHP
  212. Yahoo! Web Services Resources
  213. eBay REST API
  214. WordML
  215. WordML Part 2: Lists
  216. WordML Part 3: Tables
  217. WordML Resources
  218. DocBook
  219. Articles
  220. Books and e-Books
  221. Official Documentation and Implementations
  222. XML Query
  223. Informit Articles and Sample Chapters
  224. Books and e-Books
  225. Official Documentation
  226. XForms
  227. Informit Articles and Sample Chapters
  228. Books and e-Books
  229. Official Documentation
  230. Resource Description Framework (RDF)
  231. Informit Articles and Sample Chapters
  232. Books and e-Books
  233. Official Documentation
  234. Topic Maps
  235. Informit Articles and Sample Chapters
  236. Books and e-Books
  237. Official Documentation, Implementations, and Other Resources
  238. Rich Site Summary (RSS)
  239. Informit Articles and Sample Chapters
  240. Books and e-Books
  241. Official Documentation
  242. Simple Sharing Extensions (SSE)
  243. Atom
  244. Podcasting
  245. Podcasting Resources
  246. Scalable Vector Graphics (SVG)
  247. Informit Articles and Sample Chapters
  248. Books and e-Books
  249. Official Documentation
  250. OPML
  251. OPML Resources
  252. Summary
  253. Projects
  254. JavaScript TimeTracker: JSON and PHP
  255. The Javascript Timetracker
  256. Refactoring to Javascript Objects
  257. Creating the Yahoo! Widget
  258. Web Mashup
  259. Google Maps
  260. Indeed Mashup
  261. Mashup Part 3: Putting It All Together
  262. Additional Resources
  263. Frequently Asked Questions About XML
  264. What's XML, and why should I use it?
  265. What's a well-formed document?
  266. What's the difference between XML and HTML?
  267. What's the difference between HTML and XHTML?
  268. Can I use XML in a browser?
  269. Should I use elements or attributes for my document?
  270. What's a namespace?
  271. Where can I get an XML parser?
  272. What's the difference between a well-formed document and a valid document?
  273. What's a validating parser?
  274. Should I use DOM or SAX for my application?
  275. How can I stop a SAX parser before it has parsed the entire document?
  276. 2005 Predictions
  277. 2006 Predictions
  278. Nick's Book Picks

I hear a lot of complaining about SOAP, and about WSDL, about how it's too complicated, and too hard to use, and how the world would somehow be magically better if SOAP had never been invented. Look at Yahoo! Web Services, they say, they use REST, they're much simpler than Google's SOAP-based system.

Well, the complainers are right about one thing: Yahoo! Web Services does use REST. And if all you want is to request the data and look at it, then yes, it's simpler. But I like to do more than look at my data.

So having said all that, maybe you've got it in your head that using Google's API is going to be complicated, even difficult.

Well, you're wrong.

In fact, when I first set out to put together the sample application for this section, I'd intended for it to be a Java GUI application. I thought that might be a nice example of the fact that you can use these APIs to take data off the web. It wasn't long before I realized that actually using the API -- at least, the Java implementation of it that Google provides -- was only a tiny straightforward fraction of the code. Putting together the GUI was many times more complex in comparison.

So in the spirit of keeping things simple, let's build a small command-line application that requests the top 10 web pages for a particular query. Along the way we'll discuss the various options, and by the time we're through you'll have a good idea of how to incorporate this functionality into your own applications.

(You'll also see how easy it can be to use the kind of API you can generate from WSDL file. This API isn't directly generated, but it's the kind of thing that WSDL lets you do.)

First, let's talk about how this works in general terms. In order to display the items for a particular search, you need to execute these 5 steps:

  1. Create a "search" object.
  2. Set its credentials
  3. Set the query you want to run
  4. Execute the search
  5. Examine each record returned by the search, retrieving the appropriate data

Let's see it in action. Before you start, make sure to download the Google API from http://www.google.com.

First, we'll create the basic application:

public class GoogleWatch {

   public static void main(String args[]){

        String keyword = args[0];
        System.out.println("You entered "+keyword);
 
   }

}

This application runs from the command line and takes a single argument, the query you want to run. (We'll talk more about the actual queries in a few moments.)

Next, we'll create the search and set it's credentials:

import com.google.soap.search.*;

public class GoogleWatch {

   public static void main(String args[]){

        String keyword = args[0];

        GoogleSearch siteSearch = new GoogleSearch();
        siteSearch.setKey("181gQtaOdlLD+83qvh6PnssQR5WlaB9d");

   }

}

Note that in order to compile and run this application, you will need to make sure that the classpath includes the googleapi.jar file downloaded with the SDK.

The actual key you're including with the setKey() method is your Google license key.. If you don't already have one, you'll need to go to http://www.google.com/apis and request one. This key is how Google limits you to 1000 queries a day.

Next we set the query:

import com.google.soap.search.*;

public class GoogleWatch {

   public static void main(String args[]){

        String keyword = args[0];

        GoogleSearch siteSearch = new GoogleSearch();
        siteSearch.setKey("181gQtaOdlLD+83qvh6PnssQR5WlaB9d");

        try {

           siteSearch.setQueryString(keyword);

        } catch (GoogleSearchFault f) {
           System.out.println("The call to the Google Web APIs failed:");
           System.out.println(f.toString());
        }


   }

}

Note that you don't need to learn a whole new means of building queries, the web services API uses the exact same queries the web interface does. So I could simply search for sites that reference "Darth Vader," with a simple query of "Darth Vader" or I can search for sites that link to "The Darth Side," a fantastic piece of fan fiction, with a query of:

link:http://darthside.blogspot.com/

I could even look for syndicated feeds referencing the wayward Jedi:

darth vader (filetype:rss OR filetype:atom OR filetype:rdf)

Notice the boolean-type rules still apply, even though we're not using the web interface.

Now we're ready to perform the actual search:

import com.google.soap.search.*;

public class GoogleWatch {

   public static void main(String args[]){

        String keyword = args[0];

        GoogleSearch siteSearch = new GoogleSearch();
        siteSearch.setKey("181gQtaOdlLD+83qvh6PnssQR5WlaB9d");

        try {
           siteSearch.setQueryString(keyword);
           GoogleSearchResult siteResults = siteSearch.doSearch();
        } catch (GoogleSearchFault f) {
           System.out.println("The call to the Google Web APIs failed:");
           System.out.println(f.toString());
        }


   }

}

The search returns a GoogleSearchResult object, which contains not just the results themselves, but also information about actual the search. For example, we can find out approximately how many results there are for a particular query:

import com.google.soap.search.*;

public class GoogleWatch {

   public static void main(String args[]){

        String keyword = args[0];

        GoogleSearch siteSearch = new GoogleSearch();
        siteSearch.setKey("181gQtaOdlLD+83qvh6PnssQR5WlaB9d");

        try {
            siteSearch.setQueryString(keyword);
            GoogleSearchResult siteResults = siteSearch.doSearch();
            int estimatedSites = siteResults.getEstimatedTotalResultsCount();
            System.out.println("There are "+estimatedSites+" matches for the query "+keyword.");
        } catch (GoogleSearchFault f) {
           System.out.println("The call to the Google Web APIs failed:");
           System.out.println(f.toString());
        }


   }

}

Note that although you can set the starting record for a search -- for example, siteSearch.setStart(50) - you can only retrieve results 1 through 1000. Anything farther down the list is not available through this interface.

The GoogleSearchResult object also holds other useful information, such as searchComments, and searchTips, which provide information intended for the end user, estimateIsExact, which is just what it sounds like, and directoryCategories, which is an array of Open Directory Project categories that match this particular search.

Then there are the results themselves. They are actually returned as an array of GoogleSearchResultElement objects, retrievable using the getResultElements(),method:

import com.google.soap.search.*;

public class GoogleWatch {

   public static void main(String args[]){

        String keyword = args[0];

        GoogleSearch siteSearch = new GoogleSearch();
        siteSearch.setKey("181gQtaOdlLD+83qvh6PnssQR5WlaB9d");

        try {
            siteSearch.setQueryString(keyword);
            GoogleSearchResult siteResults = siteSearch.doSearch();
            int estimatedSites = siteResults.getEstimatedTotalResultsCount();
            GoogleSearchResultElement[] sites = new GoogleSearchResultElement[10];
            sites = siteResults.getResultElements();
            if (sites.length == 0){
                System.out.println("No results found for query '"+keyword+"'");
            } else {
                for (int i = 0; i < sites.length; i++) {
                    GoogleSearchResultElement thisSite = sites[i];
                    String title = thisSite.getTitle();
                    String snippet = thisSite.getSnippet();
                    String url = thisSite.getURL();
                    System.out.println(title);
                    System.out.println(snippet);
                    System.out.println(url);
                    System.out.println("----------------------------");
                }
           }
        } catch (GoogleSearchFault f) {
           System.out.println("The call to the Google Web APIs failed:");
           System.out.println(f.toString());
        }


   }

}

For each result, we can pull specific information. For example, we've pulled the title, snippet, and url for each result. If we run the application, we'll see the results:

>java GoogleWatch "Darth Vader"
Star Wars: Databank | <b>Vader</b>, <b>Darth</b>
Character profile from the official Star Wars site.
http://www.starwars.com/databank/character/darthvader/
----------------------------
<b>Darth</b> <b>Vader</b> - Star Wars.com
The official site of David Prowse, the actor who played <b>Darth</b> <b>Vader</b

>. Includes<br>  character information, biography, and store.
http://www.darthvader-starwars.com/
----------------------------
STAR WARS: Injuries of <b>Darth</b> <b>Vader</b>
My father and <b>Darth</b> <b>Vader</b> fell into the crater and my father was i
nstantly killed.<br>  <b>...</b> At least one major fact has changed: Anakin Sky
walker and <b>Darth</b> <b>Vader</b> <b>...</b>

http://www.theforce.net/swtc/injuries.html
----------------------------
<b>Darth</b> <b>Vader</b> - Wikipedia, the free encyclopedia
<b>Darth</b> <b>Vader</b> was born Anakin Skywalker and is the father of twins L
uke Skywalker<br>  <b>...</b> He is given the name "<b>Darth</b> <b>Vader</
b>" in 19 BBY. Throughout the six films, <b>...</b>

http://en.wikipedia.org/wiki/Darth_Vader
----------------------------
WNC: <b>Darth</b> <b>Vader</b> Grotesque
<b>Darth</b> <b>Vader</b> Drawing Word of the competition was spread nationwide
through National<br>  Geographic World Magazine. The third-place winner was Chri
stopher Rader, <b>...</b>

http://www.cathedral.org/cathedral/discover/darth.shtml
----------------------------
Star Wars - Star Wars <b>Darth</b> <b>Vader</b> Voice Changer Mask
<b>...</b> speak into the helmetà your voice will sound just like <b>Darth</b> <

b>Vader</b>Æs! Quick tips<br>  to help achieve the best performance with the <b>
DARTH</b> <b>VADER</b> VOICE CHANGER <b>...</b>

http://www.hasbro.com/pl/page.viewproduct/product_id.15397/dn/starwars/default.c
fm
----------------------------
'boards - Screening Room
<b>Darth</b> <b>Vader's</b> heart of evil is no match for Hollywood executiv
es'. Credits:.<br>  Agency: Mother Creatives: Yan Elliot, Luke Williamson Pr
oducer: Angela Eleini <b>...</b>

http://www.boardsmag.com/screeningroom/commercials/1659/
----------------------------
The Morning News - <b>Darth</b> <b>Vader</b> Made Me Cry, by Matthew Baldwin
From the first moment you see <b>Darth</b> <b>Vader</b> choke that rebel fighter
, you know heÆs<br>  <b>...</b> <b>Darth</b> <b>Vader</b> was on the second or t
hird floorùI donÆt remember which, <b>...</b>

http://www.themorningnews.org/archives/stories/darth_vader_made_me_cry.php
----------------------------
AtomFilms - <b>Darth</b> <b>Vader's</b> Psychic Hotline
<b>Darth</b> <b>Vader's</b> Psychic Hotline United States | 4:59 | John E. H
udgens <b>...</b> One phone<br>  call solves them all with <b>Darth</b> <b>Vader

's</b> Psychic Hotline no matter whether <b>...</b>
http://www.atomfilms.com/af/content/darth_vaders_psychic
----------------------------
James Earl Jones
Voice of <b>Darth</b> <b>Vader</b>; "3rd Rock from the Sun" (1996) TV
Series (uncredited) .<br>  <b>...</b> <b>Darth</b> <b>Vader</b>; The Greatest Th
ing That Almost Happened (1977) (TV) . <b>...</b>

http://www.imdb.com/name/nm0000469/
----------------------------

Notice that the results include the "bold" tag (<b></b>) around the actual keyword, so if you were displaying this information on a web page, you could easily tell why the result was included. Other information available for each result includes:

  • summary: the Open Directory Sumary, if there is one
  • cachedSize: Google frequently saves a cached copy of pages it indexs. If there is such a copy, this property shows you how big it is
  • relatedInformationPresent: True or false, this property tells you whether Google can provide related pages for this URL (via the related: search term)
  • directoryCategory and directoryTitle: Information on how and where this URL is listed in the Open Directory Project, if applicable

Overall, the Google Web Service API is a great way to incorporate Google results into your web page or application, whether you use the API provided here or use another method, such as using XSLT to transform the data for your own purposes.

InformIT Promotional Mailings & Special Offers

I would like to receive exclusive offers and hear about products from InformIT and its family of brands. I can unsubscribe at any time.