Sams Teach Yourself HTML 4 in 24 Hours

Sams Teach Yourself HTML 4 in 24 Hours

By Dick Oliver

XML: Unity in Diversity

So far in this hour I've noted how HTML is in the right place at the right time to enable several key changes in business and interpersonal communication. As the people and companies of the world become more connected and dependent upon one another, HTML's capability to make all information technology easier to use and less constrained by geography seems almost magical.

Even more magically, HTML has enabled an explosion of new media formats and incompatible file types, while at the same time providing the first truly universal format for exchanging all types of information. The limitations of the HTML language itself have stood in the way of it truly fulfilling this important role, however. To address these limitations, the World Wide Web Consortium has created a new language called XML, or eXtensible Markup Language.

Before you can understand what XML is and why it may be a key to the future of computer communication, you first have to meet its mother. Her name is SGML, or Standard General Markup Language. She is a venerable, time-worn standard for describing other mark-up languages. HTML is just one of many languages that can be defined in SGML. Others include specialized languages for library indexing, print publications management, mathematical formulas, molecular chemistry, and so on. Although no one SGML browser can render all these specialized document types, any SGML browser can read the document type definition (DTD) and figure out how to render those elements of the document type that it understands while ignoring the rest.

It would be ideal if Web browsers could read any SGML document type instead of being limited to HTML documents. The problem with this idea is that SGML is so complex and powerful that it can be very difficult to learn and implement. If this book were about SGML, the title would be something like Sams Teach Yourself SGML in a Year and a Half.

XML was created to bridge the gap between HTML, which is easy to learn but kinda wimpy, and SGML, which gives you God-like powers but may require several reincarnations to master. Like SGML, XML allows you to define your own special-purpose tags and implement complex link relationships between documents. Like HTML, XML is simple enough to learn fairly quickly because it avoids all the more esoteric (and least commonly used) aspects of SGML.

XML is a general-purpose language, which includes all the HTML tags but also allows specialized extensions of the language to be easily defined without losing compatibility with the core language. XML standardizes the format of the most common types of information while freely allowing unlimited special cases for proprietary formats and new technology. This means that you can both ensure complete compatibility between the widest variety of software and easily develop unique information formats to meet your individual needs.

The ability to extend HTML pages with custom data types is far more than a way to embed a nifty movie or virtual reality scene into your Web page. To show how much more, the next section of this hour highlights some of the most exciting up-and-coming uses of XHTML and XML.

Share ThisShare This

Informit Network