Microformat
From Wikipedia, the free encyclopedia
A microformat (sometimes abbreviated μF or uF) is a web-based[1] data formatting approach that seeks to re-use existing content as metadata, using only XHTML and HTML classes[2] and attributes.[3] This approach is intended to allow information intended for end-users (such as contact information, geographic coordinates, calendar events, and the like) to also be automatically processed by software.
Although the content of web pages is technically already capable of "automated processing", and has been since the inception of the web, there are certain limitations. This is because the traditional markup tags used to display information on the web do not describe what the information means.[4] Microformats are intended to bridge this gap by attaching semantics, and thereby obviate other, more complicated methods of automated processing, such as natural language processing or screen scraping. The use, adoption and processing of microformats enables data items to be indexed, searched for, saved or cross-referenced, so that information can be reused or combined.[4]
Current microformats allow the encoding and extraction of events, contact information, social relationships, and so on. More are being developed. Version 3 of the Firefox browser,[5] as well as version 8 of Internet Explorer[6] are expected to include native support for microformats.
Contents |
[edit] Background
Microformats emerged as part of a grassroots movement[4] to make recognizable data items, (such as events, contact details or geographical locations), capable of automated processing by software, as well as directly readable by end-users.[4][7] Link-based microformats emerged first. These include vote links that express opinions of the linked page, which can be tallied into instant polls by search engines.[8]
As the microformats community grew, CommerceNet, a nonprofit organization that promotes electronic commerce on the Internet, helped sponsor and promote the technology and support the microformats community in various ways.[8] CommerceNet also helped co-found the microformats community site microformats.org.[8]
Neither CommerceNet nor Microformats.org is a standards body. The microformats community is an open wiki, mailing list, and Internet relay chat (IRC) channel.[8] Most of the existing microformats were created at the Microformats.org wiki and associated mailing list, by a process of gathering examples of web publishing behaviour, then codifying it. Some other microformats (such as rel=nofollow and unAPI) have been proposed, or developed, elsewhere.
[edit] Technical overview
XHTML and HTML standards allow for semantics to be embedded and encoded within the attributes of markup tags. Microformats take advantage of these standards by indicating the presence of metadata using the following attributes:
class
rel
rev
(in one case, otherwise deprecated in microformats[3])
For example, in the text "The birds roosted at 52.48,-1.89" is a pair of numbers which may be understood, from their context, to be a set of geographic coordinates. By wrapping them in spans (or other HTML elements) with specific class names (in this case geo
, latitude
and longitude
, all part of the geo microformat specification):
The birds roosted at <span class="geo"> <span class="latitude">52.48</span>, <span class="longitude">-1.89</span> </span>
machines can be told exactly what each value represents, and can then index it, look it up on a map, export it to a GPS device, etc.
[edit] Example
Consider the contact information:
<div> <div>Joe Doe</div> <div>The Example Company</div> <div>604-555-1234</div> <a href="http://example.com/">http://example.com/</a> </div>
With hCard microformat markup, that becomes:
<div class="vcard"> <div class="fn">Joe Doe</div> <div class="org">The Example Company</div> <div class="tel">604-555-1234</div> <a class="url" href="http://example.com/">http://example.com/</a> </div>
Here the formatted name (fn), organisation (org), telephone number (tel) and web address (url) have been identified using specific class names; and the whole thing is wrapped in class="vcard"
, which indicates that the other classes form an hCard (short for "HTML vCard)"), and are not just coincidentally named. Other, optional, hCard classes also exist.
It is now possible for software, for example browser plug-ins, to extract the information, and transfer it to other applications, such as an address book.
[edit] Specific microformats
Several microformats have been developed to enable semantic markup of particular types of information.
- hAtom - for marking up Atom (standard) feeds from within standard HTML
- hCalendar - for events
- hCard - for contact information; includes:
-
- adr (microformat) - for postal addresses
- geo (microformat) - for geographical coordinates (latitude, longitude)
- hReview - for reviews
- hResume - for resumes or CVs
- rel-directory (file systems) - for distributed directory creation and inclusion
- rel-nofollow, an attempt to discourage 3rd party content spam (e.g. spam in blogs).
- rel-tag (metadata) - for decentralized tagging (Folksonomy)
- xFolk - for tagged links
- XHTML Friends Network (XFN) - for social relationships
- XOXO - for lists and outlines
[edit] Proposed microformats
Among the many proposed microformats,[9] the following are undergoing active development:
- citation - for citing references
- currency - for amounts of money
- geo extensions - for places on Mars, The Moon, and other such bodies; for altitude; and for collections of waypoints marking routes or boundaries
- species - For the names of living things.
[edit] Uses of microformats
Using microformats within HTML code provides additional formatting and semantic data that can be used by applications. These could be applications that collect data about on-line resources, such as web crawlers, or desktop applications such as e-mail clients or scheduling software.
Several browser extensions, such as Operator, provide the ability to detect microformats within an HTML document and export them into formats compatible with contact management and calendar utilities, such as Microsoft Outlook.
Microsoft expressed a desire to incorporate Microformats into upcoming projects;[10] as have other software companies.
In Wikipedia - and more generally in MediaWiki - microformats are used as part of templates like {{coord}}.
[edit] Evaluation of microformats
Various commentators have offered review and discussion on the design principles and practical aspects of microformats. Additionally, microformats have been compared to other approaches that seek to serve the same or similar purpose.[11] From time to time, there is criticism of a single, or all, microformats.[11] Documented efforts to advocate both the spread and use of microformats are known to exist as well.[12] [13]
[edit] Design principles
Rohit Khare has stated that reduce, reuse, and recycle is "shorthand for several design principles" that motivated the development and practices behind microformats.[8][14] These aspects can be summarized as follows:
- Reduce: favor the simplest solutions and focus attention on specific problems;
- Reuse: work from experience and favor examples of current practice;
- Recycle: encourage modularity and the ability to embed, valid XHTML can be reused in blog posts, RSS feeds, and anywhere else you can access the web.[8]
[edit] Accessibility
Because some microformats make use of title attribute of HTML's abbr
element to conceal machine-readable data (particularly date-times and geographical coordinates) in the "abbr design pattern", the plain text content of the element is inaccessible to those screenreaders that expand abbreviations.[15]
[edit] Intellectual property
As part of the open participation model of the microformats community, some claim a hard-and-fast rule for participation has been that any specifications resulting from community participation be free from restrictions on use based on intellectual property. This means that microformats must not be limited by copyrights licenses or patents, and must be royalty-free. It is not clear what enforceability, if any, the 'no intellectual property' rule has.
[edit] Alternative approaches
Microformats are not the only solution for providing "more intelligent data" on the web. Alternative approaches have been considered and developed as well. For example, the use of XML markup and standards of the semantic web are cited as alternative approaches. [8] These are contrasted with microformats in that they do not necessarily coincide with the design principles of "reduce, reuse, and recycle", at least not to the same extent.[8]
One advocate of microformats characterized a problem with alternative approaches:
“ | Here's a new language we want you to learn, and now you need to output these additional files on your server. It's a hassle. (Microformats) lower the barrier to entry.[4] | ” |
[edit] See also
- COinS
- GRDDL
- Intelligent agents
- RDFa
- S5 file format
- Simple HTML Ontology Extensions
- Web crawlers
- Tag (metadata)
[edit] Notes
- ^ Microformats may be used in web pages and also any other context that supports HTML and XHTML. This includes RSS.
- ^ http://microformats.org/wiki/existing-classes
- ^ a b http://microformats.org/wiki/rel-faq
- ^ a b c d e “What’s the Next Big Thing on the Web? It May Be a Small, Simple Thing — Microformats,” Knowledge@Wharton, 27 July 2005; http://knowledge.wharton.upenn.edu/index.cfm?fa=printArticle&ID=1247
- ^ Microformats (Mozilla Wiki) (2007-05-24). Retrieved on 2007-06-03.
- ^ Bounds, Darren (2007-05-02). Microsoft drops hints about Internet Explorer 8. Retrieved on 2007-05-02.
- ^ In this context, the definition of "End-user" includes a person reading a web page on a computer screen or mobile device, or an assistive technology software program such as a screen reader.
- ^ a b c d e f g h Khare, Rohit. "Microformats: The Next (Small) Thing on the Semantic Web?". IEEE Internet Computing 10 (1): 68-75.
- ^ Microformats' exploratory discussions
- ^ Bill Gates at Mix06 - "We need microformats"
- ^ a b criticism - microformats. Retrieved on 2007-08-15.
- ^ advocacy - microformats. Retrieved on 2007-08-15.
- ^ spread-microformats. Retrieved on 2007-08-15.This includes community resources for marketing microformats such as buttons, banners, wallpaper / desktop screens, logo graphics, etc.
- ^ See, e.g. The Next (Small) Thing on the Semantic Web? pp 71-72.
- ^ .hAccessibility. Web Standards Project Accessibility Task Force article (2007-04-27). Retrieved on 2007-08-16.
[edit] References
- Allsopp, John (2007). Microformats: Empowering Your Markup for Web 2.0. Friends of ED. ISBN 978-1-59059-814-6.
- Orchard, Leslie M (2005). Hacking RSS and Atom. Wiley. ISBN 978-0-7645-9758-9.
- Robbins, Jennifer Niederst (2006). Web Design In A Nutshell, Third Edition, O'Reilly Media. ISBN 978-0-596-00987-8.
- What’s the Next Big Thing on the Web? It May Be a Small, Simple Thing — Microformats". Knowledge@Wharton (27 July 2005).
[edit] Further reading
- Suda, Brian (2006). Using Microformats. O'Reilly Media. ISBN 978-0-596-52817-1.
[edit] External links
- microformats.org
- Microformats Primer
- A four-part discussion of Microformats, UI issues, and possible presentation in Firefox 3 by Alex Faaborg of Mozilla
- "hAccessibility" - Web Standards Project discussion of inaccessible microformat design pattern
- Review of Microformats - Empowering Your Markup for Web 2.0