Yahoo! Developer Network Blog
« Previous | Main | Next »
May 29, 2008
RDF and the Monkey
Besides the existing support for microformats, we have already shared our plans for supporting other standards for embedding metadata into HTML. Today we are announcing the availability of eRDF metadata for SearchMonkey applications, which will soon be followed by support for RDFa. SearchMonkey applications can make direct use of the eRDF data by choosing the com.yahoo.rdf.erdf data source, while RDFa data will appear under com.yahoo.rdf.rdfa. Nothing changes in the way applications are created: as SearchMonkey applications have already been built on a triple-based model, the same applications can work on both microformat, eRDF or RDFa data.
Content publishers, however, will now have an even wider array of choice for providing metadata inside HTML. Therefore it is worthwhile to briefly summarize the key differences between microformats, eRDF and RDFa and the possible migration paths across these approaches:
- Microformats allow publishers to provide metadata about a fixed set of types. This currently includes addresses (vCard), events (hCal), reviews (hReview), feeds (hFeed) and social relations (XFN). The set of properties for each type is also fixed. We recommend the use of microformats in case the information fits one of these microformats supported by SearchMonkey. We are constantly expanding the list of supported microformats, but it is still likely that microformats will not meet all metadata needs.
- eRDF and RDFa in contrast are generic formats that can be used in combination with any RDF/OWL vocabulary and therefore can be used to represent any metadata. Unlike microformats, RDF/OWL vocabularies can also be arbitrarily mixed and extended. The use of a common or industry standard vocabulary is still recommended so that other application developers can easily interpret your metadata.
- While eRDF and RDFa look and feel much the same, there are differences in expressivity: while RDFa is complete with respect to the RDF model, not all RDF statements can be represented in eRDF. For example, in eRDF all statements have the current page as a subject (or resources defined within the current page). Further, there is no support for datatypes, which are recommended in SearchMonkey for example to provide the currency of monetary values or units.
- Migration is easy from less expressive representations towards more expressive representations, i.e. from microformats to eRDF or RDFa, or from eRDF to RDFa. Migration in the other direction is more problematic and likely result in the loss of some metadata. We suggest publishers who have currently no markup to adopt either RDFa (which will be soon picked up the Yahoo! crawler) or eRDF (which is already available from the search index). RDFa is a specification backed by the World Wide Web Consortium (W3C) and it is supported by a wide range of tools, but eRDF has been around longer - so we wanted to support both.
There are plenty of resources to familiarize with both eRDF and RDFa. The SearchMonkey guide has a brief overview of the topic. The eRDF specification and the RDFa Primer are more technical, but also complete, contain plenty of examples and still fairly easily readable. The tools supporting eRDF are listed on the same page as the specification. Here are some links to the RDFa implementations and tools.
In summary, our support for eRDF and RDFa brings even more choice for publishers while open up new data sources for application developers!
Peter Mika
Data Architect, SearchMonkey
Posted at May 29, 2008 10:08 AM
Comments
Peter,
Great to see this! A tiny note: regarding RDFa implementations (second-last paragraph) I'd propose to link as well to the Wiki (http://rdfa.info/wiki/Development). This page is the more recent source (and kept up-to-date by the community ;)
Cheers,
Michael
Posted by: Michael Hausenblas at May 30, 2008 12:54 AM
Any chance of Yahoo! starting to parse GRDDL? It's the third and - in my mind - most interesting of the RDF-in-HTML options.
Posted by: Tom Morris at May 30, 2008 4:31 AM
I think the "brief overview of the topic" link above should go to http://developer.yahoo.com/searchmonkey/smguide/rdf.html instead of http://devel.yahoo.com/searchmonkey/smguide/rdf.html
Posted by: Zach Leatherman at May 30, 2008 10:33 AM
@Zach: thanks for catching that; I've fixed the link.
Posted by: Brian Cantoni at May 30, 2008 2:36 PM
Not wishing to use Yahoo!'s pages to have a debate about GRDDL... :)
But...
I disagree with Tom that GRDDL is somehow more interesting than RDFa. GRDDL is a *process* not a format. It indicates how to use XSLT to get from some mark-up in a page, to some semantics. Its most significant use is when authors put semantics into a page without realising it (or at least in some awkward kind of way); we have the power to extract that information using the GRDDL process.
But if we say we are going to 'use GRDDL' to get at the semantics in a page, we then we first need to specify the format that we're going to apply XSLT to, and then we need to specify the transformation itself.
Which brings us right back to the original problem that RDFa was trying to solve--coming up with a consistent format for specifying metadata without the pain of having to define (or redefine) vocabularies (i.e., Microformats) or define unique transformation rules (GRDDL).
RDFa is intentionally generic.
Regards,
Mark
Posted by: Mark Birbeck at May 31, 2008 1:29 AM
Post a comment
Comment Policy: We encourage comments and look forward to hearing from you. Please note that Yahoo! may, in our sole discretion, remove comments if they are off topic, inappropriate, or otherwise violate our Terms of Service.
Subscribe
Recent Blog Articles
view all
The state of mobile browsers - PPK in London
Thu, 02 Jul 2009
GeoMaker - Turning web content into maps made easy
Wed, 01 Jul 2009
Tue, 30 Jun 2009
Hacking Up North : Winners of the Sunderland Hack Challenge
Fri, 26 Jun 2009
ConvergeSC web event comes to South Carolina
Wed, 24 Jun 2009
Recent Links
Junta42 blog: News Flash: Guardian Seeks to Grow through Products, Not Content
Fri, 03 Jul 2009
Twitter Approval Matrix - June 2009 - O'Reilly Radar
Thu, 02 Jul 2009
YUI 3.0 with Jonathan LeBlanc from the Yahoo Developer Network | Unmatched Style
Wed, 01 Jul 2009
Yahoo! Search Blog: VoCampers Converge at Yahoo! Headquarters in Sunnyvale
Thu, 25 Jun 2009
Make: Online : Dorkbot London June 23
Mon, 22 Jun 2009
Archives
2009
2008
2007
2006
2005

