Yahoo! Developer Network Blog

« Previous | Main | Next »


December 7, 2008

Monkey Finds Microformats and RDF

In a blog post earlier this year, we announced that people could search for just entries with specific microformats.

There are 2 things we'd like to announce regarding that:

  1. We now support adr, geo and tag microformats. Yay!
  2. In the name of progress, we have added an extra namespace to the IDs to make room for other types of extraction. Anyone who is using these for their microformat searches, can now find our structured data with these queries.

And as before, you can look for hResumes with PHP on them, Geocoded pages involving Santa Clara, or RDFa pages related to SearchMonkey

Keep monkeying around.

Paul Tarjan
(|): Chief Technical Monkey :(|)

Posted at December 7, 2008 11:41 PM | Permalink

Bookmark this on Delicious

Comments

The following example provided on this page does not restrict the search results to pages with the searchmonkeyid with the specific value.

http://search.yahoo.com/search?p=searchmonkeyid%3Acom.yahoo.page.uf.geo+santa+clara

The geo microformat just contains properties for latitude and longitude so the example provided appears to be incorrect.

http://microformats.org/wiki/geo-cheatsheet

How would you search for a page with microformat properties of specific values?

For example, find all pages containing an hResume where the location is 'San Francisco' and a skill is 'Java'.

Posted by: Rene Sugar at December 16, 2008 2:54 PM

Hey Rene,

If you view source on any of those pages returned by your query, you should see find at least one element with class="geo" on it.

And yes, we are faking 'Semantic Search' for now. You can only filter by the presence of a microformat, and then do a full text search for the rest of the terms.

Do you know of a good way to SPARQL over 10 Billion documents? :)

Posted by: Paul Tarjan at December 17, 2008 1:14 PM

http://www.utdallas.edu/~lkhan/research.html

"Dr. Laitfur Khan and his group of research students conduct top-notch research in data mining, multimedia information management, semantic web and database systems"

Current Research

"For storing RDF data, we have devised a novel and efficient technique to store and retrieval of a large volume of RDF data. With the explosion of semantic web technologies, large RDF graphs are com- mon place. This poses signicant challenges for the storage and retrieval of RDF graphs. Current frameworks do not scale for large RDF graphs and as a result does not ad- dress these challenges. We describe a frame- work that we built using Hadoop to store and retrieve large numbers of RDF triples by exploiting the cloud comput- ing paradigm. We describe a scheme to store RDF data in Hadoop Distributed File System. We present an algorithm to rewrite SPARQL queries in simpler forms. We present an algorithm to generate the best possible query plan to answer a SPARQL Protocol and RDF Query Language(SPARQL) query based on a cost model. We use Hadoop's MapReduce framework to answer the queries. Our results show that we can store large RDF graphs in Hadoop clusters built with cheap commodity class hardware. Furthermore, we show that our framework is scalable and efficient and can handle large amounts of RDF data, unlike traditional approaches."

Posted by: Rene Sugar at January 16, 2010 10:45 PM

Post a comment

Comment Policy: We encourage comments and look forward to hearing from you. Please note that Yahoo! may, in our sole discretion, remove comments if they are off topic, inappropriate, or otherwise violate our Terms of Service. Fields marked with asterisk '*' are required.

Remember Me?

Subscribe

YDN Blog: Get Yahoo! Developer Network Blog on your personalized My Yahoo! home page.

Add To My RSS Feed

YDN Link Blog: Get Yahoo! Developer Network Linkblog on your personalized My Yahoo! home page.

Add To My RSS Feed

Recent Readers

Copyright © 2010 Yahoo! Inc. All rights reserved. Copyright | Privacy Policy

Help us continue to improve the Yahoo! Developer Network: Send Your Suggestions