Yahoo! Developer Network Blog
« Previous | Main | Next »
December 7, 2008
Monkey Finds Microformats and RDF
In a blog post earlier this year, we announced that people could search for just entries with specific microformats.
There are 2 things we'd like to announce regarding that:
- We now support adr, geo and tag microformats. Yay!
- In the name of progress, we have added an extra namespace to the IDs to make room for other types of extraction. Anyone who is using these for their microformat searches, can now find our structured data with these queries.
- hCard (searchmonkeyid:com.yahoo.page.uf.hcard)
- hCalendar (searchmonkeyid:com.yahoo.page.uf.hcalendar)
- hReview (searchmonkeyid:com.yahoo.page.uf.hreview)
- hAtom (searchmonkeyid:com.yahoo.page.uf.hatom)
- hResume (searchmonkeyid:com.yahoo.page.uf.hresume)
- adr (searchmonkeyid:com.yahoo.page.uf.adr)
- geo (searchmonkeyid:com.yahoo.page.uf.geo)
- tag (searchmonkeyid:com.yahoo.page.uf.tag)
- xfn (searchmonkeyid:com.yahoo.page.uf.xfn)
- RDFa (searchmonkeyid:com.yahoo.page.rdf.rdfa)
- eRDF (searchmonkeyid:com.yahoo.page.rdf.erdf)
And as before, you can look for hResumes with PHP on them, Geocoded pages involving Santa Clara, or RDFa pages related to SearchMonkey
Keep monkeying around.
Paul Tarjan
(|): Chief Technical Monkey :(|)
Posted at December 7, 2008 11:41 PM | Permalink
Comments
The following example provided on this page does not restrict the search results to pages with the searchmonkeyid with the specific value.
http://search.yahoo.com/search?p=searchmonkeyid%3Acom.yahoo.page.uf.geo+santa+clara
The geo microformat just contains properties for latitude and longitude so the example provided appears to be incorrect.
http://microformats.org/wiki/geo-cheatsheet
How would you search for a page with microformat properties of specific values?
For example, find all pages containing an hResume where the location is 'San Francisco' and a skill is 'Java'.
Posted by: Rene Sugar at December 16, 2008 2:54 PM
Hey Rene,
If you view source on any of those pages returned by your query, you should see find at least one element with class="geo" on it.
And yes, we are faking 'Semantic Search' for now. You can only filter by the presence of a microformat, and then do a full text search for the rest of the terms.
Do you know of a good way to SPARQL over 10 Billion documents? :)
Posted by: Paul Tarjan at December 17, 2008 1:14 PM
http://www.utdallas.edu/~lkhan/research.html
"Dr. Laitfur Khan and his group of research students conduct top-notch research in data mining, multimedia information management, semantic web and database systems"
Current Research
"For storing RDF data, we have devised a novel and efficient technique to store and retrieval of a large volume of RDF data. With the explosion of semantic web technologies, large RDF graphs are com- mon place. This poses signicant challenges for the storage and retrieval of RDF graphs. Current frameworks do not scale for large RDF graphs and as a result does not ad- dress these challenges. We describe a frame- work that we built using Hadoop to store and retrieve large numbers of RDF triples by exploiting the cloud comput- ing paradigm. We describe a scheme to store RDF data in Hadoop Distributed File System. We present an algorithm to rewrite SPARQL queries in simpler forms. We present an algorithm to generate the best possible query plan to answer a SPARQL Protocol and RDF Query Language(SPARQL) query based on a cost model. We use Hadoop's MapReduce framework to answer the queries. Our results show that we can store large RDF graphs in Hadoop clusters built with cheap commodity class hardware. Furthermore, we show that our framework is scalable and efficient and can handle large amounts of RDF data, unlike traditional approaches."
Posted by: Rene Sugar at January 16, 2010 10:45 PM
Post a comment
Comment Policy: We encourage comments and look forward to hearing from you. Please note that Yahoo! may, in our sole discretion, remove comments if they are off topic, inappropriate, or otherwise violate our Terms of Service. Fields marked with asterisk '*' are required.
Subscribe
Recent Blog Articles
view all
YQL Open Table for Google Buzz now live
Tue, 09 Feb 2010
INSERT INTO twitter.status ...
Mon, 08 Feb 2010
Announcing the Yahoo! Brasil Open Hack Day 2010, 20-21 March
Mon, 08 Feb 2010
Marketing hacks, linchpins, and tech women of valor
Sun, 07 Feb 2010
Yahoo! India invites you to join the first India Hadoop Summit
Thu, 04 Feb 2010
Recent Links
Appcelerator Titanium + Yahoo YQL on Vimeo
Mon, 08 Feb 2010
Tue, 02 Feb 2010
PhoneGap | Cross platform mobile framework
Sat, 30 Jan 2010
Web developers can rule the iPad - O'Reilly Radar
Sat, 30 Jan 2010
rc3.org - Is the iPad the harbinger of doom for personal computing?
Thu, 28 Jan 2010
Archives
2010
2009
2008
2007
2006
2005
Recent Readers

