Toward Semantic Web data from Wikipedia

Submitted by connolly on Tue, 2006-03-07 17:23. :: | | |

When I heard about Wikimania 2006 in August in Boston, I put it on my travel schedule, at least tentatively.

Then I had an idea...

Wikipedia:Infobox where the data lives in wikipedia. sparql, anyone? or grddl?

my bookmarks, 2006-02-16

Then I put the idea in a wishlist slide in my presentation on microformats and GRDDL at the W3C technical plenary last week.

The next day, in the SemWeb IG meeting, I met Markus Krötzsch and at lunch I learned he's working on Semantic MediaWiki, a project to do just what I'm hoping for. From our discussion, I think this could work out really well.

For reference, he's 3rd from the left in a photo from wikimania 2005.

I use wikipedia quite regularly to look up airport codes, latitutes, longitudes, lists of postal codes, and the like; boy would I love to have it all in RDF... maybe using GRDDL on the individual pages, maybe a SPARQL interface from their DB... maybe both.

Hmm... the RDF export of their San Diego demo page seems to conflate pages with topics of pages. I guess I should file a bug.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.
Submitted by denny (not verified) (%c) on Thu, 2006-03-09 09:32.

Hi Dan,

I don't think we conflate pages with topics of pages -- the topic of the page San Diego is http://wiki.ontoworld.org/index.php/_San_Diego whereas the page is http://wiki.ontoworld.org/index.php/San_Diego

Notice the extra _ just before the article name in the first URI.

Or do you mean something else?

denny

Submitted by connolly (OpenID: http://www.w3.org/People/Connolly/) on Thu, 2006-03-09 15:37.

Indeed, I had missed the extra _. So you're not conflating the two. Very good.

Meanwhile, I wonder if you're making the data browseable.

What happens when I GET http://wiki.ontoworld.org/index.php/_San_Diego ? ah... 301 Moved Permanently . Hmm... there might be a better 3xx code.

What if I accept application/rdf+xml when I do that GET? hmm... same thing. It would be nicer if it redirected to the RDF/XML version in that case. Then I could "follow my nose" thru the data; see Links on the Semantic Web.