Modelling HTTP cache configuration in the Semantic Web

Submitted by connolly on Fri, 2006-12-22 19:10. :: |

The W3C Semantic Web Interest Group is considering URI best practices, whether to use LSIDs or HTTP URIs, etc. I ran into some of them at MIT last week. At first it sounded like they wanted some solution so general it would solve the only two hard things in Computer Science: cache invalidation and naming things , as Phil Karlton would say. But then we started talking about a pretty interesting approach: using the semantic web to model cache configuration. It has long been a thorn in my side that there is no standard/portable equivalent ot .htaccess files, no RDF schema for HTTP and MIME, etc.

At WWW9 in May 2000, I gave a talk on formalizing HTTP caching. Where I used larch there, I'd use RDF, OWL, and N3 rules, today. I made some progress in that direction in August 2000: An RDF Model for GET/PUT and Document Management.

Web Architecture: Protocols for State Distribution is a draft I worked on around 1996 to 1999 wihthout ever really finishing it.

I can't find Norm Walsh's item on wwwoffle config, but I did find his XML 2003 paper Caching in with Resolvers:

This paper discusses entity resolvers, caches, and other strategies for dealing with access to sporadically available resources. Our principle focus is on XML Catalogs and local proxy caches. We’ll also consider in passing the ongoing debate of names and addresses, most often arising in the context of URNs vs. URLs.

In Nov 2003 I worked on Web Architecture Illustrated with RDF diagramming tools.

The tabulator, as it's doing HTTP, propagates stuff like content type, last modified, etc. from javascript into its RDF store. Meanwhile, the accessability evaluation and repair folks just released HTTP Vocabulary in RDF. I haven't managed to compare the tabulator's vocabulary with that one yet. I hope somebody does soon.

And while we're doing this little survey, check out the Uri Template stuff by Joe Gregorio and company. I haven't taken a very close look yet, but I suspect it'll be useful for various problems, if not this one in particular.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.
Submitted by http://sebastia... (OpenID: http://sebastian.snopek.openid.pl/) on Mon, 2007-07-23 17:37.
Actually I think that URI Templates are a verey interesting idea and it's quite simple to understand it. Basicaly, URI Templates enable you to add variables to an URI that can be later on substituted to create a URI. Surely everyone who has managed to language their own website will have no serious problems with learnig how to use the most common Such Templates appear in many places (for example Open Search, WADL or WSDL) and probalby they will become even more common soon. In near future infrastructure developers writing blogs, wikis, forums, content management systems, and ideally even web servers, proxies, and routers will be able to utilize URI Templates for configuration and more. However, it will be possible only when URI Templates are finalized and made an IETF standard.
Although it may seem to be a dull subject from the outside looking in, the standardization of URI Templates will be probably the most positive thing to happen for URL Design in the last 10 years. Cheers, Sebie
Submitted by http://mnot.myo... (OpenID: http://mnot.myopenid.com/) on Sun, 2007-01-14 03:47.

WRT modelling HTTP caching itself, I've been thinking about writing some cosmogol for it to see how both fare. Want to help? As far as modelling cache configuration, we spent a lot of time working on this at Akamai, leading to URISpace (which wasn't really compatible with the semantic Web, but it did what we needed). The main trick is balancing between editability and readability; it's hard to make it flexible and easy to understand. The other tricky part is mapping between stuff you get in-request (URI, headers), in-response (usually headers), and in out-of-band configuration, as well as clearly establishing precedence with them. This is probably just a use case for the Web configuration problem, to some degree. I've often thought about putting together a translator for URISpace to Apache config, squid accelerator config, robots.txt, p3p.xml, site maps, and so on. WRT URI Templates; they're more one-off; I'd suggest at looking at something like WADL -- which uses them -- for more capable configuration. Cheers,