Tuesday, August 14, 2007

Semantic Technologies Jena Tutorial: source code

At the Jena tutorial I gave at Semantic Technologies 2007, I promised to make the tutorial source code available on my web site. My plan was to spend a little time cleaning up the archive, partly to remove duplication of the Jena libraries (they are in the tree three times, once for the main code, once for Joseki and once for Eyeball). It is still my intention to do that, but I've recognised that other priorities keep intervening. So I've – finally – released the code as-is. It's big archive, 67Mb, so don't download it via your cell phone! And apologies to anyone who has been waiting a long time for me to get around to doing this.

del.icio.us: jena, java, semantic-web, tutorial

Tuesday, August 07, 2007

When collaborative filtering goes bad

Amazon, bless them, have let their CF algorithms get a bit wayward recently. I just received this missive:

Hello, Ian Dickinson,

We've noticed that customers who have purchased or rated Harry Potter: Years 1-4 (4 Disc Box Set) have also purchased Classic Farm & Agricultural Machinery (3 x DVD) [2007] on DVD. For this reason, you might like to know that Classic Farm & Agricultural Machinery (3 x DVD) [2007] will be released on 13 August 2007.

Leaving aside the small matter that people who have bought one thing by definition haven't bought another thing that hasn't been released yet, anyone human looking at the correlation between Harry Potter (I bought the DVD's for the kids, honest) and Classic Farm Machinery is going to do what I did: laugh out loud. Which makes me think that research into the understanding of humour by computers - some of which has been in the press recently - may have a purpose beyond illuminating our understanding of human behaviour. If we create computers that can get jokes, they might be better able to spot stupid errors than automatons that just follow the numbers.