Deep Thoughts in a shallow pond: British Library posts 1.3M scans of book photos

Saturday, May 17, 2014

British Library posts 1.3M scans of book photos

When I visited the British Library a few years ago, I was in awe at the majesty of the place. Not to mention the double-rainbow I saw while sitting outside the RBC Capital Markets building...

Last year, The British Library's Mechanical Curator project, using facial recognition tools and Flickr hosting, published over 1 million historical scans from books of the 1700-1900s. It's a fascinating example of big data at work. Indexing and curating such a large collection would be nearly impossible without crowd-sourcing and complex algorithms to analyze the imagery.

Imagine a world without Google, or any search engine on the internet. How would you find anything? What if you had to browse through 200 year-old books to find a URL to a web site, then manually search through each link on a web site to find a photo?

For all of the 30-50 billion or so web pages on the internet, there is still a world outside that contains far more information than will ever be digitized, categorized, or classified.

After visiting London, my retirement dream is to have a small boat moored for a couple of months each year near the British Museum and British Library. Nothing fancy, as long as it has internet access and a sitting room for reading a book or two...