January 2009
12 posts
Code - Open Blog - NYTimes.com →
A blog at the NYTimes documenting their open source efforts, open API, and open datasets.
Jan 28th
Galago →
Looks like Trevor’s made some progress on Galago, his java-based open-source distributed search engine, to be used with the upcoming book “Search Engines: Information Retrieval In Practice”
Jan 27th
Data Visualization Sketches for Google Search... →
not really search results, but social data.
Jan 27th
SIGIR website submits to mass submission...
I’ve been trying to upload my SIGIR paper for an hour with no luck.  This seems to happen every year. We’re computer scientists.  You’d think we could engineer something that wouldn’t choke when 600 people try to upload PDFs at once.  And we’ve still got 8 hours ‘til the official deadline! Update: Another half hour later and still no luck.  Anyone else have...
Jan 27th
CiteULike: Available datasets  →
(thx Jason)
Jan 27th
Captcha-solving neural net written in Javascript. ... →
(vi DF)
Jan 24th
“Microsoft will eliminate up to 5,000 jobs in R&D, marketing, sales, finance,...”
– Microsoft Reports Second-Quarter Results yikes.
Jan 22nd
How good are you, Turker? →
In a nutshell — Turkers seem to be reliable at self-reporting confidence.
Jan 21st
Publication quality tables in LaTeX (pdf) →
An essay accompanying the ‘booktabs’ LaTeX package, describing what’s wrong with most tables, and what to do about it.
Jan 20th
LaTeX info from loglophile →
Some great tips on typesetting with LaTeX.
Jan 20th
Why Google Employees Quit  →
Interesting for anyone who might be on the job market in the next year or two.
Jan 19th
R, the Software, Finds Fans in Data Analysts →
NYTimes article on R (via Brendan O’Connor’s Blog)
Jan 7th