January 2009
12 posts
Code - Open Blog - NYTimes.com →
A blog at the NYTimes documenting their open source efforts, open API, and open datasets.
Galago →
Looks like Trevor’s made some progress on Galago, his java-based open-source distributed search engine, to be used with the upcoming book “Search Engines: Information Retrieval In Practice”
Data Visualization Sketches for Google Search... →
not really search results, but social data.
SIGIR website submits to mass submission...
I’ve been trying to upload my SIGIR paper for an hour with no luck. This seems to happen every year.
We’re computer scientists. You’d think we could engineer something that wouldn’t choke when 600 people try to upload PDFs at once. And we’ve still got 8 hours ‘til the official deadline!
Update: Another half hour later and still no luck. Anyone else have...
CiteULike: Available datasets
→
(thx Jason)
Captcha-solving neural net written in Javascript. ... →
(vi DF)
Microsoft will eliminate up to 5,000 jobs in R&D, marketing, sales, finance,...
– Microsoft Reports Second-Quarter Results
yikes.
How good are you, Turker? →
In a nutshell — Turkers seem to be reliable at self-reporting confidence.
Publication quality tables in LaTeX (pdf) →
An essay accompanying the ‘booktabs’ LaTeX package, describing what’s wrong with most tables, and what to do about it.
LaTeX info from loglophile →
Some great tips on typesetting with LaTeX.
Why Google Employees Quit →
Interesting for anyone who might be on the job market in the next year or two.
R, the Software, Finds Fans in Data Analysts →
NYTimes article on R (via Brendan O’Connor’s Blog)