Friday, November 30, 2007
By Rob Kodey
Watch the Netflix Prize Leaderboard, right from your Personal Homepage! NEW: Wide Mode option. NEW: See how team scores are improving. See how teams are moving up and down the Leaderboard. User Preferences control the display. Comments, feedback, and suggestions welcome.
I can now watch my progress without leaving Google. Great.
Also started trialling Mathematica (so that I can hide my mathematical ignorance from my family!). It looks very powerful, but its going to take some time to figure out how to work it.
Wednesday, November 28, 2007
Monday, November 26, 2007
Friday, November 16, 2007
I think we have a new partnership. I come up with the ideas and she figures out the maths and I program the maths. It should speed things up.
Wednesday, November 14, 2007
Complaints about the noise of the computers at night. It looks like the proverbial garage might become a literal garage. Have toyed with the idea of putting the computer in the shed - its good and cold, but unfortunately I think it would get stolen.
Friday, November 9, 2007
This may not be a good think. Sharing information promotes groupthink. The collaborative filtering literature maybe a case in point. There appear to be two underlying psychological models being used.
- If you rate something similarly to someone else then you can use their predictions on unrated movies as an estimate of your rating for that movie plus some noise.
- Your preference for a movie consists of a set of (orthogonal?) preferences for different factors within the movie multiplied by the amount of that factor within the movie plus some noise - (if I've understood the matrix factorization stuff correctly).
The mathematics (and there is vast reams of it) seems to revolve mainly around calcuating similarity in the first instance or working out how best to tackle the noise in the second approach.
Staggeringly, and please correct me someone if I'm wrong, I can find absolutely no discussion whatsover (not even a single paper) that discusses the merits of the two psychological models underlined in 1 and 2. Yet, at best, they are very crude and, imho wrong. One must be able to do better.
Even the competition's organisers seem to think that only the mathematical sciences are important for this problem. I came across this quote from the CEO of Netflix in the New York Times.
"Mr. Hastings said he thought it was important to make the ratings database widely available. “Unless you work at Microsoft research or Yahoo research or for Jim Bennett here at Netflix, you won’t have access to a large data set,” he said. “The beauty of the Netflix prize is you can be a mathematician in Romania or a statistician in Taiwan, and you could be the winner.”"
No mention of psychologists, or even economics only mathematicans or statisticans. Has the computer science world got itself into one big groupthink on the problem?