RSS Reader Wish

8/17/2004 10:55:28 PM

RSS Reader Wish

I subscribe to about 1000 RSS feeds. What would be nice is if my reader could search through the feeds and sort posts based on my probable interest level. In addition, it would categorize those posts into groups of similar content. Currently, most readers provide a search feature to filter posts on a particular keyword to help manage noise.

The reader would analyze the content of a new post, compute via Bayesian techniques the probability of interest given the probability that similar posts with similar keywords and properties from the same feed (and other feeds) have been interesting in the past. There would need to a way to rate each post. Rating could be done explicitly by clicking a set of buttons, though I am not in favor of this approach. The ideal way would implicitly infer a rating from a post through a user's actions such as deleting quickly or saving a post, in the same way that WinFS automatically paints metadata into documents through file operations (copying, for example, a document to a folder automatically generates keywords from the name of all the parent folders). Under the same technique, I could also see which of my feeds tend to generate mostly or all noise, and delete those underperforming feeds.

For categorizing similar posts, this could be done explicitly by setting up individual group that match on certain keywords. Or implicitly, by using one of a number of simple clustering algorithms to collect similar posts.

Another possible mechanism would utilize collaborative filtering, such as when Amazon examines an book that you order and automatically makes additional recommends based on what other buyers have bought. There would need to be centralized site on web that collects aggregate statistics from multiple users; this is probably more relevant for web-based readers like BlogLines than for desktop readers.







My name is Wesner Moise. I am a software entrepreneur developing revolutionary AI desktop applications. I worked as a software engineer in Microsoft Excel group for six years during the 1990s. I worked on PivotTables and wrote the most lines of code in Excel 97-- about 10 times the median developer. I have a Harvard BA in applied math/computer science and a UCLA MBA in technology entrepreneurship. I am a member of the Triple Nine Society, a 99.9 percentile high-IQ society.

Social Media