<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Aggregators for the New Age</title>
	<atom:link href="http://motespace.com/blog/2006/06/05/aggregators-for-the-new-age/feed/" rel="self" type="application/rss+xml" />
	<link>http://motespace.com/blog/2006/06/05/aggregators-for-the-new-age/</link>
	<description>Disclaimer: The following web space does not contain my own opinions, merely linguistic representations thereof.</description>
	<lastBuildDate>Sun, 13 Mar 2011 07:12:38 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: mote</title>
		<link>http://motespace.com/blog/2006/06/05/aggregators-for-the-new-age/comment-page-1/#comment-2560</link>
		<dc:creator>mote</dc:creator>
		<pubDate>Thu, 08 Jun 2006 15:50:07 +0000</pubDate>
		<guid isPermaLink="false">http://fairuz.isi.edu/blog/index.php/2006/06/05/aggregators-for-the-new-age/#comment-2560</guid>
		<description>(Hmmm... looks like there&#039;s a bug in wordpress&#039;s &quot;\&quot; and &quot;&#039;&quot; escaping)</description>
		<content:encoded><![CDATA[<p>(Hmmm&#8230; looks like there&#8217;s a bug in wordpress&#8217;s &#8220;\&#8221; and &#8220;&#8216;&#8221; escaping)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mote</title>
		<link>http://motespace.com/blog/2006/06/05/aggregators-for-the-new-age/comment-page-1/#comment-2559</link>
		<dc:creator>mote</dc:creator>
		<pubDate>Thu, 08 Jun 2006 15:47:18 +0000</pubDate>
		<guid isPermaLink="false">http://fairuz.isi.edu/blog/index.php/2006/06/05/aggregators-for-the-new-age/#comment-2559</guid>
		<description>Les, thanks for stopping by.  I\&#039;ll be emailing you once I\\\&#039;m ready to start writing the intelligent part of my aggregator--I\&#039;m curious what sort of features you\\\&#039;ve tried for machine learning.  

A few thoughts on what you wrote:
1. I\&#039;m quite sure a straight binary classifier is not the answer.  Interestingness is different from the spam/ham problem because the answer is fuzzy rather than black &amp; white.  My naive guess is that the best user interface will be one that subtly marks to the user that an article is worth reading (say, a brighter red used for the entry header) or not worth reading (a duller grey for the background).  Ordering (higher noise-to-signal stuff down near the bottom) is also a possibility.  I find it helpful, when dealing with AI judgements that are not too accurate, to let the user interface be as vague as possible.

2. I definitely agree with you about the advances since 2003--the mechanical turk has been great!  I\&#039;ve been thinking of different ways to use digg or del data, but I\&#039;m not sure if Joshua or Kevin would be too happy with me pounding their server every time a new feed item comes in.</description>
		<content:encoded><![CDATA[<p>Les, thanks for stopping by.  I\&#8217;ll be emailing you once I\\\&#8217;m ready to start writing the intelligent part of my aggregator&#8211;I\&#8217;m curious what sort of features you\\\&#8217;ve tried for machine learning.  </p>
<p>A few thoughts on what you wrote:<br />
1. I\&#8217;m quite sure a straight binary classifier is not the answer.  Interestingness is different from the spam/ham problem because the answer is fuzzy rather than black &#038; white.  My naive guess is that the best user interface will be one that subtly marks to the user that an article is worth reading (say, a brighter red used for the entry header) or not worth reading (a duller grey for the background).  Ordering (higher noise-to-signal stuff down near the bottom) is also a possibility.  I find it helpful, when dealing with AI judgements that are not too accurate, to let the user interface be as vague as possible.</p>
<p>2. I definitely agree with you about the advances since 2003&#8211;the mechanical turk has been great!  I\&#8217;ve been thinking of different ways to use digg or del data, but I\&#8217;m not sure if Joshua or Kevin would be too happy with me pounding their server every time a new feed item comes in.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: l.m.orchard</title>
		<link>http://motespace.com/blog/2006/06/05/aggregators-for-the-new-age/comment-page-1/#comment-2547</link>
		<dc:creator>l.m.orchard</dc:creator>
		<pubDate>Wed, 07 Jun 2006 14:22:09 +0000</pubDate>
		<guid isPermaLink="false">http://fairuz.isi.edu/blog/index.php/2006/06/05/aggregators-for-the-new-age/#comment-2547</guid>
		<description>One thing, about Bayes and aggregators:  I don&#039;t think anyone&#039;s found a way to apply Bayesian filtering to RSS aggregation that produces a satisfying result - at least not one worth cheering to the web about.  

I know I haven&#039;t - and I&#039;m up to my 6th private attempt or so at different arrangements with Bayes in particular.  I think a different form of filtering is what&#039;s needed.  Maybe LSI, maybe some other form of valued scoring that doesn&#039;t result in a flat spam/ham answer.  (ie. &quot;interestingness&quot;, as you say.)

But, in the time since 2003, what&#039;s really shown promise are more and more varied ways of soliciting and exploiting human intelligence in the course of finding and filtering news and feed items.  See: del.icio.us, digg, etal.  The best news lately is pre-scanned and filtered by human domain experts.</description>
		<content:encoded><![CDATA[<p>One thing, about Bayes and aggregators:  I don&#8217;t think anyone&#8217;s found a way to apply Bayesian filtering to RSS aggregation that produces a satisfying result &#8211; at least not one worth cheering to the web about.  </p>
<p>I know I haven&#8217;t &#8211; and I&#8217;m up to my 6th private attempt or so at different arrangements with Bayes in particular.  I think a different form of filtering is what&#8217;s needed.  Maybe LSI, maybe some other form of valued scoring that doesn&#8217;t result in a flat spam/ham answer.  (ie. &#8220;interestingness&#8221;, as you say.)</p>
<p>But, in the time since 2003, what&#8217;s really shown promise are more and more varied ways of soliciting and exploiting human intelligence in the course of finding and filtering news and feed items.  See: del.icio.us, digg, etal.  The best news lately is pre-scanned and filtered by human domain experts.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mote</title>
		<link>http://motespace.com/blog/2006/06/05/aggregators-for-the-new-age/comment-page-1/#comment-2524</link>
		<dc:creator>mote</dc:creator>
		<pubDate>Tue, 06 Jun 2006 20:42:06 +0000</pubDate>
		<guid isPermaLink="false">http://fairuz.isi.edu/blog/index.php/2006/06/05/aggregators-for-the-new-age/#comment-2524</guid>
		<description>Also, I&#039;m noticing that a lot of things (e.g. the rise of folksonomy, &quot;web 2.0&quot; (whatever that means) ) have changed since 3 years back when these ideas were first being thrown around.  Untapped potential there, too.</description>
		<content:encoded><![CDATA[<p>Also, I&#8217;m noticing that a lot of things (e.g. the rise of folksonomy, &#8220;web 2.0&#8243; (whatever that means) ) have changed since 3 years back when these ideas were first being thrown around.  Untapped potential there, too.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

