January 10, 2005
RSS Market Share
There has been much discussion and lively debate in the blogosphere and other spheres of note lately regarding RSS traffic and aggregator market share. Since our job is to help publishers understand how their feed is being consumed in as accurate a way possible, we've decided to make our own understanding of traffic patterns and market share public. As with any micro-report like this, there will be much gnashing of teeth and waving of hands, and in the end, this is just as likely to confuse as many people as it helps. Transparency can only assist the discussion, however, so here we go.
First, a lot of context. There are a number of aggregators that currently consume feeds on behalf of multiple subscribers. Some of these (eg, Bloglines, My.Yahoo, NewsGator Online) relate the number of subscribers on behalf of whom they're polling when they make http requests, but others do not. Therefore, straight away, we can tell you that the market share numbers are skewed to the extent that the currently non-reporting proxies have sizeable numbers of subscribers.
Next, an aggregator's ping/hit frequency has zero bearing on its market share. FeedDemon, for example, is an extremely efficient aggregator and makes far fewer polling attempts with no loss of timely data updates. Our market share data below is based on aggregate circulation as of January 6, 2005 across the top 800 feeds we manage. FeedBurner measures RSS circulation by creating unique combinations of user-agent and IP addresses within a single 24 hour period and then factoring for anomalies (example anomaly: the Danger hiptop for T-mobile user-agent comes through a single ip proxy on behalf of all its users, so that's not a circulation of 1). So, if an Opera user-agent at ip address X polls a feed every minute on the minute all day (to use a realistic example), we only count that as a circulation of 1. This has obvious limitations, but it allows us to create a reasonably accurate view over time for a feed's circulation.
It should be pointed out that web-based aggregators may have an overstated market share to some unknown degree, because if there are 100 web aggregator subscribers to a feed, we are counting all of them every day when the web aggregator checks in and tells us the feed has 100 subscribers, regardless of how many people actually read the feed that day. Clearly, there's a need to dive deeper on stats tracking to start to get a better sense for how widely viewed an item is, how many registered subscribers are actually viewing the content as opposed to just retrieving it, etc. Since we wouldn't mention this unless we were doing something about it, look for a premium offering on this front in the near future.
Ok, we could go on and on with context, but instead we'll just make this topic a regular post here, and each time we'll explore some other piece of the puzzle. Now, some key summary points and then some stats.
Key points:
- RSS Client market is not yet consolidating, it's expanding. There were 409 different clients polling the top 800 FeedBurner feeds in September and now there are 719 different clients. FeedBurner actively catalogs the behavior and specifications for hundreds of these user-agents.
- Mobile RSS readers are already growing in popularity. While none yet crack our top 20 list, there are thousands of users of the FeedBurner Mobile Feed Reader alone.
- Of the feeds that we currently manage, in aggregate, RSS circulation is growing by about 1% every weekday. We are also adding to our total number of feeds under management at the rate of about 1% every weekday. If the widely discussed Pew research estimates are correct, we currently reach about 5% of the total RSS subscribers in the US, although we currently manage just over 1% of the world’s active feeds.
- We currently manage a significant percentage of the world's active podcasts with our SmartCast service. The podcasting phenomenon is accelerating. Enclosures are an important trend. More on all this in a separate post.
- We have been producing this list internally for quite a while, and it can and does change dramatically in a short period of time, both because of the changes in the sorts of feeds we manage and the changing trends in client popularity.
- This list is heavily skewed toward aggregators used on blog feeds, since most of our feeds are from blogs. This list might read quite differently for more traditional media feeds such as Reuters, NYT, CNET, etc. On a similar theme, individual publishers will notice that the overall market share may be wildly different from their own feed's market share. Simply removing our top 10 feeds from this data results in a wildly different market share list, possibly because of clients that ship with one or more of our top 10 feeds as a default. All of this pointing to the caution not to read too much into this single data point. We could make qualifications about everything on the list. Your mileage may vary, caveat emptor, mea culpa, c'est la vie.
Top 20 RSS clients across FeedBurner most highly subscribed 800 feeds as of January 6, 2005
Aggregator Name (Market Share Percentage)
1. Bloglines (32.86%)
2. NetNewsWire (16.95%)
3. Firefox Live Bookmarks (7.78%)
4. Pluck (7.20%)
5. NewsGator Online(4.45%)
6. (not identified) (4.07%)*
7. FeedDemon (3.83%)
8. SharpReader (3.27%)
9. My Yahoo (2.58%)
10. iPodder (2.42%)
11. NewsGator (2.23%)
12. Thunderbird (2.13%)
13. RSS Bandit (1.12%)
14. NewsFire (1.05%)
15. iPodderX (1.02%)
16. Sage (0.71%)
17. FeedReader (0.67%)
18. RssReader (0.54%)
19. LiveJournal (0.46%)
20. Opera RSS Reader (0.45%)
* (not identified) represents an amalgam of clients that request feeds with a blank user-agent field. This total likely represents tens or hundreds of different clients and is not one particuarly large stealth project.
Comments
Hey Dick (Costolo, I assume), it's Randy, one of the original employees at 724. Great to see you blogging. Congrats on the success of Feedburner.
Looking at these results, it would seem to be that Bloglines (a Web-based aggregator) is badly misbehaving. According to the algorithm you gave, this should penalize Bloglines, yet they remain #1 and by far. Any thoughts?
Interesting stats in light of Mark Fletchers conversation with Gary Peterson last year over the bloglines biz model (http://weblogs.jupiterresearch.com/analysts/peterson/archives/005523.html) Im a blogliner so im glad to see them doing well :-)
I'm actually a little surprised, based on the obvious fact that Mac users love RSS, that NewsFire isn't on the list at all.
Could it be that people, including myself, enjoy looking at NewsFire and not actually use it?
First off, thanks for noting that FeedDemon is "an extremely efficient aggregator" - it's nice to see that recognized :)
I'm not about to claim that FeedDemon should be on top here - I know it has far fewer users than Bloglines - but there are two things I feel the need to mention:
1) Since FeedDemon only updates feeds in the active channel group, it's quite likely that many feeds aren't being updated "in a 24 hour period" in FeedDemon. For example, I have a group that contains some of your top 10 feeds, and I don't update this group every day. Am I correct in thinking that this would affect your chart (assuming many other FeedDemon users do the same thing)?
2) FeedDemon 1.5 enables you to access your feeds through Bloglines. Your server logs would show these hits as coming from Bloglines despite the fact that the feeds are being read in FeedDemon. NetNewsWire will soon offer the same feature, so it will be affected by this, too.
Maybe a pie chart with the top 10 aggregators would be illuminating, too.
Needless to say, these stats are fascinating to me - as the person who started the whole aggregator market share thread :-) So thanks for publishing this.
On Nick Bradbury's point, the 24-hr period issue was one that concerned my in my original analysis in Dec. Is it possible to enable Feedburner users to increase the timespan of the circulation numbers to, say, a week? This would probably give a fairer indication of circulation for the desktop aggregators, plus give another option to Feedburner users to track their stats.
Your projected Bloglines market share seems unusually high. I have a question about your methodology. If a FeedDemon user subscribes to 5 feeds, it is easy to count that as a single user since you can match against their IP. If a Bloglines user subscribes to 5 feeds, it is impossible to determine this just from the information in the HTTP request. I assume that in such situations you end up counting the user 5 times.
This would explain the hyperinflated Bloglines numbers.
Hi Dare,
Actually, that user that's using FeedDemon (or RSSBandit *smile*) will contribute to 5 different circulation numbers, so it ends up being the same. That is, when we calculate these aggregate numbers, we're taking the circulation number for each feed individually and summing them, so that FeedDemon user will add "5" to the overall count, just like the Bloglines subscriber.
Did that make any sense?
I would like to see the market share divided in two parts: one for the on line aggregator and one for the offline aggregator.
Nice idea - I was first suprised that NewsIsFree doesn't show up at all - then I realized that this is due to the underlying technique.
We don't show up at the top as our spider just hits a feedburner source ONCE in a defined period. The results get stored in our database and users/readers served from there. No matter how many users really subscribed to it. Bloglines hits it per subscriber. Not really bandwith friendly - but nice for such stats :)
Best
Pino Calzo
NewsIsFree
Nice idea - I was first suprised that NewsIsFree doesn't show up at all - then I realized that this could be due to the underlying technique.
Our spider just hits a feedburner source once in a defined period. The results get stored in our database and users/readers served from there. No matter how many users really subscribed to it.
wonder if that could explain it (but i can't believe that bloglines hits a source with every user subscribed?)
I never thought that RSS would ever have this much complexity. I saw a presentation of it at Baruch College here in NYC and it was fairly direct. In fact we're having a presentation of it at the NYPC General Mtg. here on Thurs. this week. If you want to find who's talking, etc. go to www.nypc.org . The NY PC Users' Group is a great organization, has a lot of SIG's and is well worth a few bucks per year. I certainly want to learn more about RSS and find out its advantage on websites. That's my big question. Any comments on that issue would be appreciated. And how is it made easy for the "newbie".
I never thought, that RSS will ever have it a lot of complexity. Thanks you! Owing to you I have solved the problem.
Can we expect an update analysis, this is more than a year old and would love to see what has changed since then.
