Shows that PLoS One has an overall equal rate of citation to PLoS Biology, and that more of One's articles have been published in a recent year.
- Mike Chelen
Has anyone generated a slightly nicer data object out of this data yet? Been thinking of graphing the correlations of downloads versus citations versus whatever and similar for different journals which really requires a bit of cleaning up the data to be effective but if someone has already done it?
- Cameron Neylon
Cameron: what else needs to be done to make the data more usable? the source data here is available as TSV http://manyeyes.alphaworks.ibm.com/manyeye... and CSV or XLS too, is there any other format that would be better?
- Mike Chelen
I was wanting to do some analysis that included comparing papers based on time of publication i.e. "what is the average trajectory of downloads?" as well as comparing these across journals so I was hoping someone might have converted to either SQL and/or a set of python objects containing lists of downloads/citations/pageviews by month. Not difficult to do myself but just wondered whether someone else had already.
- Cameron Neylon
That would certainly do one of the things I had in mind but the big problem I was having was with wanting to come up with average initial rates and saturation points to see if there are any characteristics of "hot" vs "slow-burn" papers. I saw some evidence of this in the very crude graph analysis I did when the stats first came out.
- Cameron Neylon
from twhirl
how could the change in rates for each article be determined given only the totals? while the plos website includes a chart of an article's recent history, the data released so far can show how older and newer articles compare in terms of downloads per day like PDF files http://friendfeed.com/mikeche...
- Mike Chelen
Cameron: here can be seen which years and journals have articles with the most downloads per day http://friendfeed.com/mikeche... is this close to what you have in mind?
- Mike Chelen
Looks very nice Mike. We should have all the 'missing' usage data (pre Aug 2005 and first 200 PLoS ONE articles) added sometime this week.
- Peter Binfield
Peter: great thanks, looking forward to it! any preferences or suggestions about where people might want to look for or share data analysis results?
- Mike Chelen
Mike, just realised that I've got a somewhat different dataset that I think hasn't been publicly released yet which includes all these parameters by month - but as Pete points out there are some dates missing.
- Cameron Neylon
Actually Cameron, you dont. We have released the usage data down to the month level (and you may be referring to that), but not the citation/bookmarks/blogs/comment/notes etc data (although we track cumulative data on these items, by the month, we only started tracking it in March, so dont really have enough monthly data to release - though we could if people felt it was valuable)
- Peter Binfield
Allows complex search and filtering through imported data, and output of search results to RSS and other standard formats.
- Mike Chelen
from Bookmarklet