2007-04-04 04:30:48
How long is data exported from the projects stored in BONCstats? I've been thinking about storing the various daily . gz files from the projects on a local drive for a sort of longitudinal study of BOINC.
I did a ballpark, generous estimate of 500MB of total zipped data exported per day from all BOINC projects (taking a look at several large projects' .../stats/ directories). I'm not sure about the rate of change on that 500MB, but will assume for simplicity's sake that it's constant. That sets a 200GB harddrive having capacity for about 400 days (200GB/(0.5GB/day)).
With that data I could do detailed analyses on BOINC projects, and make pretty graphs. In the long term I'd like to incorporate graphs (bell curves, etc. - more than simply chronological bar charts and pie charts) of total credit and RAC, active user retention rates, statistics on active CPUs (omitting skewing inactive ones), and other things.
For those who would like, they could also give information to permit access to their detailed statistics in the .../home.php pages on various projects, to give information on individual work unit completion times and credit earned. US participants could offer postal codes to show regional contributions and compete; similar services could be applied for other countries.
Thus far I'm only automatically downloading some .gz files from the projects to my local drive, but those extra things could be implemented with time. Any insight on the data storage aspect of BOINCstats would be a great help. Thanks.