Advanced search

Forums : News : Recent outage explanation
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Marius
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 29 Jun 15
Posts: 465
Credit: 4,276
RAC: 0
Message 21924 - Posted: 11 Sep 2018, 13:40:41 UTC

Hi all,

Over the last week we suffered a database corruption due to some disk errors. I've spent the last several days recovering the database from backups and from the corrupted files. Unfortunately, records of workunits from the last several weeks were lost, which means you will not receive credit for any of these jobs. I greatly apologize for this, and we've taken steps to make sure this doesn't happen again. The good news is that this was the only thing which could not be recovered, everything else is fine.

We're continuing to monitor things as the server comes back online, please report any problems you may find here.

Marius
ID: 21924 · Report as offensive     Reply Quote
Profile Marius
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 29 Jun 15
Posts: 465
Credit: 4,276
RAC: 0
Message 21925 - Posted: 11 Sep 2018, 14:43:40 UTC - in response to Message 21924.  
Last modified: 11 Sep 2018, 17:14:39 UTC

Note that work generation will likely struggle to keep up with the demand over the next several hours as everyone's computers are requesting work. This may cause you to receive a message that C@H has no available workunits, which should be temporary.
ID: 21925 · Report as offensive     Reply Quote
Tim Kunz

Send message
Joined: 20 Dec 07
Posts: 12
Credit: 6,583,271
RAC: 1,885
Message 21927 - Posted: 13 Sep 2018, 21:51:49 UTC - in response to Message 21924.  

Really? I lost thousands in credit...wasted CPU and power. We should have been notified so we could have switched to other projects in the interim.
ID: 21927 · Report as offensive     Reply Quote

Forums : News : Recent outage explanation