1) Forums : News : Recent outage explanation (Message 21925)
Posted 7 days ago by Profile Marius
Post:
Note that work generation will likely struggle to keep up with the demand over the next several hours as everyone's computers are requesting work. This may cause you to receive a message that C@H has no available workunits, which should be temporary.
2) Forums : News : Recent outage explanation (Message 21924)
Posted 7 days ago by Profile Marius
Post:
Hi all,

Over the last week we suffered a database corruption due to some disk errors. I've spent the last several days recovering the database from backups and from the corrupted files. Unfortunately, records of workunits from the last several weeks were lost, which means you will not receive credit for any of these jobs. I greatly apologize for this, and we've taken steps to make sure this doesn't happen again. The good news is that this was the only thing which could not be recovered, everything else is fine.

We're continuing to monitor things as the server comes back online, please report any problems you may find here.

Marius
3) Forums : General Topics : Cross-Project stats page broken. :( (Message 21918)
Posted 22 days ago by Profile Marius
Post:
Will take a look, thanks for catching that.
4) Forums : Technical Support : Vbox not working here (Message 21900)
Posted 13 Aug 2018 by Profile Marius
Post:
Thanks for that heroic undertaking, and sorry its still not working. I'm not sure I understand why. I've just marked your computer as having extensions enabled by hand on the server, maybe that will force you to get a VM job and the whole thing will get reset, I'm not sure. Let me know if you notice any changes in what's happening.
5) Forums : Technical Support : Vbox not working here (Message 21896)
Posted 12 Aug 2018 by Profile Marius
Post:
Thing is, on my BOINC no matter if I remove that and save the file and rename the old file something else, BOINC rewrites <p_vm_extensions_disabled>

So still stuck with legacy tasks. It's almost like VM app is not communicating with BOINC but yet BOINC knows it is there. 8/12/2018 11:10:44 AM | | VirtualBox version: 5.2.16

This looks like your machine:
http://www.cosmologyathome.org/show_host_detail.php?hostid=362858

It states: "CPU does not have hardware virtualization support".

You probably just need to enable it in your motherboard BIOS.


The C@H website will still say this because the underlying problem is that he *has* enabled it but it is not recognized by BOINC (hence receiving no workunits). Once the you get your local client to recognize extensions enabled, this will propagate to the C@H server and you'll then start receiving workunits.
6) Forums : Technical Support : Vbox not working here (Message 21891)
Posted 12 Aug 2018 by Profile Marius
Post:
The work generation is back after a temporary stoppage. Were you able to get the p_vm_extensions_disabled thing to stop appearing? Another thing you might try there is to also delete everything in the slots/ directory before restarting the BOINC client. In any case, apologies for that bug, I know its extremely annoying and we definitely need to fix it. Unfortunately, despite knowing about it for a while we haven't had the manpower to do it. Its high on my priority list though.
7) Forums : News : camb_legacy tasks temporarily suspended (Message 21882)
Posted 10 Aug 2018 by Profile Marius
Post:
I see two of your computers do not have have hardware virtualization support, which means they won't be able to receive camb_boinc2docker workunits. You have one which does have support but you need to enable it. Afterwards, you may also need to perform the steps here.
8) Forums : Technical Support : No tasks (Message 21879)
Posted 10 Aug 2018 by Profile Marius
Post:
Fixed now, sorry for the temporary hiccup.
9) Forums : News : All tasks functioning normally (Message 21878)
Posted 10 Aug 2018 by Profile Marius
Post:
If you're interested in the details, the camb_legacy problem was fixed here: https://github.com/BOINC/boinc/pull/2616

The server upgrades were based on updating to use boinc-server-docker 3.0.0, which is now based on PHP7/Debian Stretch (old version was PHP5/Debian Jessie) and MariaDB 10.3.8 (old version was MySQL 5.7.19).

These things need not mean anything to you to continue meaningful crunching for C@H :)
10) Forums : News : All tasks functioning normally (Message 21877)
Posted 10 Aug 2018 by Profile Marius
Post:
The problem affecting the camb_legacy tasks has been resolved. The server also recently received some under-the-hood upgrades which should improve performance and security.
11) Forums : News : camb_legacy tasks temporarily suspended (Message 21876)
Posted 10 Aug 2018 by Profile Marius
Post:
If camb_legacy is offline, then what is this?
8/10/2018 1:10:53 AM | Cosmology@Home | No tasks are available for planck_param_sims and why is it not online?

If things don't get fixed soon, I think I will have to leave the project, because when you do come back online, your going to clog my system with a backlog of tasks because you have been offline so long, which in turn knocks all my other projects backwards.


Hi Greg, the camb_legacy tasks have been back online for a few weeks. Prompted by your message let me post a news item to avoid any confusion. The planck_param_sims app is an older app which we are unlikely to use again in the future (I will likely deprecate it soon, although up until somewhat recently it was possible we were going to run a few more of those workunits, but that now looks unlikely). In either case, when we add new workunits it shouldn't affect normal resource sharing between your other project, please let us know if anything weird is happening.
12) Forums : Technical Support : How do you stop a C@H VM task so it will restart on the next PC bootup? (Message 21865)
Posted 3 Aug 2018 by Profile Marius
Post:
Indeed, we don't currently do checkpointing, instead the jobs are just kept very short so losing one keeps the wasted computing resources to a minimum (and on our end, there's absolutely no scientific loss from losing any one particular job). That said, its definitely on the wishlist and I imagine we'll have it at some point.
13) Forums : General Topics : Formula BOINC sprint at Cosmology@Home, July 2018 (Message 21841)
Posted 22 Jul 2018 by Profile Marius
Post:
Hi all,

It looks like the validator has reduced the 160,000 backlog down to something like 83,000, so that's really good news for the Sprint...

Maybe the validator has been fixed or at least some "tweaks" have been made so that tasks are being rewarded with credits...

Yea, I figured out the problem Friday and have set it to work on the backlog since then. It won't completely finish by the end of today but better than nothing.

So, well done to the project admins if that's the case :-)

Thank you!
14) Forums : Technical Support : Hostname problems (Message 21833)
Posted 21 Jul 2018 by Profile Marius
Post:
I suspect the project scheduler sent broken replies for a brief while, which messed up client state. It happened around 12:48 UTC, if I recall correctly.

Sorry about that, that's exactly what happened for about 20 minutes.


A possible fix which worked for me was to shut down the client, correct all corrupted download URLs in client_state.xml using a text editor or a tool like sed, then start the client again.

If you were affected by this, another solution is just to abort your camb_boinc2docker tasks, abort the stalled transfers in the Transfers tab, then detach and reattach the project. Let me know if anyone has any remaining issues.
15) Forums : News : camb_legacy tasks temporarily suspended (Message 21815)
Posted 20 Jul 2018 by Profile Marius
Post:
Hello,
Thanks for info.
But because legacy WU validator is off, you exclude a lot of people to participate at the sprint on Formula !!!
In fact you exclude all who do not want (or can not) install Virtual box !


I do apologize for that, but at the same time there's not much I can do when the sprint chose to run C@H despite that we had had a notice up for a few weeks stating that camb_legacy was currently broken.
16) Forums : General Topics : Formula BOINC sprint at Cosmology@Home, July 2018 (Message 21813)
Posted 20 Jul 2018 by Profile Marius
Post:
I'm very happy to have C@H participate in the Formula BOINC sprint, however, I do wish the organizers would have contacted me earlier, then we could have built a much bigger work pool in preparation (by the time I saw this post yesterday it was too late). We could have also communicated better to the sprinters the ongoing issues with the camb_legacy app, and pointed everyone to be running only camb_boinc2docker.

To clarify / confirm what has been said, camb_boinc2docker validation is running completely smoothly. Work generation is also running at normal pace, but demand is very high so it is likely slow for clients to grab workunits. camb_legacy validation on the other hand is currently broken and has been for several weeks. For those participating in the sprint, its unlikely I can fix it in time for the Sunday deadline, unfortunately.
17) Forums : News : camb_legacy tasks temporarily suspended (Message 21812)
Posted 20 Jul 2018 by Profile Marius
Post:
Some updates:

* The camb_legacy work generator was turned back on on accident, its off now, since I'm still unable to fix the validation problem (the validator is sporadically segfaulting and I've not been able to figure out why yet).

* The camb_boinc2docker workunits *are* being generated as normal, however, there is a formula BOINC sprint happening which is draining them about as fast as we can generate them. This should be over by Sunday, July 22, 22:00 UTC as I understand it.
18) Forums : News : camb_legacy tasks temporarily suspended (Message 21779)
Posted 16 Jul 2018 by Profile Marius
Post:
Hello,
Credits here are ridiculous to compare with other project.
I compare bteween project , based on the same host.(355352)
And on all my four host, everywhere the same : this project give the lowest credit/WU.

Credit/WU isn't a good metric since WU across different projects can be different lengths (our WU are quite short for example, hence credit/WU is fairly small). In any case, since every project is free to award credits as they please, you should not put too much stock in cross-project comparison of credits. In case you weren't familiar with it, Cosmology@Home uses the BOINC Credit New system; I don't think its perfect but its the best I would say we have. Discussion about that is probably best for the BOINC message boards / email lists.

Also, apologies I've not solved the camb_legacy issues yet, its proving somewhat difficult. I will get it sorted it out but in the meantime I encourage everyone to take a look at running camb_boinc2docker jobs which are of more use to us scientifically anyway.
19) Forums : News : camb_legacy tasks temporarily suspended (Message 21778)
Posted 16 Jul 2018 by Profile Marius
Post:
Greetings. Since the camb_legacy tasks were "tweaked", my cosmology@home workload has been heavy and all jobs appear to complete successfully. However, I'm receiving very little credit for the work done. Where I once was receiving between 5,000-6,900 credits daily, I now receive only a few hundred credits per day, if any at all, even though I continue to process as many jobs as before. I suspect there might be something wrong with the accreditation process. Can you please address this issue? Thanks. - Gene Weber

Hi Gene, looking at your hosts, the credits/job seems normal to me, the problem is simply that we don't have many camb_legacy jobs being sent out since I haven't been able to fix the problem. Are you sure the issue isn't simply that you are crunching fewer jobs and so not receiving as much credit? You could also look into setting up your computers to run camb_boinc2docker jobs, which are running fine at the moment.
20) Forums : Technical Support : Can't get any tasks to download (Message 21772)
Posted 8 Jul 2018 by Profile Marius
Post:
Unfortunately, I started getting the "didn't enter an online state in a timely fashion" error next. I solved that by uninstalling Trusteer.

Oh sorry I misread your message, you did solve that error. That's interesting that uninstalling an antivirus did that, I can possibly recommend that to others that see the same thing. Thanks for reporting that.


Next 20