Forums :
Technical Support :
Cosmology@home tasks fail with Postponed: VM job unmanageable, restarting later if vboxsvc priority is set to idle
Message board moderation
Previous · 1 · 2
Author | Message |
---|---|
computezrmle Send message Joined: 4 Dec 08 Posts: 3 Credit: 12,475,043 RAC: 0 |
Volunteers frequently affected by the postponed issue may try a different vboxwrapper. BOINC's wiki pages mention communication problems between vboxwrapper and VirtualBox 6.x, especially on Windows. They offer premade executables that may solve the problems: https://boinc.berkeley.edu/trac/wiki/VboxApps#Premadevboxwrapperexecutables It would be the job of the project developers to test those vboxwrappers and distribute them to the clients. As long as this is not done volunteers could use the following steps as a workaround: 1. Download an alternative vboxwrapper from the page mentioned above (or use one you got from another project, e.g. LHC@home) 2. Start the BOINC client but suspend computing 3. Change to the project directory, e.g. projects/www.cosmologyathome.org, and replace the vboxwrapper there with the test version; the filename must be the name of the old vboxwrapper 4. Resume computing -> check the logfiles of tasks started after the patch Each restart of the BOINC client will replace the patch with the original vboxwrapper from the project server. This can be avoided setting <dont_check_file_sizes>1</dont_check_file_sizes> in cc_config.xml, but then all other automatic updates will also not work. |
Jim1348 Send message Joined: 17 Nov 14 Posts: 134 Credit: 5,412,499 RAC: 14 |
That is a masterpiece of concision. I have taken the liberty of posting it on Rosetta. They are in desperate (desperate) need of it for their python work units. Thanks, it saves me various other work-arounds, mainly involving going back to VBox 5.2.44. That is easy in Windows, but not so easy in Ubuntu. |
maeax Send message Joined: 21 Dec 17 Posts: 31 Credit: 8,383,702 RAC: 38,024 ![]() |
1. Download an alternative vboxwrapper from the page mentioned above (or use one you got from another project, e.g. LHC@home) There are three vboxwrapper (Theory, Atlas and CMS) |
Jonathan Send message Joined: 27 Sep 17 Posts: 185 Credit: 8,306,392 RAC: 5,376 ![]() |
The wrapper version reported in the logs doesn't match the wrapper file name in a lot of cases I have check. Log shows 2021-12-07 09:27:23 (10616): Detected: vboxwrapper 26202 but I was running 26203 downloaded from Boinc. I have seen this on other projects too. On Windows, do a properties of the wrapper and look at the details tab. LHC Atlas has wrapper as vboxwrapper_26198ab7_windows_x86_64 and it reports version as 26197 in the logs. I was running 4 concurrent 2 core tasks on my Intel i7 920. 4 Cores, 8 with HT. I used 26203 and the machine was stable with no other projects running. I didn't use that machine for anything else as it was just crunching. |
Peter Hucker of the Scottish Boinc Team Send message Joined: 5 Jul 11 Posts: 22 Credit: 892,222 RAC: 10,376 ![]() |
Same nonsense happening here - VM job unmanageable, restarting later. This is on 3 machines, all with plenty RAM, running latest VB. When I see loads stuck, I restart Boinc. Otherwise apparently they retry once a day. Not prepared to do the workaround as it seems to have side effects. |
Peter Hucker of the Scottish Boinc Team Send message Joined: 5 Jul 11 Posts: 22 Credit: 892,222 RAC: 10,376 ![]() |
And after giving them a shove, some produce computation errors. |
.clair. Send message Joined: 4 Nov 07 Posts: 626 Credit: 12,068,402 RAC: 0 |
And after giving them a shove, some produce computation errors. Is this the same three that will or will not run rosetta python. or is virtual blox giving you a new set of problems? |
Peter Hucker of the Scottish Boinc Team Send message Joined: 5 Jul 11 Posts: 22 Credit: 892,222 RAC: 10,376 ![]() |
I have 7 PCs. 1 runs Rosetta Python, 6 fail. Cosmology fails on all of them about 50% of the time. Since other people have similar problems, I'm not blaming my end. LHC works perfectly, but they're the only project that knows how to use the buggy Oracle crap properly. Not sure why Oracle still exists, they've never written anything decent.And after giving them a shove, some produce computation errors. |
Peter Hucker of the Scottish Boinc Team Send message Joined: 5 Jul 11 Posts: 22 Credit: 892,222 RAC: 10,376 ![]() |
Maybe things have improved - I just got cosmology VB tasks for my oldest and newest computers. The oldest one failed them all, but the newest one is succeeding on 12/14 tasks. |
.clair. Send message Joined: 4 Nov 07 Posts: 626 Credit: 12,068,402 RAC: 0 |
Just had a look at your computers list to look at the result files and nothing shows. is that because of your team/group setup ? |
Peter Hucker of the Scottish Boinc Team Send message Joined: 5 Jul 11 Posts: 22 Credit: 892,222 RAC: 10,376 ![]() |
I use a gridcoin pool, hence my computers are in his account, not mine. They are: http://www.cosmologyathome.org/show_host_detail.php?hostid=448254 http://www.cosmologyathome.org/show_host_detail.php?hostid=448255 http://www.cosmologyathome.org/show_host_detail.php?hostid=448252 http://www.cosmologyathome.org/show_host_detail.php?hostid=448253 http://www.cosmologyathome.org/show_host_detail.php?hostid=448261 http://www.cosmologyathome.org/show_host_detail.php?hostid=448263 http://www.cosmologyathome.org/show_host_detail.php?hostid=448260 |