1) Forums : News : Docker-based applications upgrade (Message 21082)
Posted 2 Jun 2016 by Profile Thunder
Post:
I'm glad to hear that explanation because I was coming to say that my rate of error tasks had greatly increased (not decreased). I'll keep an eye for the next few days and see if that stops.
2) Forums : News : Server temporarily degraded (Message 20977)
Posted 31 Mar 2016 by Profile Thunder
Post:
I came looking for why one of my machines decided to grab a single legacy task. This probably explains it. (Yeah, I was too lazy to look in it's log.)

Glad to know "whatever" is all worked out now. :-)
3) Forums : Technical Support : Postponed: VM hypervisor failed (Message 20974)
Posted 31 Mar 2016 by Profile Thunder
Post:
Well, that was after I also reinstalled VirtualBox, so perhaps the combination of the two. I did that after detaching from the project.
4) Forums : Technical Support : Postponed: VM hypervisor failed (Message 20972)
Posted 29 Mar 2016 by Profile Thunder
Post:
Detaching from the project and re-attaching appears to be what fixed it.

I had tried rebooting (no success), shutting down BOINC client and re-installing virtualbox (no success) and finally detaching and re-attaching to the project (that finally did it).

I had to replace my app_config.xml in the project directory, but fortunately had made a copy of it.
5) Forums : Technical Support : Virtual Box Questions (Message 20969)
Posted 27 Mar 2016 by Profile Thunder
Post:
You would simply install VirtualBox for OS X hosts from https://www.virtualbox.org/wiki/Downloads (the link is to the right of each description).

Once installed, there is no other software necessary.

Make sure you've enabled VT-x in the BIOS (most PCs do not have it enabled by default) and then follow the FAQ to remove any <p_vm_extensions_disabled> line from your client_state.xml file.

I found on all of my machines that having VirtualBox use all CPU cores would slow down the computer terribly. In that case, just follow the instructions on "How can I limit the number of CPUs used?" in the FAQ. I found that just leaving one core free (setting the limit at 3 cores on a 4 core machine, 5 on a 6, 7 on an 8, etc.) was sufficient.

The instructions are only for the "camb_boinc2docker" application, if you're running the Planck sims as well, duplicate the entries for "lsplitsims" as well.

As for the slowing down, etc. No, the point behind virtualization (which has been around for 10 years or so) is to allow software in "virtual" machines to access the hardware of the system very nearly as fast as if they were the native OS.
6) Forums : Technical Support : Postponed: VM hypervisor failed (Message 20964)
Posted 26 Mar 2016 by Profile Thunder
Post:
Unfortunately another 20 Planck tasks exhibited the same behavior after completely restarting the (physical) machine.

For now I've detached from the project on this machine. I'll try on Monday to re-install VirtualBox and attach to the project again.
7) Forums : Technical Support : Postponed: VM hypervisor failed (Message 20961)
Posted 25 Mar 2016 by Profile Thunder
Post:
I just had 14 tasks fail (I finally aborted them) with: "task postponed 86400.000000 sec: Communication with VM Hypervisor failed."

This is one that's actually updated to 5.0.16 virtualbox, so I know it's not a latest version issue.

It's host 280861 and there are plenty of tasks to view the stderr output from.
8) Forums : Technical Support : Long Running Planck Tasks (Message 20927)
Posted 10 Mar 2016 by Profile Thunder
Post:

Haha... Unfortunately I realize though, the aim of these jobs is to find the minimum function value. So literally, these jobs are trying to minimize fun :(


As my wife would say, "There you go, getting all detail-ly on me." (She insists it's a word.)

Alas, if minimizing the fun gets the science done, I suppose I'll just have to live with it. ;-)
9) Forums : Technical Support : Long Running Planck Tasks (Message 20925)
Posted 10 Mar 2016 by Profile Thunder
Post:
2889 is in the top 0.1% of longest jobs it looks like!

Whoa... Task 37153848:

2016-02-28 10:14:31 (9908): Guest Log: nfev: 3523


Now now... nfev may be interesting, but I'm enjoying the next output more:

fun: 194.45174567033487

I've always wondered if my boxes enjoy their work. Now I not only know that they do, but have a quantifiable measure of just how much! ;-)

*sarcasm mode now off*
10) Forums : Technical Support : errors on download (Message 7659)
Posted 17 Nov 2008 by Profile Thunder
Post:
WOW! My 13 work computers all went for the entire weekend without a single http error on a download! :D

I\'m fairly sure I\'ve never gone that long without an error since the problem first began, so I\'m cautiously optimistic that you\'ve cracked this nut!

Thanks for working to get it resolved. :)
11) Forums : Technical Support : errors on download (Message 7623)
Posted 11 Nov 2008 by Profile Thunder
Post:
Well, when I\'m wrong, I\'m definitely wrong and now is one of those times...

Cosmology@Home 11/11/2008 12:31:29 PM Temporarily failed download of params_110708_131017_0.ini: http error
Cosmology@Home 11/11/2008 12:31:29 PM Backing off 5 min 40 sec on download of params_110708_131017_0.ini


Note that 11/07/08 is AFTER Scott believes he made a fix for this problem, so it\'s at least an indication that it may not have been successful.

It does seem to be happening less frequently at least... :\\
12) Forums : Technical Support : errors on download (Message 7619)
Posted 10 Nov 2008 by Profile Thunder
Post:
it seems that only october work units are affected by this problem. Could you let me know if that is NOT true, ie if you\'ve seen counterexamples?
...
Ben


Well, it\'s not entirely accurate to say that it\'s only \"October\" WUs, but rather the only ones that seem affected are among those created prior to the date/time of Scott\'s post re: the upload handler.

I have not personally seen any units created after his post have download errors. So tentatively I would say that it fixed the problem. A majority of of the WUs that were experiencing the problem have been cancelled anyhow due to \"too many error\" rules by now anyhow.

Unless it\'s more widespread than I\'m seeing, I\'d say the problem is pretty close to working itself out at this point.
13) Forums : Technical Support : errors on download (Message 7578)
Posted 6 Nov 2008 by Profile Thunder
Post:
Looks as though the file_upload_handler has been segfaulting. I just replaced it with the newest version of the server code, so we\'ll see if that changes things.


Thanks for the update Scott. I had 3 failed downloads on the work machines when I arrived this morning, but it\'s entirely likely they were attempted (or at the very least created) before you made that change. Since the problem began, I\'ve never gone more than a few hours without a failed download, so I\'m sure it won\'t take long to tell if this made a difference.
14) Forums : Technical Support : errors on download (Message 7569)
Posted 4 Nov 2008 by Profile Thunder
Post:
Guess I just had a bad day then. I re-enabled work based on your comments phoneman and about 2/3 of the computers recieved .ini files successfully. The other 1/3 recieved good ones after I aborted the bad ones.

<shrug> Guess I\'ll just keep aborting downloads as I have the chance to. The only project that seems to suffer is Cosmology, so I guess they\'ll do something about it when they want to be more productive.
15) Forums : Technical Support : errors on download (Message 7562)
Posted 3 Nov 2008 by Profile Thunder
Post:
Okay, scratch that.... at the moment, every single computer I have that is trying to get new work is getting http errors on every single download attempt. I\'ve set \"no new work\" on all of them until I hear that something has changed.
16) Forums : Technical Support : errors on download (Message 7561)
Posted 3 Nov 2008 by Profile Thunder
Post:
I came in on Monday to find that nearly every single work computer had a failed download pending from sometime over the span of the weekend.

This is definitely one of those issues that is going to force me to suspend the project on all but a single computer (for testing purposes).

I\'ll keep them on if the project folks think it would help or if they\'re working on finding a solution, but otherwise, I just don\'t see that I have any choice but to back off this one for a while.
17) Forums : Technical Support : errors on download (Message 7542)
Posted 28 Oct 2008 by Profile Thunder
Post:
This problem continues, with no discernable pattern, enough that I\'m going to stop posting every time it does unless the project folks need more specific examples.
18) Forums : Technical Support : errors on download (Message 7538)
Posted 27 Oct 2008 by Profile Thunder
Post:
Another, about 24 hours old (just discovered it this morning):

Cosmology@Home 10/26/2008 6:25:02 AM Started download of params_102208_152954_0.ini
Cosmology@Home 10/26/2008 6:25:06 AM Temporarily failed download of params_102208_152954_0.ini: http error
Cosmology@Home 10/26/2008 6:25:06 AM Backing off 1 min 0 sec on download of params_102208_152954_0.ini
19) Forums : Technical Support : errors on download (Message 7520)
Posted 23 Oct 2008 by Profile Thunder
Post:
Cosmology@Home 10/23/2008 3:40:12 PM Started download of params_101808_233947_1.ini
Cosmology@Home 10/23/2008 3:40:14 PM Temporarily failed download of params_101808_233947_1.ini: http error


I\'m still getting this sporadically. Here was another occurrance.

20) Forums : Technical Support : errors on download (Message 7514)
Posted 22 Oct 2008 by Profile Thunder
Post:
Yep, just happened on another:

Cosmology@Home 10/22/2008 11:14:40 AM Sending scheduler request: To fetch work. Requesting 27 seconds of work, reporting 1 completed tasks
Cosmology@Home 10/22/2008 11:14:45 AM Scheduler request succeeded: got 1 new tasks
Cosmology@Home 10/22/2008 11:14:47 AM Started download of params_101908_002837_0.ini
Cosmology@Home 10/22/2008 11:14:48 AM Temporarily failed download of params_101908_002837_0.ini: http error
Cosmology@Home 10/22/2008 11:14:48 AM Backing off 1 min 0 sec on download of params_101908_002837_0.ini


Next 20