Advanced search

Message boards : Technical Support : Camb_boinc2docker Errors

Author Message
ritterm
Send message
Joined: 30 May 08
Posts: 25
Credit: 896,913
RAC: 0
Message 21395 - Posted: 4 May 2017, 1:32:16 UTC
Last modified: 4 May 2017, 1:56:16 UTC

Three of my hosts are throwing off computation errors. Two AMD-Linux hosts have lines similar to the following the stderr output:

2017-05-03 17:59:37 (2320): Guest Log: Note: The following floating-point exceptions are signalling: IEEE_UNDERFLOW_FLAG
2017-05-03 17:59:37 (2320): Guest Log: ERROR STOP Stopped due to parameter error
2017-05-03 17:59:37 (2320): Guest Log: Optical depth is strange. You have: 9.1423397432200005E-003
2017-05-03 17:59:38 (2320): Guest Log: boinc_app exited (1)

Are some tasks naturally going to error out like this? If so, what's the expected error rate?

The errors for an Intel-Win10 host are access violations and I'm not sure there's much I can do.

Jim1348
Send message
Joined: 17 Nov 14
Posts: 48
Credit: 2,358,299
RAC: 0
Message 21397 - Posted: 4 May 2017, 6:17:44 UTC - in response to Message 21395.

Are some tasks naturally going to error out like this? If so, what's the expected error rate?

The errors for an Intel-Win10 host are access violations and I'm not sure there's much I can do.

I get three or four errors a day on my Windows 7 64-bit machine (4 cores of an i7-4771), and less than one error a day on my Ubuntu 16.10 machine (3 cores of an i7-4770). The error codes are either 1 (0x00000001) Unknown error code, or else 139 (0x0000008B) Unknown error code.

This has been the same for several years, so I expect it is normal.

Profile Marius
Project administrator
Project developer
Project scientist
Avatar
Send message
Joined: 29 Jun 15
Posts: 427
Credit: 4,276
RAC: 0
Message 21398 - Posted: 4 May 2017, 13:48:43 UTC - in response to Message 21397.

Are some tasks naturally going to error out like this? If so, what's the expected error rate?

The errors for an Intel-Win10 host are access violations and I'm not sure there's much I can do.

I get three or four errors a day on my Windows 7 64-bit machine (4 cores of an i7-4771), and less than one error a day on my Ubuntu 16.10 machine (3 cores of an i7-4770). The error codes are either 1 (0x00000001) Unknown error code, or else 139 (0x0000008B) Unknown error code.

This has been the same for several years, so I expect it is normal.

Its true, this occasional failure has been around for a while, this reminds thought that its actually fixable on our end, just with a bit of work, which I will try to get to shortly!

ritterm
Send message
Joined: 30 May 08
Posts: 25
Credit: 896,913
RAC: 0
Message 21400 - Posted: 4 May 2017, 15:53:47 UTC - in response to Message 21398.
Last modified: 4 May 2017, 15:57:51 UTC

Jim1348 wrote:
This has been the same for several years, so I expect it is normal.

Marius wrote:
Its true, this occasional failure has been around for a while, this reminds thought that its actually fixable on our end, just with a bit of work, which I will try to get to shortly!

Thanks for the feedback, guys.

With regard to the access violation errors, these are happening only on my Intel-Win10 hosts. One is 2-core/4-thread/8GB RAM and the other is a 4-core/4GB RAM. Is there anything I can do about that? Generally speaking, neither has trouble on other projects I've run recently (e.g., WCG, LHC, Asteroids, Universe).

Message boards : Technical Support : Camb_boinc2docker Errors