Advanced search

Message boards : Technical Support : Strange , is it a website problem?

Author Message
andy_taximan
Send message
Joined: 10 Mar 16
Posts: 4
Credit: 966,037
RAC: 4,013
Message 21541 - Posted: 13 Oct 2017, 18:32:02 UTC

Im seeing tasks that are validated by me and the same task has also been aborted by me !
https://www.cosmologyathome.org/workunit.php?wuid=44332843 but these links can change if i post it elsewere to show who really aborted it, very strange.
(https://imgur.com/T8VadYc if allowed)

andy_taximan
Send message
Joined: 10 Mar 16
Posts: 4
Credit: 966,037
RAC: 4,013
Message 21542 - Posted: 13 Oct 2017, 18:41:42 UTC

https://www.cosmologyathome.org/workunit.php?wuid=44335629

Jonathan
Send message
Joined: 27 Sep 17
Posts: 19
Credit: 1,197,985
RAC: 12,102
Message 21543 - Posted: 13 Oct 2017, 21:37:53 UTC - in response to Message 21542.

Are you having computer problems? Is the machine getting stuck and not responding? Your computer shows 192 processors, that is a lot. I am wondering if you are actually getting the errors and then just getting the same work units right back.
The number of errors is high, 158 when I looked.

andy_taximan
Send message
Joined: 10 Mar 16
Posts: 4
Credit: 966,037
RAC: 4,013
Message 21544 - Posted: 14 Oct 2017, 3:23:32 UTC

no problems, i didnt abort them, they were aborted by grcpool, just that they show as my pc which should be impossible? , err im running 32x6cpu they validate fine lol

Jonathan
Send message
Joined: 27 Sep 17
Posts: 19
Credit: 1,197,985
RAC: 12,102
Message 21545 - Posted: 14 Oct 2017, 7:21:57 UTC - in response to Message 21544.

You are getting a lot of errors as such, below. Check through all your attached computers for problems. You don't have a common set up since you are using GCRPOOL.

<core_client_version>7.8.2</core_client_version>
<![CDATA[
<message>
aborted by user
</message>
<stderr_txt>
No heartbeat from core client for 30 sec - exiting
No heartbeat from core client for 30 sec - exiting

</stderr_txt>
]]>

andy_taximan
Send message
Joined: 10 Mar 16
Posts: 4
Credit: 966,037
RAC: 4,013
Message 21546 - Posted: 14 Oct 2017, 8:30:04 UTC

im not using gcrpool tho and i havent aborted any tasks since 3rd oct

Profile Marius
Project administrator
Project developer
Project scientist
Avatar
Send message
Joined: 29 Jun 15
Posts: 430
Credit: 4,276
RAC: 0
Message 21547 - Posted: 16 Oct 2017, 10:43:48 UTC

I am wondering if you are actually getting the errors and then just getting the same work units right back


Yea, this is plausible. When a task is aborted it gets resent to a different user, but that could just randomly have been you again. I'd suspect something is wrong only if this is happening a lot (is it?)

Message boards : Technical Support : Strange , is it a website problem?