Advanced search

Forums : Technical Support : URGENT PROBLEMS THREAD (2009 and after)
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9

AuthorMessage
Klimax

Send message
Joined: 24 Oct 07
Posts: 22
Credit: 648,291
RAC: 0
Message 11048 - Posted: 13 May 2012, 17:47:03 UTC
Last modified: 13 May 2012, 17:58:50 UTC

Hello.

I'm getting an error on two files for upload. Each file is from different WU. About five attempts throught the day were made.

Errors:
13.5.2012 19:40:27 | Cosmology@Home | Temporarily failed upload of wu_051212_051616_0_0_0_3: transient HTTP error

13.5.2012 19:40:27 | Cosmology@Home | Temporarily failed upload of wu_051212_051616_0_1_0_3: transient HTTP error

Rest of files uploaded correctly.
(WUs http://www.cosmologyathome.org/result.php?resultid=21477251 and http://www.cosmologyathome.org/result.php?resultid=21477248)

Any idea what went wrong?
Thanks.

ETA: Captured http trafic by fiddler and it saw two messages - first reporting filesize, which succeeded and second message for upload itself, which timed out without any reply from server.
ID: 11048 · Report as offensive     Reply Quote
Klimax

Send message
Joined: 24 Oct 07
Posts: 22
Credit: 648,291
RAC: 0
Message 11049 - Posted: 13 May 2012, 20:58:15 UTC - in response to Message 11048.  

Hello.

I'm getting an error on two files for upload. Each file is from different WU. About five attempts throught the day were made.

Errors:
13.5.2012 19:40:27 | Cosmology@Home | Temporarily failed upload of wu_051212_051616_0_0_0_3: transient HTTP error

13.5.2012 19:40:27 | Cosmology@Home | Temporarily failed upload of wu_051212_051616_0_1_0_3: transient HTTP error

Rest of files uploaded correctly.
(WUs http://www.cosmologyathome.org/result.php?resultid=21477251 and http://www.cosmologyathome.org/result.php?resultid=21477248)

Any idea what went wrong?
Thanks.

ETA: Captured http trafic by fiddler and it saw two messages - first reporting filesize, which succeeded and second message for upload itself, which timed out without any reply from server.

Update: Only first one remains. The other got through.
ID: 11049 · Report as offensive     Reply Quote
Profile cykodennis

Send message
Joined: 31 May 10
Posts: 234
Credit: 4,896,378
RAC: 0
Message 11050 - Posted: 14 May 2012, 12:50:38 UTC - in response to Message 11039.  

Any news on this? Did the download errors become less frequent or---dare I say it---disappear?

Best,
Ben


I've sent you a log of a cykomachine per PM, related to the last seven days.
There are still download errors.
You can search for them by copying the text into a textfile and search for the word "giving".

ID: 11050 · Report as offensive     Reply Quote
Profile cykodennis

Send message
Joined: 31 May 10
Posts: 234
Credit: 4,896,378
RAC: 0
Message 11051 - Posted: 14 May 2012, 13:44:12 UTC

Now about the upload errors
Task ID 21462987 is one of this WUs, which fails always since it was finished.
My Boinc-Client just says "HTTP Error"
ID: 11051 · Report as offensive     Reply Quote
Klimax

Send message
Joined: 24 Oct 07
Posts: 22
Credit: 648,291
RAC: 0
Message 11069 - Posted: 16 May 2012, 4:38:43 UTC - in response to Message 11049.  

...
Update: Only first one remains. The other got through.

That one still can't finish uploading and now another two can't be uploaded.
Server times out in all three cases.
ID: 11069 · Report as offensive     Reply Quote
Profile cykodennis

Send message
Joined: 31 May 10
Posts: 234
Credit: 4,896,378
RAC: 0
Message 11070 - Posted: 16 May 2012, 7:08:41 UTC

My exampletask is shown by the project as "in progress".
So, it will reach its deadline on May 26, ending in a "timeout - no response", triggering the next downloaderror WU.

ID: 11070 · Report as offensive     Reply Quote
Klimax

Send message
Joined: 24 Oct 07
Posts: 22
Credit: 648,291
RAC: 0
Message 11071 - Posted: 16 May 2012, 14:05:14 UTC - in response to Message 11069.  

...
Update: Only first one remains. The other got through.

That one still can't finish uploading and now another two can't be uploaded.
Server times out in all three cases.


Final hopefully update: All files uploaded.
ID: 11071 · Report as offensive     Reply Quote
Profile Michael Roberts
Avatar

Send message
Joined: 27 Sep 07
Posts: 1
Credit: 231,400
RAC: 0
Message 12742 - Posted: 30 Mar 2013, 22:03:51 UTC

Sat 30 Mar 2013 10:58:46 PM CET | Cosmology@Home | Finished download of params_033013_121527_2.ini
Sat 30 Mar 2013 10:58:46 PM CET | Cosmology@Home | Giving up on download of params_031513_022037_0.ini: permanent HTTP error
Sat 30 Mar 2013 10:58:46 PM CET | Cosmology@Home | Started download of params_032613_124844_1.ini
Sat 30 Mar 2013 10:58:47 PM CET | Cosmology@Home | Giving up on download of params_032613_124844_1.ini: permanent HTTP error

Recent messages mention failed uploads. I am getting quite a lot of failed downloaads as shown above. No similar problems with other projects.
ID: 12742 · Report as offensive     Reply Quote
.clair.

Send message
Joined: 4 Nov 07
Posts: 607
Credit: 11,335,070
RAC: 15,880
Message 12743 - Posted: 31 Mar 2013, 21:28:12 UTC

The download errors are a long term problem with C@H,
we all get them,
Just ignore them,
The problem is at the server and will be fixed, errm, sometime :(
ID: 12743 · Report as offensive     Reply Quote
Jesse Viviano

Send message
Joined: 29 Nov 14
Posts: 35
Credit: 781,933
RAC: 0
Message 20182 - Posted: 16 Dec 2014, 23:18:20 UTC

The project's assimilator has been down for a long time. As of this writing, 297,507 work units are still waiting for assimilation. This could cause two possible problems in my mind that could be critical.

The first possible problem is that the deleter cannot do its job if work units are not assimilated. This could eventually cause the disk or disks to get full, causing availability issues.

The second possible problem is that without assimilation and deletion, old work units linger around in the database, which can keep it bloated with old work units and slow it down because the deleter cannot delete database entries for completed work units until the assimilator has processed them.

The third possible problem is that if assimilation feeds the results into the work generator in a feedback loop to improve the work being generated like some other BOINC projects, then plenty of our work is getting wasted because the results of our work is not improving the work being generated. I do not know if this project uses a feedback loop or not, so this point is invalid if there is no feedback loop or if the feedback loop is generated by the validator instead of the assimilator.
ID: 20182 · Report as offensive     Reply Quote
kararom

Send message
Joined: 9 Jan 09
Posts: 69
Credit: 29,506,700
RAC: 0
Message 20187 - Posted: 14 Jan 2015, 20:50:06 UTC - in response to Message 20182.  

What about assimiliator, Ben?
ID: 20187 · Report as offensive     Reply Quote
Jesse Viviano

Send message
Joined: 29 Nov 14
Posts: 35
Credit: 781,933
RAC: 0
Message 20204 - Posted: 10 Feb 2015, 3:47:40 UTC

I see that the assimilator was turned back on, but it has failed again. There are 102,190 work units waiting for assimilation as of this writing.
ID: 20204 · Report as offensive     Reply Quote
Profile C@H Sceptic

Send message
Joined: 23 Jan 15
Posts: 17
Credit: 101,772
RAC: 0
Message 20205 - Posted: 14 Feb 2015, 9:40:19 UTC - in response to Message 20204.  

I see that the assimilator was turned back on, but it has failed again. There are 102,190 work units waiting for assimilation as of this writing.


Makes you wonder where the half million jobs went; did someone save them or just delete the queue when restarting the process?
ID: 20205 · Report as offensive     Reply Quote
Jesse Viviano

Send message
Joined: 29 Nov 14
Posts: 35
Credit: 781,933
RAC: 0
Message 20206 - Posted: 15 Feb 2015, 4:02:54 UTC - in response to Message 20205.  

According to http://www.cosmologyathome.org/forum_thread.php?id=7208&nowrap=true#20193 which was written by someone who is not a native writer of English, it appears that the assimilator was turned on. If my guess is correct, the finished work units have been sent to some sort of post processing like being copied to a database, copied to long term storage, and/or being fed into the work unit generator as part of a feedback loop. After this, they can be deleted because their jobs are done.
ID: 20206 · Report as offensive     Reply Quote
Jesse Viviano

Send message
Joined: 29 Nov 14
Posts: 35
Credit: 781,933
RAC: 0
Message 20214 - Posted: 15 Mar 2015, 14:38:20 UTC

The assimilator is down again.
ID: 20214 · Report as offensive     Reply Quote
Profile C@H Sceptic

Send message
Joined: 23 Jan 15
Posts: 17
Credit: 101,772
RAC: 0
Message 20215 - Posted: 17 Mar 2015, 22:49:45 UTC - in response to Message 20214.  

The assimilator is down again.


Hmm, not an Urgent Problem, been here since the project started.
ID: 20215 · Report as offensive     Reply Quote
Previous · 1 . . . 6 · 7 · 8 · 9

Forums : Technical Support : URGENT PROBLEMS THREAD (2009 and after)