Advanced search

Forums : Technical Support : download of params_101908_003016_0.ini failed since 2 days ?
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Barraud Denis
Avatar

Send message
Joined: 5 Oct 07
Posts: 4
Credit: 194,800
RAC: 0
Message 7556 - Posted: 2 Nov 2008, 22:12:54 UTC
Last modified: 2 Nov 2008, 22:21:31 UTC

One of new WU failed to download the ini file, shoud i abort this WU ? Do You have a problem ?
xp 32 bits , boinc 6.3.19
Messages :
02/11/2008 23:06:52|Cosmology@Home|Started download of params_101908_003016_0.ini
02/11/2008 23:06:53|Cosmology@Home|Temporarily failed download of params_101908_003016_0.ini: HTTP error
02/11/2008 23:06:53|Cosmology@Home|Backing off 1 hr 38 min 18 sec on download of params_101908_003016_0.ini
ID: 7556 · Report as offensive     Reply Quote
Jim Weisert

Send message
Joined: 1 Oct 07
Posts: 3
Credit: 194,536
RAC: 0
Message 7575 - Posted: 6 Nov 2008, 0:08:25 UTC - in response to Message 7556.  

One of new WU failed to download the ini file, shoud i abort this WU ? Do You have a problem ?
xp 32 bits , boinc 6.3.19
Messages :
02/11/2008 23:06:52|Cosmology@Home|Started download of params_101908_003016_0.ini
02/11/2008 23:06:53|Cosmology@Home|Temporarily failed download of params_101908_003016_0.ini: HTTP error
02/11/2008 23:06:53|Cosmology@Home|Backing off 1 hr 38 min 18 sec on download of params_101908_003016_0.ini


Similar problem, different file (although this just started today for me).
05/11/2008 2:10:07 PM|Cosmology@Home|Temporarily failed download of params_102208_150208_1.ini: http error
[...]
05/11/2008 4:48:01 PM|Cosmology@Home|Temporarily failed download of params_102208_150208_1.ini: http error
ID: 7575 · Report as offensive     Reply Quote
adrianxw
Avatar

Send message
Joined: 25 Aug 07
Posts: 49
Credit: 302,769
RAC: 0
Message 7579 - Posted: 6 Nov 2008, 19:43:02 UTC
Last modified: 6 Nov 2008, 19:45:07 UTC

06/11/2008 20:38:54|Cosmology@Home|Started download of params_101908_011851_0.ini
06/11/2008 20:38:55|Cosmology@Home|Temporarily failed download of params_101908_011851_0.ini: http error
06/11/2008 20:38:55|Cosmology@Home|Backing off 2 hr 34 min 51 sec on download of params_101908_011851_0.ini

...me too. I aborted it.
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 7579 · Report as offensive     Reply Quote
Barraud Denis
Avatar

Send message
Joined: 5 Oct 07
Posts: 4
Credit: 194,800
RAC: 0
Message 7580 - Posted: 7 Nov 2008, 0:55:00 UTC - in response to Message 7579.  

06/11/2008 20:38:54|Cosmology@Home|Started download of params_101908_011851_0.ini
06/11/2008 20:38:55|Cosmology@Home|Temporarily failed download of params_101908_011851_0.ini: http error
06/11/2008 20:38:55|Cosmology@Home|Backing off 2 hr 34 min 51 sec on download of params_101908_011851_0.ini

...me too. I aborted it.



==> An others that failed download :
07/11/2008 01:47:41|Cosmology@Home|Temporarily failed download of params_101908_002043_0.ini: HTTP error
07/11/2008 01:47:41|Cosmology@Home|Backing off 3 hr 1 min 6 sec on download of params_101908_002043_0.ini


==> I ask me if the serial WU named : params_101908_*.INI are all Bugged ?
ID: 7580 · Report as offensive     Reply Quote
adrianxw
Avatar

Send message
Joined: 25 Aug 07
Posts: 49
Credit: 302,769
RAC: 0
Message 7585 - Posted: 7 Nov 2008, 6:57:03 UTC

07/11/2008 07:49:42|Cosmology@Home|Started download of params_102608_201937_0.ini
07/11/2008 07:49:43|Cosmology@Home|Temporarily failed download of params_102608_201937_0.ini: http error
07/11/2008 07:49:43|Cosmology@Home|Backing off 2 hr 28 min 27 sec on download of params_102608_201937_0.ini


... and again...
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 7585 · Report as offensive     Reply Quote
Profile Jord
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 15 Jun 07
Posts: 345
Credit: 50,500
RAC: 0
Message 7587 - Posted: 7 Nov 2008, 12:01:07 UTC

Just caught this one stuck in downloading:
07-Nov-08 12:25:32|Cosmology@Home|Temporarily failed download of params_110108_110240_0.ini: HTTP error

ID: 7587 · Report as offensive     Reply Quote
adrianxw
Avatar

Send message
Joined: 25 Aug 07
Posts: 49
Credit: 302,769
RAC: 0
Message 7591 - Posted: 7 Nov 2008, 16:12:50 UTC

07/11/2008 17:08:11|Cosmology@Home|Started download of params_102608_203255_1.ini
07/11/2008 17:08:14|Cosmology@Home|Temporarily failed download of params_102608_203255_1.ini: http error
07/11/2008 17:08:14|Cosmology@Home|Backing off 3 hr 3 min 31 sec on download of params_102608_203255_1.ini


... tum-ti-tum-ti-tum.....
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 7591 · Report as offensive     Reply Quote
Profile Jord
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 15 Jun 07
Posts: 345
Credit: 50,500
RAC: 0
Message 7593 - Posted: 7 Nov 2008, 17:13:14 UTC

07-Nov-08 18:06:30|Cosmology@Home|[sched_op_debug] Reason: Unrecoverable error for result wu_102608_203853_0_2_1 (WU download error: couldn\'t get input files:<file_xfer_error> <file_name>params_102608_203853_0.ini</file_name> <error_code>-197</error_code> <error_message>user requested transfer abort</error_message></file_xfer_error>)


Now that\'s an error... :-)
ID: 7593 · Report as offensive     Reply Quote
Ruud van der Kroef

Send message
Joined: 25 Aug 07
Posts: 12
Credit: 3,572,642
RAC: 828
Message 7614 - Posted: 9 Nov 2008, 22:16:57 UTC - in response to Message 7593.  

I am seeing this error already for weeks.
If you look at my current Tasks for user list, you will find 35 WU\'s with an Outcome=Client error and Client state=Downloading or Aborted by user. I think at least the Aborted by user WU\'s have indeed been aborted by me while they were trying to download, sometimes already for days.

If you look at the list, it seems that the WU\'s that fail to download come in bursts.
I also noticed, when looking on BOINC Manager\'s Transfers tab, the transfer always seems to halt at about 19%.

As I already mentioned, the WU\'s were sometimes trying to download already for days. This happened especially the first couple of times I noticed this problem.
If I recall I think that I even waited for the deadline to pass before I killed the download.
I have never seen one of these WU\'s succeed in downloading.

Another thing I just noticed white checking one of my computers is, looking on BOINC Manager\'s Transfers tab, there is a C@H WU trying to download and sits at 19.06%. The filename is params_101808_232532_0.ini.
If I look on the Tasks tab, the WU is not there!!
When I check the Workunit details of this computer I cannot really find it. The only one that could be this one is 11355177.

Oh, and there is yet another thing:
I seem to have noticed that when you have such a C@H WU that is repeately trying to download, it is blocking all other downloads.
This means, when you have a single CPU system, and a C@H WU is trying to download, no other WU\'s will be downloaded, even not for other projects.
I have no proof for it yet.

In any case, the current state of the project is, that it needs a lot of operator intervention and is not really suited to leave it running unattended.

My (well, a little bit more than) 2 cents.

Greetings,
Ruud
ID: 7614 · Report as offensive     Reply Quote
Ruud van der Kroef

Send message
Joined: 25 Aug 07
Posts: 12
Credit: 3,572,642
RAC: 828
Message 7615 - Posted: 9 Nov 2008, 22:40:10 UTC - in response to Message 7614.  

Meanwhile I found this thread: errors on download.

From the title it was not clear that this tread is about this specific problem.

Anyway, I noticed that the effects of the failing downloads: halt at 19% and blocking other donwloads, also have seen by others.

Sorry for the dubbling.

Ruud
ID: 7615 · Report as offensive     Reply Quote
Anshul Kanakia
Volunteer moderator
Project administrator
Project developer

Send message
Joined: 30 Sep 08
Posts: 70
Credit: 164,860
RAC: 0
Message 7617 - Posted: 10 Nov 2008, 11:44:40 UTC - in response to Message 7615.  

Meanwhile I found this thread: errors on download.

From the title it was not clear that this tread is about this specific problem.

Anyway, I noticed that the effects of the failing downloads: halt at 19% and blocking other donwloads, also have seen by others.

Sorry for the dubbling.

Ruud


Hi everyone,
In case you have not already noticed from all the messages I put up, this WU download error has had me dumbfounded for the last week or so. I run Cosmology@Home on my machine at home too and I have had several WUs fail downloading on me. A certain trend seems to show up on these failing WU\'s though and I wanted to your help to confirm. It looks to me like this happens on WUs that have been lying around in the db after creation for a few days and not picked up. So you will typically not see this on newer ones, ie. No WUs created within the past week should fail download.
You can check this by looking at the date of creation of the WU which is part of the filename: params___<#>.ini
I would greatly appreciate your help in this matter. Please let me know as soon as possible if a WU created within the week fails download. Scott and I are working our level best to get this problem fixed as well as getting the new validator up and running. Thanks again.

Anshul
ID: 7617 · Report as offensive     Reply Quote
Phoneman1

Send message
Joined: 5 Nov 07
Posts: 113
Credit: 3,100,327
RAC: 0
Message 7618 - Posted: 10 Nov 2008, 12:33:29 UTC - in response to Message 7617.  


I would greatly appreciate your help in this matter. Please let me know as soon as possible if a WU created within the week fails download. Scott and I are working our level best to get this problem fixed as well as getting the new validator up and running.

Hi Anshul,

Problem is still occuring today. In addition to the 3 I reported yesterday in the errors on download thread there have been 5 this morning in the 11xx08 range including 2 from 110708. You are seeing more % of the old ones causing triouble as most but not all the good ones have been crunched. I successfully got 5 102608 units this morning because the first cruncher never replied.

Whilst you and Scott work on fixing the problem you could inhibit re-sends, that will reduce the number tied up in downloads for all and give you space to find the fix.

Phoneman1
ID: 7618 · Report as offensive     Reply Quote
TCU Computer Science

Send message
Joined: 28 Oct 07
Posts: 3
Credit: 9,446,460
RAC: 0
Message 7622 - Posted: 11 Nov 2008, 2:51:37 UTC - in response to Message 7617.  

No WUs created within the past week should fail download.
You can check this by looking at the date of creation of the WU which is part of the filename: params_<date of creation>_<wu id>_<#>.ini
I would greatly appreciate your help in this matter. Please let me know as soon as possible if a WU created within the week fails download. Scott and I are working our level best to get this problem fixed as well as getting the new validator up and running. Thanks again.

Anshul


params_110508_151153_1.ini
params_110508_151250_0.ini
params_110508_151717_0.ini
params_110508_152105_1.ini
params_110708_130330_1.ini
params_110708_130500_1.ini
params_110708_130651_0.ini
params_110708_130651_2.ini
params_110708_130640_0.ini
params_110708_130649_0.ini

ID: 7622 · Report as offensive     Reply Quote
TCU Computer Science

Send message
Joined: 28 Oct 07
Posts: 3
Credit: 9,446,460
RAC: 0
Message 7629 - Posted: 13 Nov 2008, 2:20:50 UTC - in response to Message 7617.  

Please let me know as soon as possible if a WU created within the week fails download.


params_111108_210459_0.ini
ID: 7629 · Report as offensive     Reply Quote
Viking69
Avatar

Send message
Joined: 26 Jun 07
Posts: 5
Credit: 110,787
RAC: 0
Message 7630 - Posted: 13 Nov 2008, 2:45:40 UTC

my last WU failure

11/12/2008 6:23:45 PM|Cosmology@Home|Temporarily failed download of params_110108_110431_0.ini: http error

so yes, this is a week old, but as I see in this thread, it is not always consistant.
ID: 7630 · Report as offensive     Reply Quote
sygopet

Send message
Joined: 2 Aug 08
Posts: 27
Credit: 204,771
RAC: 0
Message 7631 - Posted: 13 Nov 2008, 9:48:26 UTC - in response to Message 7617.  



Hi everyone,
. . . A certain trend seems to show up on these failing WU\'s though and I wanted to your help to confirm. It looks to me like this happens on WUs that have been lying around in the db after creation for a few days and not picked up. So you will typically not see this on newer ones, ie. No WUs created within the past week should fail download.
Anshul


Sorry, Anshul - I see no correlation between \"age\" of workunit and the problems with downloading. Newly created units are, in my experience, just as prone to cause trouble. There will, indeed, be a greater proportion of failing units which are a week (or more) old but that is simply because the same unit has probably been around the system several times and, unless the user is in a position to spot, and abort, such units quickly there could be up to 10 days before it is timed out and resent to the next \"victim\". A temporary reduction to one failure (causes the unit to be removed from circulation) would cut down on the problems experienced considerably.
It seems to me that you have access to the database that should show if failing units are generated on particular days, or are from particular batches, or are truly random.
ID: 7631 · Report as offensive     Reply Quote
Barraud Denis
Avatar

Send message
Joined: 5 Oct 07
Posts: 4
Credit: 194,800
RAC: 0
Message 7632 - Posted: 13 Nov 2008, 9:53:11 UTC - in response to Message 7631.  
Last modified: 13 Nov 2008, 9:57:46 UTC

[quote][quote]

two other WU fallling to be download.

13/11/2008 10:48:12|Cosmology@Home|Temporarily failed download of params_110908_172127_0.ini: HTTP error
13/11/2008 10:48:12|Cosmology@Home|Backing off 1 hr 1 min 10 sec on download of params_110908_172127_0.ini

13/11/2008 10:51:36|Cosmology@Home|Temporarily failed download of params_101908_013803_0.ini: HTTP error
13/11/2008 10:51:36|Cosmology@Home|Backing off 1 min 0 sec on download of params_101908_013803_0.ini

I sugest to add in log trace the http error mentionned inside, perhaps an bad http header that failed the download.
ID: 7632 · Report as offensive     Reply Quote
Phoneman1

Send message
Joined: 5 Nov 07
Posts: 113
Credit: 3,100,327
RAC: 0
Message 7633 - Posted: 13 Nov 2008, 10:53:44 UTC - in response to Message 7631.  

A temporary reduction to one failure (causes the unit to be removed from circulation) would cut down on the problems experienced considerably.
It seems to me that you have access to the database that should show if failing units are generated on particular days, or are from particular batches, or are truly random.


I agree inhibiting re-sends pending a full fix is justified here.

This morning I downloaded a total of 67 tasks of these 9 failed. 1 of these was dated November 9th - 2 days after the change mentioned earlier.
The Nov 9th wu that failed to download

All 9 that failed had been \"round the houses\" so to speak before I got them. Some lying abandoned for 10 days by their temporary \"owners\".

Inhibiting re-sends will also stop units which did download Ok but were not reported in time from being re-sent of course. But if the policy had een in place this morning I would have got the same 58 units that did download - they were all suffix 0.

Phoneman1
ID: 7633 · Report as offensive     Reply Quote
Ruud van der Kroef

Send message
Joined: 25 Aug 07
Posts: 12
Credit: 3,572,642
RAC: 828
Message 7635 - Posted: 13 Nov 2008, 13:29:03 UTC - in response to Message 7631.  



Hi everyone,
. . . A certain trend seems to show up on these failing WU\'s though and I wanted to your help to confirm. It looks to me like this happens on WUs that have been lying around in the db after creation for a few days and not picked up. So you will typically not see this on newer ones, ie. No WUs created within the past week should fail download.
Anshul


Sorry, Anshul - I see no correlation between \"age\" of workunit and the problems with downloading. Newly created units are, in my experience, just as prone to cause trouble. There will, indeed, be a greater proportion of failing units which are a week (or more) old but that is simply because the same unit has probably been around the system several times and, unless the user is in a position to spot, and abort, such units quickly there could be up to 10 days before it is timed out and resent to the next \"victim\". A temporary reduction to one failure (causes the unit to be removed from circulation) would cut down on the problems experienced considerably.
It seems to me that you have access to the database that should show if failing units are generated on particular days, or are from particular batches, or are truly random.

I have a whole list of failing WUs:

params_110108_112743_0.ini..\\
params_102208_145705_1.ini..| synstar04
params_110108_110516_0.ini..|
params_101908_010406_0.ini../
params_101908_004058_1.ini
params_102608_202237_0.ini
params_102608_215836_0.ini
params_102608_201617_1.ini
params_101508_182911_0.ini
params_102608_194839_1.ini..\\
params_102608_221244_0.ini..|
params_110108_110410_1.ini...> synstar07
params_102608_195322_0.ini..|
params_102608_221236_0.ini../
params_102608_221037_0.ini
params_102608_201533_0.ini

You can see 9 are of the 102608 batch, 3 of the 110108, and 2 of the 101908 batch.

I also placed brackets at the end of 2 groups.
I had aborted the first one of each group at the corresponding computer, and waited for the project to update. The next would try to download and failed immediately. I aborted this one and waited again.
You can see this happened 3 times on the synstar04 and 4 times on the synstar07 before one downloaded successfully.

Ruud
ID: 7635 · Report as offensive     Reply Quote
Anshul Kanakia
Volunteer moderator
Project administrator
Project developer

Send message
Joined: 30 Sep 08
Posts: 70
Credit: 164,860
RAC: 0
Message 7638 - Posted: 13 Nov 2008, 18:21:45 UTC

Please refer to Message 7637 - Posted 13 Nov 2008 18:19:25 UTC in the Permission problem (download errors) thread.
ID: 7638 · Report as offensive     Reply Quote
1 · 2 · Next

Forums : Technical Support : download of params_101908_003016_0.ini failed since 2 days ?