Forums :
Technical Support :
Unit Verified But Status is Still Pending
Message board moderation
Author | Message |
---|---|
arcturus Send message Joined: 28 Aug 07 Posts: 35 Credit: 666,900 RAC: 0 |
http://www.cosmologyathome.org/workunit.php?wuid=732100 http://www.cosmologyathome.org/workunit.php?wuid=724700 http://www.cosmologyathome.org/workunit.php?wuid=723981 http://www.cosmologyathome.org/workunit.php?wuid=723887 http://www.cosmologyathome.org/workunit.php?wuid=722973 http://www.cosmologyathome.org/workunit.php?wuid=722910 http://www.cosmologyathome.org/workunit.php?wuid=722891 http://www.cosmologyathome.org/workunit.php?wuid=719822 http://www.cosmologyathome.org/workunit.php?wuid=717999 http://www.cosmologyathome.org/workunit.php?wuid=717236 http://www.cosmologyathome.org/workunit.php?wuid=716501 http://www.cosmologyathome.org/workunit.php?wuid=715030 http://www.cosmologyathome.org/workunit.php?wuid=713611 http://www.cosmologyathome.org/workunit.php?wuid=711137 http://www.cosmologyathome.org/workunit.php?wuid=708313 http://www.cosmologyathome.org/workunit.php?wuid=707925 http://www.cosmologyathome.org/workunit.php?wuid=670314 |
Jim Weisert Send message Joined: 1 Oct 07 Posts: 3 Credit: 194,536 RAC: 0 |
http://www.cosmologyathome.org/workunit.php?wuid=732100 I didn\'t check all of these WUs, but the ones I looked at are probably situations in which either your result or your wingman\'s result failed to upload properly, so the results did not validate. Eventually, a 3rd result will used for validation. There appears to be a bug in the BOINC client that caused uploads to be aborted sometime during the Cosmology@Home server failure. See this thread |
![]() Volunteer moderator Volunteer tester ![]() Send message Joined: 15 Jun 07 Posts: 345 Credit: 50,500 RAC: 0 |
There appears to be a bug in the BOINC client that caused uploads to be aborted sometime during the Cosmology@Home server failure. See this thread It\'s not that simple. It\'s a problem with one of the file_upload_handlers on the project and the client timing out and giving up on the upload after a couple of retries. The client side has now been adjusted so any upload problem doesn\'t time out over time, but is handled as a transient error (anything to do with uploading/downloading). That still leaves it up to the project to make sure they have sufficient file_upload_handlers in place. |
arcturus Send message Joined: 28 Aug 07 Posts: 35 Credit: 666,900 RAC: 0 |
ok fine when can we expect points to be granted or not? |
![]() Volunteer moderator Project administrator Project developer ![]() Send message Joined: 1 Apr 07 Posts: 662 Credit: 13,742 RAC: 0 |
You have an Athlon XP CPU, right? Are you overclocking it (or anything on your machine, for that matter). The reason why I ask is that a lot of your results are considered \"inconclusive\" right now, meaning that your result and the other did not agree to the desired accuracy. This doesn\'t mean that it was necessarily your machine\'s fault (nor does it necessarily mean that you won\'t get credit when the next result is compared), but it can be if you\'re overclocking or increased your RAM voltage or something along those lines. Scott Kruger Project Administrator, Cosmology@Home |
Hefto99 Send message Joined: 24 Jun 07 Posts: 7 Credit: 6,391,524 RAC: 0 |
You have an Athlon XP CPU, right? Are you overclocking it (or anything on your machine, for that matter). Scott, I think it is not related to OC or anything else. I have the same problem on 2 machines - all results uploaded during recent problems with server and reported Nov 15 are invalid. |
arcturus Send message Joined: 28 Aug 07 Posts: 35 Credit: 666,900 RAC: 0 |
You have an Athlon XP CPU, right? Are you overclocking it (or anything on your machine, for that matter). Yes, it\'s an overclocked XP but why *only* those units submitted 1:39:26 UTC on the 16th? All other units submitted subsequent (and previous) have received points and the pc remains untouched so it seems unlikely the oc is relevant in this case. |
![]() ![]() Send message Joined: 28 Aug 07 Posts: 169 Credit: 2,093,665 RAC: 3,105 ![]() |
I have the same problem when my work units uploaded on the 15th and none of my machines are overclocked, all standard. It is related to the server problems that were fixed on the 15th, the constant retrys to upload seems to have caused some irregularities in the returned results by possibly giving up on upload and when WU was uploaded not all files were included? I have over 35 work units affected by this. All work prior and since is not having problems and being validated ok. This is about the 3rd thread with people having this problem so it is not an isolated problem to one user. If you check your database you will see lots and lots of work units that have not been verified on the 15th/16th after they uploaded when the server came back online. |
cwhyl Send message Joined: 26 Jul 07 Posts: 3 Credit: 296,700 RAC: 0 |
You have an Athlon XP CPU, right? Are you overclocking it (or anything on your machine, for that matter). Yep, my lost results have nothing to do with overclocking. I run Boinc with a command window +manager and saw the exact same things as Matthias Lehmkuhl reported in the other thread: \"giving up on uploading...files not found\" and then the WU state switched from \"Uploading\" to \"Ready to report\" in the manager after numerous tries. |
![]() Volunteer moderator Project administrator Project developer ![]() Send message Joined: 1 Apr 07 Posts: 662 Credit: 13,742 RAC: 0 |
Just a thought =) I\'m turning on some debugging features of the validator to see which files aren\'t validating and why. Hopefully I\'ll have some answers soon. Scott Kruger Project Administrator, Cosmology@Home |
![]() Send message Joined: 1 Jul 07 Posts: 37 Credit: 208,284 RAC: 0 |
Just a thought =) Scott, If it helps the problem will be that these files will have some or all of the output files missing. The clients effected will probably be 5.10.21 and later and the files will have downloaded prior to the outage and reported back to the scheduler after the outage. I suspect that the project connected up a temporary server on the 12th to inform users what the problem was. This server would not have had a file_upload_handler program installed and this caused these later clients to think that they had contacted the server and that the files were no longer needed so they auto-deleted any upload file that tried to upload. The code has now been changed and should be in the next release after 5.10.30. Dave. |
![]() Send message Joined: 1 Jul 07 Posts: 37 Credit: 208,284 RAC: 0 |
Scott, A final thought before I go to bed. What happens if two clients have reported the same WU but have both failed to upload the output files? Will the validator spot the lack of output files and invalidate them both? If not then you could have a real headache trying to sort them out of your database! Dave. |
![]() Volunteer moderator Project administrator Project developer ![]() Send message Joined: 1 Apr 07 Posts: 662 Credit: 13,742 RAC: 0 |
Scott, A final thought before I go to bed. What happens if two clients have reported the same WU but have both failed to upload the output files? Will the validator spot the lack of output files and invalidate them both? If not then you could have a real headache trying to sort them out of your database! Dave. If there aren\'t exactly 6 output files, the result is invalidated. If both results fail to have exactly 6 output files, then both are invalidated. Anyway, the validator shows that some of the results are failing because CAMB will output \"NaN\" (not a number) instead of real values in a number of the output files, meaning that the integrations have failed. This seems to be an issue with the compiler; we compiled CAMB with ifort 10 instead of ifort 9 like usual. We might try compiling it the other way around, but that might cause more problems than it solves, though. As it stands, only about 1% of the results end up being invalidated (for the last couple of weeks, we had around 250 out of 25000 invalid results, which is pretty good, considering). I\'ll keep you posted, though. Scott Kruger Project Administrator, Cosmology@Home |
arcturus Send message Joined: 28 Aug 07 Posts: 35 Credit: 666,900 RAC: 0 |
Any update? |
![]() Volunteer moderator Project administrator Project developer ![]() Send message Joined: 1 Apr 07 Posts: 662 Credit: 13,742 RAC: 0 |
Any update? I\'ve been going through the invalidated results and I can\'t find any real correlation between them, other than certain hosts seem to send back invalid results most of the time. However, the invalidation rate has gone down to about .4% in the last couple of weeks, so I think I\'d rather not fix what isn\'t that broken. Scott Kruger Project Administrator, Cosmology@Home |
arcturus Send message Joined: 28 Aug 07 Posts: 35 Credit: 666,900 RAC: 0 |
How is this thread different from the thread HERE where that guys seems to have a similar issue and you mention a fix? |
![]() ![]() Send message Joined: 28 Aug 07 Posts: 169 Credit: 2,093,665 RAC: 3,105 ![]() |
Any update? G\'Day Scott, The validation problem, I understood this thread to be most concerned about, is all the results from 15th (possibly 16th for some) of November. This is when we were able to upload our results after the server went down. Very few of those results have validated for me, even ones I had completed before the crash are still sitting there due to another computer returning it\'s result on the 15th. So currently the validator is going great but back on the 15th/16th it was sick. I have not noticed any of my results from the 15th having been resent to a third or 4th host (and a couple will be 5th and 6th) as yet. (I suppose I am trying not to lose 3,900 cobblestones (39 results x 100 granted credits)). Thanks Conan, Keep smiling, it makes others wonder what you have been up too. |
![]() Volunteer moderator Project administrator Project developer ![]() Send message Joined: 1 Apr 07 Posts: 662 Credit: 13,742 RAC: 0 |
Any update? Ahh... OK. For whatever reason, I assumed this was a general problem with validation, not just with the ones on the 15/16. I can get that fixed fairly easily. I will make those results valid, grant them credit, and then cancel them so that they don\'t have to get sent out again. This will have to wait until tomorrow evening, though, since I\'m a bit busy with school work right now. Scott Kruger Project Administrator, Cosmology@Home |
![]() ![]() Send message Joined: 28 Aug 07 Posts: 169 Credit: 2,093,665 RAC: 3,105 ![]() |
Any update? That will be fine, thanks Scott, much appreciated. |