Advanced search

Forums : Technical Support : URGENT Problems Discussion Thread
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · Next

AuthorMessage
Profile UBT - Rick Horn
Volunteer tester
Avatar

Send message
Joined: 8 Jun 07
Posts: 23
Credit: 473,547
RAC: 0
Message 6166 - Posted: 13 May 2008, 14:40:55 UTC - in response to Message 6165.  

Detaching and then reloading hasn\'t solved the glitch either, so suspending the project here :(


Nope, neither has re-booting, or resetting the project. It all seems to be the \'051...\' work units, I still have some of the \'050...\' and they\'re fine.

I\'m not suspending, leaving them all open if the problem is resolved quickly enough I won\'t miss a beat.

The lack of stats output is another issue too.



If we abort the 051s and we don`t get any more WUs, we`ll be out in the cold, and won`t get any credits from.
I think I`ll just carry on and see what happens.
ID: 6166 · Report as offensive
Greg C. TNO

Send message
Joined: 6 Apr 08
Posts: 5
Credit: 1,033,430
RAC: 0
Message 6167 - Posted: 13 May 2008, 14:51:53 UTC - in response to Message 6166.  

Detaching and then reloading hasn\'t solved the glitch either, so suspending the project here :(


Nope, neither has re-booting, or resetting the project. It all seems to be the \'051...\' work units, I still have some of the \'050...\' and they\'re fine.

I\'m not suspending, leaving them all open if the problem is resolved quickly enough I won\'t miss a beat.

The lack of stats output is another issue too.



If we abort the 051s and we don`t get any more WUs, we`ll be out in the cold, and won`t get any credits from.
I think I`ll just carry on and see what happens.


Pretty much what I\'m doing, letting everything run. I\'m also using the time to see if Ibercivis is less buggy now. The problems here are being adressed already, (what with that news posting right on the main page.) I hope it isn\'t something too horrible to fix. :-)



ID: 6167 · Report as offensive
Profile Jayargh
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 25 Jun 07
Posts: 508
Credit: 2,282,158
RAC: 0
Message 6170 - Posted: 13 May 2008, 17:20:41 UTC

Seems that every time I get work units from C@H that end in computation errors it trashes work across a wide range of other projects on a number of my hosts....this latest round trashed a score of Milkyways and 1 CPDN model which is pretty frustrating considering I was 30% into a 600hr workunit. :( Be aware that sending out bad work has a greater global effect than just in here.
ID: 6170 · Report as offensive
Greg C. TNO

Send message
Joined: 6 Apr 08
Posts: 5
Credit: 1,033,430
RAC: 0
Message 6171 - Posted: 13 May 2008, 18:08:51 UTC - in response to Message 6170.  

Seems that every time I get work units from C@H that end in computation errors it trashes work across a wide range of other projects on a number of my hosts....this latest round trashed a score of Milkyways and 1 CPDN model which is pretty frustrating considering I was 30% into a 600hr workunit. :( Be aware that sending out bad work has a greater global effect than just in here.


Ouch. Kind of like when LHC went rabid last month and was crashing all my boxen...
ID: 6171 · Report as offensive
computerguy09

Send message
Joined: 6 Oct 07
Posts: 5
Credit: 1,177,089
RAC: 1,320
Message 6172 - Posted: 13 May 2008, 19:44:48 UTC - in response to Message 6170.  
Last modified: 13 May 2008, 19:47:04 UTC

Seems that every time I get work units from C@H that end in computation errors it trashes work across a wide range of other projects on a number of my hosts....this latest round trashed a score of Milkyways and 1 CPDN model which is pretty frustrating considering I was 30% into a 600hr workunit. :( Be aware that sending out bad work has a greater global effect than just in here.


I find it difficult to understand how the bad WU in Cosmo effects other projects, unless:

1) Your system crashes and corrupts files all over your disk
2) You intentionally uninstall BOINC, detach from all projects, and cause other projects to fail.
3) You don\'t have \"Keep applications in Memory\" checked in your BOINC preferences
4) Some combination of the above.

I\'ve been running BOINC for quite some time, on 10-15 systems currently, and have at least a basic understanding of how BOINC works, and have seen lots of bad things happen to a single project (SETI, LHC and others), and haven\'t had any problem with one project spilling over to the others on the same box. I had a rash of the \'051 errors today on several boxes, and none of my other projects had any hiccups at all.

These comments are only based on my experience and not intended to reflect badly on others....

Mark
ID: 6172 · Report as offensive
Profile Mixer[SG]

Send message
Joined: 22 Feb 08
Posts: 1
Credit: 102,830
RAC: 0
Message 6173 - Posted: 13 May 2008, 20:10:59 UTC

It seems that only the workunits \"wu_051208...\" have errors.
ID: 6173 · Report as offensive
Profile Copycat-Digital for WCG*
Avatar

Send message
Joined: 25 Sep 07
Posts: 17
Credit: 1,471,530
RAC: 0
Message 6174 - Posted: 13 May 2008, 20:18:55 UTC
Last modified: 13 May 2008, 20:39:03 UTC

Getting massage:

2008/05/13 09:41:26 PM|Cosmology@Home|Restarting task wu_051208_203621_8_3 using camb version 212
2008/05/13 09:41:26 PM|Cosmology@Home|Restarting task wu_051208_203449_34_3 using camb version 212
2008/05/13 09:41:27 PM|Cosmology@Home|Task wu_051208_203621_8_3 exited with zero status but no \'finished\' file
2008/05/13 09:41:27 PM|Cosmology@Home|If this happens repeatedly you may need to reset the project.
2008/05/13 09:41:27 PM|Cosmology@Home|Task wu_051208_203449_34_3 exited with zero status but no \'finished\' file
2008/05/13 09:41:27 PM|Cosmology@Home|If this happens repeatedly you may need to reset the project.

Rebooted - detatched twice> same error message
This is on a AMD 4200+ box With WinXP
All workunits issued 051208 do this
Aborted all wu before the 13 th
Workunits issued on 051308 seems to run OK
Intelbox (e2200) runs OK but all workunits downloaded is 051308

Same on Intel box
ID: 6174 · Report as offensive
Profile Jayargh
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 25 Jun 07
Posts: 508
Credit: 2,282,158
RAC: 0
Message 6175 - Posted: 13 May 2008, 20:59:22 UTC - in response to Message 6172.  

Seems that every time I get work units from C@H that end in computation errors it trashes work across a wide range of other projects on a number of my hosts....this latest round trashed a score of Milkyways and 1 CPDN model which is pretty frustrating considering I was 30% into a 600hr workunit. :( Be aware that sending out bad work has a greater global effect than just in here.


I find it difficult to understand how the bad WU in Cosmo effects other projects, unless:

1) Your system crashes and corrupts files all over your disk
2) You intentionally uninstall BOINC, detach from all projects, and cause other projects to fail.
3) You don\'t have \"Keep applications in Memory\" checked in your BOINC preferences
4) Some combination of the above.

I\'ve been running BOINC for quite some time, on 10-15 systems currently, and have at least a basic understanding of how BOINC works, and have seen lots of bad things happen to a single project (SETI, LHC and others), and haven\'t had any problem with one project spilling over to the others on the same box. I had a rash of the \'051 errors today on several boxes, and none of my other projects had any hiccups at all.

These comments are only based on my experience and not intended to reflect badly on others....

Mark


None of the 4 listed apply....most of the problems occur on my Linux boxes...Cosmo somehow corrupts the Boinc client and I don\'t know how... been on Boinc for 4 years now and this is the only project it occurs on other than when LHC crashed everyone as Greg mentioned.
ID: 6175 · Report as offensive
Profile Scott
Volunteer moderator
Project administrator
Project developer
Avatar

Send message
Joined: 1 Apr 07
Posts: 662
Credit: 13,742
RAC: 0
Message 6176 - Posted: 13 May 2008, 21:33:34 UTC

Just cancelled the bad WUs.
Scott Kruger
Project Administrator, Cosmology@Home
ID: 6176 · Report as offensive
Profile Scott
Volunteer moderator
Project administrator
Project developer
Avatar

Send message
Joined: 1 Apr 07
Posts: 662
Credit: 13,742
RAC: 0
Message 6347 - Posted: 18 Jun 2008, 6:06:07 UTC

Going to take a look at the no work issue tomorrow morning.
Scott Kruger
Project Administrator, Cosmology@Home
ID: 6347 · Report as offensive
Profile Scott
Volunteer moderator
Project administrator
Project developer
Avatar

Send message
Joined: 1 Apr 07
Posts: 662
Credit: 13,742
RAC: 0
Message 6356 - Posted: 18 Jun 2008, 19:45:01 UTC

6/18/2008 1:43:11 PM|Cosmology@Home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 11 completed tasks
6/18/2008 1:43:26 PM|Cosmology@Home|Scheduler request succeeded: got 0 new tasks
6/18/2008 1:43:26 PM|Cosmology@Home|Message from server: Server error: feeder not running

Can\'t seem to report any of my results this morning, and it seems like the server status page is screaming with red errors...

I had to restart the mysql server this morning. It seems to be working much better now.
Scott Kruger
Project Administrator, Cosmology@Home
ID: 6356 · Report as offensive
Profile Seejay
Avatar

Send message
Joined: 22 Dec 07
Posts: 13
Credit: 115,740
RAC: 0
Message 6623 - Posted: 18 Jul 2008, 18:06:13 UTC

How come the SERVER STATUS liar says that there are only 2 results waiting for validation, when I\'ve got 59 (just counted) myself??? Wot\'s goin\' on?? 0+0
ID: 6623 · Report as offensive
Profile Cappy [Team Musketeers]
Avatar

Send message
Joined: 8 Jul 07
Posts: 15
Credit: 880,250
RAC: 0
Message 6624 - Posted: 18 Jul 2008, 18:27:29 UTC

same here machines are starving,,hay

if they cant get fed here ill take um e;sewhere :P
ID: 6624 · Report as offensive
Profile m4rtyn
Avatar

Send message
Joined: 23 Aug 07
Posts: 18
Credit: 372,460
RAC: 0
Message 6625 - Posted: 18 Jul 2008, 20:27:42 UTC - in response to Message 6623.  
Last modified: 18 Jul 2008, 20:29:01 UTC

How come the SERVER STATUS liar says that there are only 2 results waiting for validation, when I\'ve got 59 (just counted) myself??? Wot\'s goin\' on?? 0+0


Hi Seejay,

I just took a quick look at your pending wu\'s. As far as I can see non have made quorum yet. The server status only lists pending wu\'s that have made quorum and for some reason are still awaiting validation. You also have one in the checked but no concensus yet state.
m4rtyn
************************** *************************
ID: 6625 · Report as offensive
Profile Seejay
Avatar

Send message
Joined: 22 Dec 07
Posts: 13
Credit: 115,740
RAC: 0
Message 6626 - Posted: 18 Jul 2008, 20:43:00 UTC - in response to Message 6625.  

\"M4artyn\" wrote:
\"Seejay\" wrote:
How come the SERVER STATUS liar says that there are only 2 results waiting for validation, when I\'ve got 59 (just counted) myself??? Wot\'s goin\' on?? 0+0


Hi Seejay,

I just took a quick look at your pending wu\'s. As far as I can see non have made quorum yet. The server status only lists pending wu\'s that have made quorum and for some reason are still awaiting validation. You also have one in the checked but no concensus yet state.


Hi M4artyn, long time.....

You\'re right, I didn\'t even bother checking the status of the WUs in question. Oooops!! I suppose I owe an apology to \"Darkmatter\" !! ;-)
I just assumed, after all the validator problems recently, etc. that there was an error!! Amazing what flaky and vanishing projects can do to your trust in said projects!!

Take care,

Chris :)

ID: 6626 · Report as offensive
STE\/E
Volunteer tester

Send message
Joined: 12 Jun 07
Posts: 375
Credit: 16,522,388
RAC: 0
Message 6627 - Posted: 18 Jul 2008, 22:15:28 UTC

Guy\'s, not to get on your Cases but this isn\'t supposed to be a Discussion Thread about the Urgent Problems the Project has. The Urgent Problem Discussion Thread is here > http://www.cosmologyathome.org/forum_thread.php?id=140

The problem with all the Continual General Discussion in this Thread is you have to sift thru all the Posts to actually find out whats a Urgent Problem & whats just a lot of prattle ... :)
ID: 6627 · Report as offensive
Profile m4rtyn
Avatar

Send message
Joined: 23 Aug 07
Posts: 18
Credit: 372,460
RAC: 0
Message 6628 - Posted: 18 Jul 2008, 22:48:39 UTC - in response to Message 6627.  


just a lot of prattle ... :)


Hi Poorboy,

[comencing_prattle]

Could\'nt help noticing you\'ve added a bit of your own as well :) Seriously though Seejays belief that the validator was screwed again certainly qualifies as an urgent problem, not that the project admin is likly to ever read this thread.

[/end_prattle]
m4rtyn
************************** *************************
ID: 6628 · Report as offensive
STE\/E
Volunteer tester

Send message
Joined: 12 Jun 07
Posts: 375
Credit: 16,522,388
RAC: 0
Message 6629 - Posted: 18 Jul 2008, 23:11:01 UTC
Last modified: 18 Jul 2008, 23:15:31 UTC

LOL, looks like the Mod\'s are on the job and moving Posts ... Yes, I\'m Guilty too & have made a few Prattle Posts myself in the Urgent Problems Thread, but I\'m going to go to Rehab and try to stop ... :)

Seriously though Seejays belief that the validator was screwed again certainly qualifies as an urgent problem, not that the project admin is likly to ever read this thread.


Actually Seejays claim to having 50 Wu\'s waiting for Validation isn\'t true, his Pending Wu\'s waiting for a Wingman to return their Result doesn\'t really Qualify as a Wu Waiting for Validation, at least I don\'t think it does.

A Wu waiting for Validation is 1 that the Quorum of 2 (2 people that have turned in a Result) has been meet & then that makes it a Wu waiting for Validation if both Wu\'s are a success (ie Valied) ... :)

I didn\'t see anything of the sort in seejay\'s Wu\'s, all I saw was Pending Wu\'s waiting as I said for a Wingman to return theirs, which going full circle doesn\'t make it a Wu waiting for Validation to the Server ... :)
ID: 6629 · Report as offensive
Nothing But Idle Time

Send message
Joined: 27 Aug 07
Posts: 84
Credit: 148,380
RAC: 0
Message 6636 - Posted: 19 Jul 2008, 11:09:12 UTC

I completed college in 1970 so maybe our language has changed? My definition of \"urgent\" and that of this project is quite different. And, my definitions of \"quality\", \"attentive\", \"communicative\", \"professional\", \"dedicated\", ..., are also quite different. Maybe academic pursuits have no deadlines nor profit motive, nor any employee risking being fired and replaced by someone more goal oriented.
ID: 6636 · Report as offensive
Profile Dany

Send message
Joined: 13 Mar 08
Posts: 4
Credit: 184,450
RAC: 0
Message 6660 - Posted: 21 Jul 2008, 4:17:35 UTC

Still nothing...

Well, if by tomorrow night I haven\'t got any WU, or at least an explanation, then I shall drop the ball. Something the founders seemed to have done...

I\'m not expecting an improvement so I\'ll give my regards to everyone here and a Happy Cruching.

Hopefully it won\'t come to that, but past experience is in my favor.

C ya.
ID: 6660 · Report as offensive
Previous · 1 . . . 14 · 15 · 16 · 17 · 18 · Next

Forums : Technical Support : URGENT Problems Discussion Thread