Advanced search

Forums : Technical Support : URGENT PROBLEMS THREAD (2009 and after)
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 · Next

AuthorMessage
Profile Faik

Send message
Joined: 3 Oct 11
Posts: 2771
Credit: 140,700
RAC: 0
Message 10054 - Posted: 11 Feb 2012, 1:13:32 UTC - in response to Message 10053.  

Dear Bill,

It is not that I have failed you; it is that I do not know. I know that Professor Wandelt is away currently, and he has promised me to work on this as soon as he is back. I will certainly forward your message to him.

Best wishes,

Faik
---
Member of the C@H Team!
ID: 10054 · Report as offensive     Reply Quote
Bill & Patsy
Avatar

Send message
Joined: 27 Jul 08
Posts: 25
Credit: 1,045,640
RAC: 0
Message 10055 - Posted: 11 Feb 2012, 1:46:40 UTC - in response to Message 10054.  

Thank you!
--Bill

ID: 10055 · Report as offensive     Reply Quote
Profile Faik

Send message
Joined: 3 Oct 11
Posts: 2771
Credit: 140,700
RAC: 0
Message 10056 - Posted: 11 Feb 2012, 3:03:08 UTC - in response to Message 10055.  

You welcome!

Faik
---
Member of the C@H Team!
ID: 10056 · Report as offensive     Reply Quote
Neoka

Send message
Joined: 3 Feb 12
Posts: 3
Credit: 118,491
RAC: 0
Message 10058 - Posted: 11 Feb 2012, 9:48:22 UTC
Last modified: 11 Feb 2012, 9:49:56 UTC

Hi,
what's the problem. I added a new project Cosmology, I have prepared two tasks. For more than five days, the Boinc manager can not send the results (Cosmology @ Home | [error] Error Reported upload the file server: The server is out of disk space). It also can not download new tasks.
Other projects work well. What should I do?
ID: 10058 · Report as offensive     Reply Quote
Profile cykodennis

Send message
Joined: 31 May 10
Posts: 234
Credit: 4,896,378
RAC: 0
Message 10060 - Posted: 11 Feb 2012, 16:07:28 UTC - in response to Message 10058.  

Just wait.
It is not on your side. The serverharddisk is full and the techincal staff on travel.
ID: 10060 · Report as offensive     Reply Quote
w1hue

Send message
Joined: 28 Sep 09
Posts: 9
Credit: 1,124,314
RAC: 0
Message 10066 - Posted: 11 Feb 2012, 22:43:23 UTC - in response to Message 10050.  


I think that if the wu's are not reported back before deadline they will marked as 'unvalid' and sent out again and there is nothing I can do to prevent that.

I recall that other projects (SETI and Milkyway, in particular) have been able to give credit to WUs that went past their deadlines when the projects were having server problems...
ID: 10066 · Report as offensive     Reply Quote
Rapture
Avatar

Send message
Joined: 27 Oct 07
Posts: 85
Credit: 661,330
RAC: 0
Message 10069 - Posted: 11 Feb 2012, 23:23:08 UTC - in response to Message 10054.  

Do you know when Ben Wandelt will return?
ID: 10069 · Report as offensive     Reply Quote
Bill & Patsy
Avatar

Send message
Joined: 27 Jul 08
Posts: 25
Credit: 1,045,640
RAC: 0
Message 10076 - Posted: 12 Feb 2012, 1:19:49 UTC - in response to Message 10069.  

Do you know when Ben Wandelt will return?

Good grief. Have you read even the recent posts in this thread?
--Bill

ID: 10076 · Report as offensive     Reply Quote
Tomas

Send message
Joined: 18 Aug 08
Posts: 30
Credit: 827,240
RAC: 0
Message 10079 - Posted: 12 Feb 2012, 16:19:11 UTC - in response to Message 10066.  


I think that if the wu's are not reported back before deadline they will marked as 'unvalid' and sent out again and there is nothing I can do to prevent that.

I recall that other projects (SETI and Milkyway, in particular) have been able to give credit to WUs that went past their deadlines when the projects were having server problems...


The credit is not that important, wasted work done is. Will wu's that is not reported back before the deadline trigger the server to send them out again as long that part of the server is running?

ID: 10079 · Report as offensive     Reply Quote
Neoka

Send message
Joined: 3 Feb 12
Posts: 3
Credit: 118,491
RAC: 0
Message 10107 - Posted: 13 Feb 2012, 21:22:10 UTC

I decided to stop working on your project until you remove the problems. If your problems because of ready tasks fail and lose credits for work unsubscribe from your project.
ID: 10107 · Report as offensive     Reply Quote
tgm

Send message
Joined: 20 Dec 07
Posts: 2
Credit: 757,457
RAC: 0
Message 10112 - Posted: 14 Feb 2012, 5:42:08 UTC

How do I kindly suggest that there may be some project management issues associated with the care and feeding of the servers used to support this project. The indications we have seen suggest that there is no disk space left. This should have been predicted long before now and we should not be the people noticing it first. But this may not be the actual issue. The fact that nobody knows what is really the underlying cause more than 10 days after the crash certainly goes beyond the term 'problematic'. My guess is that someone may believe that the loss of time, data, etc. may be relatively inconsequential. But that someone should also consider the number of people with their computing resources could go away as quickly as they came and extend the reprocessing time to finish even further. I doubt that the management of this system crash will be enthusiastically viewed by the listed sponsors of the endeavor either (JPL, NSF, and NASA).
ID: 10112 · Report as offensive     Reply Quote
Profile cykodennis

Send message
Joined: 31 May 10
Posts: 234
Credit: 4,896,378
RAC: 0
Message 10113 - Posted: 14 Feb 2012, 6:20:19 UTC - in response to Message 10112.  

You are right.
But improvements are on their way.
Announcement

If it ends all in a better managment situation, it might have been the annoying worth.
ID: 10113 · Report as offensive     Reply Quote
josullivan

Send message
Joined: 22 Jun 09
Posts: 1
Credit: 120,960
RAC: 0
Message 10118 - Posted: 14 Feb 2012, 13:36:35 UTC

Sorry but I have had enough of this time wasting now, if you lot can't get it sorted by now you never will.

I'm off!!
ID: 10118 · Report as offensive     Reply Quote
Jon

Send message
Joined: 21 Sep 11
Posts: 5
Credit: 1,199,422
RAC: 0
Message 10122 - Posted: 15 Feb 2012, 2:35:22 UTC - in response to Message 10054.  

Not knowing the problem, it is clear that the end result is a full disk somewhere. One short term fix would be to run a scheduled job to move teh files out of the full file system onto another volume. That way, at least you'd get the files.

A lot of people are leaving over this fiasco. I don;t intend to do that, but I've joined other projects that I won't leave now that I've started them. You've lost a lot of CPU right there, and I consider myself loyal.

I need to repurpose at least one of the devices. I've been holding off for weeks now in order to upload your files. I'm running out of wiggle room. I'd hate for days of data to be lost. How much longer must I hold them?


What bothers me, and I suspect many here, is that nothing is being done. It's 2012 and VPN technology is not new. If for some reason that is not available, I suspect the IT dept of the U of IL knows a thing or two about computers.

For whatever reason, it hasn't been looked at. If it has been looked at, no useful information has been shared and no short-term work-around has been implemented. That's tragic.

We are all for you guys. We wouldn't dedicate our clock cycles if we were not. Because we do this we have a stake in the project. What we are asking, is what is happening in the short term? "Nothing" is an unacceptable answer. Has IT been asked? Is there a comp sci professor or comp sci student who might be able to look at it? Even if it is a chargeable call, it has to be cheaper than the CPU time you are losing in both the short and long term.

Even if you can't do any of those things, someone should have an idea of what the first day this might actually be looked at. Can we at least narrow it down to 1 to 5 days, 6 to 10 days, 11 to 15 days, etc. ??? Maybe tell us what's been tried thus far? Please?



ID: 10122 · Report as offensive     Reply Quote
Profile cykodennis

Send message
Joined: 31 May 10
Posts: 234
Credit: 4,896,378
RAC: 0
Message 10124 - Posted: 15 Feb 2012, 7:38:34 UTC - in response to Message 10122.  

I totally agree with what you write Jon.
And this is something we have to talk about with Ben, as soon as he is back again. Obviously, Ben Wandelt seems to be the only project-involved person, who has the permissions and abilities, which are needed to look at the problem.

However, it is not completely right that there was no information.
Faik told us, that he has informed Ben Wandelt, who is actually traveling. This was...erm..ca. 10 days ago (?).
Faik also made clear that there is no one else who is able to fix the problem.
So we can answer our questions ourselves:
- The problem will be fixed as soon as Ben is back
- We do not know when Ben is back
- Nothing has been done so far (because Ben is away)
- The problem seems to be simple
- As Faik has signalized, the "crew" (Faik&Ben) has learned from this issue, and searches for a competent third man proactively

ID: 10124 · Report as offensive     Reply Quote
JLConawayII

Send message
Joined: 27 Apr 10
Posts: 19
Credit: 1,086,670
RAC: 0
Message 10136 - Posted: 15 Feb 2012, 18:14:04 UTC

I always get a kick out of all the people that take any problem like this as a personal attack on them, and announce loudly that they're leaving as if they're so important the project will collapse without their presence. Stop acting like children, the problem will get fixed when it gets fixed. In the meantime there are many other worthwhile projects to which you can dedicate your resources, like SETI@home, Einstein@home, MilkyWay@home, etc.
ID: 10136 · Report as offensive     Reply Quote
Jon

Send message
Joined: 21 Sep 11
Posts: 5
Credit: 1,199,422
RAC: 0
Message 10137 - Posted: 15 Feb 2012, 19:45:38 UTC - in response to Message 10124.  

I agree that some info has been shared, but I've been sitting on repurposing a device for 2 weeks now. If I knew it was fruitless to wait, I would repurpose it now. I am hoping it gets resolved and am holding off on this, but every day I wait the harder it will be for me to catch up. This one device is not a significant total of my work units as they are spread accross several devices. I don't want to screw cosmology@home out of days of one computer's clock cycles if I don't have to.

No one knowing when Ben is going to return seems a little odd. One would think, for example, Ben might have an idea. I really would like to make plans based on good information. If we are talking weeks, I am going to wipe it. If we are talking hours or days I'll hold off.

I'd like to know which of these it might be.
ID: 10137 · Report as offensive     Reply Quote
w1hue

Send message
Joined: 28 Sep 09
Posts: 9
Credit: 1,124,314
RAC: 0
Message 10141 - Posted: 15 Feb 2012, 21:59:43 UTC - in response to Message 10136.  

I always get a kick out of all the people that take any problem like this as a personal attack on them, and announce loudly that they're leaving as if they're so important the project will collapse without their presence. Stop acting like children, the problem will get fixed when it gets fixed. In the meantime there are many other worthwhile projects to which you can dedicate your resources, like SETI@home, Einstein@home, MilkyWay@home, etc.

I second that! This sort of crap appears on all project's message boards whenever a problem pops up. Come on folks -- most BOINC projects are under funded and the researchers can't just drop everything the second a server barfs or a disk fills up! If you want to do work for a project that is relatively well funded and problem free, give "World Community Grid" a look.
ID: 10141 · Report as offensive     Reply Quote
Paul John

Send message
Joined: 1 Jan 12
Posts: 1
Credit: 58,800
RAC: 0
Message 10142 - Posted: 15 Feb 2012, 22:30:59 UTC

Wot no ini file? Every time I get a new job lately the download failes because it can't find the ini file the last one being "params_013112_081643_1.ini" it comes up with "giving up on download...file not found" What's happening?
ID: 10142 · Report as offensive     Reply Quote
Profile cykodennis

Send message
Joined: 31 May 10
Posts: 234
Credit: 4,896,378
RAC: 0
Message 10143 - Posted: 15 Feb 2012, 22:49:57 UTC - in response to Message 10141.  

/sign

I always get a kick out of all the people that take any problem like this as a personal attack on them, and announce loudly that they're leaving as if they're so important the project will collapse without their presence. Stop acting like children, the problem will get fixed when it gets fixed. In the meantime there are many other worthwhile projects to which you can dedicate your resources, like SETI@home, Einstein@home, MilkyWay@home, etc.

I second that! This sort of crap appears on all project's message boards whenever a problem pops up. Come on folks -- most BOINC projects are under funded and the researchers can't just drop everything the second a server barfs or a disk fills up! If you want to do work for a project that is relatively well funded and problem free, give "World Community Grid" a look.

ID: 10143 · Report as offensive     Reply Quote
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 · Next

Forums : Technical Support : URGENT PROBLEMS THREAD (2009 and after)