Advanced search

Forums : Technical Support : Longer / heavier WUs?
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 · Next

AuthorMessage
WimTea

Send message
Joined: 16 Nov 07
Posts: 27
Credit: 408,380
RAC: 0
Message 8030 - Posted: 8 Apr 2009, 17:44:55 UTC

After picking up some WUs again after a few months I notice they tend to take longer and use / reserve more memory than they used to. Especially the memory requirement is a bit of concern to me as my older hosts are very well suited for Cosmo work CPU wise but also a bit short in the RAM department. I've seen WUs take 240-350 Mb of RAM where before I saw 120 as the max. The VM size used to be close to the max amount in use, but is currently much bigger, eg. 224 Mb used / 410 Mb VM @ 40-50% completion.

Has there been an change?
ID: 8030 · Report as offensive     Reply Quote
Profile Conan
Avatar

Send message
Joined: 28 Aug 07
Posts: 169
Credit: 1,256,874
RAC: 0
Message 8031 - Posted: 8 Apr 2009, 18:12:13 UTC

From what I can see is that it is the Windows application work units that are taking over 50% longer to do (I have one running for over 17 hours at the moment with 3 still to go), whilst the Linux work units on the same spec computer are taking 6 to 8 hours, like they use to.

Memory usage is about 176 MB in the early stage of the WU on Linux and going up to 350 MB in the later stages on both Windows and Linux.

The problem this brings up is awarded credit for these long running work units with the 140 limit that is now in place.
I have already had one claim for 225 credits and got 140 (9 cr/h) on the Windows machine.

I am wondering why have the Windows work units started to take up to 50% longer to run ???
Has only started with the new work units.
ID: 8031 · Report as offensive     Reply Quote
WimTea

Send message
Joined: 16 Nov 07
Posts: 27
Credit: 408,380
RAC: 0
Message 8032 - Posted: 8 Apr 2009, 21:36:34 UTC

Some more details about the Cosmo RAM usage on Windows...
I just noticed a WU was removed from memory due to lack of memory. Altering the BOINC setting to increase mem usage for BOINC brought the WU back to life again and it resumed @ 60%. It almost immediately started claiming wild varying amounts of memory, anywhere between 100 and 440 Mb, currently 189 Mb @ 63.5%, but almost right at the beginning continuously with a VM size of 765 Mb (!).

Hope this helps to solve this issue, unfortunately in the meantime it means NNT for my 2 elderly hosts...
ID: 8032 · Report as offensive     Reply Quote
WimTea

Send message
Joined: 16 Nov 07
Posts: 27
Credit: 408,380
RAC: 0
Message 8036 - Posted: 8 Apr 2009, 22:51:14 UTC - in response to Message 8030.  

update: 305 Mb @ 77%, VM @ 765 Mb. As stated before it never reached 150 Mb.
ID: 8036 · Report as offensive     Reply Quote
Profile Ageless
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 15 Jun 07
Posts: 345
Credit: 50,500
RAC: 0
Message 8038 - Posted: 9 Apr 2009, 2:26:03 UTC
Last modified: 9 Apr 2009, 2:33:48 UTC

Mine is now taking up 383MB of RAM and 383MB of VM. It's slowing my system down, even while I have 2GB of RAM. I wonder what it does on my TV server with only 1GB of RAM, probably swapping a lot to VM.

Hmm, that one only uses 106MB and 200MB VM. Comparable run time though, both tasks are at 23%. The big one is using WinXP, the smaller one Win2k.
Jord.
ID: 8038 · Report as offensive     Reply Quote
Profile Conan
Avatar

Send message
Joined: 28 Aug 07
Posts: 169
Credit: 1,256,874
RAC: 0
Message 8039 - Posted: 9 Apr 2009, 5:20:47 UTC - in response to Message 8031.  

From what I can see is that it is the Windows application work units that are taking over 50% longer to do (I have one running for over 17 hours at the moment with 3 still to go), whilst the Linux work units on the same spec computer are taking 6 to 8 hours, like they use to.

Memory usage is about 176 MB in the early stage of the WU on Linux and going up to 350 MB in the later stages on both Windows and Linux.

The problem this brings up is awarded credit for these long running work units with the 140 limit that is now in place.
I have already had one claim for 225 credits and got 140 (9 cr/h) on the Windows machine.

I am wondering why have the Windows work units started to take up to 50% longer to run ???
Has only started with the new work units.


Well that 17 hour work unit ended after 24 hours, claimed 372.80 and got 140.00.

Memory usage has now shot up to between 540 MB to 587 MB at the 10 hour mark on two current work units.
Another now at 17 hours is using 247 MB.

I have 2 GB RAM but for the first time with Cosmology I am now running into a problem I sometimes get on The Lattice Project when I run more than 1 WU at a time ""Waiting For Memory"".

This is most unusual and my Linux machines still only take the normal times of 6 to 8 hours not 15 to 24 hours.
Will have to stop work on the Windows machine till this gets sorted as it is dropping to a crawl and i can't use it.

ID: 8039 · Report as offensive     Reply Quote
Profile Ananas

Send message
Joined: 19 Jan 08
Posts: 180
Credit: 2,500,290
RAC: 0
Message 8040 - Posted: 9 Apr 2009, 5:40:01 UTC

Probably time to Akosify the windows application ;-)
ID: 8040 · Report as offensive     Reply Quote
Lexxikon
Avatar

Send message
Joined: 4 Nov 07
Posts: 4
Credit: 225,729
RAC: 0
Message 8042 - Posted: 9 Apr 2009, 8:43:38 UTC

Similar situation here. But, Cosmo is using almost 1 gig ram for 2 WU in progress. Luckily, I have 4 gigs total. Hopefully, the WU's will get shortened or more credit is given for a 17 to 20 hour WU.
ID: 8042 · Report as offensive     Reply Quote
Profile Misfit
Volunteer tester
Avatar

Send message
Joined: 9 Jun 07
Posts: 150
Credit: 237,789
RAC: 0
Message 8045 - Posted: 9 Apr 2009, 8:50:03 UTC - in response to Message 8040.  

Probably time to Akosify the windows application ;-)

It takes days just to get the correct app available. They might not be able to handle optimization. Ever see Scanners?
me@rescam.org
ID: 8045 · Report as offensive     Reply Quote
Profile sysfried

Send message
Joined: 24 Jun 07
Posts: 114
Credit: 5,296,905
RAC: 18
Message 8050 - Posted: 9 Apr 2009, 19:01:38 UTC

I don't have a problem with the RAM size, but I do have to complain about the "claimed vs. granted"

http://cosmologyathome.org/workunit.php?wuid=6279002
380 claimed vs. 140 granted.

That's a "no new WUs" for me.

Sincerely,

sysfried
Happy member of Team: Planet 3D Now!

ID: 8050 · Report as offensive     Reply Quote
rroonnaalldd

Send message
Joined: 10 Apr 08
Posts: 18
Credit: 147,580
RAC: 0
Message 8061 - Posted: 10 Apr 2009, 3:59:49 UTC

Message from server: CAMB needs 476.84 MB RAM but only 379.65 MB is available for use.

Says all. The Host has 512MB but is limited for Boinc because it is not a dedicated cruncher.
ID: 8061 · Report as offensive     Reply Quote
Profile Conan
Avatar

Send message
Joined: 28 Aug 07
Posts: 169
Credit: 1,256,874
RAC: 0
Message 8070 - Posted: 10 Apr 2009, 14:12:10 UTC - in response to Message 8039.  

From what I can see is that it is the Windows application work units that are taking over 50% longer to do (I have one running for over 17 hours at the moment with 3 still to go), whilst the Linux work units on the same spec computer are taking 6 to 8 hours, like they use to.

Memory usage is about 176 MB in the early stage of the WU on Linux and going up to 350 MB in the later stages on both Windows and Linux.

The problem this brings up is awarded credit for these long running work units with the 140 limit that is now in place.
I have already had one claim for 225 credits and got 140 (9 cr/h) on the Windows machine.

I am wondering why have the Windows work units started to take up to 50% longer to run ???
Has only started with the new work units.


Well that 17 hour work unit ended after 24 hours, claimed 372.80 and got 140.00.


Memory usage has now shot up to between 540 MB to 587 MB at the 10 hour mark on two current work units.
Another now at 17 hours is using 247 MB.

I have 2 GB RAM but for the first time with Cosmology I am now running into a problem I sometimes get on The Lattice Project when I run more than 1 WU at a time ""Waiting For Memory"".

This is most unusual and my Linux machines still only take the normal times of 6 to 8 hours not 15 to 24 hours.
Will have to stop work on the Windows machine till this gets sorted as it is dropping to a crawl and I can't use it.


My last couple of Windows work units have taken from 18 to 24 hours, but now I have my first 2 Linux work units also going past normal times (one at 10 hours/41% done and one at 17 hours/86% done).

The two Linux work units start ""wu_040709_03014"", all other Linux types seem to run ok.
ID: 8070 · Report as offensive     Reply Quote
mikes

Send message
Joined: 12 Jan 08
Posts: 1
Credit: 2,987,752
RAC: 7
Message 8076 - Posted: 11 Apr 2009, 0:52:34 UTC
Last modified: 11 Apr 2009, 0:52:56 UTC

My Fedora 10 linux box that I have working on Cosmo is using anywhere from 350-580MB per WU, and with two cores running WU, that is causing serious performance problems with the other apps I use (2GB RAM). Time to turn off Cosmo for the time being until the app gets fixed.
ID: 8076 · Report as offensive     Reply Quote
[HWU]Flotta Stellare - Starfleet

Send message
Joined: 22 Apr 08
Posts: 8
Credit: 1,043,050
RAC: 0
Message 8077 - Posted: 11 Apr 2009, 2:27:50 UTC - in response to Message 8076.  
Last modified: 11 Apr 2009, 2:29:25 UTC

My E9450@3.4ghz is using from 380-5580MB per WU too, and with 4 cores running is impossible continuing. I'm turning off Cosmo for the time being until the app gets fixed. :(
ID: 8077 · Report as offensive     Reply Quote
Profile Ageless
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 15 Jun 07
Posts: 345
Credit: 50,500
RAC: 0
Message 8078 - Posted: 11 Apr 2009, 3:27:40 UTC - in response to Message 8077.  

5580MB per WU

That must be a record.

I'm now running at 705MB RAM and 705MB VM. And still only at 70%, after 30h 35m wall time/23h 4m CPU time.
Jord.
ID: 8078 · Report as offensive     Reply Quote
WimTea

Send message
Joined: 16 Nov 07
Posts: 27
Credit: 408,380
RAC: 0
Message 8082 - Posted: 11 Apr 2009, 20:45:24 UTC

My last WU until a(ny) response from the project admins is @ 45hrs and 95% now. Mem usage gradually diminished to 150 Mb / 300 Mb. Time between last 2 checkpoints was around 8 (!) CPU hours. Luckily this particular host has hibernate active and working just fine...
ID: 8082 · Report as offensive     Reply Quote
Jim Wilkins

Send message
Joined: 22 Nov 07
Posts: 5
Credit: 62,080
RAC: 0
Message 8085 - Posted: 12 Apr 2009, 4:54:03 UTC - in response to Message 8082.  

After 20 hours with 3 hours to go, mine reset to 25% and about 30 hours to go. Looks like I will be in high priority mode for some time
ID: 8085 · Report as offensive     Reply Quote
Profile sysfried

Send message
Joined: 24 Jun 07
Posts: 114
Credit: 5,296,905
RAC: 18
Message 8088 - Posted: 12 Apr 2009, 20:34:57 UTC

Could we get an official feedback on this? I'm still on no new workunits... I'm afraid this is some kind of bug and I'd be wasting my time and energy.

Sysfried
Happy member of Team: Planet 3D Now!

ID: 8088 · Report as offensive     Reply Quote
vass

Send message
Joined: 29 Oct 07
Posts: 1
Credit: 2,300,548
RAC: 0
Message 8094 - Posted: 14 Apr 2009, 0:04:56 UTC

Running new WUs is impossible on 4cores, need 3.5 G RAM !!.
The credit is also very low
I turned cosmo off until things turn the way they were
ID: 8094 · Report as offensive     Reply Quote
Profile Ageless
Volunteer moderator
Volunteer tester
Avatar

Send message
Joined: 15 Jun 07
Posts: 345
Credit: 50,500
RAC: 0
Message 8100 - Posted: 15 Apr 2009, 0:11:41 UTC

I have had a small talk with Anshul about these long/high memory tasks.

Some of the files that were copied back onto the server seem to have been horribly corrupted. If the small fixes Anshul and Scott are putting though don't fix this, they'll go for rebuild of the project and its files.

In the mean time, these tasks can be run to completion (if you have the memory available). Other tasks going out seem to run in much less time than normal. So there is definitely something wrong.

It's being worked on.
Jord.
ID: 8100 · Report as offensive     Reply Quote
1 · 2 · 3 · 4 · Next

Forums : Technical Support : Longer / heavier WUs?