Advanced search

Message boards : Technical Support : VM tasks fail to run

Author Message
Profile Opolis
Send message
Joined: 31 Jan 17
Posts: 3
Credit: 139,072
RAC: 0
Message 21392 - Posted: 3 May 2017, 16:05:02 UTC

I am new to Cosmology and am experiencing a strange issue.

I have Virtual Box setup (use it for other projects without any issues).
I was able to complete a handful of camb_boinc2docker tasks successfully.
Suddenly, all camb_boinc2docker and planck_param_sims tasks would hang at 0.01%.
I have VB extension installed and the VM shows a blank screen.

I have tried various VB versions 5.1.18, 5.1.22, 5.1.16, completely uninstalling each time.

I cannot get an VB tasks to run now.

Here is an example of stderr from a task that reported back "Error while computing".

<core_client_version>7.6.33</core_client_version>
<![CDATA[
<message>
finish file present too long
</message>
<stderr_txt>
2017-05-02 16:05:13 (5804): vboxwrapper (7.7.26197): starting
2017-05-02 16:05:13 (5804): Feature: Checkpoint interval offset (70 seconds)
2017-05-02 16:05:13 (5804): Detected: VirtualBox COM Interface (Version: 5.1.22)
2017-05-02 16:05:13 (5804): Detected: Minimum checkpoint interval (600.000000 seconds)
2017-05-02 16:05:13 (5804): Create VM. (boinc_1d378dec26331a8f, slot#1)
2017-05-02 16:05:13 (5804): Updating drive controller type and model for desired configuration.
2017-05-02 16:05:13 (5804): Setting Memory Size for VM. (2048MB)
2017-05-02 16:05:13 (5804): Setting CPU Count for VM. (10)
2017-05-02 16:05:13 (5804): Setting Chipset Options for VM.
2017-05-02 16:05:13 (5804): Setting Boot Options for VM.
2017-05-02 16:05:13 (5804): Enabling VM Network Access.
2017-05-02 16:05:13 (5804): Setting Network Configuration for NAT.
2017-05-02 16:05:13 (5804): Disabling USB Support for VM.
2017-05-02 16:05:13 (5804): Disabling COM Port Support for VM.
2017-05-02 16:05:13 (5804): Disabling LPT Port Support for VM.
2017-05-02 16:05:13 (5804): Disabling Audio Support for VM.
2017-05-02 16:05:13 (5804): Disabling Clipboard Support for VM.
2017-05-02 16:05:13 (5804): Disabling Drag and Drop Support for VM.
2017-05-02 16:05:13 (5804): Adding storage controller(s) to VM.
2017-05-02 16:05:13 (5804): Adding virtual ISO 9660 disk drive to VM. (vm_isocontext.iso)
2017-05-02 16:05:13 (5804): Adding VirtualBox Guest Additions to VM.
2017-05-02 16:05:13 (5804): Adding network bandwidth throttle group to VM. (Defaulting to 1024GB)
2017-05-02 16:05:13 (5804): Enabling shared directory for VM.
2017-05-02 16:05:13 (5804): Starting VM. (boinc_1d378dec26331a8f, slot#1)
2017-05-02 16:05:26 (5804): Guest Log: BIOS: VirtualBox 5.1.22
2017-05-02 16:05:26 (5804): Guest Log: BIOS: Boot : bseqnr=1, bootseq=0032
2017-05-02 16:05:26 (5804): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=80
2017-05-02 16:05:26 (5804): Guest Log: BIOS: Boot from Hard Disk 0 failed
2017-05-02 16:05:26 (5804): Guest Log: BIOS: Boot : bseqnr=2, bootseq=0003
2017-05-02 16:05:26 (5804): Guest Log: BIOS: Booting from CD-ROM...
2017-05-02 16:05:26 (5804): Guest Log: BIOS: KBD: unsupported int 16h function 03
2017-05-02 16:05:26 (5804): Guest Log: BIOS: AX=0305 BX=0000 CX=0000 DX=0000
2017-05-02 16:05:26 (5804): Successfully started VM. (PID = '2884')
2017-05-02 16:05:26 (5804): Reporting VM Process ID to BOINC.
2017-05-02 16:05:36 (5804): VM state change detected. (old = 'poweroff', new = 'running')
2017-05-02 16:05:56 (5804): Preference change detected
2017-05-02 16:05:56 (5804): Setting CPU throttle for VM. (100%)
2017-05-02 16:05:56 (5804): Setting checkpoint interval to 600 seconds. (Higher value of (Preference: 60 seconds) or (Vbox_job.xml: 600 seconds))
2017-05-02 16:07:06 (5804): Guest Log: vgdrvHeartbeatInit: Setting up heartbeat to trigger every 2000 milliseconds
2017-05-02 16:07:06 (5804): Guest Log: vboxguest: misc device minor 56, IRQ 20, I/O port d020, MMIO at 00000000f0400000 (size 0x400000)
2017-05-02 16:07:06 (5804): Guest Log: net.ipv4.ip_forward = 1
2017-05-02 16:07:06 (5804): Guest Log: sysctl: cannot stat /proc/sys/net/ipv6/conf/all/forwarding: No such file or directory
2017-05-02 16:07:06 (5804): Guest Log: sysctl: setting key "cannot stat %s": No such file or directory
2017-05-02 16:07:06 (5804): Guest Log: sysctl: "cannot stat %s" is an unknown key
2017-05-02 16:07:06 (5804): Guest Log: sysctl: setting key "cannot stat %s": No such file or directory
2017-05-02 16:07:06 (5804): Guest Log: Segmentation fault
2017-05-02 16:07:06 (5804): Guest Log: automount ...
2017-05-02 16:07:06 (5804): Guest Log: Is the disk unpartitioned?, test for the 'boot2docker format-me' string
2017-05-02 16:07:06 (5804): Guest Log: automount over.
2017-05-02 16:07:06 (5804): Guest Log: Setting hostname to boot2docker Done.
2017-05-02 16:07:06 (5804): Guest Log: Tue May 2 16:07:00 UTC 2017 dhcp -------------------------------
2017-05-02 16:07:16 (5804): Guest Log: udevadm settle - timeout of 5 seconds reached, the event queue contains:
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/device:00/PNP0700:00 (1117)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00 (1350)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXCPU:00 (1351)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXCPU:01 (1352)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXCPU:02 (1353)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXCPU:03 (1354)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXCPU:04 (1355)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXCPU:05 (1356)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXCPU:06 (1357)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXCPU:07 (1358)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXCPU:08 (1359)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXCPU:09 (1360)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXPWRBN:00 (1361)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXPWRBN:00/input/input0 (1362)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXPWRBN:00/input/input0/event0 (1363)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSLPBN:00 (1364)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSLPBN:00/input/input1 (1365)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSLPBN:00/input/input1/event1 (1366)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00 (1367)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00 (1368)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/ACPI0003:00 (1369)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/ACPI0003:00/power_supply/AC (1370)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/LNXVIDEO:00 (1371)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/LNXVIDEO:00/device:01 (1372)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/LNXVIDEO:00/input/input6 (1373)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/LNXVIDEO:00/input/input6/event4 (1374)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/PNP0400:00 (1375)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/PNP0400:01 (1376)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/PNP0501:00 (1377)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/PNP0501:01 (1378)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/PNP0501:02 (1379)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/PNP0501:03 (1380)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/PNP0C02:00 (1381)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/PNP0C0A:00 (1382)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/PNP8390:00 (1383)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/device:00 (1384)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/device:00/APP0001:00 (1385)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/device:00/PNP0000:00 (1386)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/device:00/PNP0100:00 (1387)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/device:00/PNP0103:00 (1388)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/device:00/PNP0200:00 (1389)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/device:00/PNP0303:00 (1390)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/device:00/PNP0700:00 (1391)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/device:00/PNP0B00:00 (1392)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/device:00/PNP0F03:00 (1393)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A03:00/device:02 (1394)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0C0F:00 (1395)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0C0F:01 (1396)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0C0F:02 (1397)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:00/PNP0C0F:03 (1398)
2017-05-02 16:07:16 (5804): Guest Log: /sys/devices/LNXSYSTM:00/LNXSYBUS:01 (1399)
2017-05-02 16:07:16 (5804): Guest Log: Tue May 2 16:07:06 UTC 2017 dhcp -------------------------------
2017-05-02 17:05:10 (5804): VM state change detected. (old = 'running', new = 'paused')
2017-05-02 17:05:20 (5804): Error 0x80bb0002 in vbox51::VBOX_VM::resume (c:\src\boinc\boinc\samples\vboxwrapper\vbox_mscom_impl.cpp:1596)
2017-05-02 17:05:20 (5804): Error Source : ConsoleWrap
2017-05-02 17:05:20 (5804): Error Description: Cannot resume the machine as it is not paused (machine state: Saving)
2017-05-02 17:05:20 (5804): VM state change detected. (old = 'paused', new = 'saving')
2017-05-02 17:05:22 (5804): VM is no longer is a running state. It is in 'saved'.
2017-05-02 17:05:22 (5804): VM state change detected. (old = 'saving', new = 'saved')
2017-05-02 17:05:22 (5804): Powering off VM.
2017-05-02 17:05:22 (5804): Deregistering VM. (boinc_1d378dec26331a8f, slot#1)
2017-05-02 17:05:22 (5804): Removing virtual disk drive(s) from VM.
2017-05-02 17:05:22 (5804): Error 0x80bb0002 in vbox51::VBOX_VM::deregister_vm (c:\src\boinc\boinc\samples\vboxwrapper\vbox_mscom_impl.cpp:995)
2017-05-02 17:05:22 (5804): Error Source : SessionMachine
2017-05-02 17:05:22 (5804): Error Description: The machine is not mutable or running (state is Saved)
2017-05-02 17:05:22 (5804): Error 0x80bb0002 in vbox51::VBOX_VM::deregister_vm (c:\src\boinc\boinc\samples\vboxwrapper\vbox_mscom_impl.cpp:995)
2017-05-02 17:05:22 (5804): Error Source : SessionMachine
2017-05-02 17:05:22 (5804): Error Description: The machine is not mutable or running (state is Saved)
2017-05-02 17:05:22 (5804): Removing network bandwidth throttle group from VM.
2017-05-02 17:05:22 (5804): Removing storage controller(s) from VM.
2017-05-02 17:05:23 (5804): Removing VM from VirtualBox.
2017-05-02 17:05:28 (5804): Virtual machine exited.
17:05:33 (5804): called boinc_finish(0)
2017-05-02 17:27:02 (9416): vboxwrapper (7.7.26197): starting
2017-05-02 17:27:02 (9416): Feature: Checkpoint interval offset (394 seconds)
2017-05-02 17:27:02 (9416): Detected: VirtualBox COM Interface (Version: 5.1.22)
2017-05-02 17:27:02 (9416): Detected: Minimum checkpoint interval (600.000000 seconds)
2017-05-02 17:27:02 (9416): Create VM. (boinc_1d378dec26331a8f, slot#1)
2017-05-02 17:27:02 (9416): Updating drive controller type and model for desired configuration.
2017-05-02 17:27:02 (9416): Setting Memory Size for VM. (2048MB)
2017-05-02 17:27:02 (9416): Setting CPU Count for VM. (10)
2017-05-02 17:27:02 (9416): Setting Chipset Options for VM.
2017-05-02 17:27:02 (9416): Setting Boot Options for VM.
2017-05-02 17:27:02 (9416): Enabling VM Network Access.
2017-05-02 17:27:02 (9416): Setting Network Configuration for NAT.
2017-05-02 17:27:02 (9416): Disabling USB Support for VM.
2017-05-02 17:27:02 (9416): Disabling COM Port Support for VM.
2017-05-02 17:27:02 (9416): Disabling LPT Port Support for VM.
2017-05-02 17:27:02 (9416): Disabling Audio Support for VM.
2017-05-02 17:27:02 (9416): Disabling Clipboard Support for VM.
2017-05-02 17:27:02 (9416): Disabling Drag and Drop Support for VM.
2017-05-02 17:27:02 (9416): Adding storage controller(s) to VM.
2017-05-02 17:27:02 (9416): Adding virtual ISO 9660 disk drive to VM. (vm_isocontext.iso)
2017-05-02 17:27:02 (9416): Adding VirtualBox Guest Additions to VM.
2017-05-02 17:27:02 (9416): Adding network bandwidth throttle group to VM. (Defaulting to 1024GB)
2017-05-02 17:27:02 (9416): Enabling shared directory for VM.
2017-05-02 17:27:02 (9416): Starting VM. (boinc_1d378dec26331a8f, slot#1)
2017-05-02 17:27:16 (9416): Guest Log: BIOS: VirtualBox 5.1.22
2017-05-02 17:27:16 (9416): Guest Log: BIOS: Boot : bseqnr=1, bootseq=0032
2017-05-02 17:27:16 (9416): Guest Log: int13_harddisk: function 02, unmapped device for ELDL=80
2017-05-02 17:27:16 (9416): Guest Log: BIOS: Boot from Hard Disk 0 failed
2017-05-02 17:27:16 (9416): Guest Log: BIOS: Boot : bseqnr=2, bootseq=0003
2017-05-02 17:27:16 (9416): Guest Log: BIOS: Booting from CD-ROM...
2017-05-02 17:27:16 (9416): Guest Log: BIOS: KBD: unsupported int 16h function 03
2017-05-02 17:27:16 (9416): Guest Log: BIOS: AX=0305 BX=0000 CX=0000 DX=0000
2017-05-02 17:27:16 (9416): Successfully started VM. (PID = '11448')
2017-05-02 17:27:16 (9416): Reporting VM Process ID to BOINC.
2017-05-02 17:27:26 (9416): VM state change detected. (old = 'poweroff', new = 'running')

</stderr_txt>
]]>

Profile Opolis
Send message
Joined: 31 Jan 17
Posts: 3
Credit: 139,072
RAC: 0
Message 21394 - Posted: 3 May 2017, 23:07:38 UTC

Interestingly, after sitting idle for over 8hrs., planck tasks have started to run successfully...

Profile Opolis
Send message
Joined: 31 Jan 17
Posts: 3
Credit: 139,072
RAC: 0
Message 21399 - Posted: 4 May 2017, 15:08:17 UTC

One change I made was to add an app_config with help from a teammate.
Prior to this I was letting the project run only governed by BOINC manager settings (10 cores).
It has been running pretty smoothly for a day now.

<app_config>
<app>
<name>camb_boinc2docker</name>
<max_concurrent>1</max_concurrent>
</app>
<app_version>
<app_name>camb_boinc2docker</app_name>
<cmdline>-t 8</cmdline>
<avg_ncpus>8</avg_ncpus>
<max_ncpus>8</max_ncpus>
<plan_class>vbox64_mt</plan_class>
</app_version>

<app>
<name>lsplitsims</name>
<max_concurrent>1</max_concurrent>
</app>
<app_version>
<app_name>lsplitsims</app_name>
<cmdline>-t 8</cmdline>
<avg_ncpus>8</avg_ncpus>
<max_ncpus>8</max_ncpus>
<plan_class>vbox64_mt</plan_class>
</app_version>
</app_config>

Message boards : Technical Support : VM tasks fail to run