Message: VM VM Hypervisor failed to enter an online state in a timely fashion

Message boards : Number crunching : Message: VM VM Hypervisor failed to enter an online state in a timely fashion
Message board moderation

To post messages, you must log in.

AuthorMessage
[VENETO] boboviz

Send message
Joined: 7 Apr 17
Posts: 54
Credit: 26,471
RAC: 0
Message 251 - Posted: 9 Apr 2019, 13:08:43 UTC

All my wus stop at 88% with the message "Posponed: VM Hypervisor failed....."
I'm using Win 10 1089 and Virtual Box 5.1.10
ID: 251 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[VENETO] boboviz

Send message
Joined: 7 Apr 17
Posts: 54
Credit: 26,471
RAC: 0
Message 252 - Posted: 9 Apr 2019, 15:29:37 UTC - in response to Message 251.  
Last modified: 9 Apr 2019, 15:30:01 UTC

Only 2 wus continue to crunch, but at the end, error 197 like this thread

Please, fix the app
ID: 252 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
conf [MM]

Send message
Joined: 5 May 19
Posts: 1
Credit: 56,129
RAC: 0
Message 296 - Posted: 12 May 2019, 8:26:19 UTC

Same problem for me.
I`m using VirtualBox 5.2.28 and nearly all of my WUs stopped at around 80%.
After 30 minutes I had 40 VirtualMachines and my RAM ( 16GB ) began
to get crazy --> restart the PC with alt strg entf.
Maybe you can set a limit for WUs at the same time as other VM Projects do ?
ID: 296 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Hal Bregg

Send message
Joined: 24 Apr 19
Posts: 53
Credit: 114,639
RAC: 2
Message 297 - Posted: 12 May 2019, 13:45:22 UTC - in response to Message 251.  

All my wus stop at 88% with the message "Posponed: VM Hypervisor failed....."
I'm using Win 10 1089 and Virtual Box 5.1.10


VirtualBox 5.2.8 works fine for me.
ID: 297 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
[VENETO] boboviz

Send message
Joined: 7 Apr 17
Posts: 54
Credit: 26,471
RAC: 0
Message 398 - Posted: 20 Aug 2019, 8:29:49 UTC - in response to Message 297.  

VirtualBox 5.2.8 works fine for me.


I tried and it works!!
ID: 398 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mmonnin

Send message
Joined: 16 Jan 18
Posts: 23
Credit: 305,743
RAC: 266
Message 400 - Posted: 20 Aug 2019, 20:20:13 UTC

I'm seeing more and more of these along with ones that run for so long and abort. Task conditions have changed the past couple of days.
ID: 400 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
gvdsm

Send message
Joined: 15 Aug 20
Posts: 5
Credit: 0
RAC: 0
Message 605 - Posted: 21 Aug 2020, 8:30:27 UTC - in response to Message 297.  

Hi,

I am trying to get nanoHub@Home working without the given error message "Posponed: VM Hypervisor failed.....".

Are you still using this Virtualbox 5.2.8 to execute their WU's?
ID: 605 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
gvdsm

Send message
Joined: 15 Aug 20
Posts: 5
Credit: 0
RAC: 0
Message 606 - Posted: 21 Aug 2020, 8:31:29 UTC - in response to Message 398.  

Hi,

I am trying to get nanoHub@Home working without the given error message "Posponed: VM Hypervisor failed.....".

Are you still using this Virtualbox 5.2.8 to execute their WU's?
ID: 606 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mikey
Avatar

Send message
Joined: 18 Nov 18
Posts: 11
Credit: 31,781
RAC: 0
Message 608 - Posted: 21 Aug 2020, 11:03:17 UTC - in response to Message 606.  

Hi,

I am trying to get nanoHub@Home working without the given error message "Posponed: VM Hypervisor failed.....".

Are you still using this Virtualbox 5.2.8 to execute their WU's?


If you are getting that message than your VB version is too new
ID: 608 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 11 Jan 17
Posts: 98
Credit: 224,673
RAC: 9
Message 609 - Posted: 22 Aug 2020, 6:09:51 UTC - in response to Message 251.  

How many work units are you trying to run? They take about 2 GB each.
You have 24 cores and 20 GB of memory.
ID: 609 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
PaoloNasca

Send message
Joined: 25 Aug 19
Posts: 3
Credit: 20,859
RAC: 18
Message 611 - Posted: 24 Aug 2020, 20:23:54 UTC

45 WUs are stuck in the “Posponed: VM Hypervisor failed....." (33% of progress).

Windows Server 2016
Boinc client 7.16.7
Virtualbox 5.2.8
1 CPU 4 core (8 thread)
32GB RAM
ID: 611 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 11 Jan 17
Posts: 98
Credit: 224,673
RAC: 9
Message 612 - Posted: 24 Aug 2020, 21:49:32 UTC - in response to Message 611.  

45 WUs are stuck in the “Posponed: VM Hypervisor failed....." (33% of progress).

In BOINC settings, are you allowing enough memory to be used? I set it to 95%.

I haven't seen a problem with my Ryzen 2700, with 8/16 cores and 32 GB (Ubuntu 18.04 and VBox 5.2.42),
though I usually limit it to a maximum of 8 work units at a time.
ID: 612 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
PaoloNasca

Send message
Joined: 25 Aug 19
Posts: 3
Credit: 20,859
RAC: 18
Message 613 - Posted: 24 Aug 2020, 22:33:54 UTC - in response to Message 612.  

In BOINC settings, are you allowing enough memory to be used? I set it to 95%.

The memory usage was 90%, so I'm going to increase it.

I usually limit it to a maximum of 8 work units at a time

How can I do it?
ID: 613 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mikey
Avatar

Send message
Joined: 18 Nov 18
Posts: 11
Credit: 31,781
RAC: 0
Message 614 - Posted: 24 Aug 2020, 23:25:47 UTC - in response to Message 613.  

In BOINC settings, are you allowing enough memory to be used? I set it to 95%.

The memory usage was 90%, so I'm going to increase it.

I usually limit it to a maximum of 8 work units at a time

How can I do it?


Something like this except I'm not sure the <name> or <app_name> is correct:

<app_config>

<app>
<name>boinc2docker</name>
<max_concurrent>1</max_concurrent>
<fraction_done_exact/>
</app>
<app_version>
<app_name>boinc2docker</app_name>
<cmdline>-t4</cmdline>
</app_version>

<report_results_immediately/>

</app_config>

The above would run 1 workunit at a time, the <max_concurrent> part, and then use 14 cpu cores for each task, the <cmdline> part.

You would place this file in C:\program data\Boinc\projects\(thenanohubfolder), to do that copy and paste it into Notepad, I don't know Linux so this is for Windows, and save the file as app_config.xml in the folder. Be sure Notepad does NOT append .txt on the end when it saves it, it has a habit of doing that, and then in the Boinc Manager go to Options, read config files. and Boinc should find it, check the Event Log under Tools to make sure and that it doesn't have an error other than 'app not found' if you do not have any nanhub workunits. The change will apply when you get new tasks from the Server NOT to any existing tasks.
ID: 614 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 11 Jan 17
Posts: 98
Credit: 224,673
RAC: 9
Message 615 - Posted: 25 Aug 2020, 1:53:13 UTC - in response to Message 614.  

I usually limit it to a maximum of 8 work units at a time

How can I do it?

I use a simple version:

<app_config>
  <app>
  <name>boinc2docker</name>
  <max_concurrent>8</max_concurrent>
  </app>
</app_config>

It runs a maximum of 8 works units, with one CPU core per work unit.
You can add the other items as mikey suggests if you wish.
ID: 615 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
jdzukley

Send message
Joined: 20 Apr 19
Posts: 1
Credit: 36,650
RAC: 0
Message 616 - Posted: 25 Aug 2020, 17:44:51 UTC

This happens to me from time to time. It seams to always happen when I am doing a number of concurrent things on the computer, and not limited boinc to take into account my activities, AND disk usage is at capacity from the nanohub tasks in progress. Bottom line, the message makes sense, the VM did not start the next task within the allotted time because too much was going on - on the computer.

The fix is very simple. exit boinc, and then restart boinc. Since the tasks in execution have the disk drive at capacity (temporarily) I prefer to suspend all tasks, underway (started) and not started before exiting boinc. That way when you restart boinc, it starts without having any jobs in que. I start the task manager, and release tasks as disk capacity is available. It usually only takes a 1 to 2 minutes.

Finally, I set my overall configuration to keep it under 75% CPU capacity, as I have the system set up to automatically overdrive. So even with 74% capacity, the overall CPU capacity on the computer is often in the 95+% range. I limit the number of tasks running by using

Options
Computer Preferences
Computing
Usage limits: x% of CPU's.

Currently, I had this set at 60% and I still got this message. Checking disk usage was pegged at 100%. Meaning the current batch of tasks from nanohub were slaming my disk drive.

No big deal. Shut down boinc and restart it.
ID: 616 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 11 Jan 17
Posts: 98
Credit: 224,673
RAC: 9
Message 617 - Posted: 25 Aug 2020, 20:03:22 UTC - in response to Message 616.  

Currently, I had this set at 60% and I still got this message. Checking disk usage was pegged at 100%. Meaning the current batch of tasks from nanohub were slaming my disk drive..

Yes, nanoHUB does hammer the disk drive. I posted on it a long time ago. The write rate is high, but the work units are short. I don't know if it will damage the drive or not. But I use SSDs, which are fast enough to avoid this problem, but may need to be protected. In Linux, I increase the size of the built-in write buffer to around 4 GB in size and an hour write-delay. In Windows, I use the Samsung Rapid Mode cache (included in their Magician utility), or even better PrimoCache, which allows for a bigger write buffer (similar to the Linux values). I think the Crucial SSDs have a cache included in their utility too.
ID: 617 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote

Message boards : Number crunching : Message: VM VM Hypervisor failed to enter an online state in a timely fashion


©2021 COPYRIGHT 2017-2018 NCN