Posts by mmonnin

1) Message boards : News : Running again (Message 719)
Posted 14 Jun 2021 by mmonnin
Post:
How much work is going to be created this round? There has been work for a couple of days straight, which is a fairly long period for this project.
2) Questions and Answers : Windows : Message: Postponed: VM job unmanageable, restarting later (Message 488)
Posted 6 Nov 2019 by mmonnin
Post:
There isn't. Here is a screenshot of my event log. I see that it found it, and something's missing. Should I have the config file in the nanohub file, or a folder up?


The log file said it found the file. Its in the right spot.

This is the content of mine and it works.

<app_config>
   <project_max_concurrent>10</project_max_concurrent>
   <report_results_immediately/>
</app_config>
3) Questions and Answers : Windows : Message: Postponed: VM job unmanageable, restarting later (Message 486)
Posted 5 Nov 2019 by mmonnin
Post:
So, I have a .xml file with just,

<app_config>
<project_max_concurrent>3</project_max_concurrent>
</app_config>

in it, and I'm still having the same issue, despite restarts and using the read config option in BOINC. I'm confident that the .xml file is actually an .xml file, since I went to codebeautify.org to enter in the above code, add in the necessary syntax, and download it as an XML file. It is in C:\ProgramData\BOINC\projects\boinc.nanohub.org_nanoHUB_at_home, and I'm pretty sure that's the right location. Can anyone suggest what I'm doing wrong?


Did you re-read the config files to tell BOINC to look at the file?
After doing that check the Event log. There will be an error or a line saying BOINC found the app_config.xml file

87331 nanoHUB_at_home 11/5/2019 6:24:39 PM Found app_config.xml
4) Message boards : News : Request for feedback (Message 429)
Posted 20 Sep 2019 by mmonnin
Post:
The tasks only run for several minutes on a CPU, in a VM. A GPU version is not the solution to every compute problem. You're a person with only a hammer as a tool and only see nails. Stop with the GPU suggestions.
5) Message boards : Number crunching : Validate error (Message 425)
Posted 15 Sep 2019 by mmonnin
Post:
I just abort tasks that run for more than ~15min. Some run and complete with longer run times than that but most time out.
6) Message boards : News : Request for feedback (Message 424)
Posted 15 Sep 2019 by mmonnin
Post:
Would it be possible to know something about the project we are currently working on? That would be very interesting and also provide an incentive to keep working on the project.
Thanks!


https://www.itap.purdue.edu/newsroom/190903_nanoHUB.html

https://nanohub.org/
7) Questions and Answers : Unix/Linux : How to preview headless VirtualBox machine (Message 417)
Posted 7 Sep 2019 by mmonnin
Post:
I haven't done it before but have heard the extension pack is needed to view the VM.
8) Message boards : Number crunching : Validate error (Message 412)
Posted 23 Aug 2019 by mmonnin
Post:
My PCs have several versions of vbox. Only one has validate errors but not every single task. At 2gb per task that's only enough to run maybe 1 depending on other uses.
9) Message boards : Number crunching : Validate error (Message 401)
Posted 20 Aug 2019 by mmonnin
Post:
Tasks use 1.8GB each. At a min we need 2GB per task or limit the # of tasks
10) Message boards : Number crunching : Message: VM VM Hypervisor failed to enter an online state in a timely fashion (Message 400)
Posted 20 Aug 2019 by mmonnin
Post:
I'm seeing more and more of these along with ones that run for so long and abort. Task conditions have changed the past couple of days.
11) Message boards : Number crunching : Validate error (Message 393)
Posted 19 Aug 2019 by mmonnin
Post:
Try to stick to VBox version supplied with Boinc installation file and limit amount of tasks running consecutively.
nanoHUB is real memory hogger.


Running out of memory would cause errors, not invalids.
12) Message boards : Number crunching : Validate error (Message 390)
Posted 18 Aug 2019 by mmonnin
Post:
Hello,

about 90% of my tasks are with "Validate error" mark and only few are validated as correct.

Where is the problem?

https://boinc.nanohub.org/nanoHUB_at_home/results.php?userid=50955


1 - That is a private link only you can see.
2 - You're nowhere close to 90%. 754/1286 is 58%. Still pretty bad.
3 - 3 of my 4 hosts have zero invalids. The 4th has 18% invalid rate.

All of mine are on Linux. 2x AMD Zen PCs, a 3570k and the 2P 2670v1 with Invalids. The 2P is the only one on Mint 17 while the others are on Ubuntu 18. It also has the most memory at 128GB so it won't ever run out.

You're on Win10 and gemini is on some Linux version. I don't really see a connection between OS/hardware. Your VBox version is 5.2.8, gemini8's is 5.1.12r112440 and my good ones are 5.2.26 or 6.0. The bad one is 5.0.2.
13) Message boards : Number crunching : Are failed WUs processed again (Message 375)
Posted 15 Jul 2019 by mmonnin
Post:
I've begun aborting anything that is not _0. I won't help if I'm the initial user but most _1's I've had timeout. An admin and fix the issue if the don't want this type of behavior.
14) Questions and Answers : Unix/Linux : EXIT_TIME_LIMIT_EXCEEDED still present (Message 370)
Posted 29 Jun 2019 by mmonnin
Post:
The only thing we can do (have done, will do) is increase the time limit for all WUs produced by the nanoHUB tools that have a lot of failures with the same error. For example, yesterday two tools produced most of the time limit errors (1% of all WUs), so we increased the time limit for all WUs produced by those tools.


Why not just set it to the same as the deadline and be done with it? Why is there an artificial limit that wastes resources?
https://boinc.nanohub.org/nanoHUB_at_home/workunit.php?wuid=2369266
15) Message boards : Number crunching : Invalid rate. (Message 244)
Posted 31 Mar 2019 by mmonnin
Post:
Every task for me aborted after 105 minutes from the last batch. No reported CPU usage.
16) Message boards : News : Request for feedback (Message 209)
Posted 27 Feb 2019 by mmonnin
Post:
https://boinc.nanohub.org/nanoHUB_at_home/result.php?resultid=827199

This says aborted by client but I sure did not abort it.

Like 7 of those and 163 of the 'Validate error'. 170 out of 170 with errors.
17) Message boards : Number crunching : Recently, the WU spend more and more time/memory (Message 199)
Posted 14 Feb 2019 by mmonnin
Post:
Detach the project.
18) Message boards : Number crunching : Recently, the WU spend more and more time/memory (Message 197)
Posted 12 Feb 2019 by mmonnin
Post:
Many projects give that message but it can be ignored.
19) Message boards : Number crunching : Very short return time (Message 140)
Posted 17 Dec 2018 by mmonnin
Post:
There are plenty of options via BOINC Manager and app_config.xml files to limit the # of tasks.
20) Message boards : Number crunching : Huge number of downloads (Message 139)
Posted 17 Dec 2018 by mmonnin
Post:
ETAs are typically junk until tasks are completed.


Next 20


©2024 COPYRIGHT 2017-2018 NCN