Message boards :
News :
Request for feedback
Message board moderation
Author | Message |
---|---|
Send message Joined: 27 Sep 18 Posts: 58 Credit: 0 RAC: 0 |
While examining the BOINC job records we noticed that some jobs are failing with the exit code EXIT_ABORTED_VIA_GUI. If you have aborted a nanoHUB@Home job in your client, what was the reason? In the same analysis we are working to improve the time and disk estimates for nanoHUB tools that produce WUs that fail often. |
Send message Joined: 24 Sep 18 Posts: 5 Credit: 288,351 RAC: 0 |
Primarily the appalling rate of 'Validate error' status for results that look just like Valid results in the logs. |
Send message Joined: 9 Nov 18 Posts: 4 Credit: 706,619 RAC: 0 |
Pretty sure in the past I have aborted a wu or two that showed 100% completed with a run time in the days. But not in the past few months, I've not closely monitored this project. |
Send message Joined: 16 Jan 18 Posts: 23 Credit: 305,743 RAC: 0 |
https://boinc.nanohub.org/nanoHUB_at_home/result.php?resultid=827199 This says aborted by client but I sure did not abort it. Like 7 of those and 163 of the 'Validate error'. 170 out of 170 with errors. |
Send message Joined: 2 Mar 19 Posts: 1 Credit: 33,234 RAC: 0 |
job completed at 100 % but continued to run until canceled at "timed out". Boinc listed "timed out" as computational error. |
Send message Joined: 26 Sep 18 Posts: 7 Credit: 9,829 RAC: 0 |
I like this project but I have not been running it primarily because of VBox. As I run a mix of projects, VBox does not "play well with others in the sandbox". It is terribly inefficient especially on the threadrippers. Even with 64GB memory wu's are often "waiting for memory". The only way I can get it to run consistently is to give the project an entire machine and I am not willing to do that. I would gladly support nanoHUB if it were to become more "cruncher friendly". Good luck. |
Send message Joined: 8 Jan 19 Posts: 24 Credit: 2,501 RAC: 0 |
There certainly WERE lots of validate errors, but I've not had one recently. Certainly the error count I have on my results page here now is VASTLY larger than at any other project I have on here, in fact, it is probably larger than I have accumulated across all my projects in the last 20 years. I've had a few long runners as well, but not nearly as many. |
Send message Joined: 18 Jan 18 Posts: 2 Credit: 441,272 RAC: 0 |
Dataman, What is your host OS? I know vbox plays a lot nicer on my Linux systems than it does on my Windows systems. The MacOS version is somewhere in-between the two. Specifically on the Windows systems, I believe it has to do with the games Windows plays with memory reservation and such. |
Send message Joined: 26 Sep 18 Posts: 7 Credit: 9,829 RAC: 0 |
Dataman, Win10 & Ubuntu. Actually the last batches have run much better than the previous ones. |
Send message Joined: 8 Jan 19 Posts: 24 Credit: 2,501 RAC: 0 |
Another load of work units arrived, and another load of errors and invalids, of various types. I have set no new tasks, the project needs more work to be done before it releases work again. |
Send message Joined: 26 Sep 18 Posts: 7 Credit: 9,829 RAC: 0 |
Just my opinion but the last batch was much better for me. I received over a 100 wu's with 11 errors and 4 I aborted because they would eventually error. It is annoying that some ran for 8 hours before the error but after all it is an alpha project and errors are expected. The credits suck!!! LOL It would help if there was more communication from the project about what is going on or my fellow crunchers will continue to loose interest in the project. Cheers |
Send message Joined: 20 Jan 17 Posts: 1 Credit: 7,316 RAC: 0 |
177 wu Ok, 2 wu error while computing, and 1 wu aborted (10+ hour runtime) so far,and 438 in progress :) |
Send message Joined: 26 Sep 18 Posts: 7 Credit: 9,829 RAC: 0 |
The batches just released are 100% successful for me. Bravo! |
Send message Joined: 8 Jan 19 Posts: 24 Credit: 2,501 RAC: 0 |
Sounds promising, I'll re-enable the project on this machine so I can watch it. |
Send message Joined: 24 Sep 18 Posts: 5 Credit: 288,351 RAC: 0 |
Don't be fooled, there are still rubbish WUs out there that will run for hours before failing. If all you want is Wuprop hours go ahead, if you don't want to waste resources, electricity, time then you need to watch any machine running them and be ready to spot and abort the dross. |
Send message Joined: 11 Jan 17 Posts: 99 Credit: 224,673 RAC: 0 |
I have had 200 valid and only 2 invalid in the last two days, with no long runners. It may depend on OS/CPU/VBox version, or whatever, but they are certainly not all bad. Also, make sure you have enough memory. They take almost 2 GB per work unit. People with hidden computers should not expect too much help though. |
Send message Joined: 24 Sep 18 Posts: 5 Credit: 288,351 RAC: 0 |
I have had 200 valid and only 2 invalid in the last two days, with no long runners. You seem to have got lucky, well done. Dataman is having to abort units, look at the errors: https://boinc.nanohub.org/nanoHUB_at_home/results.php?hostid=497 Here is an unknown user, again look at the errors: https://boinc.nanohub.org/nanoHUB_at_home/results.php?hostid=247 I'm running only 1 or 2 WUs at a time, the same OS/CPU/VBox and some work and some don't. If you're willing to watch and check what you're getting you can abort the bad ones. |
Send message Joined: 25 Feb 18 Posts: 1 Credit: 89,273 RAC: 0 |
Many "exceeded elapsed time limit " errors. |
Send message Joined: 26 Sep 18 Posts: 7 Credit: 9,829 RAC: 0 |
Yep, yesterday's wu's ran great. Today I have no invalids except for re-sends but I still have occasional ones that exceed the time limit. They are easy to spot by batch numbers and even easier to abort. ;) Some of the errors you see in my stat's go way back to Dec. Sloppy housekeeping. LOL Cheers |
Send message Joined: 6 Mar 18 Posts: 1 Credit: 60,163 RAC: 0 |
I was running vbox 6 w/out really paying attention. As of yesterday, I reverted to vbox 5.1.28 w/excellent results. I aborted the last few tasks to make the reversion. |
©2025 COPYRIGHT 2017-2018 NCN