Request for feedback

Message boards : News : Request for feedback
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 5 · Next

AuthorMessage
Nonexistent_Admin
Volunteer moderator
Project administrator

Send message
Joined: 27 Sep 18
Posts: 58
Credit: 0
RAC: 0
Message 204 - Posted: 21 Feb 2019, 14:56:33 UTC

While examining the BOINC job records we noticed that some jobs are failing with the exit code EXIT_ABORTED_VIA_GUI. If you have aborted a nanoHUB@Home job in your client, what was the reason?

In the same analysis we are working to improve the time and disk estimates for nanoHUB tools that produce WUs that fail often.
ID: 204 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
PDW

Send message
Joined: 24 Sep 18
Posts: 5
Credit: 288,351
RAC: 0
Message 206 - Posted: 24 Feb 2019, 8:47:23 UTC - in response to Message 204.  

Primarily the appalling rate of 'Validate error' status for results that look just like Valid results in the logs.
ID: 206 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Paul

Send message
Joined: 9 Nov 18
Posts: 4
Credit: 706,619
RAC: 0
Message 207 - Posted: 26 Feb 2019, 17:12:29 UTC

Pretty sure in the past I have aborted a wu or two that showed 100% completed with a run time in the days. But not in the past few months, I've not closely monitored this project.
ID: 207 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mmonnin

Send message
Joined: 16 Jan 18
Posts: 23
Credit: 305,743
RAC: 0
Message 209 - Posted: 27 Feb 2019, 23:25:26 UTC

https://boinc.nanohub.org/nanoHUB_at_home/result.php?resultid=827199

This says aborted by client but I sure did not abort it.

Like 7 of those and 163 of the 'Validate error'. 170 out of 170 with errors.
ID: 209 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
ghenj

Send message
Joined: 2 Mar 19
Posts: 1
Credit: 33,234
RAC: 0
Message 211 - Posted: 3 Mar 2019, 10:39:29 UTC - in response to Message 204.  

job completed at 100 % but continued to run until canceled at "timed out". Boinc listed "timed out" as computational error.
ID: 211 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dataman
Avatar

Send message
Joined: 26 Sep 18
Posts: 7
Credit: 9,829
RAC: 0
Message 212 - Posted: 4 Mar 2019, 18:16:30 UTC

I like this project but I have not been running it primarily because of VBox. As I run a mix of projects, VBox does not "play well with others in the sandbox". It is terribly inefficient especially on the threadrippers. Even with 64GB memory wu's are often "waiting for memory". The only way I can get it to run consistently is to give the project an entire machine and I am not willing to do that. I would gladly support nanoHUB if it were to become more "cruncher friendly". Good luck.

ID: 212 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
adrianxw

Send message
Joined: 8 Jan 19
Posts: 24
Credit: 2,501
RAC: 0
Message 213 - Posted: 5 Mar 2019, 21:09:22 UTC - in response to Message 206.  
Last modified: 5 Mar 2019, 21:10:47 UTC

There certainly WERE lots of validate errors, but I've not had one recently. Certainly the error count I have on my results page here now is VASTLY larger than at any other project I have on here, in fact, it is probably larger than I have accumulated across all my projects in the last 20 years. I've had a few long runners as well, but not nearly as many.
ID: 213 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
smithnp

Send message
Joined: 18 Jan 18
Posts: 2
Credit: 441,272
RAC: 0
Message 217 - Posted: 15 Mar 2019, 11:55:05 UTC - in response to Message 212.  

Dataman,

What is your host OS? I know vbox plays a lot nicer on my Linux systems than it does on my Windows systems. The MacOS version is somewhere in-between the two.

Specifically on the Windows systems, I believe it has to do with the games Windows plays with memory reservation and such.
ID: 217 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dataman
Avatar

Send message
Joined: 26 Sep 18
Posts: 7
Credit: 9,829
RAC: 0
Message 220 - Posted: 19 Mar 2019, 18:30:12 UTC - in response to Message 217.  

Dataman,

What is your host OS? I know vbox plays a lot nicer on my Linux systems than it does on my Windows systems. The MacOS version is somewhere in-between the two.

Specifically on the Windows systems, I believe it has to do with the games Windows plays with memory reservation and such.


Win10 & Ubuntu. Actually the last batches have run much better than the previous ones.

ID: 220 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
adrianxw

Send message
Joined: 8 Jan 19
Posts: 24
Credit: 2,501
RAC: 0
Message 221 - Posted: 20 Mar 2019, 5:57:37 UTC

Another load of work units arrived, and another load of errors and invalids, of various types. I have set no new tasks, the project needs more work to be done before it releases work again.
ID: 221 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dataman
Avatar

Send message
Joined: 26 Sep 18
Posts: 7
Credit: 9,829
RAC: 0
Message 222 - Posted: 20 Mar 2019, 13:21:38 UTC
Last modified: 20 Mar 2019, 13:37:48 UTC

Just my opinion but the last batch was much better for me. I received over a 100 wu's with 11 errors and 4 I aborted because they would eventually error. It is annoying that some ran for 8 hours before the error but after all it is an alpha project and errors are expected. The credits suck!!! LOL

It would help if there was more communication from the project about what is going on or my fellow crunchers will continue to loose interest in the project.

Cheers

ID: 222 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
morgan

Send message
Joined: 20 Jan 17
Posts: 1
Credit: 7,316
RAC: 0
Message 226 - Posted: 21 Mar 2019, 21:28:40 UTC
Last modified: 21 Mar 2019, 21:31:21 UTC

177 wu Ok, 2 wu error while computing, and 1 wu aborted (10+ hour runtime) so far,and
438 in progress :)
ID: 226 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dataman
Avatar

Send message
Joined: 26 Sep 18
Posts: 7
Credit: 9,829
RAC: 0
Message 227 - Posted: 21 Mar 2019, 22:53:30 UTC

The batches just released are 100% successful for me. Bravo!

ID: 227 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
adrianxw

Send message
Joined: 8 Jan 19
Posts: 24
Credit: 2,501
RAC: 0
Message 228 - Posted: 22 Mar 2019, 9:13:37 UTC

Sounds promising, I'll re-enable the project on this machine so I can watch it.
ID: 228 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
PDW

Send message
Joined: 24 Sep 18
Posts: 5
Credit: 288,351
RAC: 0
Message 229 - Posted: 22 Mar 2019, 17:42:35 UTC - in response to Message 228.  

Don't be fooled, there are still rubbish WUs out there that will run for hours before failing.
If all you want is Wuprop hours go ahead, if you don't want to waste resources, electricity, time then you need to watch any machine running them and be ready to spot and abort the dross.
ID: 229 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Jim1348

Send message
Joined: 11 Jan 17
Posts: 99
Credit: 224,673
RAC: 0
Message 230 - Posted: 22 Mar 2019, 18:07:20 UTC
Last modified: 22 Mar 2019, 18:09:11 UTC

I have had 200 valid and only 2 invalid in the last two days, with no long runners.
It may depend on OS/CPU/VBox version, or whatever, but they are certainly not all bad.

Also, make sure you have enough memory. They take almost 2 GB per work unit.
People with hidden computers should not expect too much help though.
ID: 230 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
PDW

Send message
Joined: 24 Sep 18
Posts: 5
Credit: 288,351
RAC: 0
Message 231 - Posted: 22 Mar 2019, 18:19:01 UTC - in response to Message 230.  

I have had 200 valid and only 2 invalid in the last two days, with no long runners.
It may depend on OS/CPU/VBox version, or whatever, but they are certainly not all bad.

Also, make sure you have enough memory. They take almost 2 GB per work unit.
People with hidden computers should not expect too much help though.

You seem to have got lucky, well done.

Dataman is having to abort units, look at the errors: https://boinc.nanohub.org/nanoHUB_at_home/results.php?hostid=497

Here is an unknown user, again look at the errors: https://boinc.nanohub.org/nanoHUB_at_home/results.php?hostid=247

I'm running only 1 or 2 WUs at a time, the same OS/CPU/VBox and some work and some don't.
If you're willing to watch and check what you're getting you can abort the bad ones.
ID: 231 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Trotador

Send message
Joined: 25 Feb 18
Posts: 1
Credit: 89,273
RAC: 0
Message 232 - Posted: 22 Mar 2019, 19:12:11 UTC

Many "exceeded elapsed time limit " errors.
ID: 232 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Dataman
Avatar

Send message
Joined: 26 Sep 18
Posts: 7
Credit: 9,829
RAC: 0
Message 233 - Posted: 22 Mar 2019, 21:10:12 UTC
Last modified: 22 Mar 2019, 21:16:55 UTC

Yep, yesterday's wu's ran great. Today I have no invalids except for re-sends but I still have occasional ones that exceed the time limit. They are easy to spot by batch numbers and even easier to abort. ;)
Some of the errors you see in my stat's go way back to Dec. Sloppy housekeeping. LOL
Cheers
ID: 233 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
William Kahler

Send message
Joined: 6 Mar 18
Posts: 1
Credit: 60,163
RAC: 0
Message 234 - Posted: 22 Mar 2019, 22:10:47 UTC

I was running vbox 6 w/out really paying attention. As of yesterday, I reverted to vbox 5.1.28 w/excellent results. I aborted the last few tasks to make
the reversion.
ID: 234 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · 3 · 4 . . . 5 · Next

Message boards : News : Request for feedback


©2024 COPYRIGHT 2017-2018 NCN