Joined: 27 Sep 18
We are back up again. The BOINC server for this project is connected to several other key parts of the nanoHUB infrastructure. We had to upgrade the BOINC server in January to support the volume of WUs our crunchers consume. The WU's are created from nanoHUB input files by another server, and we had to do significant upgrades to that server. The database of in-progress jobs (via BOINC and other venues) was consuming all the memory on that machine, leading to swapping. That server is critical for all nanoHUB jobs, including courses at several universities, so the DB upgrade had to be thoroughly tested before rolling it out. We are tiptoeing back into the creation of WUs. TLDR: our crunchers consume jobs at a rate at least one order of magnitude higher than our other computing venues, so it stresses all the infrastructure. We're watching everything closely.
Joined: 26 Sep 18
Thanks for the update!
I received new batches this morning but although they are now completing I am getting ~40% "invalids". I will keep it running.
Thanks again for keeping us informed.
©2024 COPYRIGHT 2017-2018 NCN