bugfix: database breakage during update #511

twagner9 · 2024-07-12T15:35:13Z

Description

The corresponding PR to Issue #510.

Fixes #388

Edit: It took me awhile to find a suitable method for tracking this update via the GUI, and I landed on an interface as the best medium to avoid exposing any GUI code to the database, avoiding increasing compilation time and strange dependency issues.

I also had been underestimating the scale of this upgrade; for a single refinement package with ~1.2 million particles, if any classification and refinements have been run using it, it’s much more accurate to imagine the number of column changes on the scale of 100 million or more, into the billions – each CLASSIFICATION_RESULT_% and REFINEMENT_RESULT_% table has several columns that need to be changed for every row, which would be every particle in the package.

Because of this, upgrade times are substantially slower. I was able to use a project that was being used for 3D classification. It only contained REFINEMENT_RESULT_% tables (no CLASSIFICATION_RESULT_% tables), and refinement packages contained anywhere from 10,000 to 200,000+ particles, for a total of ~460 million row updates. The upgrade takes around 1h:30m to complete. I spoke with @timothygrant80 about this, and he said he thinks this is an uncommon enough case that this upgrade time is okay.

I have rebased my feature branch to be current with the master branch using to minimize conflicts and headaches

yes
no

Which compilers were tested

g++
icpc
clang
other (please specify)

These changes are isolated to the

gui
core library
gpu core library
program it modifies

How has the functionality been tested?

Please describe the tests that you ran to verify your changes. Please also note any relevant details for your test configuration.

Checklist:

I have not changed anything that did not need to be changed
I have performed a self-review of my own code
I have commented my code, (w.r.t. why), particularly in hard-to-understand areas
I have made corresponding changes to the documentation {Ok to pass for now}
My changes generate no new warnings
Any dependent changes have been merged and published in downstream modules

Previously, updating a database created from the beta cisTEM version would result in a crash at the outset as output_pixel_size was being improperly added to REFINEMENT_PACKAGE_CONTAINED_PARTICLES_ tables (it was out of order). Additionally, REFINEMENT_RESULTS_ and CLASSIFICATION_RESULTS_ have pixel_size, aberration, microscope voltage, and amplitude contrast updated as well, making all operations usable post-update.

Some columns still remained empty; updated to either give appropriate (already existent) value, or filled with 0.0.

Using an interface that MainFrame inherits from and Database is aware of, the schema update is now accurately tracked when upgrading a database originally created in cisTEM to the current devel via a OneSecondProgress dialog.

bHimes

I can't comment much on the logic of the implementation, but the implementation details themselves look good. I've pointed out a few small things that should be quick to address.

src/gui/MainFrame.cpp

src/gui/UpdateProgressTracker.h

-Made dedicated custom dialog for updating the database -Retained same dialog format when no schema changes are detected -Added Database::CopyDatabase for backing up the database -Functionalized the update tracking logic as it can now be used for Update only and Backup and Update

-Changed backup function name in database.cpp/h -Changed how failing backup is handled in MainFrame.cpp

bHimes · 2024-10-14T22:28:16Z

looks good

jojoelfe · 2024-10-15T13:29:26Z

This is great, you can close #388 with this.

twagner9 mentioned this pull request Aug 12, 2024

node.js/GLIBC mismatch: automatic GCC compilation fails #513

Closed

twagner9 added 3 commits October 3, 2024 10:21

Filling in more columns

661c4dd

Some columns still remained empty; updated to either give appropriate (already existent) value, or filled with 0.0.

gui: added a progress dialog for schema update

68ae856

Using an interface that MainFrame inherits from and Database is aware of, the schema update is now accurately tracked when upgrading a database originally created in cisTEM to the current devel via a OneSecondProgress dialog.

twagner9 force-pushed the output_pixel_size_db_update_fix branch from 8e73ce4 to 68ae856 Compare October 3, 2024 15:23

Update comments, refactor some variables

f80b4e8

twagner9 requested a review from bHimes October 9, 2024 14:41

bHimes requested changes Oct 9, 2024

View reviewed changes

src/gui/MainFrame.cpp Outdated Show resolved Hide resolved

src/gui/MainFrame.cpp Outdated Show resolved Hide resolved

src/gui/UpdateProgressTracker.h Outdated Show resolved Hide resolved

src/gui/UpdateProgressTracker.h Show resolved Hide resolved

twagner9 added 2 commits October 11, 2024 17:37

fix: variable names and comments, backup fail

4aa4595

-Changed backup function name in database.cpp/h -Changed how failing backup is handled in MainFrame.cpp

bHimes approved these changes Oct 14, 2024

View reviewed changes

twagner9 merged commit f151ba2 into master Oct 15, 2024
6 checks passed

twagner9 deleted the output_pixel_size_db_update_fix branch October 15, 2024 13:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bugfix: database breakage during update #511

bugfix: database breakage during update #511

twagner9 commented Jul 12, 2024 •

edited

Loading

bHimes left a comment

bHimes commented Oct 14, 2024

jojoelfe commented Oct 15, 2024

bugfix: database breakage during update #511

bugfix: database breakage during update #511

Conversation

twagner9 commented Jul 12, 2024 • edited Loading

Description

I have rebased my feature branch to be current with the master branch using to minimize conflicts and headaches

Which compilers were tested

These changes are isolated to the

How has the functionality been tested?

Checklist:

bHimes left a comment

Choose a reason for hiding this comment

bHimes commented Oct 14, 2024

jojoelfe commented Oct 15, 2024

twagner9 commented Jul 12, 2024 •

edited

Loading