From ee6326c9d0ce9631b34e1133ce47aa51f5f4db25 Mon Sep 17 00:00:00 2001 From: Pavel Raiskup Date: Thu, 28 Nov 2024 07:51:35 +0100 Subject: [PATCH] doc: fixes during the F39 to F41 infrastructure update pt.2 --- doc/how_to_upgrade_persistent_instances.rst | 35 +++++++++++++++++++++ 1 file changed, 35 insertions(+) diff --git a/doc/how_to_upgrade_persistent_instances.rst b/doc/how_to_upgrade_persistent_instances.rst index 3d704fa40..9f7b2cd21 100644 --- a/doc/how_to_upgrade_persistent_instances.rst +++ b/doc/how_to_upgrade_persistent_instances.rst @@ -35,6 +35,33 @@ Announce the outage See a specific document :ref:`announcing_fedora_copr_outage`, namely the "planned" outage state. +Check the hot-fixes +------------------- + +The old set of instances (especially prod) has been running for quite some time, +likely accumulating several hotfixes over that period. Research the applied +hotfixes and determine which of them need to be manually implemented on the N+2 +boxes (if any, note them). + +First, check the `hot-fixed issues and PRs `_. +Then, check the file-system modifications:: + + # over ssh on the _old_ box, search for weird things (ignore config changes + # and /boot) + [root@copr-be-dev ~][STG]# rpm -Va | grep -v -e /etc/ -e /boot/ + ... + S.5....T. /var/www/cgi-resalloc + ... + S.5....T. /usr/lib/python3.12/site-packages/copr_backend/pulp.py + ... + +E.g., the ``/var/www/cgi-resalloc`` file is a weird change, but that in +particular is covered `in playbooks `_. +The ``pulp.py`` change is important to note though! You may consult the +``dnf diff copr-backend`` output, find the corresponding upstream PR on GitHub, +and tag the PR with ``hot-fixed`` label (if not already done). + + Preparation ----------- @@ -251,6 +278,10 @@ It's possible that the playbook fails, but it typically isn't crucial now. If provisioning at least reaches the end of the ``base`` role, revert the ``birthday=yes`` commit and proceed with the next steps. +The playbooks above have not automatically updated the systems. If you prefer +to start on Fedora N+2 with up-2-date set of packages, do the ``dnf update`` now +(manual step over ssh). + Get it working -------------- @@ -273,6 +304,10 @@ Post-upgrade By this point, every Copr service should be operational. +It's a good idea to test ``/usr/sbin/reboot`` now to debug potential boot issues +during the outage window, as future reboots are likely to occur at the most +inconvenient times. + Rename the instance names -------------------------