Contents
- Centos 7, RockyLinux 8, Ubuntu 18.04, 20.04, 22.04, known to work.
- Hostname of Docker host must exist on the network. Must use bridge networking if Docker host is a Virtual Machine.
- User running
pavics-compose.sh
below must not beroot
but a regular user belonging to thedocker
group. - Install latest docker-ce and docker-compose for the chosen distro (not the version from the distro).
- Have a real SSL Certificate, self-signed SSL Certificate do not work properly. Let's Encrypt offers free SSL Certificate.
- If using Let's Encrypt, port 80 and 443 and hostname should be accessible publicly over the internet before requesting a certificate with Let's Encrypt. Let's Encrypt will need to access your hostname at port 80 and 443 in order to verify and provide the SSL certificate.
# Assuming Docker already installed, networking, hostname, firewall, open ports configured properly.
git clone https://github.com/bird-house/birdhouse-deploy.git
cd birdhouse-deploy/birdhouse
cp env.local.example env.local
$EDITOR env.local
# Set the following variables at the minimun:
#SSL_CERTIFICATE='/path/to/cert.pem'
#PAVICS_FQDN='<full qualified hostname of the current host>'
# Only needed if using LetsEncrypt SSL certificate
#SUPPORT_EMAIL='a real email to receivez LetsEncrypt renewal notification'
# Get the SSL Cert from LetsEncrypt, written to path of var SSL_CERTIFICATE.
FORCE_CERTBOT_E2E=1 FORCE_CERTBOT_E2E_NO_START_PROXY=1 deployment/certbotwrapper
# Start the full stack.
./pavics-compose.sh up -d
To run docker-compose
for PAVICS, the pavics-compose.sh (:download:`download </birdhouse/pavics-compose.sh>`) wrapper script must be used.
This script will source the env.local
file, apply the appropriate variable substitutions on all the configuration files
".template", and run docker-compose
with all the command line arguments given to pavics-compose.sh (:download:`download </birdhouse/pavics-compose.sh>`).
See env.local.example (:download:`download </birdhouse/env.local.example>`) for more details on what can go into the env.local
file.
If the file env.local is somewhere else, symlink it here, next to docker-compose.yml (:download:`download </birdhouse/docker-compose.yml>`) because many scripts assume this location.
To follow infrastructure-as-code, it is encouraged to source control the above env.local file and any override needed to customized this PAVICS deployment for your organization. For an example of possible override, see how the emu service (:download:`download </birdhouse/optional-components/emu/docker-compose-extra.yml>`) (README) can be optionally added to the deployment via the override mechanism. Ouranos specific override can be found in this birdhouse-deploy-ouranos repo.
Suggested deployment layout:
├── birdhouse-deploy/ # this repo │ ├── birdhouse/ │ │ ├── env.local # relative symlink to env.local.real below │ │ ├── pavics-compose.sh │ │ ├── (...) ├── private-config/ # your private config and override: sibling level of this repo │ ├── docker-compose-extra.yml │ ├── env.local.real │ ├── .git/
The automatic deployment is able to handle multiple repos, so will trigger if
this repo or your private-personalized-config repo changes, giving you
automated continuous deployment. See the continuous deployment setup section
below and the variable AUTODEPLOY_EXTRA_REPOS
in env.local.example (:download:`download </birdhouse/env.local.example>`).
The automatic deployment of the PAVICS platform, of the Jupyter tutorial notebooks and of the automatic deployment mechanism itself can all be enabled by following the scheduling instructions.
Resource usage monitoring (CPU, memory, ..) and alerting for the host and each of the containers can be enabled by following the monitoring instructions.
To launch all the containers, use the following command:
./pavics-compose.sh up -d
If you get a 'No applicable error code, please check error log'
error from the WPS processes, please make sure that the WPS databases exists in the
postgres instance. See create-wps-pgsql-databases.sh (:download:`download </birdhouse/scripts/create-wps-pgsql-databases.sh>`).
RAM: at least 128 GB, Thredds 32+ GB, Geoserver 8+ GB, leaving spaces for other components and all the various Jupyter users
CPU: at least 48 cores for parallel computations
Disk: at least 100 TB, depending how much data is hosted on Thredds and Geoserver and storage for the various Jupyter users
In general, the more users, the more cpu cores and memory needed. The more data, more memory and bigger and faster disks needed.
All WPS requests should be completed within
proxy_read_timeout
of the Nginx proxy, see nginx.conf (:download:`download <birdhouse/config/proxy/nginx.conf>`). Any WPS requests that will take longer should use the async mode.Default value
PROXY_READ_TIMEOUT_VALUE
in default.env (:download:`download <birdhouse/default.env>`).Overrideable in
env.local
file, as usual for all values indefault.env
file.
Use create-magpie-users (:download:`download </birdhouse/scripts/create-magpie-users>`) or follow manual instructions below.
config.yml
file if using create-magpie-users
:
users: - username: < value of JUPYTER_DEMO_USER in `env.local` > password: < you decide, at least 12 in length > email: < anything is fine > group: anonymous
Manual instructions:
- Go to
https://<PAVICS_FQDN>/magpie/ui/login
and login with theadmin
user. The password should be inenv.local
. - Then go to
https://<PAVICS_FQDN>/magpie/ui/users/add
. - Fill in:
- User name: <value of JUPYTER_DEMO_USER in
env.local
> - Email: < anything is fine >
- Password: < you decide >
- User group:
anonymous
- User name: <value of JUPYTER_DEMO_USER in
- Click "Add User".
An end-to-end integration test suite is available at https://github.com/Ouranosinc/PAVICS-e2e-workflow-tests with pre-configured Jenkins at https://github.com/Ouranosinc/jenkins-config.
For that test suite to pass, run the script scripts/bootstrap-instance-for-testsuite (:download:`download </birdhouse/scripts/bootstrap-instance-for-testsuite>`) to prepare your new instance. Further documentation inside the script.
Optional components
all-public-access
and secure-thredds
also need to be enabled in env.local
using EXTRA_CONF_DIRS
variable.
ESGF login is also needed for https://github.com/Ouranosinc/pavics-sdi/blob/master/docs/source/notebooks/esgf-dap.ipynb part of test suite. ESGF credentials can be given to Jenkins via https://github.com/Ouranosinc/jenkins-config/blob/aafaf6c33ea60faede2a32850604c07c901189e8/env.local.example#L11-L13
The canarie monitoring link
https://<PAVICS_FQDN>/canarie/node/service/stats
can be used to confirm the
instance is ready to run the automated end-to-end test suite. That link should
return the HTTP response code 200
.
Vagrant allows us to quickly spin up a VM to easily reproduce the runtime environment for testing or to have multiple flavors of PAVICS with slightly different combinations of the parts all running simultaneously in their respective VM, allowing us to see the differences in behavior.
See vagrant_variables.yml.example (:download:`download </vagrant_variables.yml.example>`) for what's configurable with Vagrant.
If using Centos box, follow disk-resize (:download:`download </birdhouse/vagrant-utils/disk-resize>`) after
first vagrant up
failure due to disk full. Then vagrant reload && vagrant
provision
to continue. If using Ubuntu box, no manual steps required,
everything just works.
Install VirtualBox, both the platform and the extension pack, and Vagrant.
One time setup:
# Clone this repo and checkout the desired branch. # Follow instructions and fill up infos in vagrant_variables.yml cd .. # to the folder having the Vagrantfile cp vagrant_variables.yml.example vagrant_variables.yml
Starting and managing the lifecycle of the VM:
# start everything, this is the only command needed to bring up the entire # PAVICS platform vagrant up # get bridged IP address vagrant ssh -c "ip addr show enp0s8|grep 'inet '" # get inside the VM # useful to manage the PAVICS platform as if Vagrant is not there # and use pavics-compose.sh as before # ex: cd /vagrant/birdhouse; ./pavics-compose.sh ps vagrant ssh # power-off VM vagrant halt # delete VM vagrant destroy # reload Vagrant config if vagrant_variables.yml or Vagrantfile changes vagrant reload # provision again (because all subsequent vagrant up won't provision again) # useful to test all provisioning scripts or to bring a VM at unknown state, # maybe because it was provisioned too long ago, to the latest state. # not needed normally during tight development loop vagrant provision
Core features of the platform has tests to prevent regressions.
To run the tests:
python3 -m pip install -r tests/requirements.txt
pytest tests/
Some tests require internet access (to access JSON schemas used to validate JSON structure). If you need to run tests offline, you can skip the tests that require internet access by using the -k 'not online' pytest option.
We are trying to follow the standard of semantic versioning.
The standard is for one application. Here we have a collection of several apps with different versions and we want to track which combination of versions works together. So we need a slight modification to the definition of the standard.
Given a version number MAJOR.MINOR.PATCH, increment the:
- MAJOR version when the API or user facing UI changes that requires significant documentation update and/or re-training of the users. Also valid when a big milestone has been reached (ex: DACCS is released).
- MINOR version when we add new components or update existing components that also require change to other existing components (ex: new Magpie that also force Twitcher and/or Frontend update) or the change to the existing component is a major one (ex: major refactoring of Twitcher, big merge with corresponding upstream component from birdhouse project).
- PATCH version when we update existing components without impact on other existing components and the change is a minor change for the existing component.
To help properly update versions in all files that could reference to the latest tag,
the bump2version utility is employed.
Running this tool will modify versions in files referencing to the latest revision
(as defined in .bumpversion.cfg) and apply change logs
updates by moving Unreleased
items under a new version matching the new version.
In order to handle auto-update of the releaseTime
value simultaneously to the
generated release version, the bump2version
call is wrapped in Makefile.
One of the following commands should be used to generate a new version.
# bump to a specific semantic version
make VERSION="<MAJOR>.<MINOR>.<PATCH>" bump
# bump the next semantic version automatically
make bump (major|minor|patch)
# test result without applying it
make VERSION="<MAJOR>.<MINOR>.<PATCH>" bump dry
To validate, you can look up the resulting version and release time that will be written to RELEASE.txt. The current version can also be requested using the following command.
make version
Once the version as been bumped and the PR is merged, a corresponding version tag should be added
to the commit generated by the merge. This step is intentionally manual instead of leaving it up
to bump2version
to auto-generate the tag in other to apply it directly on master
branch
(onto the merge commit itself), instead of onto the commits in the PR prior merging.
- Pull/merge latest
master
to make sure modifications are applied in CHANGES.md, in next step, are under the most recent "unreleased" section. - Update CHANGES.md, commit, push.
- Open a PR with the new content from CHANGES.md as the PR description. PR description can have more pertinent info, ex: test results, staging server location, other discussion topics, that might or might not be relevant in CHANGES.md. Use your judgement.
- Wait for a PR approval.
- Review PR description if something needs to be added or updated after the PR review process. The goal is for the PR description to capture all the essential informations for someone else not participating in the PR review process to understand it easily. This "someone else" might even be your future self trying to understand what was going through your mind when you opened this PR :)
- Only when you are ready to merge the PR immediately, you can continue with
the following steps to. Doing the following steps too early and you might
lose the "push race" if someone else is also trying to release at the same
time. Also, in the spirit of not losing the "push race", execute all these
steps together, do not take a break in the middle.
- Merge with
master
branch, if needed, so nextmake bump <major|minor|patch>
step will bump to the proper next version. Might need to review the places where CHANGES.md items were inserted following merge to make sure the new ones by this PR are under "unreleased". - Run
make bump <major|minor|patch>
with appropriate options, as described in "Tagging policy" section above. Push. - Merge this PR, copying the entire PR description into the merge commit description. This is so that the page https://github.com/bird-house/birdhouse-deploy/tags will contain relevant info nicely. That page was previously used as an ad-hoc changelog before CHANGES.md was formally introduced.
- Run
git tag
on the commit created the by merge, with the same tag asmake bump <major|minor|patch>
generated. - Run
git push --tags
to upload the new version.
- Merge with