Workflow for deploying changes to lab server #19

domenkozar · 2016-02-17T19:43:00Z

Here are the scenarios under which we'd like to handle lab servers modifications:

deploying a change to all lab servers (as an admin)
deploying a change to all lab servers (via PR)
deploying a change to subset of lab servers (as temporary change for the time of experiment)

Proposal:

Once we have Hydra setup on supporting server (see #8), it would poll snabblab-nixos for git changes and build the whole machine cluster. If the build is successful, channel would update. Meanwhile all lab servers would pull for latest channel every 15min and upgrade if new channel is available.

Pros

by building the whole machine we can be sure there are no Nix eval errors
changes can still be deployed outside the workflow, but next channel update will reset that state
no need for shared space, git is the place where changes really happen

Cons

slight delay between the change and deploy (usually shouldn't take more than 30min)

Questions

How the change be tested locally before it's deployed?

By choosing a different NixOps backend it could be deployed into VirtualBox or Qemu via libvirt.

Another option is to apply the change to only one server, test it, then move it to modules/lab-configuration.nix.

cc @lukego

The text was updated successfully, but these errors were encountered:

domenkozar · 2016-03-17T14:33:13Z

Whatever we decide, we should have a boot test to avoid issues like NixOS/nixpkgs#12949

lukego · 2016-03-20T12:22:11Z

👍 sounds good!

domenkozar · 2016-04-18T09:33:33Z

I did my research over the weekend (as I'm excited about this as it will reduce my human error logistic in the deployment).

Aszlig implemented this for his cluster and upstreamed the Hydra part. Here are the necessary parts:

domenkozar · 2016-05-03T23:02:05Z

A preliminary implementation is now in customchannel branch and on Snabb Hydra.

domenkozar · 2016-05-05T19:53:35Z

The prototype plan is to deploy build-{1,2,3,4} machines using this workflow, then gradually switch over one lugano server and then the rest of the lab.

Channel is being generated at https://hydra.snabb.co/eval/674#tabs-new

What's left to do for prototype:

figure out how hetzner nixops backend generated filesystem units
refactor deployment code to handle nixops and channels
test a single machine rebuild & restart
be able to deploy auto-upgradable channel using nixops
merge the two nixops deployments (eiger + lab)

For later on:

NixOS test
channel versioning (nixpkgs+custom)
monitor upgrade logs
document this workflow upstream

domenkozar mentioned this issue Mar 20, 2016

How to store secrets #25

Open

domenkozar mentioned this issue May 16, 2016

[WIP] self-upgrading snabblab built using Hydra #39

Closed

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workflow for deploying changes to lab server #19

Workflow for deploying changes to lab server #19

domenkozar commented Feb 17, 2016

domenkozar commented Mar 17, 2016

lukego commented Mar 20, 2016

domenkozar commented Apr 18, 2016 •

edited

Loading

domenkozar commented May 3, 2016 •

edited

Loading

domenkozar commented May 5, 2016 •

edited

Loading

Workflow for deploying changes to lab server #19

Workflow for deploying changes to lab server #19

Comments

domenkozar commented Feb 17, 2016

Proposal:

Pros

Cons

Questions

How the change be tested locally before it's deployed?

domenkozar commented Mar 17, 2016

lukego commented Mar 20, 2016

domenkozar commented Apr 18, 2016 • edited Loading

domenkozar commented May 3, 2016 • edited Loading

domenkozar commented May 5, 2016 • edited Loading

domenkozar commented Apr 18, 2016 •

edited

Loading

domenkozar commented May 3, 2016 •

edited

Loading

domenkozar commented May 5, 2016 •

edited

Loading