Specific instructions to run a GCE cluster

NOTE: GCE VM lifecycle management has been deprecated in favor of Google Kubernetes Engine. Please refer to these instructions https://github.com/HSF/harvester/wiki/GKE-setup

Setup your PanDA queue in AGIS. You can take CERN-EXTENSION_GOOGLE_HARVESTER as example.
Setup the standard Harvester server following the general instructions in this Wiki
Install and configure the NGINX side service. The ports specified in the ngingx.conf template, need to be accessible from GCE.
Install the GCE python API following Google's documentation
Download the CernVM4 image for GCE and upload it to the image repository in GCE account. It's important to take version 4 (not version 3), since the contextualization mechanisms changed.
Create a service account under the "IAM & admin" tab in the GCE console. From our experience, the best Role is "Compute Admin", but roles evolve with time and you might find more suitable Roles. Download the service account json file and place it on the harvester server. Anyone with this json can start/stop VMs with your account, so keep the file safe.
Edit and place the user data file on the harvester machine
Be sure the GCE block in the harvester configuration is correct and all the files specified are in place.

[googlecloud]

# zone where you are booting up your VMs and storage, e.g. us-east1-b
zone = <the GCE zone your VMs will run on, e.g. us-east1-b>
# project defined in the google compute account, where the activity will be billed
project = <your GCE project ID>
# private service account json generated in the google cloud management console
service_account_file = /path/to/service_file.json
# file with the user data to send to CERN VM
user_data_file = /path/to/user_data.txt
# image to use
image = <URL to the CernVM4 image in your project>
# harvester frontend
harvester_frontend = your_server_name:nginx_listening_port

Add your queue configuration to the harvester queueconfig. One example for a cloud queue can be found here. Pay attention to the submitter parameters ram_per_core, disk_size and preemptible, which are optional and can be used to tune your VMs.

...
        "submitter": {
			"name": "GoogleSubmitter",
			"module": "pandaharvester.harvestersubmitter.cloud_google_submitter",
			"logBaseURL": "https://aipanda011.cern.ch:25443/cache/",
                        "ram_per_core": 4,
                        "disk_size": 100,
			"preemptible": true
		},
...

Home

Getting started
Installation and configuration
Testing and running
Debugging
Work with Middleware
Admin FAQ

Developer pages
Code structure
DB structure
DB-schema-changes
State and sequence diagrams
Plugin API specifications
Agents and Plugins descriptions
Plugin utilities
Workflows supported by harvester
Developer Q&A
Release

Development guides
Development workflow
Tagging

Production & commissioning
Scale up submission
Condor experiences
Commissioning on the grid
Production servers
Service monitoring
Auto Queue Configuration with CRIC
SSH+RPC middleware setup

Kubernetes section
Kubernetes setup
X509 credentials
AWS setup
GKE setup
CERN setup
CVMFS installation
Generic service accounts

Advanced payloads
Horovod integration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specific instructions to run a GCE cluster

Clone this wiki locally