A Unified Approach to Virtual Machine Placement and Migration in the Cloud using Deep Reinforcement Learning

This repo contains the gym environment for VM placement and migration. You may trian or evaluate DQN and PPO agents for automatic VM placement and migration in this environment.

Experiment data and plots

Experiment data are located in data. Plots are located in plots.

Re-run Experiments

Requirements

To train models using CPU:

conda env create --name vm --file=cpu.yml

To train models using GPU:

conda env create --name vm --file=gpu.yml

Examples

To see help,

python main.py -h

By default, the config file is config/r2.yml.

To test the random agent and save results in results/random.json:

python main.py -a random -e -o results/random.json

To test the dqn agent and save results in results/dqn.json:

python main.py -a dqn -e -o results/dqn.json

Inspect and update the experiment parallelisability in exp_config.py depending on your machine. If you have less than 8 cores, modify cores. If you have less than 40GB memory, reduce multiruns.

To run all experiments,

chmod +x run.sh
./run.sh

The experiment data are saved in data. When the experiments complete, draw plots in plots.ipynb.

Agents

ppo
ppolstm
dqn
firstfit
firstfit
bestfit
bestfit
random

where "md" means multi-discrete action space.

Experiment Results

exp_performance: performance evaluation of the proposed approach against the baselines.
exp_reward: evaluation of the reward functions.
exp_var: evaluation of target variance.
exp_suspension: evaluation of service length and system load.
exp_training: the episodic returns.
exp_vm_size: evaluation of VM size.

Configuration

environment:

pms: number of PMs
vms: number of VMs
var: target variance
service_length: service length mean for VMs
arrival_rate: 100% system load = pms / distribution expectation / service rate
training_steps: number of steps in an episode in training
eval_steps: number of steps in an episode during evaluation
seed: integer to ensure reproducibility
reward_function:
1. reward 1 in the paper
2. reward 2 in the paper
3. reward 3 in the paper
cap_target_util: cap the upper limit of target utilisation to 100%
sequence:
1. "uniform": VM size follows Unif(0.1,1)
2. "lowuniform": VM size follows Unif(0.1,0.65)
3. "highuniform": VM size follows Unif(0.25,1)

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
config		config
data		data
plots		plots
src		src
vmenv		vmenv
weights-10		weights-10
weights		weights
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cpu.yml		cpu.yml
exp.py		exp.py
exp_beta.py		exp_beta.py
exp_convex.py		exp_convex.py
exp_migration_ratio.py		exp_migration_ratio.py
exp_performance.py		exp_performance.py
exp_performance_small.py		exp_performance_small.py
exp_reward.py		exp_reward.py
exp_suspension.py		exp_suspension.py
exp_vm_size.py		exp_vm_size.py
gpu.yml		gpu.yml
main.py		main.py
plots.ipynb		plots.ipynb
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Unified Approach to Virtual Machine Placement and Migration in the Cloud using Deep Reinforcement Learning

Experiment data and plots

Re-run Experiments

Requirements

Examples

Agents

Experiment Results

Configuration

About

Releases

Packages

Languages

License

yzh503/vm-placement-migration-gym

Folders and files

Latest commit

History

Repository files navigation

A Unified Approach to Virtual Machine Placement and Migration in the Cloud using Deep Reinforcement Learning

Experiment data and plots

Re-run Experiments

Requirements

Examples

Agents

Experiment Results

Configuration

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages