Multi Robot Task Allocation

Given 6 trucks and 52 cities, can you figure out which truck should go to which city and in what order, so that the team completes the deliveries while spending the least amount of fuel?

This is a very famous combinatorial optimization problem known as the Multi Agent Travelling Salesman Problem

Generally a heuristic is hand crafted to decide which truck can be sent to which city.

In this project we ask the question, "Can a neural network be used to learn a good heuristic instead?"

Problem statement

Given 3 Robots, 6 tasks and current allocation of robot to task, the network should suggest best possible allocation of robots in the next step such that the total distance travelled by the robots is minimised.

Problem Representation

The task and robot states are given as input to the network. Each task has a binary ID and the network outputs the ID of the task as an output.

Task State Representation

Sample Task Distributions

Robot State Representation

Network Architecture

Training - using Particle Swarm Optimization

Training data

The heuristic learned should be invariant to the distribution of the task locations. Thus the network is trained to minimise the total distance over a wide variety of task distributions. A batch of 1000 randomly distributed task scenarios was used for the same.

Cost Function

The sum of euclidean distances travelled by all the robots is taken as the cost function to be minimized.

Constraints

Already visited task shouldn't be revisited.
More than one robots should not be directed to the same location.
Tasks need to be completed within specific number if interations. For this case 2 iterations.

These events are counted and multiplied by scaling factors as costs to be added to the total cost function.

Optimization scheme

The output of the network does not directly give the cost function, thus a "look up" needs to be performed. This makes the function non differentiable and thus can not be trained by gradient descent based schemes.

Gradient free optimization scheme Particle Swarm Optimization is used instead.

PSO is applied with the weights of the network acting as the variables for optimization.

Hyperparameters

'Num_Networks': 50, 'num_iterations': 500, 'alpha': 0.8, 'BetaLocal': 2, 'BetaGlobal': 2, 'StagnationPenalty': 5, 'InternalIterations': 10

Weight clipping

The weights of the neural network are clipped to be within -1000 and 1000. This constricts the variable space and helps improve convergence.

Demo

Network performance before training.

BeforeTraining.mp4

Network performance after training

AfterTraining.mp4

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.vscode		.vscode
GenericPSO-BackpropHybrid		GenericPSO-BackpropHybrid
mTSP_NNPSO		mTSP_NNPSO
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi Robot Task Allocation

Problem statement

Problem Representation

Task State Representation

Sample Task Distributions

Robot State Representation

Network Architecture

Training - using Particle Swarm Optimization

Training data

Cost Function

Constraints

Optimization scheme

Hyperparameters

Weight clipping

Demo

Results

Limitations

About

Releases

Packages

Languages

SupreetKurdekar/NeuralPSO

Folders and files

Latest commit

History

Repository files navigation

Multi Robot Task Allocation

Problem statement

Problem Representation

Task State Representation

Sample Task Distributions

Robot State Representation

Network Architecture

Training - using Particle Swarm Optimization

Training data

Cost Function

Constraints

Optimization scheme

Hyperparameters

Weight clipping

Demo

Results

Limitations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages