DIPPM: a Deep Learning Inference Performance Predictive Model using Graph Neural Networks

We have developed a DL Inference Performance Predictive Model (DIPPM) that predicts the inference latency, energy, and memory usage of a given input DL model on the NVIDIA A100 GPU. We also devised an algorithm to suggest the appropriate A100 Multi-Instance GPU profile from the output of DIPPM.

For more details: https://doi.org/10.1007/978-3-031-39698-4_1

Environment setup

# Prerequsite CUDA 11.7

pip install torch==1.13.1 torchvision==0.14.1

pip install torch-geometric==2.2.0

pip install https://data.pyg.org/whl/torch-1.13.0%2Bcu117/torch_cluster-1.6.0%2Bpt113cu117-cp310-cp310-linux_x86_64.whl

pip install https://data.pyg.org/whl/torch-1.13.0%2Bcu117/torch_scatter-2.1.0%2Bpt113cu117-cp310-cp310-linux_x86_64.whl

pip install https://data.pyg.org/whl/torch-1.13.0%2Bcu117/torch_sparse-0.6.16%2Bpt113cu117-cp310-cp310-linux_x86_64.whl

pip install pytorch_lightning==1.9.0

pip install networkx apache-tvm

Dataset setup

git clone https://github.com/karthickai/deeplearning_inference

cd deeplearning_inference

sh dataset.sh

Train the DIPPM

python train.py --model_type GraphSAGE --epoch 10

To use DIPPM

import dippm
import torchvision

model = torchvision.models.vgg16(pretrained=True)
model.eval()

#current dippm supports only A100 GPU
out  = dippm.predict(model, batch=8, input="3,244,244", device="A100")
print("Predicted Memory {0} MB, Energy {1} J, Latency {2} ms, MIG {3}".format(*out))

cite

@InProceedings{10.1007/978-3-031-39698-4_1,
author="Panner Selvam, Karthick
and Brorsson, Mats",
title="DIPPM: A Deep Learning Inference Performance Predictive Model Using Graph Neural Networks",
booktitle="Euro-Par 2023: Parallel Processing",
year="2023",
publisher="Springer Nature Switzerland",
address="Cham",
pages="3--16",
isbn="978-3-031-39698-4"
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
models		models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dataset.sh		dataset.sh
dippm.py		dippm.py
example.py		example.py
model.py		model.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DIPPM: a Deep Learning Inference Performance Predictive Model using Graph Neural Networks

Environment setup

Dataset setup

Train the DIPPM

To use DIPPM

cite

About

Releases 1

Packages

Languages

License

karthickai/dippm

Folders and files

Latest commit

History

Repository files navigation

DIPPM: a Deep Learning Inference Performance Predictive Model using Graph Neural Networks

Environment setup

Dataset setup

Train the DIPPM

To use DIPPM

cite

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages