Skip to content

Lvhhhh/MQBench

 
 

Repository files navigation


Introduction

MQBench is an open-source model quantization toolkit based on PyTorch fx.

The envision of MQBench is to provide:

  • SOTA Algorithms. With MQBench, the hardware vendors and researchers can benefit from the latest research progress in academia.
  • Powerful Toolkits. With the toolkit, quantization node can be inserted to the original PyTorch module automatically with respect to the specific hardware. After training, the quantized model can be smoothly converted to the format that can inference on the real device.

Installation


git clone [email protected]:ModelTC/MQBench.git
cd MQBench
python setup.py install

Documentation

MQBench aims to support (1) various deployable quantization algorithms and (2) hardware backend libraries to facilitate the development of the community.

For the detailed information, please refer to mqbench documentation.

Citation

If you use this toolkit or benchmark in your research, please cite this project.

@article{MQBench,
  title   = {MQBench: Towards Reproducible and Deployable Model Quantization Benchmark},
  author  = {Yuhang Li* and Mingzhu Shen* and Jian Ma* and Yan Ren* and Mingxin Zhao* and
             Qi Zhang* and Ruihao Gong* and Fengwei Yu and Junjie Yan},
  journal= {Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks},
  year={2021}
}

License

This project is released under the Apache 2.0 license.

About

Model Quantization Benchmark

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 98.1%
  • Dockerfile 1.7%
  • Shell 0.2%