gxDNN

Run Neural networks on NationalChip NPU processor.

NPU Introduction

NPU Overview

NPU is specially designed for artificial intelligence of Internet of things to accelerate the operation of neural network, and solve the problem of low efficiency of traditional chip in neural network operation.

LEO(GX8010, GX8009, GX8008) and GRUS(GX8002) include NPU processor with high performance and low power consumption. The NPU processor contains sub modules such as matrix multiplication, convolution, general calculation, data copy and decompression.

NPU Toolchain and API

The NPU compiler gxnpuc is used to compile the tensorflow model generated by the user into instructions that can be executed by the NPU hardware module. The compiling tool is used in Linux environment.

After generating the NPU instruction file, you need to use corresponding version NPU API to load the model and input data, run the model and get the output data.

In this repository, we provide NPU Toolchain and API for versions LEO and GRUS

Use Steps

Generate TensorFlow frozen PB.
Edit the model configuration file and use gxnpuc to compille the PB file into an instruction file that can be loaded and executed by NPU.
Call NPU API, load the model and input data, run the model and get the output data.

Install

Install gxDNN Compiler

pip install npu_compiler

Upgrade gxDNN Compiler

pip install --upgrade npu_compiler

View Compiler Version

gxnpuc --version

Compiler Usage

gxnpuc

Compile TF model file into NPU file that can run on NPU.

usage: gxnpuc [-h] [-V] [-L] [-v] [-m] [-c CMD [CMD ...]] [-w]
              [config_filename]

NPU Compiler

positional arguments:
  config_filename       config file

optional arguments:
  -h, --help            show this help message and exit
  -V, --version         show program's version number and exit
  -L, --list            list supported ops
  -v, --verbose         verbosely list the processed ops
  -m, --meminfo         verbosely list memory info of ops
  -c CMD [CMD ...], --cmd CMD [CMD ...]
                        use command line configuration
  -w, --weights         print compressed weights(GRUS only)

Config File Description

GURS

ITEM	Configuration	Description
CORENAME	GRUS	Chip name.
PB_FILE		TensorFlow Frozen PB.
OUTPUT_FILE		Output file name.
NPU_UNIT	NPU32	NPU MAC num. (Must be NPU32 on GRUS)
COMPRESS	true / false	FC weights compress or not.
CONV2D_COMPRESS	true / false	Conv2D weights compress or not.
OUTPUT_TYPE	c_code	Output type. (Must be c_code on GRUS)
INPUT_OPS	op_name: [shape] ...	Input OP name and shape.
OUTPUT_OPS	[out_op_names, ...]	Output OP name.
FP16_OUT_OPS	[out_op_names, ...]	Output OP name with fp16 format. (Others use fp32)
FUSE_BN	true / false	Fuse batch nomailization or not.

LEO

ITEM	Configuration	Description
CORENAME	LEO	Chip name.
PB_FILE		TensorFlow Frozen PB.
OUTPUT_FILE		Output file name.
NPU_UNIT	NPU32 / NPU64	NPU device type. (NPU32 : SNPU, NPU64 : NPU)
COMPRESS	true / false	Weights compress or not.
COMPRESS_QUANT_BITS	4/5/6/7/8	Quantization compression bits.
COMPRESS_TYPE	LINEAR/GAUSSIAN	Linear has more accurate; Gaussian has more compression rate
INPUT_OPS	op_name: [shape] ...	Input OP name and shape.
OUTPUT_OPS	[out_op_names, ...]	Output OP name.
INPUT_DATA	op_name: [data] ...	When input data is definite, it needs to be indicated

Examples

GRUS

LEO

FAQ

Q: What is the Python version supported by the NPU Compiler?
A: Python 2.7 and Python 3.6 are supported.

Q: What is the TensorFlow version supported by the NPU Compiler?
A: All versions above TensorFlow 1.10 are supported.

Q: How do I view the list of OP supported by NPU Compiler?
A: Use the command `gxnpuc --list`. See GRUS_OPS content for GX8002 chip.

Q: How do I view the version of the NPU Compiler?
A: Use the command `gxnpuc --version`.

Q: "Some OP types are not supported when compiling the model!" What if this error is encountered while compiling the model?
A: Consult the engineers of NationalChip to see if they can add the printed unsupported OP.

Q: Is dynamic RNN model supported?
A: Not supported. The model structure needs to be modified and implemented with a for loop.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
grus		grus
leo		leo
tools/torch_2_tf		tools/torch_2_tf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gxDNN

NPU Introduction

NPU Overview

NPU Toolchain and API

Use Steps

Install

Install gxDNN Compiler

Upgrade gxDNN Compiler

View Compiler Version

Compiler Usage

gxnpuc

Config File Description

GURS

LEO

Examples

FAQ

About

Releases

Packages

Contributors 3

Languages

NationalChip/gxDNN

Folders and files

Latest commit

History

Repository files navigation

gxDNN

NPU Introduction

NPU Overview

NPU Toolchain and API

Use Steps

Install

Install gxDNN Compiler

Upgrade gxDNN Compiler

View Compiler Version

Compiler Usage

gxnpuc

Config File Description

GURS

LEO

Examples

FAQ

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages