Go framework for DL model inference and API deployment

Deep learning models are usually deployed in the cloud and the inference services are provided through APIs. This framework provides the basic architectural components for deploying APIs and achieves several goals:

The API processing module is decoupled from the model inference module to reduce the risk of network and computing blocking caused by high concurrency.
The API processing module and model inference module can be deployed as a distributed architecture, and can achieve horizontal expansion.
The framework implemented using Go language to achieve execution efficiency and simplify deployment and maintenance.
Custom logic is implemented using callback, hiding common logic. Developers only need to focus on custom logic

Other features:

Yaml is used for server-side configuration, which can be configured separately during distributed deployment.
API signature supports SHA256 and SM2 algorithms.
Model examples:

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
cli		cli
config		config
doc		doc
examples		examples
helper		helper
http		http
server		server
types		types
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
go.mod		go.mod
infer_test.go		infer_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Go framework for DL model inference and API deployment

Distributed deployment architecture

Development documentation

Star History

About

Releases

Packages

Languages

License

jack139/go-infer

Folders and files

Latest commit

History

Repository files navigation

Go framework for DL model inference and API deployment

Distributed deployment architecture

Development documentation

Star History

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages