Skip to content

Latest commit

 

History

History
26 lines (19 loc) · 796 Bytes

README.md

File metadata and controls

26 lines (19 loc) · 796 Bytes

logo

Overview

Sukima is a ready-to-deploy container that implements a REST API for Language Models designed with the specific purpose of easy deployment and scalability.

Curent API Functions

  • models : Fetch a list of ready-to-use Language Models for inference.
  • load : Allocate a Language Model.
  • generate : Use a Language Model to generate tokens.
  • classify : Use a Language Model to classify tokens and retrieve scores.

To view more information for API Usage, see /docs endpoint.

Setup

Setup Guide

Usage Guide

Todo

  • Autoscaling
  • HTTPS Support
  • Rate Limiting
  • Support for other Language Modeling tasks such as Sentiment Analysis and Named Entity Recognition.

License

GPL-2.0