pip install charactr-api-sdk
Before you use the SDK, you have to initialize it first with your account's credentials. To get them, please login to the Gemelo Platform.
from charactr_api import CharactrAPISDK, Credentials
sdk = CharactrAPISDK(Credentials(client_key="xxx", api_key="yyy"))
tts_voices = sdk.tts.get_voices()
You can also browse the voices more conveniently in the Gemelo Platform.
result = sdk.tts.convert(tts_voices[0]["id"], "Hello world")
Simplex streaming allows you to send the text to the server and receive the converted audio response as a stream through WebSockets. It greatly reduces the latency compared to the traditional REST endpoint and it's not limited by the text's length.
To use it, call sdk.tts.start_simplex_stream
method:
def on_data(data: bytes) -> None:
# do anything with the incoming binary data
def on_close(code: int, message: str) -> None:
if code != 1000:
# stream was closed with an error
else:
# stream was ended successfully
stream = sdk.tts.start_simplex_stream(
voice_id, text, on_data=on_data, on_close=on_close
)
This more advanced technique allows you to start a two-way, WebSockets-based stream, where you can asynchronously send text and receive the converted audio. You control when the stream ends.
To use, it, call the sdk.tts.start_duplex_stream
method. You may pass callbacks to know when the stream ends/errors and to receive data:
start_duplex_stream(voice_id: int,
on_data: (bytes) -> None | None = None,
on_close: (int, str) -> None | None = None) -> TTSStreamDuplex
The method also returns a set of functions to control the stream:
def convert(text: str) -> None:
"""Convert text to speech and receive audio response as stream,
using the `on_data` callback."""
def wait() -> None:
"""Blocks execution until there was 5 seconds of stream inactivity."""
def terminate() -> None:
"""Ends the stream immediately. In most cases we advise to use `close()` instead."""
def close() -> None:
"""Requests the server to close the connection gracefully."""
Example usage:
def on_data(data: bytes) -> None:
# do anything with the incoming binary data
def on_close(code: int, message: str) -> None:
if code != 1000:
# the stream was closed because of an error
else:
# the stream ended successfully
stream = sdk.tts.start_duplex_stream(
voice_id, on_data=on_data, on_close=on_close
)
stream.convert(text1)
stream.convert(text2)
stream.close()
tts_voices = sdk.vc.get_voices()
You can also browse the voices more conveniently in the Gemelo Platform.
result = sdk.vc.convert(vc_voices[0]["id"], audio_input)
Voice cloning API allows you to create voice clones from audio, list all created clones, change their name and delete them, as well as using them with tts & vc endpoints.
The only notable difference is, that cloned_voice option needs to be set to True when doing the convert request:
{ "cloned_voice": True }
For more, see the relevant code example: Full Voice Cloning Example
Our SDK comes with a bunch of examples to help you start working with it. Please, refer to the SDK's README page if you wish to run them.