Story Maker - Text To Speech OpenAI API

The Text To Speech OpenAI (TTS) API allows you to convert multiple text inputs into high-quality, natural-sounding speech. You can use this API to create engaging audio stories, generate voiceovers for multimedia content, or narrate interactive experiences.

Text To Speech

POST https://api.ttsopenai.com/uapi/v1/story-maker

This endpoint allows you to convert multiple text inputs into speech. You can customize the voice, speed, and model used for the conversion.

Example Request

curl -X POST https://api.ttsopenai.com/uapi/v1/story-maker \
  -H "Content-Type: application/json" \
  -H "x-api-key: <your api key>" \
  -d '{
        "name": "Name of the story",
        "blocks": [
          {
            "name": "Name of the block",
            "input": "Text to be converted into speech",
            "silence_before": 2,
            "voice_id": "OA001",
            "emotion": "neutral",
            "model": "tts-1",
            "speed": 1,
            "duration": 0
          }
        ]
      }'

Request Attributes

name string

The name of the story.

blocks array

An array of text blocks to be converted into speech.

blocks.name string

The name of the block.

blocks.input string

The text to be converted into speech.

blocks.silence_before integer

The duration of silence before the speech starts, in seconds.

blocks.voice_id string

The voice used for the conversion.

blocks.emotion string

The emotion of the speech. (Coming soon)

blocks.model string

The model used for the conversion.

blocks.speed float

The speed of the speech.

blocks.duration integer

The duration of the speech, in seconds.

Example Response

Response

{
  "success": true,
  "result": {
    "uuid": "cfc26b24-ae61-11ef-9913-9e4d64684f7d",
    "voice_id": "",
    "speed": 1,
    "model": "tts-1",
    "tts_input": "Name of the story",
    "estimated_credit": 0,
    "used_credit": 0,
    "status": 1,
    "status_percentage": 1,
    "error_message": "",
    "speaker_name": null,
    "created_at": "2024-11-29T14:54:19",
    "updated_at": "2024-11-29T14:54:19",
    "file_size": 0
  }
}

Response Attributes

success boolean

Indicates whether the request was successful.

result object

The result of the story-maker conversion.

result.uuid string

The unique identifier for the conversion.

result.voice_id string

The voice used for the conversion.

result.speed float

The speed of the speech.

result.model string

The model used for the conversion.

result.tts_input string

The text that was converted into speech.

result.estimated_credit integer

The estimated number of credits used for the conversion.

result.used_credit integer

The actual number of credits used for the conversion.

result.status integer

The status of the conversion. Possible values are:

1: Converting
2: Completed
3: Error
11: Reworking
12: Joining Audio
13: Merging Audio
14: Downloading Audio

result.status_percentage integer

The percentage of the conversion that has been completed.

result.error_message string

The error message, if any.

result.speaker_name string

The name of the speaker.

result.created_at string

The date and time when the conversion was created.

result.updated_at string

The date and time when the conversion was last updated.

result.file_size integer

The size of the audio file, in bytes.

Document

Generate lifelike speech from pdf, docx, pptx, and other document formats.

Emotion

Generate expressive speech with emotional context and custom vibes.