What audio formats does it output?

It supports MP3, opus, AAC, and FLAC. MP3 is the default and works for most use cases. Choose opus for low-latency streaming or FLAC if you need lossless quality.

Does it require an OpenAI API key?

Yes. You need a valid OpenAI API key with access to the TTS endpoints. The skill handles the API interaction, but authentication is on you.

Home/Skills/Media & Creative/OpenAI TTS

OpenAI TTS

pors·Jan 6, 2026

Media & Creative

4.6k6

Summary

TL;DR: Converts text to natural-sounding speech using OpenAI's TTS models, giving your AI agent a voice.

OpenAI TTS is an OpenClaw skill that text-to-speech via OpenAI Audio Speech API.

Created by pors, this skill has been downloaded 5k+ times on ClawHub. Install it with one command and your AI agent gains these capabilities right away.

Use cases

Add voice responses to a chatbot or virtual assistant application
Generate audio versions of written content like articles, documentation, or newsletters
Create accessible versions of text-heavy interfaces for visually impaired users
Produce voiceovers for presentations or demo videos without recording equipment

Installation

Run this command to install the skill on your OpenClaw agent:

Install with OpenClaw

npx clawhub@latest install openai-tts

Downloads

4.6k

Active installs

Stars

Updated

Jan 6, 2026

Security scan

VirusTotalBenign

View report

OpenClawBenignhigh confidence

This skill is a straightforward OpenAI text-to-speech helper that sends chosen text to OpenAI and writes the returned audio where the user asks.

Purpose & Capability

Instruction Scope

Install Mechanism

Credentials

Persistence & Privilege

SKILL.md

---
name: openai-tts
description: Text-to-speech via OpenAI Audio Speech API.
homepage: https://platform.openai.com/docs/guides/text-to-speech
metadata: {"clawdbot":{"emoji":"🔊","requires":{"bins":["curl"],"env":["OPENAI_API_KEY"]},"primaryEnv":"OPENAI_API_KEY"}}
---

# OpenAI TTS (curl)

Generate speech from text via OpenAI's `/v1/audio/speech` endpoint.

## Quick start

```bash
{baseDir}/scripts/speak.sh "Hello, world!"
{baseDir}/scripts/speak.sh "Hello, world!" --out /tmp/hello.mp3
```

Defaults:
- Model: `tts-1` (fast) or `tts-1-hd` (quality)
- Voice: `alloy` (neutral), also: `echo`, `fable`, `onyx`, `nova`, `shimmer`
- Format: `mp3`

## Voices

| Voice | Description |
|-------|-------------|
| alloy | Neutral, balanced |
| echo | Male, warm |
| fable | British, expressive |
| onyx | Deep, authoritative |
| nova | Female, friendly |
| shimmer | Female, soft |

## Flags

```bash
{baseDir}/scripts/speak.sh "Text" --voice nova --model tts-1-hd --out speech.mp3
{baseDir}/scripts/speak.sh "Text" --format opus --speed 1.2
```

Options:
- `--voice <name>`: alloy|echo|fable|onyx|nova|shimmer (default: alloy)
- `--model <name>`: tts-1|tts-1-hd (default: tts-1)
- `--format <fmt>`: mp3|opus|aac|flac|wav|pcm (default: mp3)
- `--speed <n>`: 0.25-4.0 (default: 1.0)
- `--out <path>`: output file (default: stdout or auto-named)

## API key

Set `OPENAI_API_KEY`, or configure in `~/.clawdbot/clawdbot.json`:

```json5
{
  skills: {
    entries: {
      "openai-tts": {
        apiKey: "sk-..."
      }
    }
  }
}
```

## Pricing

- tts-1: ~$0.015 per 1K characters
- tts-1-hd: ~$0.030 per 1K characters

Very affordable for short responses!

Version history

v1.0.0Latest

Jan 6, 2026

Initial release - text-to-speech via OpenAI API with 6 voice options

Frequently asked questions

You get access to all voices offered by OpenAI's TTS API, including alloy, echo, fable, onyx, nova, and shimmer. Each has a distinct tone and character.

Installation method

Send this prompt to your agent to install the skill

npx clawhub@latest install openai-tts

Download ZIP

Skill info

Versionv1.0.0

Authorpors

CategoryMedia & Creative

UpdatedJan 6, 2026

Files

SKILL.md1.6 KB

Run OpenClaw in the cloud

Deploy in seconds. Skills pre-installed.

See plans

Skill data sourced from ClawHub