text-to-audioRp. 2.000

Elevenlabs Eleven-V3 Timing

ElevenLabs Eleven-V3 Timing converts text to natural speech and returns alignment metadata—character/word timestamps in JSON—for precise subtitles, karaoke effects, and lip-sync. Supports voice_id, similarity/stability, and optional Speaker Boost. Priced at $0.10 per 1,000 characters. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Field

5

Wajib

2

Prompt & Teks

Masukkan instruksi utama yang akan diproses model.

wajibstring

Text to convert to speech. Every character is 1 token. Maximum 10000 characters. Use <#x#> between words to control pause duration (0.01-99.99s).

Konfigurasi

Atur Voice Id, Similarity, Stability, Use Speaker Boost, dan parameter lain sesuai schema model.

wajibstring

The voice to use for speech generation

number
0 - 1

High enhancement boosts overall voice clarity and target speaker similarity. Very high values can cause artifacts, so adjusting this setting to find the optimal value is encouraged.

number
0 - 1

Voice stability (0-1) Default value: 0.5

boolean

This parameter supports English text normalization, which improves performance in number-reading scenarios.

Ringkasan

Harga Final

Hitung harga pasti sebelum membuat transaksi untuk mengetahui biaya yang dibutuhkan.

Model

Elevenlabs Eleven-V3 Timing

Rp. 2.000

Belum ada estimasi harga

Isi input, lalu hitung harga sebelum membuat payment.

Elevenlabs Eleven V3 Timing - Bayar dengan QRIS