Elevenlabs Eleven-V3 Timing
ElevenLabs Eleven-V3 Timing converts text to natural speech and returns alignment metadata—character/word timestamps in JSON—for precise subtitles, karaoke effects, and lip-sync. Supports voice_id, similarity/stability, and optional Speaker Boost. Priced at $0.10 per 1,000 characters. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.
Field
5
Wajib
2
Prompt & Teks
Masukkan instruksi utama yang akan diproses model.
Text to convert to speech. Every character is 1 token. Maximum 10000 characters. Use <#x#> between words to control pause duration (0.01-99.99s).
Konfigurasi
Atur Voice Id, Similarity, Stability, Use Speaker Boost, dan parameter lain sesuai schema model.
The voice to use for speech generation
High enhancement boosts overall voice clarity and target speaker similarity. Very high values can cause artifacts, so adjusting this setting to find the optimal value is encouraged.
Voice stability (0-1) Default value: 0.5
This parameter supports English text normalization, which improves performance in number-reading scenarios.
Ringkasan
Harga Final
Hitung harga pasti sebelum membuat transaksi untuk mengetahui biaya yang dibutuhkan.
Model
Elevenlabs Eleven-V3 Timing
Rp. 2.000
Belum ada estimasi harga
Isi input, lalu hitung harga sebelum membuat payment.
