video-to-textRp. 100

Wavespeed-Ai Molmo2 Video-Understanding

Molmo2-4B Video Understanding: Analyze videos with specialized tasks (general, summary, analysis, counting, scene description). Open-source vision-language model with temporal understanding capabilities. Ready-to-use REST API, no cold starts, duration-based pricing.

Field

3

Wajib

1

Prompt & Teks

Masukkan instruksi utama yang akan diproses model.

string

Optional guidance or specific instructions for the understanding task (e.g., 'Focus on the people' or 'Count the number of cars').

Media

Unggah file langsung di field yang disediakan

wajibstring

Input video URL for understanding. Supports common video formats (MP4, MOV, WebM). Maximum 2 minutes.

Pilih file dari perangkat

Konfigurasi

Atur Task dan parameter lain sesuai schema model.

string

Type of understanding task. General: overall understanding. Summary: brief overview. Analysis: detailed breakdown. Counting: count objects/actions. Scene_description: describe scenes in sequence.

Ringkasan

Harga Final

Hitung harga pasti sebelum membuat transaksi untuk mengetahui biaya yang dibutuhkan.

Model

Wavespeed-Ai Molmo2 Video-Understanding

Rp. 100

Belum ada estimasi harga

Isi input, lalu hitung harga sebelum membuat payment.

Wavespeed Ai Molmo2 Video Understanding - Bayar dengan QRIS