Provide a remote audio or video URL for transcription. Synchronously-oriented endpoint that may return a completed result or pending status for longer jobs.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Transcription request. Must provide audio_url for URL-based transcription.
Remote audio/video URL (social media or cloud storage).
(Optional) Language hint (BCP-47 code, e.g., 'en-US', 'es-ES'). Defaults to 'auto' for automatic detection.
(Optional) Output formats to generate. Available: doc, pdf, txt, markdown, srt, vtt, csv, json. Defaults to ["json"] if not provided.
doc, pdf, txt, markdown, srt, vtt, csv, json (Optional) Webhook URL for async processing. If provided, the job runs asynchronously and results are sent to this URL when complete. Without callback, the endpoint processes synchronously.
(Optional) Enable speaker diarization to identify different speakers.
(Optional) Team identifier. If not provided, uses your default workspace.
(Optional) Folder identifier. If not provided, file will be added to the root of the workspace.
(Optional) Configuration for output formatting.
Transcription response. Returns one of two response types:
Option 1 - Completed (status: "completed"): Returned when no callback URL is provided (synchronous processing). Response includes status, id, and data with all requested formats (defaults to json if not specified).
Option 2 - Pending (status: "pending"): Only returned when callback URL is provided (asynchronous processing). Response includes status, id, and poll_url to check progress. Results will be sent to the callback URL when complete.