Upload an audio or video file for transcription. Synchronously-oriented endpoint that may return a completed result or pending status for longer jobs.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Audio or video file to transcribe.
(Optional) Language hint (BCP-47 code). Defaults to 'auto'.
(Optional) Output formats to generate. Available: doc, pdf, txt, markdown, srt, vtt, csv, json. Defaults to ["json"] if not provided.
doc, pdf, txt, markdown, srt, vtt, csv, json (Optional) Webhook URL for async processing. If provided, the job runs asynchronously and results are sent to this URL when complete. Without callback, the endpoint processes synchronously.
(Optional) Enable speaker diarization.
(Optional) Team identifier. If not provided, uses your default workspace.
(Optional) Folder identifier. If not provided, file will be added to the root of the workspace.
(Optional) Output configuration settings.
Transcription response. Returns one of two response types:
Option 1 - Completed (status: "completed"): Returned when no callback URL is provided (synchronous processing). Response includes status, id, and data with all requested formats (defaults to json if not specified).
Option 2 - Pending (status: "pending"): Only returned when callback URL is provided (asynchronous processing). Response includes status, id, and poll_url to check progress. Results will be sent to the callback URL when complete.