Transcribe audio - /api/transcribe-audio
Audio File:
JWT Token:
JWT token must contain customerId and agentId fields with purpose 'audio'. Will be sent as Authorization header.
Language (optional):
ISO language code. If not specified, Whisper will auto-detect the language.
Prompt (optional):
A text to guide the model and provide context for the transcription.
Transcribe File
Extract text from any file - /api/file-to-text
Loading supported file types...
Document File:
JWT Token:
JWT token must contain customerId and agentId fields with purpose 'file'. Will be sent as Authorization header.
Extract Text
Extract text from PDF pages (image-based) - /api/pdf-page-to-text
PDF File:
Page Numbers:
Comma-separated page numbers to extract using vision analysis
Model:
Vision model (e.g., claude-sonnet-latest, claude-opus-latest, claude-opus-4-6, claude-opus-4-7, gpt-5.1)
Prompt (optional):
Override the default prompt. E.g., "Extract only the text" or "Summarize this page"
JWT Token:
JWT token must contain customerId and agentId fields with purpose 'file'. Will be sent as Authorization header.
Extract PDF Pages
Extract text from GCS file - /api/gcs-file-to-text
Loading supported file types...
JWT Token:
JWT token must contain filePath with purpose 'gcs'. Will be sent as Authorization header.
Extract Text from GCS
Extract geometry from CAD files - /api/gcs-to-geojson
JWT Token:
JWT token must contain filePath with purpose 'gcs'. Will be sent as Authorization header.
Extract Geometry
Extract text from URL - /api/url-to-text
Loading supported file types...
URL:
Direct URL to the file to extract text from
JWT Token:
JWT token must contain customerId and agentId fields with purpose 'url'. Will be sent as Authorization header.
Extract Text from URL