Skip to content

Ai_Cf_Openai_Whisper_Large_V3_Turbo_Output

Defined in: .pnpm-home/store/v11/links/@cloudflare/workers-types/4.20251118.0/2891147a4db443bf13ca098dcb7381fb6bac5793758b7fd187ebaf5bc0deb969/node_modules/@cloudflare/workers-types/index.ts:4315

optional segments: object[]

Defined in: .pnpm-home/store/v11/links/@cloudflare/workers-types/4.20251118.0/2891147a4db443bf13ca098dcb7381fb6bac5793758b7fd187ebaf5bc0deb969/node_modules/@cloudflare/workers-types/index.ts:4342

optional avg_logprob: number

The average log probability of the predictions for the words in this segment, indicating overall confidence.

optional compression_ratio: number

The compression ratio of the input to the output, measuring how much the text was compressed during the transcription process.

optional end: number

The ending time of the segment within the audio, in seconds.

optional no_speech_prob: number

The probability that the segment contains no speech, represented as a decimal between 0 and 1.

optional start: number

The starting time of the segment within the audio, in seconds.

optional temperature: number

The temperature used in the decoding process, controlling randomness in predictions. Lower values result in more deterministic outputs.

optional text: string

The transcription of the segment.

optional words: object[]


text: string

Defined in: .pnpm-home/store/v11/links/@cloudflare/workers-types/4.20251118.0/2891147a4db443bf13ca098dcb7381fb6bac5793758b7fd187ebaf5bc0deb969/node_modules/@cloudflare/workers-types/index.ts:4337

The complete transcription of the audio.


optional transcription_info: object

Defined in: .pnpm-home/store/v11/links/@cloudflare/workers-types/4.20251118.0/2891147a4db443bf13ca098dcb7381fb6bac5793758b7fd187ebaf5bc0deb969/node_modules/@cloudflare/workers-types/index.ts:4316

optional duration: number

The total duration of the original audio file, in seconds.

optional duration_after_vad: number

The duration of the audio after applying Voice Activity Detection (VAD) to remove silent or irrelevant sections, in seconds.

optional language: string

The language of the audio being transcribed or translated.

optional language_probability: number

The confidence level or probability of the detected language being accurate, represented as a decimal between 0 and 1.


optional vtt: string

Defined in: .pnpm-home/store/v11/links/@cloudflare/workers-types/4.20251118.0/2891147a4db443bf13ca098dcb7381fb6bac5793758b7fd187ebaf5bc0deb969/node_modules/@cloudflare/workers-types/index.ts:4389

The transcription in WebVTT format, which includes timing and text information for use in subtitles.


optional word_count: number

Defined in: .pnpm-home/store/v11/links/@cloudflare/workers-types/4.20251118.0/2891147a4db443bf13ca098dcb7381fb6bac5793758b7fd187ebaf5bc0deb969/node_modules/@cloudflare/workers-types/index.ts:4341

The total number of words in the transcription.