Ai_Cf_Openai_Whisper_Large_V3_Turbo_Output

Defined in: .pnpm-home/store/v11/links/@cloudflare/workers-types/4.20251118.0/2891147a4db443bf13ca098dcb7381fb6bac5793758b7fd187ebaf5bc0deb969/node_modules/@cloudflare/workers-types/index.ts:4315

Properties

segments?

optional segments: object[]

Defined in: .pnpm-home/store/v11/links/@cloudflare/workers-types/4.20251118.0/2891147a4db443bf13ca098dcb7381fb6bac5793758b7fd187ebaf5bc0deb969/node_modules/@cloudflare/workers-types/index.ts:4342

avg_logprob?

optional avg_logprob: number

The average log probability of the predictions for the words in this segment, indicating overall confidence.

compression_ratio?

optional compression_ratio: number

The compression ratio of the input to the output, measuring how much the text was compressed during the transcription process.

end?

optional end: number

The ending time of the segment within the audio, in seconds.

no_speech_prob?

optional no_speech_prob: number

The probability that the segment contains no speech, represented as a decimal between 0 and 1.

start?

optional start: number

The starting time of the segment within the audio, in seconds.

temperature?

optional temperature: number

The temperature used in the decoding process, controlling randomness in predictions. Lower values result in more deterministic outputs.

text?

optional text: string

The transcription of the segment.

words?

optional words: object[]

text

text: string

Defined in: .pnpm-home/store/v11/links/@cloudflare/workers-types/4.20251118.0/2891147a4db443bf13ca098dcb7381fb6bac5793758b7fd187ebaf5bc0deb969/node_modules/@cloudflare/workers-types/index.ts:4337

The complete transcription of the audio.

transcription_info?

optional transcription_info: object

Defined in: .pnpm-home/store/v11/links/@cloudflare/workers-types/4.20251118.0/2891147a4db443bf13ca098dcb7381fb6bac5793758b7fd187ebaf5bc0deb969/node_modules/@cloudflare/workers-types/index.ts:4316

duration?

optional duration: number

The total duration of the original audio file, in seconds.

duration_after_vad?

optional duration_after_vad: number

The duration of the audio after applying Voice Activity Detection (VAD) to remove silent or irrelevant sections, in seconds.

language?

optional language: string

The language of the audio being transcribed or translated.

language_probability?

optional language_probability: number

The confidence level or probability of the detected language being accurate, represented as a decimal between 0 and 1.

vtt?

optional vtt: string

Defined in: .pnpm-home/store/v11/links/@cloudflare/workers-types/4.20251118.0/2891147a4db443bf13ca098dcb7381fb6bac5793758b7fd187ebaf5bc0deb969/node_modules/@cloudflare/workers-types/index.ts:4389

The transcription in WebVTT format, which includes timing and text information for use in subtitles.

word_count?

optional word_count: number

Defined in: .pnpm-home/store/v11/links/@cloudflare/workers-types/4.20251118.0/2891147a4db443bf13ca098dcb7381fb6bac5793758b7fd187ebaf5bc0deb969/node_modules/@cloudflare/workers-types/index.ts:4341

The total number of words in the transcription.