OpenAI: GPT-4o Audio

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input audio tokens.

Description

The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input audio tokens.

ArchitectureАрхитектура

Modality:
text->text
InputModalities:
audio, text
OutputModalities:
text
Tokenizer:
GPT

ContextAndLimits

ContextLength:
128000 Tokens
MaxResponseTokens:
16384 Tokens
Moderation:
Enabled

PricingRUB

Request:
Image:
WebSearch:
InternalReasoning:
Prompt1KTokens:
Completion1KTokens:

DefaultParameters

Temperature:
0
StartChatWith OpenAI: GPT-4o Audio

UserComments