New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OpenAI Audio Support #589

Open

willbakst opened this issue Oct 2, 2024 · 0 comments

Labels

Feature Request

Milestone

Contributor

willbakst commented Oct 2, 2024

Description

This is blocked until they actually release the model

We already have audio support for Gemini, we should add support for the upcoming gpt-4o-audio-preview release

Get it working with BaseMessageParam and string templates
Add additional support for Messages and shorthand to handle Python audio types (make sure this works for both Gemini and OpenAI)
Ensure good handling of audio outputs

It's important that the library supports multimodality natively as this is clearly the direction of these new multimodal models.

The text was updated successfully, but these errors were encountered:

willbakst added the Feature Request label

willbakst modified the milestone: v1.4

willbakst added this to the v1.6 milestone

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment