Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OpenAI Audio Support #589

Open
willbakst opened this issue Oct 2, 2024 · 0 comments
Open

OpenAI Audio Support #589

willbakst opened this issue Oct 2, 2024 · 0 comments
Labels
Feature Request New feature or request
Milestone

Comments

@willbakst
Copy link
Contributor

Description

This is blocked until they actually release the model

We already have audio support for Gemini, we should add support for the upcoming gpt-4o-audio-preview release

  1. Get it working with BaseMessageParam and string templates
  2. Add additional support for Messages and shorthand to handle Python audio types (make sure this works for both Gemini and OpenAI)
  3. Ensure good handling of audio outputs

It's important that the library supports multimodality natively as this is clearly the direction of these new multimodal models.

@willbakst willbakst added the Feature Request New feature or request label Oct 2, 2024
@willbakst willbakst modified the milestone: v1.4 Oct 2, 2024
@willbakst willbakst added this to the v1.6 milestone Oct 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Feature Request New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant