Audio Support
Here we show how irouter
makes audio queries as simple as text, image and PDF queries.
There are a limited amount of models that support audio input. Make sure to use an LLM that supports audio input, like google/gemini-2.5-flash
.
To check which LLMs support audio input, check the OpenRouter Model Overview.
Local Audio Files
The simplest way to work with audio files in irouter
is to pass the filepath and an instruction as a list of strings. Both .mp3
and .wav
files are supported.
from irouter import Call
model_name = "google/gemini-2.5-flash"
audio_path = "../assets/bottles.mp3"
c = Call(model_name)
c([audio_path, "What do you hear?"], temperature=0)
# 'I hear the sound of a glass bottle being opened and closed, along with the distinct clinking of glass.'
Chat with Audio
Use Chat
for conversation tracking and token usage monitoring: