Add real-time voice to the AI gateway

Adds streaming voice models to the Hercules AI gateway so apps can hold low-latency, two-way voice conversations with users.