Add voice generation to the AI gateway

Adds text to speech models to the Hercules AI gateway so apps can generate spoken audio from text through one unified interface.