On January 22, Jinshi Data News, the first end-to-end voice model GLM-4-Voice of Zhipu AI was officially launched on the open platform. It can directly understand and generate Chinese and English voices, realize real-time voice conversations, and adjust the emotions, tones, speeds, and dialects of voices flexibly according to user instructions, making voice interactions more natural and vivid.
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
The first end-to-end voice API of Zhipu BigModel is launched
On January 22, Jinshi Data News, the first end-to-end voice model GLM-4-Voice of Zhipu AI was officially launched on the open platform. It can directly understand and generate Chinese and English voices, realize real-time voice conversations, and adjust the emotions, tones, speeds, and dialects of voices flexibly according to user instructions, making voice interactions more natural and vivid.