GLM-4-Voice
text audio
open-source
Zhipu AI's GLM-4-Voice (2024) is an open-source end-to-end speech model supporting Chinese and English.
Version: 4.0
Released: 1y 7d ago on 10/25/2024
Pricing:
- details: free
Architecture
- family: GLM
- parameters: Unknown
- training_data: Chinese and English speech and text
- context_length: 131072
- inference_type: local
Capabilities
- speech recognition
- speech generation
- text-generation
- translation
Languages Supported
enzh
Benchmarks
Safety
- content filtering
- Designed for safe enterprise use
Deployment
- regions: self-hosted
- hosting: local, Hugging Face Spaces
- integrations: Text Generation WebUI
Tags
open-weightmultimodalspeech