MusicLM

audio

research

MusicLM (2023) is a Google research model for generating music from text descriptions. It uses a hierarchical sequence-t...

Version: 1.0

Released: 2y 3m ago on 08/01/2023

Architecture

parameters: Proprietary (undisclosed)
context_length: Generates several minutes of audio (24kHz)
training_data: Trained on a large proprietary music dataset
inference: Hierarchical Transformer (text-to-audio)

Capabilities

Generates high-fidelity music from text descriptions
Produces multi-minute compositions with complex structures
Can condition on melodies (e.g. humming) to create stylized music

Benchmarks

Quality: Outperforms prior music models in audio quality and prompt alignment

Safety

Trained on licensed music data
potential copyright considerations for generated content.

Deployment

regions: global
hosting: Google Cloud
integrations: Not deployed; results reported in research publication only.

Tags

audio-generationmusictransformergoogle-research

Join our community

Connect with others, share experiences, and stay in the loop.

LinkedIn

Connect with us and explore career opportunities.

Facebook

Follow us for updates and community news.

YouTube

Watch our latest videos and tutorials.

Twitter

Follow our latest updates and announcements.

Instagram

Follow us for behind-the-scenes content.

TikTok

Follow us for short-form content and trends.