Scaling Speech Technology to 1,000+ Languages (MMS)Product21/100 via “controllable music generation with style and instrumentation control”
* ⏫ 06/2023: [Simple and Controllable Music Generation (MusicGen)](https://arxiv.org/abs/2306.05284)
Unique: Implements controllable music generation through explicit control tokens for musical attributes (style, instrumentation, tempo, mood) rather than relying solely on text description semantics. Enables both unconditional generation and fine-grained parameter control within a single generative model.
vs others: Provides more granular control over musical characteristics compared to pure text-to-music models, and generates full compositions rather than just audio samples, though may sacrifice some naturalness or coherence compared to human-composed music or specialized music synthesis systems.