Google AI
The Times Australia
The Times World News

.

AI can now generate entire songs on demand. What does this mean for music as we know it?

  • Written by Oliver Bown, Associate Professor, UNSW Sydney
AI can now generate entire songs on demand. What does this mean for music as we know it?

In March, we saw the launch of a “ChatGPT for music[1]” called Suno[2], which uses generative AI to produce realistic songs on demand from short text prompts. A few weeks later, a similar competitor – Udio[3]arrived on the scene[4].

I’ve been working with various creative computational tools for the past 15 years, both as a researcher and a producer, and the recent pace of change has floored me. As I’ve argued elsewhere[5], the view that AI systems will never make “real” music like humans do should be understood more as a claim about social context than technical capability.

The argument “sure, it can make expressive, complex-structured, natural-sounding, virtuosic, original music which can stir human emotions, but AI can’t make proper music” can easily begin to sound like something from a Monty Python sketch.

After playing with Suno and Udio, I’ve been thinking about what it is exactly they change – and what they might mean not only for the way professionals and amateur artists create music, but the way all of us consume it.

Expressing emotion without feeling it

Generating audio from text prompts in itself is nothing new[6]. However, Suno and Udio have made an obvious development: from a simple text prompt, they generate song lyrics (using a ChatGPT-like text generator), feed them into a generative voice model, and integrate the “vocals” with generated music to produce a coherent song segment.

This integration is a small but remarkable feat. The systems are very good at making up coherent songs that sound expressively “sung” (there I go anthropomorphising).

The effect can be uncanny. I know it’s AI, but the voice can still cut through with emotional impact. When the music performs a perfectly executed end-of-bar pirouette into a new section, my brain gets some of those little sparks of pattern-processing joy that I might get listening to a great band.

To me this highlights something sometimes missed[7] about musical expression: AI doesn’t need to experience emotions and life events to successfully express them in music that resonates with people.

Music as an everyday language

Like other generative AI products, Suno and Udio were trained on vast amounts of existing work by real humans – and there is much debate about those humans’ intellectual property rights[8].

Nevertheless, these tools may mark the dawn of mainstream AI music culture. They offer new forms of musical engagement that people will just want to use, to explore, to play with and actually listen to for their own enjoyment.

AI capable of “end to end” music creation is arguably not technology for makers of music, but for consumers of music. For now it remains unclear whether users of Udio and Suno are creators or consumers – or whether the distinction is even useful.

A long-observed phenomenon in creative technologies is that as something becomes easier and cheaper to produce, it is used for more casual expression. As a result, the medium goes from an exclusive high art form to more of an everyday language – think what smartphones have done to photography.

So imagine you could send your father a professionally produced song all about him for his birthday, with minimal cost and effort, in a style of his preference – a modern-day birthday card. Researchers have long considered this eventuality, and now we can do it. Happy birthday, dad!

Can you create without control?

Whatever these systems have achieved and may achieve in the near future, they face a glaring limitation: the lack of control.

Text prompts are often not much good as precise instructions, especially in music. So these tools are fit for blind search – a kind of wandering through the space of possibilities – but not for accurate control. (That’s not to diminish their value. Blind search can be a powerful creative force.)

Viewing these tools as a practising music producer, things look very different. Although Udio’s about page[9] says “anyone with a tune, some lyrics, or a funny idea can now express themselves in music”, I don’t feel I have enough control to express myself with these tools.

I can see them being useful to seed raw materials for manipulation, much like samples and field recordings. But when I’m seeking to express myself, I need control.

Using Suno, I had some fun finding the most gnarly dark techno grooves I could get out of it. The result was something I would absolutely use in a track.

But I found I could also just gladly listen. I felt no compulsion to add anything or manipulate the result to add my mark.

And many jurisdictions have declared[10] that you won’t be awarded copyright for something just because you prompted it into existence with AI.

For a start, the output depends just as much on everything that went into the AI – including the creative work of millions of other artists. Arguably, you didn’t do the work of creation. You simply requested it.

New musical experiences in the no-man’s land between production and consumption

So Udio’s declaration that anyone can express themselves in music is an interesting provocation. The people who use tools like Suno and Udio may be considered more consumers of music AI experiences than creators of music AI works, or as with many technological impacts, we may need to come up with new concepts for what they’re doing.

A shift to generative music may draw attention away from current forms of musical culture, just as the era of recorded music saw the diminishing (but not death) of orchestral music, which was once the only way to hear complex, timbrally rich and loud music. If engagement in these new types of music culture and exchange explodes, we may see reduced engagement in the traditional music consumption of artists, bands, radio and playlists.

While it is too early to tell what the impact will be, we should be attentive. The effort to defend existing creators’ intellectual property protections, a significant moral rights issue, is part of this equation.

But even if it succeeds I believe it won’t fundamentally address this potentially explosive shift in culture, and claims that such music might be inferior also have had little effect in halting cultural change historically, as with techno or even jazz, long ago. Government AI policies may need to look beyond these issues to understand how music works socially and to ensure that our musical cultures are vibrant, sustainable, enriching and meaningful for both individuals and communities.

References

  1. ^ ChatGPT for music (www.rollingstone.com)
  2. ^ Suno (suno.com)
  3. ^ Udio (www.udio.com)
  4. ^ arrived on the scene (www.rollingstone.com)
  5. ^ argued elsewhere (mitpress.mit.edu)
  6. ^ nothing new (openai.com)
  7. ^ sometimes missed (www.humanartistrycampaign.com)
  8. ^ intellectual property rights (news.sky.com)
  9. ^ about page (www.udio.com)
  10. ^ declared (ia.acs.org.au)

Read more https://theconversation.com/ai-can-now-generate-entire-songs-on-demand-what-does-this-mean-for-music-as-we-know-it-228937

Times Magazine

Adobe Ushers in a New Era of Creativity with New Creative Agent and Generative AI Innovations in Adobe Firefly

Adobe (Nasdaq: ADBE) — the global technology leader that unleashes creativity, productivity and ...

CRO Tech Stack: A Technical Guide to Conversion Rate Optimization Tools

The fascinating thing is that the value of this website lies in the fact that creating a high-cali...

How Decentralised Applications Are Reshaping Enterprise Software in Australia

Australian businesses are experiencing a quiet revolution in how they manage data, execute agreeme...

Bambu Lab P2S 3D Printer Review: High-End Performance Meets Everyday Usability

After a full month of hands-on testing, the Bambu Lab P2S 3D printer has proven itself to be one...

Nearly Half of Disadvantaged Australian Schools Run Libraries on Less Than $1000 a Year

A new national snapshot from Dymocks Children’s Charities reveals outdated books, no librarians ...

Growing EV popularity is leading to queues at fast chargers. Could a kerbside charger network help?

The war on Iran has made crystal clear how shaky our reliance on fossil fuels is. It’s no surpri...

The Times Features

Mother’s Day, The Lodge Dining Room

Her Day, The Lodge Way This Mother’s Day, The Lodge Dining Room presents a refined take on high...

The Albanese Government’s plan to impose a retrospectiv…

LABOR’S RETROSPECTIVE TAX GRAB RISKS 3 MILLION JOBS The Albanese Government’s plan to impose a retr...

Court outcome reinforces wildlife trafficking will not …

A 20-year-old man has been fined close to $50,000 and ordered to pay costs after pleading guilty t...

Businesses tap UOW PhD researchers to accelerate innova…

Industry internship program connects businesses with research talent to fast-track innovation an...

Olivia Colman, Kate Box to join an exclusive Live Q…

Photo credit : Photo Credit Mark De BlokFresh out of cinemas, JIMPA - the new film by acclaimed di...

Rental growth reaccelerates as cost to tenants reaches …

Australian renters are spending a record share of their gross median household income on housing c...

Worried about feeding your baby solid foods? Here’s wha…

When you have a baby, mealtimes can be messy and stressful. If you’re a new parent you may be...

Key Nutrients to Consider Before Pregnancy

Preparing for pregnancy often begins well before conception. Nutrition plays an important role durin...

When AI starts shopping for you, fashion may be enterin…

Fashion has always been a bit different to other industries. Consumers do not just buy because...