Old 11-03-2019, 09:01 PM   #1
serr
Human being with feelings
 
Join Date: Sep 2010
Posts: 12,627
Default Deep fake vocal plugin?

Is this a thing yet? "Deep faking" someone's voice with a simple plugin. Like autotune but changing the voice instead of the tuning. Looking to put someone else's voice in a song and get sued.
serr is online now   Reply With Quote
Old 11-04-2019, 01:12 AM   #2
ivansc
Human being with feelings
 
Join Date: Aug 2007
Location: Near Cambridge UK and Near Questembert, France
Posts: 22,754
Default

Go visit Antares. They have stuff that can do some pretty radical voice alterations...
__________________
Ici on parles Franglais
ivansc is offline   Reply With Quote
Old 11-04-2019, 06:47 AM   #3
chip mcdonald
Human being with feelings
 
chip mcdonald's Avatar
 
Join Date: May 2006
Location: NA - North Augusta South Carolina
Posts: 4,294
Default

It will be.

As well as a.i. enabled plugins to make your output sound like whatever you train it to. Audio wise and performance wise.

There will be a day when you'll have perfect guitar simulation, perfect e.q. duplication, and performance execution tweaked to match anyone.

I'm really surprised something like this hasn't surface in a VST already. I certainly wouldn't invest $$$$ in a guitar sim right now, at some point using a GAN to match the preamp/eq/output/speaker/mic stages of a guitar amp will happen and the analog/digital argument will be over.

Vocalists will be in for a shock soon. Within 5 years you should be able to have "anyone" sing on your music.

Guitar playing is going to get messed up, as not just tuning but timing, vibrato and phrasing becomes tweakable. Fret buzz will be eliminated.

Personally I'm really looking forward to an a.i. trainable midi drum editor.

A.i. trainable eq. Mastering. As long as you have the right thing to train it on.

It will be the Really Big Deal in audio by 2022. And unfortunately allow more charlatans to appear to have talent. The hidden danger to humans: a.i. assisted activity masking true ability, spread across a few generations could have irreparable effects on the genome.

/ my glass is nearly empty
__________________
]]] guitar lessons - www.chipmcdonald.com [[[
WEAR A FRAKKING MASK!!!!
chip mcdonald is offline   Reply With Quote
Old 11-04-2019, 07:32 AM   #4
Softsynth
Human being with feelings
 
Join Date: Jun 2015
Posts: 8,703
Default

The computational power required to do what you are suggesting in real time in VST form just isn't there. I don't believe you are going to hear anything like that in VST form for a long time, at least not anything convincing.

Considering Deepfake even the best deep fake still looks odd and well, fake! - but to me the best are still comfortably better in some regards than the no expense hollywood CGI (see the jarringly odd Rachel FX in the otherwise visually excellent Blade Runner 2049, and the possibly worse video game style Grand Moff Tarkin in Rogue One).

Given a bit of time and effort the deep fake technique could have done a better job than than those premium efforts.
Softsynth is offline   Reply With Quote
Old 11-04-2019, 08:21 AM   #5
serr
Human being with feelings
 
Join Date: Sep 2010
Posts: 12,627
Default

Oh, I don't need real time. No live use planned. Just have a song that would work well for Bill Cosby to sing.
serr is online now   Reply With Quote
Old 11-04-2019, 09:18 AM   #6
Reason
Human being with feelings
 
Join Date: Apr 2015
Posts: 976
Default

Quote:
Originally Posted by serr View Post
Oh, I don't need real time. No live use planned. Just have a song that would work well for Bill Cosby to sing.
"Baby It's Cold Outside"? "Blurred Lines"? Don't leave us hanging, man!
Reason is offline   Reply With Quote
Old 11-05-2019, 05:49 AM   #7
chip mcdonald
Human being with feelings
 
chip mcdonald's Avatar
 
Join Date: May 2006
Location: NA - North Augusta South Carolina
Posts: 4,294
Default

Quote:
Originally Posted by Softsynth View Post
The computational power required to do what you are suggesting in real time in VST form just isn't there.
Initially it will be like Melodyne.

Quote:

I don't believe you are going to hear anything like that in VST form for a long time, at least not anything convincing.
I wouldn't be surprised if it happens before the end of next year. Max 3 years.

Quote:
Considering Deepfake even the best deep fake still looks odd and well, fake!
They're not all the same or trying to do the same thing. And it's not just about Deep Fake; there are a multitude of approaches happening simultaneously. What I'm referencing are the applications of applying styles via GANs and the generation methods that are effectively indistinguishable. The a.i. generated faces are well out of the uncanny valley. The Dadabots project should give everyone pause.


Quote:
no expense hollywood CGI (see the jarringly odd Rachel FX in the otherwise visually excellent Blade Runner 2049, and the possibly worse video game style Grand Moff Tarkin in Rogue One).
Tarkin was horrible but at least Rogue One was better than the abomination that was Last Jedi...

..but Deep Fake is just one basic use of techniques that are accelerating faster than Moore's law. Effectively my argument: you're able to do on your desktop over night what required millions of $ and months in Hollywood just a year ago.

It's coming faster than you think.
__________________
]]] guitar lessons - www.chipmcdonald.com [[[
WEAR A FRAKKING MASK!!!!
chip mcdonald is offline   Reply With Quote
Old 11-05-2019, 06:16 AM   #8
Softsynth
Human being with feelings
 
Join Date: Jun 2015
Posts: 8,703
Default

Quote:
Originally Posted by chip mcdonald
...
Tarkin was horrible but at least Rogue One was better than the abomination that was Last Jedi...
Yes, I didn't like Rogue One but it was much, MUCH better than the Last Jedi.

Quote:
Originally Posted by chip mcdonald
..but Deep Fake is just one basic use of techniques that are accelerating faster than Moore's law. Effectively my argument: you're able to do on your desktop over night what required millions of $ and months in Hollywood just a year ago.

It's coming faster than you think.
The face mapping technology in Deep Fake is a neat trick, completely unrelated to audio techniques.
I think you are making some wild assumptions about how far technology will get in such a short time frame. Even the de-aging techniques used in The Irishman are pretty ropey in parts, looking like camp make up at times according to reviewers, and that's 2019 cost no object stuff.


We shall see, if we can tell!
Softsynth is offline   Reply With Quote
Old 11-06-2019, 12:11 PM   #9
cyrano
Human being with feelings
 
cyrano's Avatar
 
Join Date: Jun 2011
Location: Belgium
Posts: 5,246
Default

Quote:
Originally Posted by Softsynth View Post
The computational power required to do what you are suggesting in real time in VST form just isn't there. I don't believe you are going to hear anything like that in VST form for a long time, at least not anything convincing.
Oh, yes, it's available allright. Just not in VST form, but simple apps on several appstores. Most notably on a Chinese appstore. The one on the western side of things was removed after public outcry.

Quote:
Considering Deepfake even the best deep fake still looks odd and well, fake! - but to me the best are still comfortably better in some regards than the no expense hollywood CGI (see the jarringly odd Rachel FX in the otherwise visually excellent Blade Runner 2049, and the possibly worse video game style Grand Moff Tarkin in Rogue One).

Given a bit of time and effort the deep fake technique could have done a better job than than those premium efforts.
I've seen and heard pretty convincing examples from researchers. There's at least one site I remember that specialises in these: deepfake.com.

Mostly pics and video, not audio.
__________________
In a time of deceit telling the truth is a revolutionary act.
George Orwell
cyrano is offline   Reply With Quote
Old 11-06-2019, 12:21 PM   #10
serr
Human being with feelings
 
Join Date: Sep 2010
Posts: 12,627
Default

links?
serr is online now   Reply With Quote
Old 11-07-2019, 11:25 AM   #11
Glennbo
Human being with feelings
 
Glennbo's Avatar
 
Join Date: Mar 2008
Location: Planet Earth
Posts: 9,097
Default

Quote:
Originally Posted by serr View Post
Is this a thing yet? "Deep faking" someone's voice with a simple plugin. Like autotune but changing the voice instead of the tuning. Looking to put someone else's voice in a song and get sued.
Not really a deep fake like you are referring to, but around 2000 I had a plugin called "Voice Tweaker" that could do a Barry White fake pretty well using it's "Maleify" preset. It had another preset called "Womanize" that did the opposite and would prolly need a name change for today's market.
__________________
Glennbo
Hear My Music - Click Me!!!
--
Glennbo is offline   Reply With Quote
Old 11-07-2019, 11:57 AM   #12
nait
Human being with feelings
 
nait's Avatar
 
Join Date: Jun 2018
Location: Edmonton, AB, Canada
Posts: 1,391
Default

Apparently they're making a new movie with James Dean in it. I mean... CGI James Dean or deep faked James Dean.. not James Dean's rotted corpse The voice will be some voice actor doing his best James Dean impersonation, I presume.

I'd have to guess that we aren't far from technology existing where you could completely and convincingly fake video/audio of anyone, and that tech being used for even greater evils. How are we going to be able to verify authenticity of some "recorded" video? Say someone admitting to a crime, or a high ranking political official doing something that could incite a war, or implicate them (not that we need deep fakes for that!)
__________________
My Rig (also serves as my gaming PC): MSI Mag X570 Tomahawk Mobo, Ryzen R9 3900X, 32GB RAM, Samsung 960 Evo 500gb NVMe, Crucial 1TB NVMe, NVidia RTX 2080 Super, Arturia Minifuse 2, Nektar Impact LX25+ MIDI Controller Keyboard.
nait is offline   Reply With Quote
Old 11-07-2019, 05:44 PM   #13
Held
Human being with feelings
 
Held's Avatar
 
Join Date: Nov 2019
Posts: 540
Default

Here's a fun example of what you can do with deep fake vocal software. I don't think it's on the market for purchase, though.

A Voice Deepfake Was Used To Scam A CEO Out Of $243,000

Edit: And here's a video of Joe Rogan's voiced being faked (it's linked in the article). Pretty impressive.
Held is offline   Reply With Quote
Old 11-07-2019, 07:28 PM   #14
cassembler
Human being with feelings
 
cassembler's Avatar
 
Join Date: Aug 2017
Location: Dallas, TX
Posts: 348
Default

To expand this a bit further, how long until Pandora (or Spotify, whatever) is simply streaming us AI-generated content, that's better than what humans are creating?

I'm putting the over/under at 2030 for music.

2033 for video game content (i.e. not just random maps; whole epic story lines, new enemies, new graphic textures, new voice audio, all created on-the-fly).

2040 for AI-generated TV and movies.

My only real question is will humanity make it that long Sleep tight!
__________________
It helps if the hitter thinks you're a little crazy
- Nolan Ryan
cassembler is offline   Reply With Quote
Old 11-08-2019, 04:36 AM   #15
Held
Human being with feelings
 
Held's Avatar
 
Join Date: Nov 2019
Posts: 540
Default

Quote:
Originally Posted by cassembler View Post
To expand this a bit further, how long until Pandora (or Spotify, whatever) is simply streaming us AI-generated content, that's better than what humans are creating?
You're saying that like it's a bad thing. If AI's produce better music than people, I'm happy to listen to it. We'll just net universal basic income to make up for the lost jobs. Yang 2020!
Held is offline   Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -7. The time now is 02:29 PM.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.