We've sent a verification link by email
Didn't receive the email? Check your Spam folder, it may have been caught by a filter. If you still don't see it, you can resend the verification email.
Started December 1st, 2011 · 1 reply · Latest reply by dx3_1979 12 years, 11 months ago
Supposing I wanted to mimic someones voice -- say Laurence Fishburne. I could take a voice sample of, say, 1 minute, and grab the frequency spectrum. Grab the largest 4 or 5 peaks. This is the basis for his voice. Do the same for my voice.
Now, I speak a specific sentence, paragraph, or speech. Every instant in time, my voice will deviate from the basis. If I superimpose that deviation (percentage-wise) on top of Laurence Fishburne's basis -- will I wind up with Laurence Fishburne's voice, but the speech I made?
I was kind of thinking like a moving modulation filter. It would be crazy CPU intensive, but for each sample, grab the FFT of the surrounding 50 ms, divide it by the basis spectrum, then multiply the target basis spectrum
my voice is c(t) -->> C(f) (the basis frequency spectrum, an FFT of perhaps 5 minutes of random talking)
Reference voice (Laurence Fishburne or whatever)is r(t) -->> R(f) (the refererence spectrum, 5 mins of him doing random talking)
Then I say my line "Jump the moon!"
this is x(t) (or x(k) since we are talking digital)
for each sample in my line (x(t))
FFT{ x(t-50ms) to x(t+50ms)} --->>> X(f)
modulation function M(f) == X(f)/C(f)
output Y(f) == M(f)*R(f)
output sample y(t) == IFFT{Y(f)}
end for
Anyone think this is feasable, and/or any software that is ready made to do this?