Speech re-synthesis from spectrogram image through sinusoidal modelling

dc.contributor.authorSinghal, Rahul
dc.date.accessioned2023-03-07T09:30:26Z
dc.date.available2023-03-07T09:30:26Z
dc.date.issued2014
dc.description.abstractA novel method to extract parameters i.e. frequencies and their bandwidth for intelligible speech synthesis is presented in the paper. The parameters are extracted from the spectrogram image of the pre-recorded male and female voice samples and used to re-synthesize speech by employing sinusoidal signals. The phase continuity is preserved by quantifying time-scale and identifying phase at temporal boundaries for a given frequency. The amplitude distribution of the sinusoidals follow Gaussian distribution and use frequency overlap to extend the bandwidth from 4 kHz to 6 kHz for the improvement in clarity of synthesized speech. The synthesized speech is further passed through a weighting filter to improve the envelope of re-synthesized time-domain signal. The synthesized speech is synthetic but noticeably intelligible.en_US
dc.identifier.urihttps://ieeexplore.ieee.org/document/6968501
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/xmlui/handle/123456789/9572
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.subjectEEEen_US
dc.subjectParameter extractionen_US
dc.subjectIntelligible speech synthesisen_US
dc.subjectSinusoidal synthesisen_US
dc.subjectSynthetic speechen_US
dc.subjectGaussian filteren_US
dc.titleSpeech re-synthesis from spectrogram image through sinusoidal modellingen_US
dc.typeArticleen_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: