DSpace logo

Please use this identifier to cite or link to this item: http://dspace.bits-pilani.ac.in:8080/jspui/xmlui/handle/123456789/9572
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSinghal, Rahul-
dc.date.accessioned2023-03-07T09:30:26Z-
dc.date.available2023-03-07T09:30:26Z-
dc.date.issued2014-
dc.identifier.urihttps://ieeexplore.ieee.org/document/6968501-
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/xmlui/handle/123456789/9572-
dc.description.abstractA novel method to extract parameters i.e. frequencies and their bandwidth for intelligible speech synthesis is presented in the paper. The parameters are extracted from the spectrogram image of the pre-recorded male and female voice samples and used to re-synthesize speech by employing sinusoidal signals. The phase continuity is preserved by quantifying time-scale and identifying phase at temporal boundaries for a given frequency. The amplitude distribution of the sinusoidals follow Gaussian distribution and use frequency overlap to extend the bandwidth from 4 kHz to 6 kHz for the improvement in clarity of synthesized speech. The synthesized speech is further passed through a weighting filter to improve the envelope of re-synthesized time-domain signal. The synthesized speech is synthetic but noticeably intelligible.en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.subjectEEEen_US
dc.subjectParameter extractionen_US
dc.subjectIntelligible speech synthesisen_US
dc.subjectSinusoidal synthesisen_US
dc.subjectSynthetic speechen_US
dc.subjectGaussian filteren_US
dc.titleSpeech re-synthesis from spectrogram image through sinusoidal modellingen_US
dc.typeArticleen_US
Appears in Collections:Department of Electrical and Electronics Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.