Text-independent speaker identification using Radon and discrete cosine transforms based features from speech spectrogram

Ajmera, Pawan K.

DSpace Home
→
BITS Faculty Publications
→
Department of Electrical and Electronics Engineering
→
View Item

dc.contributor.author	Ajmera, Pawan K.
dc.date.accessioned	2023-03-14T06:41:23Z
dc.date.available	2023-03-14T06:41:23Z
dc.date.issued	2011-11
dc.identifier.uri	https://www.sciencedirect.com/science/article/pii/S0031320311001671
dc.identifier.uri	http://dspace.bits-pilani.ac.in:8080/xmlui/handle/123456789/9690
dc.description.abstract	This paper presents a new feature extraction technique for speaker recognition using Radon transform (RT) and discrete cosine transform (DCT). The spectrogram is compact, efficient in representation and carries information about acoustic features in the form of pattern. In the proposed method, speaker specific features have been extracted by applying image processing techniques to the pattern available in the spectrogram. Radon transform has been used to derive the effective acoustic features from the speech spectrogram. Radon transform adds up the pixel values in the given image along a straight line in a particular direction and at a specific displacement. The proposed technique computes Radon projections for seven orientations and captures the acoustic characteristics of the spectrogram. DCT applied on Radon projections yields low dimensional feature vector. The technique is computationally efficient, text-independent, robust to session variations and insensitive to additive noise. The performance of the proposed algorithm has been evaluated using the Texas Instruments and Massachusetts Institute of Technology (TIMIT) and our own created Shri Guru Gobind Singhji (SGGS) databases. The recognition rate of the proposed algorithm on TIMIT database (consisting of 630 speakers) is 96.69% and for SGGS database (consisting of 151 speakers) is 98.41%. These results highlight the superiority of the proposed method over some of the existing algorithms.	en_US
dc.language.iso	en	en_US
dc.publisher	Elsevier	en_US
dc.subject	EEE	en_US
dc.subject	Speaker recognition	en_US
dc.subject	Spectrogram	en_US
dc.subject	Feature extraction	en_US
dc.subject	Radon transform	en_US
dc.subject	Discrete cosine transform	en_US
dc.title	Text-independent speaker identification using Radon and discrete cosine transforms based features from speech spectrogram	en_US
dc.type	Article	en_US

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Department of Electrical and Electronics Engineering [2014]

Show simple item record

Search DSpace

Advanced Search

Browse

All of DSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

Text-independent speaker identification using Radon and discrete cosine transforms based features from speech spectrogram

Files in this item

This item appears in the following Collection(s)

Search DSpace

Browse

All of DSpace

This Collection

My Account