The present work presents a statistical method to translate human voices across age groups,based on commonalities in voices of blood *** age-translated voices have been naturalized extracting the blood relation featur...
详细信息
The present work presents a statistical method to translate human voices across age groups,based on commonalities in voices of blood *** age-translated voices have been naturalized extracting the blood relation features e.g.,pitch,duration,energy,using Mel Frequency Cepstrum Coefficients(MFCC),for social compatibility of the *** system has been demonstrated using standard English and an Indian *** voice samples for resynthesis were derived from 12 families,with member ages ranging from 8–80 *** voice-age translation,performed using the Pitch synchronous overlap and add(PSOLA)approach,by modulation of extracted voice features,was validated by perception *** translated and resynthesized voices were correlated using Linde,Buzo,Gray(LBG),and Kekre’s Fast Codebook generation(kfcg)*** translated voice targets,a strong(θ>∼93%andθ>∼96%)correlation was found with blood relatives,whereas,a weak(θ<∼78%andθ<∼80%)correlation range was found between different families and different gender from same *** study further subcategorized the sampling and synthesis of the voices into similar or dissimilar gender groups,using a support vector machine(SVM)choosing between available voice ***,∼96%,∼93%,and∼94%accuracies were obtained in the identification of the gender of the voice sample,the age group samples,and the correlation between the original and converted voice samples,*** results obtained were close to the natural voice sample features and are envisaged to facilitate a near-natural voice for speech-impaired easily.
暂无评论