TY - GEN
T1 - Efficient speech enhancement technique by exploiting the harmonic structure of voiced segments
AU - Selvi, Thamarai
AU - Pragatheeswaran, Jayanthi
PY - 2011
Y1 - 2011
N2 - One of the most significant applications of the speech processing is speech enhancement. Several classical algorithms have been proposed and they suffer from one serious drawback of enhancing speech signal under very low signal to noise ratio. In this paper, this is accomplished by exploiting the harmonic structure of voiced segments. This method improves speech quality by suppressing the noise in the frequency domain with the use of a spectral weighting function. Two design parameters are included in the suppression gain, namely, the frequency-dependent noise flooring parameter (FDNFP) and the gain factor. Enhanced harmonic structures are incorporated into the FDFNP by time-domain processing of the linear prediction residuals of voiced speech. Further enhancement of the harmonics is achieved by adaptive comb filtering with a peak-picking algorithm. Simulation results prove that an admissible level of speech distortion is obtained as comparable to those caused by conventional noise reduction techniques. The performance of the enhancement method was evaluated by the modified bark spectral distance (MBSD), ITU-Perceptual Evaluation of speech Quality (PESQ) scores, composite objective measures and listening tests.
AB - One of the most significant applications of the speech processing is speech enhancement. Several classical algorithms have been proposed and they suffer from one serious drawback of enhancing speech signal under very low signal to noise ratio. In this paper, this is accomplished by exploiting the harmonic structure of voiced segments. This method improves speech quality by suppressing the noise in the frequency domain with the use of a spectral weighting function. Two design parameters are included in the suppression gain, namely, the frequency-dependent noise flooring parameter (FDNFP) and the gain factor. Enhanced harmonic structures are incorporated into the FDFNP by time-domain processing of the linear prediction residuals of voiced speech. Further enhancement of the harmonics is achieved by adaptive comb filtering with a peak-picking algorithm. Simulation results prove that an admissible level of speech distortion is obtained as comparable to those caused by conventional noise reduction techniques. The performance of the enhancement method was evaluated by the modified bark spectral distance (MBSD), ITU-Perceptual Evaluation of speech Quality (PESQ) scores, composite objective measures and listening tests.
KW - comb filtering
KW - Enhanced harmonic structures
KW - FDFNP
KW - HMM
KW - MBSD
KW - PESQ
UR - http://www.scopus.com/inward/record.url?scp=80052210158&partnerID=8YFLogxK
U2 - 10.1109/ICRTIT.2011.5972261
DO - 10.1109/ICRTIT.2011.5972261
M3 - Conference contribution
AN - SCOPUS:80052210158
SN - 9781457705885
T3 - International Conference on Recent Trends in Information Technology, ICRTIT 2011
SP - 764
EP - 769
BT - International Conference on Recent Trends in Information Technology, ICRTIT 2011
T2 - International Conference on Recent Trends in Information Technology, ICRTIT 2011
Y2 - 3 June 2011 through 5 June 2011
ER -