Auto-Segmentation based Partitioning and Clustering Approach to Robust Endpointing
- Yu Shi ,
- Frank Soong ,
- Jianlai Zhou
ICASSP 2006 |
An auto segmentation based partitioning and clustering approach to robust voice activity detection (VAD) is proposed. It is done in two successive steps: homogeneous frame partitioning and segment clustering. The first step, due to its auto segmentation nature, does not need a noise model, and is applicable to different noise types and SNR’s. The algorithm is a dynamic programming based procedure and provides a graceful performance in finding segmentation thresholds. Multiple parameters like energy, pitch and voicing information can be easily incorporated into the procedure. The algorithm is evaluated on the test sets in the Aurora2 database. The algorithm shows its robustness at low SNR operating environments; the endpoint estimate errors are shown to have small variance.