Gauvain J L and Adda G. Transcribing broadcast news: the LIMSI Nov96 Hub4 System[J].Proc. ARPA Speech Recognition Workshop, Chantilly, Virginia, 1997: 56-63.[2]Cook G and Robinson T. Transcribing broadcast news with the 1997 ABBOT system. Proc. IEEE International Conference on Acoustic, Speech and Signal Processing, Seattle, 1998: 917-920.Chen S Shaobing and Gopalakrishnan P S. Speaker, environment and channel change detection and clustering via the Bayesian information criterion. Proc. DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, VA, 1998: 127-132.[3]Delacourt P, Kryze D, and Wellekens C J. Speaker-based segmentation for audio data indexing. Proc. ESCA-ETRW workshop on Accessing Information in Spoken Audio, Cambridge, UK, 1999: 1195-1198.[4]Solomonoff A, Mielke A, Schmidt M, and Gish H. Clustering speakers by their voices. Proc. IEEE International Conference on Acoustic, Speech and Signal Processing, Seattle, 1998: 757-760.Reynolds D A.[J].Singer E, Carlson B A, Orsquo;Leary G C, Mclaughlin J J, and Zissman M A. Blind clustering of speech utterances based on speaker and language characteristics. Proc. the International Conference on Speech and Language Processing, Sydney.1998,:-[5]杨行峻,迟惠生. 语音信号数字处理. 第一版. 北京:电子工业出版社,1995,第4章.[6]Ajmera J and McCowan I. Robust speaker change detection. IEEE Signal Processing Letters.2004, 11(8):649-651.[7]Couvreur L and Boite J M. Speaker tracking in broadcast audio material in the framework of the THISL project. Proc. ESCA-ETRW Workshop on Accessing Information in Spoken Audio, Cambridge, UK, 1999: 84-89.[8]Iurgel U and Meermeier R. New approaches to audic-visual segmentation of TV news for automatic topic retrieval. Proc. IEEE International Conference on Acoustic, Speech and Signal Processing, Salt Lake City, Utah, 2001: 1397-1400.
|