Term Clouds as Surrogates for User Generated Speech

Publication Type  Conference Paper
Author  Tsagkias E., Larson M., de Rijke M.
Year of Publication  2008
Conference Name  31st Annual International ACM SIGIR Conference (SIGIR 2008)
Pagination  773-774
Month Published  July
Publisher  ACM
Conference Location  Singapore
Abstract  

User generated spoken audio remains a challenge for Automatic Speech Recognition (ASR) technology and content-based audio surrogates derived from ASR-transcripts must be error robust. An investigation of the use of term clouds as surrogates for podcasts demonstrates that ASR term clouds closely approximate term clouds derived from human-generated transcripts across a range of cloud sizes. A user study confirms the conclusion that ASR-clouds are viable surrogates for depicting the content of
podcasts.

Citation Key  tsag:term08
Export  BibTex
Full paper  PDF (115.46 KB)
AttachmentSize
pp772-tsagias.pdf115.46 KB