TY - GEN
T1 - Designing and Implementing a Real-Time Speech Summarizer System
AU - Cheng, Ding-Yuan
AU - Chen, Chi-Hua
AU - Wu, Yu-Rou
AU - Lo, Chi-Chun
AU - Lin, Hui-Fei
PY - 2014
Y1 - 2014
N2 - As the number of speech and video documents increases on the Internet and portable devices proliferate, speech summarization becomes increasingly essential. Relevant research in this domain has typically focused on broadcasts and news; however, the automatic summarization methods used in the past may not apply to other speech domains (e.g., speech in lectures). Therefore, this study explores the lecture speech domain. The features used in previous research were analyzed and suitable features were selected following experimentation; subsequently, a three-phase teal-time speech summarizer (RTSS) was proposed. Phase One involved selecting independent features (e.g. centrality, resemblance to the title, sentence length, term frequency, and thematic words) and calculating the independent feature scores; Phase Two involved calculating the dependent features, such as the position compared with the independent feature scores; and Phase Three involved comparing these feature scores to obtain weighted averages of the function-scores, determine the highest-scoring sentence, and provide a summary.
AB - As the number of speech and video documents increases on the Internet and portable devices proliferate, speech summarization becomes increasingly essential. Relevant research in this domain has typically focused on broadcasts and news; however, the automatic summarization methods used in the past may not apply to other speech domains (e.g., speech in lectures). Therefore, this study explores the lecture speech domain. The features used in previous research were analyzed and suitable features were selected following experimentation; subsequently, a three-phase teal-time speech summarizer (RTSS) was proposed. Phase One involved selecting independent features (e.g. centrality, resemblance to the title, sentence length, term frequency, and thematic words) and calculating the independent feature scores; Phase Two involved calculating the dependent features, such as the position compared with the independent feature scores; and Phase Three involved comparing these feature scores to obtain weighted averages of the function-scores, determine the highest-scoring sentence, and provide a summary.
KW - feature selection; information retrieval; speech summarization; text mining
U2 - 10.1109/IS3C.2014.193
DO - 10.1109/IS3C.2014.193
M3 - Conference contribution
SN - 978-1-4799-5277-9
T3 - 2014 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2014)
SP - 725
EP - 728
BT - 2014 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2014)
PB - IEEE
Y2 - 10 June 2014 through 12 June 2014
ER -