Semantic analysis for automatic event recognition and segmentation of wedding ceremony videos

Wen-Huang Cheng*, Yung Yu Chuang, Yin Tzu Lin, Chi Chang Hsieh, Shao Yen Fang, Bing Yu Chen, Ja Ling Wu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

Wedding is one of the most important ceremonies in our lives. It symbolizes the birth and creation of a new family. In this paper, we present a system for automatically segmenting a wedding ceremony video Into a sequence of recognizable wedding events, e.g., the couple's wedding kiss. Our goal is to develop an automatic tool that helps users to efficiently organize, search, and retrieve his/her treasured wedding memories. Furthermore, the obtained event descriptions could benefit and complement the current research in semantic video understanding. Based on the knowledge of wedding customs, a set of audiovisual features, relating to the wedding contexts of speech/music types, applause activities, picture-taking activities, and leading roles, are exploited to build statistical models for each wedding event. Thirteen wedding events are then recognized by a hidden Markov model, which takes into account both the fitness of observed features and the temporal rationality of event ordering to improve the segmentation accuracy. We conducted experiments on a collection of wedding videos and the promising results demonstrate the effectiveness of our approach. Comparisons with conditional random fields show that the proposed approach is more effective in this application domain.

Original languageEnglish
Article number4633636
Pages (from-to)1639-1650
Number of pages12
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume18
Issue number11
DOIs
StatePublished - 1 Nov 2008

Keywords

  • Event detection
  • Home videos
  • Semantic content analysis
  • Video segmentation
  • Wedding ceremonies

Fingerprint Dive into the research topics of 'Semantic analysis for automatic event recognition and segmentation of wedding ceremony videos'. Together they form a unique fingerprint.

Cite this