An embedded audio-visual tracking and speech purification system on a dual-core processor platform

Jwu-Sheng Hu, Ming Tang Lee*, Chia Hsing Yang

*Corresponding author for this work

Research output: Contribution to journalArticle

1 Scopus citations

Abstract

Design of an embedded audio-visual tracking and speech purification system is described in this paper. The system is able to perform human face tracking, voice activity detection, sound source direction estimation, and speech enhancement in real-time. Estimating the sound source directions helps to initialize the human face tracking module when the target changes the direction. The implementation architecture is based on an embedded dual-core processor, Texas Instruments DM6446 platform (Davinci), which contains an ARM core and a DSP core. For speech signal processing, an eight-channel digital microphone array is developed and the associated pre-processing and interfacing features are designed using the Altera Cyclone II FPGA. All the experiments are conducted in a real environment and the experimental results show that this system can execute all the audition and vision functions in real-time.

Original languageEnglish
Pages (from-to)274-284
Number of pages11
JournalMicroprocessors and Microsystems
Volume34
Issue number7-8
DOIs
StatePublished - 1 Jan 2010

Keywords

  • Audio-visual tracking
  • Dual-core
  • Embedded processor
  • Microphone array
  • Speech enhancement

Fingerprint Dive into the research topics of 'An embedded audio-visual tracking and speech purification system on a dual-core processor platform'. Together they form a unique fingerprint.

  • Cite this