Weighted delay-and-sum beamforming guided by visual tracking for human-robot interaction

06/17/2019
by   José Novoa, et al.
0

This paper describes the integration of weighted delay-and-sum beamforming with speech source localization using image processing and robot head visual servoing for source tracking. We take into consideration the fact that the directivity gain provided by the beamforming depends on the angular distance between its main lobe and the main response axis of the microphone array. A visual servoing scheme is used to reduce the angular distance between the center of the video frame of a robot camera and a target object. Additionally, the beamforming strategy presented combines two information sources: the direction of the target object obtained with image processing and the audio signals provided by a microphone array. These sources of information were integrated by making use of a weighted delay-and-sum beamforming method. Experiments were carried out with a real mobile robotic testbed built with a PR2 robot. Static and dynamic robot head as well as the use of one and two external noise sources were considered. The results presented here show that the appropriate integration of visual source tracking with visual servoing and a beamforming method can lead to a reduction in WER as high as 34 beamforming alone.

READ FULL TEXT
research
12/04/2018

Localization and Tracking of an Acoustic Source using a Diagonal Unloading Beamforming and a Kalman Filter

We present the signal processing framework and some results for the IEEE...
research
05/07/2022

Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking

Beamforming is a powerful tool designed to enhance speech signals from t...
research
11/06/2013

Vision-Guided Robot Hearing

Natural human-robot interaction in complex and unpredictable environment...
research
12/01/2018

Lightweight and Optimized Sound Source Localization and Tracking Methods for Open and Closed Microphone Array Configurations

Human-robot interaction in natural settings requires filtering out the d...
research
10/06/2021

Lightweight Speech Enhancement in Unseen Noisy and Reverberant Conditions using KISS-GEV Beamforming

This paper introduces a new method referred to as KISS-GEV (for Keep It ...
research
11/25/2022

Enhanced Tracking and Beamforming Codebook Design for Wideband Terahertz Massive MIMO System

True-time-delay (TTD) lines are recently applied inside Terahertz (THz) ...
research
06/27/2023

Geometric Ultrasound Localization Microscopy

Contrast-Enhanced Ultra-Sound (CEUS) has become a viable method for non-...

Please sign up or login with your details

Forgot password? Click here to reset