University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Acoustic Reflector Localization: Novel Image Source Reversion and Direct Localization Methods

Remaggi, Luca, Jackson, Philip, Coleman, Philip and Wang, Wenwu (2017) Acoustic Reflector Localization: Novel Image Source Reversion and Direct Localization Methods IEEE Transactions on Audio, Speech and Language Processing, 25 (2). pp. 296-309.

[img]
Preview
Text
Remaggi_etal_IEEE_TASLP.pdf - Accepted version Manuscript
Available under License : See the attached licence file.

Download (5MB) | Preview
[img]
Preview
Text (licence)
SRI_deposit_agreement.pdf
Available under License : See the attached licence file.

Download (33kB) | Preview

Abstract

Acoustic reflector localization is an important issue in audio signal processing, with direct applications in spatial audio, scene reconstruction, and source separation. Several methods have recently been proposed to estimate the 3D positions of acoustic reflectors given room impulse responses (RIRs). In this article, we categorize these methods as “image-source reversion”, which localizes the image source before finding the reflector position, and “direct localization”, which localizes the reflector without intermediate steps. We present five new contributions. First, an onset detector, called the clustered dynamic programming projected phase-slope algorithm, is proposed to automatically extract the time of arrival for early reflections within the RIRs of a compact microphone array. Second, we propose an image-source reversion method that uses the RIRs from a single loudspeaker. It is constructed by combining an image source locator (the image source direction and range (ISDAR) algorithm), and a reflector locator (using the loudspeaker-image bisection (LIB) algorithm). Third, two variants of it, exploiting multiple loudspeakers, are proposed. Fourth, we present a direct localization method, the ellipsoid tangent sample consensus (ETSAC), exploiting ellipsoid properties to localize the reflector. Finally, systematic experiments on simulated and measured RIRs are presented, comparing the proposed methods with the state-of-the-art. ETSAC generates errors lower than the alternative methods compared through our datasets. Nevertheless, the ISDAR-LIB combination performs well and has a run time 200 times faster than ETSAC.

Item Type: Article
Subjects : Electronic Engineering
Divisions : Faculty of Engineering and Physical Sciences > Electronic Engineering
Authors :
NameEmailORCID
Remaggi, Lucal.remaggi@surrey.ac.ukUNSPECIFIED
Jackson, PhilipP.Jackson@surrey.ac.ukUNSPECIFIED
Coleman, Philipp.d.coleman@surrey.ac.ukUNSPECIFIED
Wang, WenwuW.Wang@surrey.ac.ukUNSPECIFIED
Date : February 2017
Identification Number : 10.1109/TASLP.2016.2633802
Copyright Disclaimer : Copyright 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.
Uncontrolled Keywords : Ellipsoids, image sources, geometry reconstruction, room impulse responses, reflectors, acoustic scene analysis.
Related URLs :
Depositing User : Symplectic Elements
Date Deposited : 20 Dec 2016 09:09
Last Modified : 07 Jul 2017 11:28
URI: http://epubs.surrey.ac.uk/id/eprint/813148

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800