University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Semantic audio tools for radio production.

Baume, Chris (2018) Semantic audio tools for radio production. Doctoral thesis, University of Surrey.

[img]
Preview
Text
baume-phd-final.pdf - Version of Record
Available under License Creative Commons Attribution.

Download (9MB) | Preview

Abstract

Radio production is a creative pursuit that uses sound to inform, educate and entertain an audience. Radio producers use audio editing tools to visually select, re-arrange and assemble sound recordings into programmes. However, current tools represent audio using waveform visualizations that display limited information about the sound. Semantic audio analysis can be used to extract useful information from audio recordings, including when people are speaking and what they are saying. This thesis investigates how such information can be applied to create semantic audio tools that improve the radio production process. An initial ethnographic study of radio production at the BBC reveals that producers use textual representations and paper transcripts to interact with audio, and waveforms to edit programmes. Based on these findings, three methods for improving radio production are developed and evaluated, which form the primary contribution of this thesis. Audio visualizations can be enhanced by mapping semantic audio features to colour, but this approach had not been formally tested. We show that with an enhanced audio waveform, a typical radio production task can be completed faster, with less effort and with greater accuracy than a normal waveform. Speech recordings can be represented and edited using transcripts, but this approach had not been formally evaluated for radio production. By developing and testing a semantic speech editor, we show that automatically-generated transcripts can be used to semantically edit speech in a professional radio production context, and identify requirements for annotation, collaboration, portability and listening. Finally, we present a novel approach for editing audio on paper that combines semantic speech editing with a digital pen interface. Through a user study with radio producers, we compare the relative benefits of semantic speech editing using paper and screen interfaces. We find that paper is better for simple edits of familiar audio with accurate transcripts.

Item Type: Thesis (Doctoral)
Divisions : Theses
Authors :
NameEmailORCID
Baume, Chris0000-0002-9799-8596
Date : 30 April 2018
Funders : British Broadcasting Corporation
Contributors :
ContributionNameEmailORCID
http://www.loc.gov/loc.terms/relators/THSPlumbley, Markm.plumbley@surrey.ac.uk0000-0002-9708-1075
Depositing User : Chris Baume
Date Deposited : 01 May 2018 10:30
Last Modified : 15 Jun 2018 14:07
URI: http://epubs.surrey.ac.uk/id/eprint/846200

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800