University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Adapting for Subject-Specific Term Length using Topic Cost in Author Verification - Notebook for PAN at CLEF 2015

Gillam, L and Vartapetiance, A (2015) Adapting for Subject-Specific Term Length using Topic Cost in Author Verification - Notebook for PAN at CLEF 2015 In: CLEF 2015, 2015-09-08 - 2015-09-11, Toulouse, France.

Full text not available from this repository.

Abstract

Previous PAN workshops have offered us the opportunity to explore three different approaches using basic statistics of stopword pairs for author verification. In this PAN, we were able to select our ‘best’ approach and explore the question of how authors writing about different subjects would necessarily adapt to term lengths specific to the subject. The adaptation required is, essentially, a redistribution of frequency: where longer terms occur. We introduce the notion of a ‘topic cost’ which increases the propensity for matching. Results show AUC and C1 scores of 0.51, 0.46 and 0.59 for Dutch, Greek and Spanish respectively. The English results are not yet available, as the evaluation system was unable to run the approach due to as yet unknown reasons.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Subjects : Computing
Authors :
NameEmailORCID
Gillam, Ll.gillam@surrey.ac.ukUNSPECIFIED
Vartapetiance, AUNSPECIFIEDUNSPECIFIED
Date : 2015
Copyright Disclaimer : The final publication is available at Springer via http://dx.doi.org
Contributors :
ContributionNameEmailORCID
publisherSpringer, UNSPECIFIEDUNSPECIFIED
Related URLs :
Depositing User : Symplectic Elements
Date Deposited : 17 May 2017 13:52
Last Modified : 18 May 2017 12:53
URI: http://epubs.surrey.ac.uk/id/eprint/840627

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800