Adapting for Subject-Specific Term Length using Topic Cost in Author Verification - Notebook for PAN at CLEF 2015
Vartapetiance, Anna and Gillam, Lee (2015) Adapting for Subject-Specific Term Length using Topic Cost in Author Verification - Notebook for PAN at CLEF 2015 In: Working Notes of CLEF 2015 - Conference and Labs of the Evaluation forum, Toulouse, France, September 8-11, 2015., 2015-09-08-2015-09-11.
Download (87kB) | Preview
Previous PAN workshops have offered us the opportunity to explore three different approaches using basic statistics of stopword pairs for author verification. In this PAN, we were able to select our ‘best’ approach and explore the question of how authors writing about different subjects would necessarily adapt to term lengths specific to the subject. The adaptation required is, essentially, a redistribution of frequency: where longer terms occur. We introduce the notion of a ‘topic cost’ which increases the propensity for matching. Results show AUC and C1 scores of 0.51, 0.46 and 0.59 for Dutch, Greek and Spanish respectively. The English results are not yet available, as the evaluation system was unable to run the approach due to as yet unknown reasons.
|Item Type:||Conference or Workshop Item (Conference Paper)|
|Subjects :||Author Verification|
|Divisions :||Faculty of Engineering and Physical Sciences > Computing Science|
|Copyright Disclaimer :||The final publication is available at Springer via http://dx.doi.org|
|Depositing User :||Lee Gillam|
|Date Deposited :||07 Sep 2016 11:53|
|Last Modified :||07 Sep 2016 11:53|
Actions (login required)
Downloads per month over past year