A Big Increase in Known Unknowns: from Author Verification to Author Clustering - Notebook for PAN at CLEF 2016
Vartapetiance, Anna and Gillam, Lee (2016) A Big Increase in Known Unknowns: from Author Verification to Author Clustering - Notebook for PAN at CLEF 2016 In: Working Notes of CLEF 2016 - Conference and Labs of the Evaluation forum, \'vora, Portugal, 5-8 September, 2016., 2016-09-05-2016-09-08, Evora, Portugal.
Download (545kB) | Preview
Previous PAN workshops have afforded evaluation of our approaches to author verification/identification based on stopword cooccurrence patterns. Problems have tended to involve comparing one document to a small set of documents (n<=5) of known authorship. This paper discusses the adaptation of one of our approaches to a PAN 2016 problem of author clustering, which involves generating clusters within larger sets of documents (n<=100) for an unknown number of distinct authors, where each set is in English, Dutch or Greek. We describe our previous approaches as the background to the approach taken to this task and briefly overview the results that were achieved, which are not expected to be particularly remarkable due to substantial limitations on our time around the task.
|Item Type:||Conference or Workshop Item (Conference Paper)|
|Subjects :||Author Verification, Author Clustering|
|Divisions :||Faculty of Engineering and Physical Sciences > Computing Science|
|Copyright Disclaimer :||The final publication will be available at http://www.springer.com/gb/computer-science/lncs|
|Depositing User :||Lee Gillam|
|Date Deposited :||06 Sep 2016 15:18|
|Last Modified :||06 Sep 2016 15:18|
Actions (login required)
Downloads per month over past year