Result Disambiguation in Web People Search
|Title||Result Disambiguation in Web People Search|
|Publication Type||Conference Paper|
|Year of Publication||2012|
|Authors||Berendsen R, Kovachev B, Nastou E, de Rijke M, Weerkamp W|
|Conference Name||ECIR 2012: 34th European Conference on Information Retrieval|
We study the problem of disambiguating the results of a web people search engine: given a query consisting of a person name plus the result pages for this query, find correct referents for all mentions by clustering the pages according to the different people sharing the name. While the problem has been studied extensively, we discover that the increasing availability of results retrieved from social media platforms causes state-of-the-art methods to break down. We analyze the problem and propose a dual strategy where we distinguish between results obtained from social media platforms and those obtained from other sources. In our dual strategy, the two types of documents are disambiguated separately, using different strategies, and their results are then merged. We study several instantiations for the different stages in our proposed strategy and manage to achieve state-of-the-art performance.