Adding Semantics to Microblog Posts
| Publication Type | Conference Paper | |
| Author | Meij E., Weerkamp W., de Rijke M. | |
| Year of Publication | 2012 | |
| Conference Name | WSDM 2012: Fifth ACM International Conference on Web Search and Data Mining | |
| Month Published | February | |
| Publisher | ACM | |
| Conference Location | Seattle | |
| Abstract | Microblogs have become an important source of information for the purpose of marketing, intelligence, and reputation management. Streams of microblogs are of great value because of their direct and real-time nature. Determining what an individual microblog post is about, however, can be non-trivial because of creative language usage, the highly contextualized and informal nature of microblog posts, and the limited length of this form of communication. We propose a solution to the problem of determining what a microblog post is about through semantic linking: we add seman- tics to posts by automatically identifying concepts that are semantically related to it and generating links to the corresponding Wikipedia articles. The identified concepts can subsequently be used for, e.g., social media mining, thereby reducing the need for manual inspection and selection. Using a purpose-built test collection of tweets, we show that recently proposed approaches for semantic linking do not perform well, mainly due to the idiosyncratic nature of microblog posts. We propose a novel method based on machine learning with a set of innovative features and show that it is able to achieve significant improvements over all other methods, especially in terms of precision. | |
| Export | BibTex | |
| Full paper | PDF (278.87 KB) |
| Attachment | Size |
|---|---|
| wsdm248-meij.pdf | 278.87 KB |