This is the suggested topic for the course Language Technology Project 2007.
Word of the day is an application which tracks topics in text feeds, for example news or blogs. It requires the basic task of counting phrases but includes many challenges which we will address in this project:
Develop a word of the day application for a text feed that is supplied by the teachers of the course. You may decide what language to work with (prefably Dutch or English texts). You will also make the main design decisions. You will use available data streams from news and blogs. The system to be developed will be put online and continue to track live text feeds. There are plans to link it to existing online demos of our department, like MoodSignals and the Dutch election news search.
Students will work in seperate groups on different modules. Groups will consist of two students. Some modules can be developed by an individual.
For this course, relevant background knowledge consists of knowledge of natural language processing plus general programming experience. Some experience with programming web services or visualization software is useful for the related modules.
The final system will be evaluated by comparing it with other available word of the day applications. Does the new application offer the same views as the earlier applications? Does it offer interesting other views? What are the response times for online queries?