![]() |
Project: Multi-lingual information retrievalDescriptionMulti-lingual IR provides the searcher with the possibility of searching in one language (e.g., one’s own) while retrieving information in multiple languages in multi-lingual collections (like the web). One’s competence may be sufficient for reading more than one language but insufficient for specifying information requests (orthography, special terminology) effectively in them. These problems become more pronounced in truly multilingual environments (like the web). To support the searcher, single-language access to multilingual environments should be provided. This line of research was begun in mid-90s and has produced several academic degrees, articles and projects on external funding (e.g. the EU 5th Framework Project Clarity, http://dis.shef.ac.uk/mark/clarity). We have shown that structured queries based on dictionary translation greatly improve performance in cross-language IR at least in news article collections (between several language pairs). Our research interests for the five-year period include: structured query formulation in varying collections over various language pairs; transitive translation through intermediate languages when direct translation is not possible; expansion of CLIR queries; corpus-based translation; fusion of translation methods; handling of proper names, other out-of-vocabulary words, and phrases; cross-cultural IR; as well as Finnish and cross-language question answering. Again, an underlying theme is the evaluation of the effectiveness of each method or tool. A special theme in evaluation is evaluation by the quality of the retrieved documents, especially highly relevant documents.
Finnish question answering
presents novel problems due to the complexity of Finnish language.
Given a natural language question, the determination of an answer
pattern is more complex in Finnish. In the cross-language case,
the answer patterns must be inferred from questions in another language,
possibly structurally very different. Duration2003 - 2009 Researchers
See at individual projects. PublicationsThe publications are mainly listed under individual projects. Here are some early / general ones:
Relevant links
The project RelFB –
simulated and pseudo relevance feedback
Updated 29.12.2005 Responsibility for updating: KJ |