Recent Research in Cross-language
Document Search
Fredric Gey
Abstract
Cross-language document
search research has been underway for more than 10 years now and while much
progress has been made, certain research challenges remain. This talk will review recent research in
Cross-language information retrieval, including the 2004 evaluation workshops:
NTCIR for Asian language retrieval in
·
Language-specific
processing (stemming, segmentation, stop-words)
·
Word
decompounding for German
·
Translation
disambiguation for bilingual dictionaries
·
Parallel
corpora induced lexicons
·
Web
corpora usage for out-of-vocabulary translation
·
Special
retrieval tasks (Patent Retrieval, Cross-language question answering
·
Geographic
information retrieval
·
Challenges
of less-commonly taught languages
·
The road
ahead in cross-language information retrieval research
Presenter: Dr. Fredric Gey has been doing research in
cross-language information retrieval since 1998. He and his associates have participated in
every cross-language information retrieval evaluation in the