Place: CS Conference Room in MUDD
Abstract:
In this talk I will focus on the problem of sentence alignment for monolingual corpora. Aligning large comparable corpora automatically would provide a valuable resource for learning of text-to-text rewriting rules. I will present a method that uses a weak sentence similarity measure combined with contextual information, taking advantage of the topical structure of the texts.