Author : Hani Safadi
Publisher : Lulu.com
ISBN 13 : 0557448093
Total Pages : 74 pages
Book Rating : 4.5/5 (574 download)
Book Synopsis Crosslingual Implementation of Linguistic Taggers Using Parallel Corpora by : Hani Safadi
Download or read book Crosslingual Implementation of Linguistic Taggers Using Parallel Corpora written by Hani Safadi and published by Lulu.com. This book was released on 2010-04-27 with total page 74 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book addresses the problem of creating linguistic taggers for resource-poor languages using existing taggers in resource rich languages. Linguistic taggers are classifiers that map individual words or phrases from a sentence to a set of tags. Linguistic taggers are usually trained using supervised learning algorithms.The proposed approach does not require that the input sentence be translated into the source language. Instead, projection of linguistic tags is accomplished through the use of a parallel corpus, which is a collection of texts that are available in a source language and a target language. The correspondence between words of the source and target language allows to project tags from source to target language words.A parallel corpus of the source and target languages might not be readily available for many language pairs. To deal with this problem, we describe a system for automatic acquisition of aligned, bilingual corpora from pre-specified domains on the World Wide Web.