Automatic Generation of Parallel Treebanks

Automatic Generation of Parallel Treebanks

An Efficient Unsupervised System

LAP Lambert Academic Publishing ( 14.09.2010 )

€ 59,00

MoreBooks! sitesinden satın al

The need for syntactically annotated data for use in natural language processing has increased dramatically in recent years. This is true especially for parallel treebanks, of which very few exist. The ones that exist are mainly hand-crafted and too small for reliable use in data- oriented applications. This work is targeted at the developers and users of Machine Translation technology. It introduces a novel open-source platform for the fast and robust automatic generation of parallel treebanks through sub-tree alignment, using a limited amount of external resources. The intrinsic and extrinsic evaluations that were undertaken demonstrate that this system is a feasible alternative to the manual annotation of parallel treebanks. Therefore, the presented platform is expected to help boost research in the field of syntax- augmented machine translation and lead to advancements in other fields where parallel treebanks can be employed.

Kitap detayları:

ISBN-13:

978-3-8383-2795-2

ISBN-10:

3838327950

EAN:

9783838327952

Kitabın dili:

English

Yazar:

Ventsislav Zhechev

Sayfa sayısı:

148

Yayın tarihi:

14.09.2010

Kategori:

Genel ve karşılaştırmalı dilbilim