Automatic Generation of Parallel Treebanks

Automatic Generation of Parallel Treebanks

An Efficient Unsupervised System

LAP Lambert Academic Publishing ( 2010-09-14 )

€ 59,00

Buy at the MoreBooks! Shop

The need for syntactically annotated data for use in natural language processing has increased dramatically in recent years. This is true especially for parallel treebanks, of which very few exist. The ones that exist are mainly hand-crafted and too small for reliable use in data- oriented applications. This work is targeted at the developers and users of Machine Translation technology. It introduces a novel open-source platform for the fast and robust automatic generation of parallel treebanks through sub-tree alignment, using a limited amount of external resources. The intrinsic and extrinsic evaluations that were undertaken demonstrate that this system is a feasible alternative to the manual annotation of parallel treebanks. Therefore, the presented platform is expected to help boost research in the field of syntax- augmented machine translation and lead to advancements in other fields where parallel treebanks can be employed.

Book Details:

ISBN-13:

978-3-8383-2795-2

ISBN-10:

3838327950

EAN:

9783838327952

Book language:

English

By (author) :

Ventsislav Zhechev

Number of pages:

148

Published on:

2010-09-14

Category:

General and comparative linguistics