ARK Machine Translation Research

This page is the home for machine translation research conducted by members of Noah's ARK in the Language Technologies Institute at Carnegie Mellon University. Machine translation is an active research area that offers great promise to revolutionize the way we communicate. Our goal is to make machine translation systems better and faster, and also to develop techniques that can be useful for other areas of natural language processing and machine learning.

Members and Collaborators



Rampion (Gimpel and Smith, 2012) is an algorithm for training statistical machine translation models based on minimizing structured ramp loss. The code provided here can be used with the Moses decoder or any other decoder that supports the same formats for configuration files and k-best lists. There is also an implementation of Rampion in cdec.

Version 0.2 also includes implementations of PRO and risk minimization as well as several additional forms of ramp loss from Gimpel (2012). Also included are the improved sentence-level BLEU approximations from Nakov et al. (2012), which are recommended for single-reference training.

Version 0.1 released 6/6/2012: rampion-v0.1.tar.gz
New! Version 0.2 released 12/28/2012: rampion-v0.2.tar.gz


cdec (Dyer et al., 2010) is a flexible and efficient software framework for machine translation and other structured prediction tasks. It was used for our German-English submission to the WMT11 shared task (Dyer et al., 2011b), for recent work on feature-rich modeling for unsupervised word alignment (Dyer et al., 2011a), as wel as for transliteration (Ammar et al., 2012). It implements training and decoding algorithms for several commonly-used models in machine translation.

Inference for Monolingual and Bilingual Gappy Pattern Models

Below is a link to code that implements the models described by Gimpel and Smith (2011a). These models can discover gappy patterns in either monolingual or bilingual (word-aligned) text. Sample data files and execution scripts are provided.

Version 0.1 released 7/20/2011: gaplm-v0.1.tar.gz   |   sample patterns

Code for Statistical Significance Testing for MT Evaluation Metrics

The links below contain software to perform paired bootstrap resampling (Koehn, 2004) for the BLEU metric (Papineni et al., 2002) as computed using the mteval-v13a script provided by NIST (

The code is available in the following tar.gz file: paired_bootstrap_v13a.tar.gz

A previous release for use with mteval-v11b is archived here.

You also may be interested in code by Jon Clark for performing bootstrap resampling and approximate randomization with BLEU, METEOR, and TER.



This research has been supported in part by the National Science Foundation under grants IIS-0844507 and IIS-1054319, the U. S. Army Research Laboratory and the U. S. Army Research Office under contract/grant number W911NF-10-1-0533, grants from Google, and by Sandia National Laboratories (fellowship to K. Gimpel).