After downloading the trunk code from http://code.google.com/p/berkeleyaligner/ , I added the project into my

Question

0

Asked: May 27, 20262026-05-27T20:51:04+00:00 2026-05-27T20:51:04+00:00

After downloading the trunk code from http://code.google.com/p/berkeleyaligner/ , I added the project into my

0

After downloading the trunk code from http://code.google.com/p/berkeleyaligner/, I added the project into my build path on Eclipse. Then with the code below i can extract the alignments for each sentence pair that i’ve read from the sourceFile and targetFile.
After the alignment, how to read the Alignment type from the BerkeleyAligner?

import edu.berkeley.nlp.wa.mt.Alignment;
import edu.berkeley.nlp.wa.mt.SentencePair;
import edu.berkeley.nlp.wordAlignment.combine.WordAlignerCombined;
public static void main(String[] args) {
BufferedReader brSrc = new BufferedReader(new FileReader ("sourceFile"));
BufferedReader brTrg = new BufferedReader(new FileReader ("targetFile"));
while ((currentSrcLine = brSrc.readLine()) !=null) {
    String currentTrgLine = brTrg.readline();
    // Reads into BerkeleyAligner SentencePair format.
    SentencePair src2trg = new SentencePair(sentCounter, params.get("source"),
        Arrays.asList(srcLine.split(" ")), Arrays.asList(trgLine.split(" ")));
    // Generate Alignment type from SentencePair
    WordAlignerCombined aligner;
    Alignment alignedPair = aligner.alignSentencePair(src2trg);
    // How do i print out the Alignment???
    }
}

e.g. sourceFile:

this is the first line in the textfile.
that is the second line.
foo bar likes to eat bar foo.

e.g. targetFile:

Dies ist die erste Textzeile in der Datei.
das ist die zweite Zeile.
foo bar gerne bar foo essen.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-27T20:51:05+00:00

Editorial Team

2026-05-27T20:51:05+00:00Added an answer on May 27, 2026 at 8:51 pm

Print the GIZA. Alignment has a method for that:

public void writeGIZA(PrintWriter out, int idx)

GIZA is:

"# sentence pair (%d) source length %d target length %d alignment score : 0\n"
"NULL ({ %s })"
" %s ({ %s })" (englishSentence.get(i), StrUtils.join(alignments))

idx is just the sentence pair id.

out is just where you want to print it.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

After downloading the trunk code from http://code.google.com/p/berkeleyaligner/ , I added the project into my

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply