Sentence Alignment Setup » History » Version 5
« Previous -
Version 5/12
(diff) -
Next » -
Current version
Prokopis Prokopidis, 2014-08-19 12:52 PM
h1. Sentence Alignment Setup (Linux only)
In order to get sentence alignments as the output of bilingual crawls, an external aligner is required. For the current version of ILSP-FC,
* download the hunalign-1.1 source code from http://mokk.bme.hu/en/resources/hunalign/
* follow the instructions on the hunalign page for building hunalign
* put the hunalign directory containing the hunalign executable next to the runnable ilsp-fc jar.
For example, if you run ilsp-fc from:
~/ilsp-fc/ilsp-fc-2.2-jar-with-dependencies.jar
you should have a hunalign dir
~/ilsp-fc/hunalign-1.1/
with the suggested hunalign directory structure, including
~/ilsp-fc/hunalign-1.1/dict/
~/ilsp-fc/hunalign-1.1/linux/src/hunalign/hunalign
Now, you are ready to produce TMX files from bilingual crawled data using the -align, -dict, -oft and -ofth options described in the Getting Started part of the documentation.