Segment Alignment » History » Version 3
Version 2 (Prokopis Prokopidis, 2016-02-16 12:34 PM) → Version 3/4 (Vassilis Papavassiliou, 2016-05-31 05:30 PM)
# Segment Alignment
It uses maligna aligner for identifying segment pairs from each detect document pair. It generates a TMX file for each cesAlign file (e.g. eng-12_ell-18_x.tmx for eng-12_ell-18_x.xml).
```
java -Dlog4j.configuration=file:/opt/ilsp-fc/log4j.xml \
-jar /opt/ilsp-fc/ilsp-fc-2.2.2-jar-with-dependencies.jar \
-align -lang "eng;lv" -xslt -oxslt \
-i (fullpath of dir with the generated cesAlign) \
-bs -oft (fullpath and basename on which all files for easier content navigation will be generated) of file with paths of generated TMX) \
-ofth (fullpath of file with paths of generated transformed TMX) \
&>"/var/www/tests/eng-ita/log-align_www_esteri_it_eng-ita"
```
## Options
```
-align : for segment alignment
-i -oft : crawlpath up to the auto-generated dir by the crawl module
-lang : two or three letter ISO code(s) fullpath of target language(s),
e.g. el (for text file containing a monolingual crawl for Greek content) or en;el (for a bilingual crawl)
CesDoc files will be list with fullpaths of the generated only for crawled web documents that are in the targeted language(s)
-bs : Basename to be used in generating all TMX files for easier content navigation
-oxslt -ofth : Export crawl results with the help fullpath of an xslt HTML file for better examination containing a list of results.
``` links pointing to generated transformed TMX files
It uses maligna aligner for identifying segment pairs from each detect document pair. It generates a TMX file for each cesAlign file (e.g. eng-12_ell-18_x.tmx for eng-12_ell-18_x.xml).
```
java -Dlog4j.configuration=file:/opt/ilsp-fc/log4j.xml \
-jar /opt/ilsp-fc/ilsp-fc-2.2.2-jar-with-dependencies.jar \
-align -lang "eng;lv" -xslt -oxslt \
-i (fullpath of dir with the generated cesAlign) \
-bs -oft (fullpath and basename on which all files for easier content navigation will be generated) of file with paths of generated TMX) \
-ofth (fullpath of file with paths of generated transformed TMX) \
&>"/var/www/tests/eng-ita/log-align_www_esteri_it_eng-ita"
```
## Options
```
-align : for segment alignment
-i -oft : crawlpath up to the auto-generated dir by the crawl module
-lang : two or three letter ISO code(s) fullpath of target language(s),
e.g. el (for text file containing a monolingual crawl for Greek content) or en;el (for a bilingual crawl)
CesDoc files will be list with fullpaths of the generated only for crawled web documents that are in the targeted language(s)
-bs : Basename to be used in generating all TMX files for easier content navigation
-oxslt -ofth : Export crawl results with the help fullpath of an xslt HTML file for better examination containing a list of results.
``` links pointing to generated transformed TMX files