Seed examples » History » Version 2
« Previous -
Version 2/3
(diff) -
Next » -
Current version
Vassilis Papavassiliou, 2014-08-15 02:55 PM
#This file contains examples of seed URLs that used for constructing bilingual collections from multilingual web sites.
#In each example there are two or more URLs but the user could use one or some or all of them.
#Text lines starting with # are considered comments. So, in a list of seed URLs, the active text lines (i.e. URLs) should not start with #
#Example 1. EN-HR
#http://www.kvarner.hr/turizam/otkrijte_kvarner/o_kvarneru
#http://www.kvarner.hr/en/tourism/discover_kvarner/about_kvarner
#Example 2. DE-IT
#http://www.suva.ch/startseite-suva.htm
#http://www.suva.ch/it/startseite-suva.htm
#Example 3. EN-EL
#http://www.dunlop.eu/dunlop_euen/
#http://www.dunlop.eu/dunlop_grel/
#Example 4. EN-FR (The use of parameter filter with argument ".agriculture." could be used to force crawler stay only in this part of the web site).
#http://europa.eu/legislation_summaries/agriculture/environment/l28117_en.htm
#http://europa.eu/legislation_summaries/agriculture/environment/l28117_fr.htm
#http://europa.eu/legislation_summaries/agriculture/index_fr.htm
#http://europa.eu/legislation_summaries/agriculture/general_framework/index_fr.htm
#Example 5. EN-FR from 2 web sites (the use of parameter filter with argument ".(nrcan|rncan)."is required)
#http://www.nrcan.gc.ca/home
#http://www.rncan.gc.ca/accueil
#Example 6. EN-ES from "uefa" web site (the use of parameter filter with argument ".uefa." is required)
#http://www.uefa.com/worldcup/news/newsid=2114740.html
#http://es.uefa.com/worldcup/news/newsid=2116290.html