2 # A sample config file for the language models
3 # provided with Gertjan van Noords language guesser
4 # (http://odur.let.rug.nl/~vannoord/TextCat/)
7 # - You may consider eliminating a couple of small languages from this
8 # list because they cause false positives with big languages and are
9 # bad for performance. (Do you really want to recognize Drents?)
10 # - Putting the most probable languages at the top of the list
11 # improves performance, because this will raise the threshold for
12 # likely candidates more quickly.
15 # this file have been modified (to OOo by Jocelyn MERAND joc.mer@gmail.com) to include country and encoding
16 # guess strings are made as following : language-country-encoding
20 amharic_utf.lm am--utf8
27 chinese_simplified.lm zh-CN-utf8
28 chinese_traditional.lm zh-TW-utf8
46 indonesian.lm id--utf8
47 irish_gaelic.lm ga--utf8
53 lithuanian.lm lt--utf8
54 luxembourgish.lm lb--utf8
56 manx_gaelic.lm gv--utf8
58 mongolian_cyrillic.lm mn--utf8
60 norwegian.lm nb--utf8 # Norwegian (Bokmal)
61 persian.lm fa--utf8 # Farsi
63 portuguese.lm pt-PT-utf8
70 scots_gaelic.lm gd--utf8
71 serbian_ascii.lm sh-YU-utf8
72 slovak_ascii.lm sk-SK-utf8
82 vietnamese.lm vi--utf8
84 yiddish_utf.lm yi--utf8