Available Transliterators

Running FileBot from the console, Groovy scripting, shell scripts, etc
Post Reply
User avatar
rednoah
The Source
Posts: 22999
Joined: 16 Nov 2011, 08:59
Location: Taipei
Contact:

Available Transliterators

Post by rednoah »

Here's a reference for the all available Transliterator IDs that can be used for String.transliterate() to transliterate between scripts if necessary.

The String.ascii() method for example uses the Any-Latin;Latin-ASCII;[:Diacritic:]remove transliterator chain.

Code: Select all

ASCII-Latin
Accents-Any
Amharic-Latin/BGN
Any-Accents
Any-Publishing
Arabic-Latin
Arabic-Latin/BGN
Armenian-Latin
Armenian-Latin/BGN
Azerbaijani-Latin/BGN
Belarusian-Latin/BGN
Bengali-Devanagari
Bengali-Gujarati
Bengali-Gurmukhi
Bengali-Kannada
Bengali-Latin
Bengali-Malayalam
Bengali-Oriya
Bengali-Tamil
Bengali-Telugu
Bopomofo-Latin
Bulgarian-Latin/BGN
Cyrillic-Latin
Devanagari-Bengali
Devanagari-Gujarati
Devanagari-Gurmukhi
Devanagari-Kannada
Devanagari-Latin
Devanagari-Malayalam
Devanagari-Oriya
Devanagari-Tamil
Devanagari-Telugu
Digit-Tone
Fullwidth-Halfwidth
Georgian-Latin
Georgian-Latin/BGN
Greek-Latin
Greek-Latin/BGN
Greek-Latin/UNGEGN
Gujarati-Bengali
Gujarati-Devanagari
Gujarati-Gurmukhi
Gujarati-Kannada
Gujarati-Latin
Gujarati-Malayalam
Gujarati-Oriya
Gujarati-Tamil
Gujarati-Telugu
Gurmukhi-Bengali
Gurmukhi-Devanagari
Gurmukhi-Gujarati
Gurmukhi-Kannada
Gurmukhi-Latin
Gurmukhi-Malayalam
Gurmukhi-Oriya
Gurmukhi-Tamil
Gurmukhi-Telugu
Halfwidth-Fullwidth
Han-Latin
Han-Latin/Names
Hangul-Latin
Hans-Hant
Hant-Hans
Hebrew-Latin
Hebrew-Latin/BGN
Hiragana-Katakana
Hiragana-Latin
IPA-XSampa
Jamo-Latin
Kannada-Bengali
Kannada-Devanagari
Kannada-Gujarati
Kannada-Gurmukhi
Kannada-Latin
Kannada-Malayalam
Kannada-Oriya
Kannada-Tamil
Kannada-Telugu
Katakana-Hiragana
Katakana-Latin
Katakana-Latin/BGN
Kazakh-Latin/BGN
Kirghiz-Latin/BGN
Korean-Latin/BGN
Latin-ASCII
Latin-Arabic
Latin-Armenian
Latin-Bengali
Latin-Bopomofo
Latin-Cyrillic
Latin-Devanagari
Latin-Georgian
Latin-Greek
Latin-Greek/UNGEGN
Latin-Gujarati
Latin-Gurmukhi
Latin-Hangul
Latin-Hebrew
Latin-Hiragana
Latin-Jamo
Latin-Kannada
Latin-Katakana
Latin-Malayalam
Latin-NumericPinyin
Latin-Oriya
Latin-Syriac
Latin-Tamil
Latin-Telugu
Latin-Thaana
Latin-Thai
Macedonian-Latin/BGN
Malayalam-Bengali
Malayalam-Devanagari
Malayalam-Gujarati
Malayalam-Gurmukhi
Malayalam-Kannada
Malayalam-Latin
Malayalam-Oriya
Malayalam-Tamil
Malayalam-Telugu
Maldivian-Latin/BGN
Mongolian-Latin/BGN
NumericPinyin-Latin
NumericPinyin-Pinyin
Oriya-Bengali
Oriya-Devanagari
Oriya-Gujarati
Oriya-Gurmukhi
Oriya-Kannada
Oriya-Latin
Oriya-Malayalam
Oriya-Tamil
Oriya-Telugu
Pashto-Latin/BGN
Persian-Latin/BGN
Pinyin-NumericPinyin
Publishing-Any
Russian-Latin/BGN
Serbian-Latin/BGN
Simplified-Traditional
Syriac-Latin
Tamil-Bengali
Tamil-Devanagari
Tamil-Gujarati
Tamil-Gurmukhi
Tamil-Kannada
Tamil-Latin
Tamil-Malayalam
Tamil-Oriya
Tamil-Telugu
Telugu-Bengali
Telugu-Devanagari
Telugu-Gujarati
Telugu-Gurmukhi
Telugu-Kannada
Telugu-Latin
Telugu-Malayalam
Telugu-Oriya
Telugu-Tamil
Thaana-Latin
Thai-Latin
Tone-Digit
Traditional-Simplified
Turkmen-Latin/BGN
Ukrainian-Latin/BGN
Uzbek-Latin/BGN
XSampa-IPA
az-Lower
az-Title
az-Upper
cs-cs_FONIPA
cs-ja
cs-ko
cs_FONIPA-ja
cs_FONIPA-ko
el-Lower
el-Title
el-Upper
es-am
es-es_FONIPA
es-ja
es-zh
es_419-ja
es_419-zh
es_FONIPA-am
es_FONIPA-es_419_FONIPA
es_FONIPA-ja
es_FONIPA-zh
it-am
it-ja
ja_Latn-ko
ja_Latn-ru
lt-Lower
lt-Title
lt-Upper
nl-Title
pl-ja
pl-pl_FONIPA
pl_FONIPA-ja
ro-ja
ro-ro_FONIPA
ro_FONIPA-ja
ru-ja
ru-zh
sk-ja
sk-sk_FONIPA
sk_FONIPA-ja
tr-Lower
tr-Title
tr-Upper
uz_Cyrl-uz_Latn
uz_Latn-uz_Cyrl
zh_Latn_PINYIN-ru
Any-Null
Any-Remove
Any-Hex/Unicode
Any-Hex/Java
Any-Hex/C
Any-Hex/XML
Any-Hex/XML10
Any-Hex/Perl
Any-Hex/Plain
Any-Hex
Hex-Any/Unicode
Hex-Any/Java
Hex-Any/C
Hex-Any/XML
Hex-Any/XML10
Hex-Any/Perl
Hex-Any
Any-Lower
Any-Upper
Any-Title
Any-CaseFold
Any-Name
Name-Any
Any-NFC
Any-NFD
Any-NFKC
Any-NFKD
Any-FCD
Any-FCC
Any-Latin
Any-Latin/Names
Any-Latin/BGN
Any-uz_Cyrl
Any-zh
Any-am
Any-es_419_FONIPA
Any-ja
Any-Katakana
Any-ru
Any-sk_FONIPA
Any-cs_FONIPA
Any-ko
Any-Telugu
Any-Oriya
Any-Gurmukhi
Any-Devanagari
Any-Malayalam
Any-Bengali
Any-Tamil
Any-Kannada
Any-pl_FONIPA
Any-Hant
Any-Hans
Any-Hiragana
Any-ro_FONIPA
Any-Gujarati
Any-Latin/UNGEGN
Any-uz_Latn
Any-Hangul
Any-Syriac
Any-Arabic
Any-Georgian
Any-Armenian
Any-Greek
Any-Greek/UNGEGN
Any-Hebrew
Any-Thai
Any-Bopomofo
Any-Thaana
Any-Cyrillic
Any-es_FONIPA
Here's the code to list the available transliterators yourself:

Code: Select all

com.ibm.icu.text.Transliterator.availableIDs.each{
	println it
}
:idea: Please read the FAQ and How to Request Help.
Post Reply