GitHub - alexsergivan/transliterator: Golang text Transliterator (i.e München -> Muenchen) (original) (raw)
Golang text Transliterator
Golang Transliterator provides one-way string transliteration. It takes Unicode text and converts to ASCII characters. Example use-case: transliterate cyrilic city name to be able to use it in the url ("Київ" ==> "Куiv").
For now, only these languages have specific transliteration rules: DE, DA, EO, RU, BG, SV, HU, HR, SL, SR, NB, UK, MK, CA, BS. For other languages, general ASCII transliteration rules will be applied. Also, this package supports adding custom transliteration rules for your specific use-case. Please check the examples section below.
Installation
go get -u github.com/alexsergivan/transliterator
Language specific transliteration example
package main
import ( "fmt" "github.com/alexsergivan/transliterator" )
func main() { trans := transliterator.NewTransliterator(nil) text := "München" // Langcode should be provided accrding to ISO 639-1. fmt.Println(trans.Transliterate(text, "de")) // Result: Muenchen fmt.Println(trans.Transliterate(text, "en")) // Result: Munchen
anotherText := "你好"
fmt.Println(trans.Transliterate(anotherText, "")) // Result: Ni Hao
oneMoreText := "Київ"
fmt.Println(trans.Transliterate(oneMoreText, "uk")) // Result: Kyiv
fmt.Println(trans.Transliterate(oneMoreText, "en")) // Result: Kiyiv
fmt.Println(trans.Transliterate(oneMoreText, "")) // Result: Kiyiv}
Adding of custom Language translitartion rules
package main
import ( "fmt" "github.com/alexsergivan/transliterator" )
func main() { customLanguageOverrites := make(map[string]map[rune]string)
customLanguageOverrites["myLangcode"] = map[rune]string{
// Ї
0x407: "CU",
// и
0x438: "y",
}
trans := transliterator.NewTransliterator(&customLanguageOverrites)
text := "КиЇв"
fmt.Println(trans.Transliterate(text, "myLangcode")) // Result: KyCUv}