GitHub - alexsergivan/transliterator: Golang text Transliterator (i.e München -> Muenchen) (original) (raw)

Golang text Transliterator

Build Status Coverage Status Go Report Card Codacy Badge

Golang Transliterator provides one-way string transliteration. It takes Unicode text and converts to ASCII characters. Example use-case: transliterate cyrilic city name to be able to use it in the url ("Київ" ==> "Куiv").

For now, only these languages have specific transliteration rules: DE, DA, EO, RU, BG, SV, HU, HR, SL, SR, NB, UK, MK, CA, BS. For other languages, general ASCII transliteration rules will be applied. Also, this package supports adding custom transliteration rules for your specific use-case. Please check the examples section below.

Installation

go get -u github.com/alexsergivan/transliterator

Language specific transliteration example

package main

import ( "fmt" "github.com/alexsergivan/transliterator" )

func main() { trans := transliterator.NewTransliterator(nil) text := "München" // Langcode should be provided accrding to ISO 639-1. fmt.Println(trans.Transliterate(text, "de")) // Result: Muenchen fmt.Println(trans.Transliterate(text, "en")) // Result: Munchen

anotherText := "你好"
fmt.Println(trans.Transliterate(anotherText, "")) // Result: Ni Hao

oneMoreText := "Київ"
fmt.Println(trans.Transliterate(oneMoreText, "uk")) // Result: Kyiv
fmt.Println(trans.Transliterate(oneMoreText, "en")) // Result: Kiyiv
fmt.Println(trans.Transliterate(oneMoreText, "")) // Result: Kiyiv

}

Adding of custom Language translitartion rules

package main

import ( "fmt" "github.com/alexsergivan/transliterator" )

func main() { customLanguageOverrites := make(map[string]map[rune]string)

customLanguageOverrites["myLangcode"] = map[rune]string{
    // Ї
    0x407: "CU",
    // и
    0x438: "y",
}
trans := transliterator.NewTransliterator(&customLanguageOverrites)
text := "КиЇв"
fmt.Println(trans.Transliterate(text, "myLangcode")) // Result: KyCUv

}