GitHub - indic-transliteration/indic_transliteration_py: Python package for indic script transliteration (original) (raw)

Build Status Documentation Status Actions Status PyPI version

For users

Installation or upgrade:

Usage

Import the necessary modules from indic_transliteration

from indic_transliteration import sanscript from indic_transliteration.sanscript import SchemeMap, SCHEMES, transliterate

Input data for transliteration

data = 'idam adbhutam'

Transliterate from Harvard-Kyoto (HK) to Telugu

print(transliterate(data, sanscript.HK, sanscript.TELUGU)) # Output: ఇదమ్ అద్భుతమ్

Transliterate from ITRANS to Devanagari

print(transliterate(data, sanscript.ITRANS, sanscript.DEVANAGARI)) # Output: इदम् अद्भुतम्

Define a scheme map for transliteration from Velthuis to Telugu

scheme_map = SchemeMap(SCHEMES[sanscript.VELTHUIS], SCHEMES[sanscript.TELUGU])

Transliterate using the scheme map

print(transliterate(data, scheme_map=scheme_map)) # Output: ఇదమ్ అద్భుతమ్

For a full list of supported schemes, please see files under indic_transliteration/sanscript/schemes/data .

Lazy anusvaara-s

    assert sanscript.SCHEMES[sanscript.ITRANS].fix_lazy_anusvaara("shaMkara") == "sha~Nkara"
    assert sanscript.SCHEMES[sanscript.ITRANS].fix_lazy_anusvaara("saMchara") == "sa~nchara"
    assert sanscript.SCHEMES[sanscript.ITRANS].fix_lazy_anusvaara("saMvara") == "sav.Nvara"
    assert sanscript.SCHEMES[sanscript.ITRANS].fix_lazy_anusvaara("saMyukta") == "say.Nyukta"
    assert sanscript.SCHEMES[sanscript.ITRANS].fix_lazy_anusvaara("saMlagna") == "sal.Nlagna"

Lazy visarga-s

    assert sanscript.SCHEMES[sanscript.DEVANAGARI].fix_lazy_visarga("अन्तः पश्य") == "अन्तᳶ पश्य"
    assert sanscript.SCHEMES[sanscript.DEVANAGARI].fix_lazy_visarga("अन्तः कुरु") == "अन्तᳵ कुरु"

Lay Indian search terms

    assert sanscript.SCHEMES[sanscript.OPTITRANS].to_lay_indian("taM jitvA") == "tam jitva"
    assert sanscript.SCHEMES[sanscript.OPTITRANS].to_lay_indian("kRShNa") == "krishna"

Modifying/ customizing schemes/ maps

  slp_scheme = sanscript.SCHEMES[sanscript.roman.SLP1_ACCENTED]
  slp_scheme['accents']['꣡'] = '%'

Dravidian language extensions

Import the necessary modules from indic_transliteration

from indic_transliteration import sanscript

Input data for transliteration

data = 'असय औषधिः ग्रन्थः। ऎ ऒ यॆक्ककॊ?'

Transliterate from Devanagari to Kannada

print(sanscript.transliterate(data, sanscript.DEVANAGARI, sanscript.OPTITRANS_DRAVIDIAN))

Font converters

    converter = tech_hindi.DVTTVedicConverter()
    text_in = "    +<=hÉ *1* +EòÉ®úÉä Ê´É´ÉÞiÉ ={ÉÊnù¹]õ& |ÉÉÊGòªÉÉnù¶ÉɪÉÉÆ SÉäiªÉjÉ \"+ +' (ºÉÚ.8-4-68)  "
    output = converter.convert(text_in)

CLI

Installing the package with pip also installs a console script, sanscript which can used to transliterate files, standard input or input strings from the command-line.

Demo

Usage

-Note*: Refer sanscript --help for the latest information.

$ sanscript [OPTIONS] [INPUT_STRING]

-Arguments*:

-Options*:

-Enabling auto-completion*:

-Help*:

Examples

Input options

Output options

For contributors

Contact

Have a problem or question? Please head to github.

Packaging

python setup.py bdist_wheel
twine upload dist/* --skip-existing

Build documentation

Testing

Run pytest in the root directory.

Auxiliary tools