TURKISH TEXT DEASCIIFIER (original) (raw)

Pek �ok ki�i �e�itli sebeplerle, T�rk�e yaz� yazarken T�rk�e karakterleri kullanmaktan ka��n�yor. G�khan T�r taraf�ndan 2000 y�l� Ocak ay�nda yaz�lan bu program, T�rk�e karakterler kullan�lmadan yaz�lm�� bir metni, Latin-5 format�nda normal bir T�rk�e yaz�ya d�n��t�r�yor. Program, be�er harflik �beklerin istatistik y�ntemleriyle analiz edilmesi esas�na dayan�yor. 100.000 civar�nda kelimeden olu�an b�y�k metinler �zerinde yap�lan �al��malar, kelimelerin neredeyse ��te birinin bu �ekilde d�zeltilmesi gerekti�ini ortaya ��kar�yor. Bu program, baz� �rneklerde bir hayli belirsizlik yaratabilen bu gibi durumlarda %99 ba�ar� sa�l�yor.

Program kullan�larak d�zeltilmi� �rnek bir c�mlenin ilk halini ve d�zeltilmi� halini a�a��da bulabilirsiniz. T�rk�e karakterler kullan�lmadan yaz�lm�� metinlerinizi T�rk�ele�tirmek i�in, metni a�a��daki kutuya yaz�n veya yap��t�r�n ve "Metni G�nder" tu�una bas�n. Sonu�lar� kopyalayarak diledi�iniz programda kullanabilirsiniz.

T�rk�e harfler kullan�larak yaz�lm�� metinleri T�rk�e harfleri tan�mayan bilgisayarlara g�nderirken ya�ad���n�z sorunlardan kurtulmak i�in de, buradaki i�lemin tersini yapan T�rk�e Karakter Temizleme Program� sayfam�z� ziyaret edin.

Due to encoding problems, lack of a Turkish keyboard, or mere habit, Turkish texts are usually typed using an ASCII character set, where non-ASCII Turkish characters are typed using their nearest ASCII equivalent, e.g., '�' is entered as 'u', '�' as 'g', etc.

This program, developed by G�khan T�r in January 2000, takes in such text and converts it to Latin-5 using a statistical 5-gram letter language model. Experiments on texts totaling about 100.000 words indicate that nearly a third of the words need such fixing and that the program successfully corrects about 99% of such cases (some of which are quite ambiguous).

Below is a sample original sentence and the corresponding output. You can type in or copy&paste; your own text into the textbox below and "deasciify" or "Turkify" it by clicking "Submit Text". You can copy the resulting text and reuse it in any application you want.

To send Turkish text to computer systems which do not recognize Turkish characters, visit Turkish Text Asciifier which does just the opposite.