Word Formation System of Suwawa Language Using Computer Program (original) (raw)
Related papers
Indian Linguistics, 2022
This paper describes word formation processes in Tiwa, a Tibeto-Burman language, spoken mostly in Assam and Meghalaya. Tiwa roots are usually monosyllabic in nature but as agglutination is employed in the language, multimorphemic words which are also polysyllabic are used much more frequently than monosyllabic words. The formation of new words in this language involves different word formation processes such as affixation, compounding, reduplication and onomatopoeia or echo word formation. All of these processes are prominent with compounding being the most productive among them.
VEA Model in Word Formation Process of Maithili MT
Morphological analysis is the most remarkable stage for the development of Maithili-English-Hindi MT system under NLP. This paper is motivated to design a morph analyzer for Maithili language and as add-on of Maithili MT system for appropriate analysis at the morphological level. The research is contributing through derivational process of analyzing word attached with affixes. This paper has reviewed most of the methods of MA at different level. Among all the development of MA, suffix striping and FSA are commonly practiced. Some of them are using lemma based approach for analyzing morph (Nikhil et.al. 2012). There are several linguists and computer scientists have discussed the statistical approach and Hybrid approach with FSA, probability based model (Rinju et. al. 2013) and several other approaches of machine learning to develop Maithili Morphological Analyzer (MMA). But rule based approach is friendly enough to use with all the machine learning models. VEA is one of the emerging modal in word formation process which is somehow adequate with Maithili language. Overall linguistics approach is core of MA in Maithili for developing MT system. This research is introducing linguistically a friendly and bit new with machine learning and proficient model for analyzing words and generating multiple words on the basis. The discussion also covered the concatenation with root word to suffix and prefix. Maithili MA is demonstrating a small concept with rule based model and we are designing it with hybrid modal including corpus based approach. This design is incorporating the lexicon tables, suffix list, prefix list and the Vowel Ending Approach (VEA) to justify that how does concatenation take place. In the above table POS category of Noun, Adjective and Verb are shifting to another category of POS after concatenation of suffixes. And it's also focused that how the words end with their vowels and how does suffix connects on the basis of its vowel ending mechanism.
Derivational Process of Wawonii Language
2020
This research focused on derivational process in Wawonii language at Wawonii regency. The main objective of this research was to find out derivational process of affixation in Wawonii language. This research used qualitative analysis, the data sources in this research are oral data and written data by applying some techniques of collecting data as follows: record, note taking, translation, and introspection. After the data collected the writer analyzed them through the following steps namely: making the table of the gathered data that indicate the derivational affixation in Wawonii language, the table of derivational process, making formulation and following by examples. The result of this research show that Wawonii language has derivational process of affixation consists of four affixation namely: prefix consist of four prefixes namely mon-, me-, po-, mong-, suffix consist of three suffixes namely -no, -omo, -io, confix consist of three confixes namely mo-i, pe-no, and infix con...
Sandhi: The Rule Based Word Formation in Hindi
Natural Language processing (NLP) helps a machine to understand the human language. Due to various reasons human language identification and analysis is a very tedious task. One of them is meaning of the words. In NLP, to derive meaning from a sentence, words are treated as data. Therefore, the formation of words is important for NLP. Out of 447 languages, 22 are official languages in India. Hindi being the most popular and used, became the target choice for computerization. Sandhi is a process through which two or more independent words are joined to produce a new meaningful word. In this paper we present an algorithm that performs Sandhi and does Sandhi-Vichchhed (splitting compound words). The algorithm has been tested on 887 unique Hindi words that are compound i.e. Sandhi-Vichchhed can be applied to them.
NOMINAL WORD FORMATIONS IN TOBA BATAK LANGUAGE: A STUDY OF GENERATIVE MORPHOLOGY
The objective of this paper is to explore nominal word formations in Toba Batak language. The theory applied in this study is generative morphology proposed by Halle (1973). The basic principle in generative morphology is that the process of word formations can generate actual words and potential words. According to generative morphology the mechanism of word formations will be postulated in list of morphemes, word formation rules, filter, and dictionary. The method of this study is qualitative descriptive; it is a method of study which describes language phenomena naturally without any exception. The results show that nominal word formations in Toba Batak language are distinguished in 3 main ways, they are: [1] by attaching affixations, [2] by inserting premodifier ni between adjectival bases and nominal bases, and [3] by moving the stress of free adjectival bases from the first syllable to the second syllable. There are 14 affixations that can form nouns in Toba Batak language, they are: (i) six prefixes (par-, na-, sa-, sanha-, hina-, ha-), (ii) two infixes (-ar- ,-al-), (iii) one suffix (-na), (iv) four multiple affixations (ha-…-on, pa-…-an, pa-…-on, par-…-an), and (v) double affixations (par-in-). Nominal word formations derive from various free word bases, such as, free adjectival bases, free verbal bases, free nominal bases, free numeric bases, and free adverbial bases. The results of these affixations can be inflectional or derivational. Some complex words have to be put into filter to be processed morphophonologically before they are put into dictionary.
A Suffix Based Morphological Analysis of Assamese Word Formation
International Journal on Recent and Innovation Trends in Computing and Communication, 2017
Languages have several important features such as part-of-speech, tenses, prefixes and suffixes etc. which play major roles to solve the purpose of the language. In Assamese language suffixation is a very sensitive and unavoidable factor in the formation of Assamese words. Suffixes are letters or group of letters placed right after the nouns, pronouns, adjectives, verbs and adverbs etc to intensify the meaning contextually of the newly formed words due to suffixation. Because of the inflectional nature of suffixation, it often creates new words differing in part-of-speech and meaning from the original words, it is attached with. Hence suffixation is morphodyanmic process through which new words are generated from old words changing their forms, function and meaning thus increasing the lexical inventory of Assamese language. This particular study can create a theoretical base about the nature of lexical generativity of suffixes in the formation of Assamese words.
The Process of Japanese Compound Word Formation
Proceedings of the Unima International Conference on Social Sciences and Humanities (UNICSSH 2022), 2023
The process of forming a Japanese word is called gokeisei. Gokeisei consists of 4 kinds, namely haseigo (派生語), karikomi or shouryaku (刈り込み• 省略), toujigo (頭字語) and fukugougo/gouseigo (複合語• 合成語). This study aims to identify changes in phonemes in the process of gouseigo formation. In this study, the data were analyzed by identifying vocabulary in the form of gouseigo, making a list of gouseigo, interpreting the data according to theory, then discussing the results of data processing. This study uses a qualitative descriptive method, namely data on japanese compound words (gouseigo) are analyzed and presented according to existing circumstances or phenomena as they are. The data collection technique in this study was carried out by collecting data and information sourced from literature books related to gouseigo. Based on the results of this study, it is hoped that it can be useful for Japanese language learners so that they can understand and form Japanese compound words correctly. The formation of the Japanese compound word (gouseigo) can be grouped into: 1)meishi + meishi, 2)meishi + doushi, 3)doushi + meishi, 4)doushi + doushi. Gouseigo vocabulary can be categorized according to the changes in onso that occur in the process of forming the word.
Computational Morphological Analysis of Yorùbá Language Words
IAES International Journal of Artificial Intelligence (IJ-AI)
Nigeria official languages are English, Yorùbá, Igbo and Hausa. The focus of the study reported in this paper is to develop learning tool that can assist learners to learn the Yorùbá language using its alphabets. The study is critical to Yorùbá language, because of its endangerment. There is need to introduce different learning tools that can mitigate its extinction. A Yorùbá word perfect system was developed to assist people in learning the Yorùbá language. English and Yorùbá words formation are experimented using computational morphological approach (word formation). The theoretical framework considered Finite state automata (FSA) to realise different ways of combining the consonants and vowels to form word. Two to five letter words were considered. The system was designed and implemented using UML tools and python programming language.The system will teach the users on how the words are formed, and the number of syllables in each word. The user need not to know how to tone mark ...