Nelda Kote | Polytechnic University of Tirana (original) (raw)
Papers by Nelda Kote
Lecture notes on data engineering and communications technologies, 2024
International Journal of Advanced Computer Science and Applications
Lecture notes in networks and systems, Oct 18, 2022
Advances in Internet, Data & Web Technologies, 2018
Text mining and natural language processing are gaining significant role in our daily life as inf... more Text mining and natural language processing are gaining significant role in our daily life as information volumes increase steadily. Most of the digital information is unstructured in the form of raw text. While for several languages there is extensive research on mining and language processing, much less work has been performed for other languages. In this paper we aim to evaluate the performance of some of the most important text classification algorithms over a corpus composed of Albanian texts. After applying natural language preprocessing steps, we apply several algorithms such as Simple Logistics, Naive Bayes, k-Nearest Neighbor, Decision Trees, Random Forest, Support Vector Machines and Neural Networks. The experiments show that Naive Bayes and Support Vector Machines perform best in classifying Albanian corpuses. Furthermore, Simple Logistics algorithm also shows good results.
Advances in Internet, Data & Web Technologies, 2018
Nowadays, analysis of opinions in online media such as newspapers, social media, forums, blogs, p... more Nowadays, analysis of opinions in online media such as newspapers, social media, forums, blogs, product review sites, has a key role in the human life. In this context, opinion mining is one of the fastest growing research areas in natural language processing that aims to extract and organize opinions from users. Machine Learning techniques represent a powerful instrument to analyze and understand correctly text data. In this paper we present a thorough experimental evaluation of machine learning algorithms used for opinion mining in Albanian language. The experimental results are interpreted with respect to various evaluation criteria for the different algorithms showing interesting features on the performance of each algorithm.
Opinion mining is an important tool to find out what others think about something. Most of method... more Opinion mining is an important tool to find out what others think about something. Most of methods used for opinion mining are based on machine learning. In this paper we present an experimental evaluation of machine learning algorithms used for opinion mining in a multi-domain corpus in Albanian language. We have created 11 multi-domains corpuses combining the opinions from 5 different topics. The opinions are classified as positive or negative. All the corpuses are used to train and test for opinion mining the performance of 50 classification algorithms. Out of these, there are seven best performing algorithms out of which three are based on Naive Bayes.
Over the last decade, Bayesian Networks (BNs) have become an increasingly popular Artificial Inte... more Over the last decade, Bayesian Networks (BNs) have become an increasingly popular Artificial Intelligence approach. BNs are a widely used method in the modelling of uncertain knowledge. There have been many important new developments in this field. This paper presents a review and classification scheme for recent researches on Bayesian Networks. This is achieved by reviewing relevant articles published in the recent years. The articles are classified based on a scheme that consists of three main Bayesian Networks topics: Bayesian Networks Structure Learning, Advanced Application of Bayesian Networks and Bayesian Network Classifiers. This review provides a reference source and classification scheme for researchers interested in BNs, and indicates under-researched areas as well as future directions.
Global journal of computer science and technology, 2011
ArXiv, 2019
In this paper, we present the first publicly available part-of-speech and morphologically tagged ... more In this paper, we present the first publicly available part-of-speech and morphologically tagged corpus for the Albanian language, as well as a neural morphological tagger and lemmatizer trained on it. There is currently a lack of available NLP resources for Albanian, and its complex grammar and morphology present challenges to their development. We have created an Albanian part-of-speech corpus based on the Universal Dependencies schema for morphological annotation, containing about 118,000 tokens of naturally occuring text collected from different text sources, with an addition of 67,000 tokens of artificially created simple sentences used only in training. On this corpus, we subsequently train and evaluate segmentation, morphological tagging and lemmatization models, using the Turku Neural Parser Pipeline. On the held-out evaluation set, the model achieves 92.74% accuracy on part-of-speech tagging, 85.31% on morphological tagging, and 89.95% on lemmatization. The manually annotat...
Lecture notes on data engineering and communications technologies, 2024
International Journal of Advanced Computer Science and Applications
Lecture notes in networks and systems, Oct 18, 2022
Advances in Internet, Data & Web Technologies, 2018
Text mining and natural language processing are gaining significant role in our daily life as inf... more Text mining and natural language processing are gaining significant role in our daily life as information volumes increase steadily. Most of the digital information is unstructured in the form of raw text. While for several languages there is extensive research on mining and language processing, much less work has been performed for other languages. In this paper we aim to evaluate the performance of some of the most important text classification algorithms over a corpus composed of Albanian texts. After applying natural language preprocessing steps, we apply several algorithms such as Simple Logistics, Naive Bayes, k-Nearest Neighbor, Decision Trees, Random Forest, Support Vector Machines and Neural Networks. The experiments show that Naive Bayes and Support Vector Machines perform best in classifying Albanian corpuses. Furthermore, Simple Logistics algorithm also shows good results.
Advances in Internet, Data & Web Technologies, 2018
Nowadays, analysis of opinions in online media such as newspapers, social media, forums, blogs, p... more Nowadays, analysis of opinions in online media such as newspapers, social media, forums, blogs, product review sites, has a key role in the human life. In this context, opinion mining is one of the fastest growing research areas in natural language processing that aims to extract and organize opinions from users. Machine Learning techniques represent a powerful instrument to analyze and understand correctly text data. In this paper we present a thorough experimental evaluation of machine learning algorithms used for opinion mining in Albanian language. The experimental results are interpreted with respect to various evaluation criteria for the different algorithms showing interesting features on the performance of each algorithm.
Opinion mining is an important tool to find out what others think about something. Most of method... more Opinion mining is an important tool to find out what others think about something. Most of methods used for opinion mining are based on machine learning. In this paper we present an experimental evaluation of machine learning algorithms used for opinion mining in a multi-domain corpus in Albanian language. We have created 11 multi-domains corpuses combining the opinions from 5 different topics. The opinions are classified as positive or negative. All the corpuses are used to train and test for opinion mining the performance of 50 classification algorithms. Out of these, there are seven best performing algorithms out of which three are based on Naive Bayes.
Over the last decade, Bayesian Networks (BNs) have become an increasingly popular Artificial Inte... more Over the last decade, Bayesian Networks (BNs) have become an increasingly popular Artificial Intelligence approach. BNs are a widely used method in the modelling of uncertain knowledge. There have been many important new developments in this field. This paper presents a review and classification scheme for recent researches on Bayesian Networks. This is achieved by reviewing relevant articles published in the recent years. The articles are classified based on a scheme that consists of three main Bayesian Networks topics: Bayesian Networks Structure Learning, Advanced Application of Bayesian Networks and Bayesian Network Classifiers. This review provides a reference source and classification scheme for researchers interested in BNs, and indicates under-researched areas as well as future directions.
Global journal of computer science and technology, 2011
ArXiv, 2019
In this paper, we present the first publicly available part-of-speech and morphologically tagged ... more In this paper, we present the first publicly available part-of-speech and morphologically tagged corpus for the Albanian language, as well as a neural morphological tagger and lemmatizer trained on it. There is currently a lack of available NLP resources for Albanian, and its complex grammar and morphology present challenges to their development. We have created an Albanian part-of-speech corpus based on the Universal Dependencies schema for morphological annotation, containing about 118,000 tokens of naturally occuring text collected from different text sources, with an addition of 67,000 tokens of artificially created simple sentences used only in training. On this corpus, we subsequently train and evaluate segmentation, morphological tagging and lemmatization models, using the Turku Neural Parser Pipeline. On the held-out evaluation set, the model achieves 92.74% accuracy on part-of-speech tagging, 85.31% on morphological tagging, and 89.95% on lemmatization. The manually annotat...