Mohamed Osman Hegazi | Prince Sattam Bin Abdulaziz University (original) (raw)
Uploads
Papers by Mohamed Osman Hegazi
Extracting knowledge from text documents has become one of the main hot topics in the field of Na... more Extracting knowledge from text documents has become one of the main hot topics in the field of Natural Language Processing (NLP) in the era of information explosion. Arabic NLP is considered immature due to several reasons including the low available resources. On the other hand, automatically extracting reliable knowledge from specialized data sources as holy books is considered ultimately a challenging task but of great benefit to all humans. In this context, this paper provides a comprehensive Quranic Dataset as a first part (foundation) of an ongoing research that attempts to lay grounds for approaches and applications to explore the holy Quran. The paper presents the algorithms and approaches that have been designed to extract an aggregative data from massive Arabic text sources including the holy Quran and tightly associated books. Holy Quran text is transferred into structured multi-dimensional data records starting from the chapter level, the word level and then the character level. All these are linked with interpretations and meanings, parsing, translations, intonation roots and stems of words, all from authentic and reliable sources. The final dataset is represented in excel sheets and database records format. Also, the paper presents models of the dataset at all levels. The Quranic dataset presented in this paper was designed to be appropriate for: database, data mining, text mining and Artificial Intelligence applications; it is also designed to serve as a comprehensive encyclopedia of holy Quran and the Quranic Science books.
International Journal of Advanced Computer Science and Applications, 2015
ABSTRACT The Holy Quran is the reference book for more than 1.6 billion of Muslims all around the... more ABSTRACT The Holy Quran is the reference book for more than 1.6 billion of Muslims all around the world Extracting information and knowledge from the Holy Quran is of high benefit for both specialized people in Islamic studies as well as non-specialized people. This paper initiates a series of research studies that aim to serve the Holy Quran and provide helpful and accurate information and knowledge to the all human beings. Also, the planned research studies aim to lay out a framework that will be used by researchers in the field of Arabic natural language processing by providing a ”Golden Dataset” along with useful techniques and information that will advance this field further. The aim of this paper is to find an approach for analyzing Arabic text and then providing statistical information which might be helpful for the people in this research area. In this paper the holly Quran text is preprocessed and then different text mining operations are applied to it to reveal simple facts about the terms of the holy Quran. The results show a variety of characteristics of the Holy Quran such as its most important words, its wordcloud and chapters with high term frequencies. All these results are based on term frequencies that are calculated using both Term Frequency (TF) and Term Frequency-Inverse Document Frequency (TF-IDF) methods.
2013 Fourth International Conference on e-Learning "Best Practices in Management, Design and Development of e-Courses: Standards of Excellence and Creativity", 2013
ABSTRACT This paper provides an enhanced model for m-learning, the model works as a framework and... more ABSTRACT This paper provides an enhanced model for m-learning, the model works as a framework and methodology to provide Arabic mobile learning engine based on the open source Mobile Learning Engine (MLE). This model is based on two phases, the first phase is for constructing the general framework of the Arabic engine inside the MLE. The second phase is for building the Arabic engine that adapts MLE for Arabic contents. Therefore our proposed model is called Arabic Mobile Learning Engine -AMLE.
This paper provide the aspects of mobile learning by presenting the differences techniques and as... more This paper provide the aspects of mobile learning by presenting the differences techniques and aspects of using mobile devices in education according to the perspective of researches and studies in this area, it offers a comparison between e-learning and m-learning, a classification of education by mobile devices, and shows the categories of mobile learning. The paper also provides the readiness of mobile learning by presenting the results of study conducted at faculty of Computer Science and Information Technology at Al- Zaiem Al-Azahri University in Sudan.
Replication can be a success factor in database systems as well as perhaps being one of the needs... more Replication can be a success factor in database systems as well as perhaps being one of the needs of proliferation, expansion, and the rapid progress of databases and distributed technology, despite there being a strong belief among database designers that most existing solutions are not feasible due to their complexity, poor performance and lack of scalability. This paper provides an approach that can help designers in implementing eager and lazy replication mechanisms. The proposed approach contains two phases: In the first phase, the database is designed to have indicator fields that can carry the update status, and to consider the replication concepts by classifying, categorizing and determining the kinds and locations of data objects; in the second phase, the updating methodology is provided to make the implementation of eager and lazy replication mechanisms easier and reliable.
This study proposes a database designing model that works as a foundation for database designing ... more This study proposes a database designing model that works as a foundation for database designing phases
and as integration for the database systems. The model starts by fragment the system from top to down and
then integrate the different parts of the system using the bottom up approach. The proposed model is a
graphic model, where it presents the fragmentations of the system in indexed binary tree and then the
integration process follows the concept of tree traversing. The proposed model can be a scientific approach
for solving the difficulties of database design in understanding the structure and the behaviors of the
application and in integrating of the system parts
The relations between the constituent elements of the predominant activities may cannot be measur... more The relations between the constituent elements of the predominant activities may cannot be measured using the accurate measurement, the reason may be on the conflict or the effect of such rules with other factor, accordingly uncertainty measurement is needed. This paper presents a measurement and predictive fuzzy model that can be fit to work on such rules. The proposed model measures and predicting the impacts of such rules using fuzzy logic, it designed based on the concept of fuzzy logic, fuzzy set and the production rule. The model transforms if-then business rule to weighted fuzzy production rule, and then used this production rule for predicting and measuring the impact of the business rule. The model is tested using real data and provide considerable results.
Extracting knowledge from text documents has become one of the main hot topics in the field of Na... more Extracting knowledge from text documents has become one of the main hot topics in the field of Natural Language Processing (NLP) in the era of information explosion. Arabic NLP is considered immature due to several reasons including the low available resources. On the other hand, automatically extracting reliable knowledge from specialized data sources as holy books is considered ultimately a challenging task but of great benefit to all humans. In this context, this paper provides a comprehensive Quranic Dataset as a first part (foundation) of an ongoing research that attempts to lay grounds for approaches and applications to explore the holy Quran. The paper presents the algorithms and approaches that have been designed to extract an aggregative data from massive Arabic text sources including the holy Quran and tightly associated books. Holy Quran text is transferred into structured multi-dimensional data records starting from the chapter level, the word level and then the character level. All these are linked with interpretations and meanings, parsing, translations, intonation roots and stems of words, all from authentic and reliable sources. The final dataset is represented in excel sheets and database records format. Also, the paper presents models of the dataset at all levels. The Quranic dataset presented in this paper was designed to be appropriate for: database, data mining, text mining and Artificial Intelligence applications; it is also designed to serve as a comprehensive encyclopedia of holy Quran and the Quranic Science books.
International Journal of Advanced Computer Science and Applications, 2015
ABSTRACT The Holy Quran is the reference book for more than 1.6 billion of Muslims all around the... more ABSTRACT The Holy Quran is the reference book for more than 1.6 billion of Muslims all around the world Extracting information and knowledge from the Holy Quran is of high benefit for both specialized people in Islamic studies as well as non-specialized people. This paper initiates a series of research studies that aim to serve the Holy Quran and provide helpful and accurate information and knowledge to the all human beings. Also, the planned research studies aim to lay out a framework that will be used by researchers in the field of Arabic natural language processing by providing a ”Golden Dataset” along with useful techniques and information that will advance this field further. The aim of this paper is to find an approach for analyzing Arabic text and then providing statistical information which might be helpful for the people in this research area. In this paper the holly Quran text is preprocessed and then different text mining operations are applied to it to reveal simple facts about the terms of the holy Quran. The results show a variety of characteristics of the Holy Quran such as its most important words, its wordcloud and chapters with high term frequencies. All these results are based on term frequencies that are calculated using both Term Frequency (TF) and Term Frequency-Inverse Document Frequency (TF-IDF) methods.
2013 Fourth International Conference on e-Learning "Best Practices in Management, Design and Development of e-Courses: Standards of Excellence and Creativity", 2013
ABSTRACT This paper provides an enhanced model for m-learning, the model works as a framework and... more ABSTRACT This paper provides an enhanced model for m-learning, the model works as a framework and methodology to provide Arabic mobile learning engine based on the open source Mobile Learning Engine (MLE). This model is based on two phases, the first phase is for constructing the general framework of the Arabic engine inside the MLE. The second phase is for building the Arabic engine that adapts MLE for Arabic contents. Therefore our proposed model is called Arabic Mobile Learning Engine -AMLE.
This paper provide the aspects of mobile learning by presenting the differences techniques and as... more This paper provide the aspects of mobile learning by presenting the differences techniques and aspects of using mobile devices in education according to the perspective of researches and studies in this area, it offers a comparison between e-learning and m-learning, a classification of education by mobile devices, and shows the categories of mobile learning. The paper also provides the readiness of mobile learning by presenting the results of study conducted at faculty of Computer Science and Information Technology at Al- Zaiem Al-Azahri University in Sudan.
Replication can be a success factor in database systems as well as perhaps being one of the needs... more Replication can be a success factor in database systems as well as perhaps being one of the needs of proliferation, expansion, and the rapid progress of databases and distributed technology, despite there being a strong belief among database designers that most existing solutions are not feasible due to their complexity, poor performance and lack of scalability. This paper provides an approach that can help designers in implementing eager and lazy replication mechanisms. The proposed approach contains two phases: In the first phase, the database is designed to have indicator fields that can carry the update status, and to consider the replication concepts by classifying, categorizing and determining the kinds and locations of data objects; in the second phase, the updating methodology is provided to make the implementation of eager and lazy replication mechanisms easier and reliable.
This study proposes a database designing model that works as a foundation for database designing ... more This study proposes a database designing model that works as a foundation for database designing phases
and as integration for the database systems. The model starts by fragment the system from top to down and
then integrate the different parts of the system using the bottom up approach. The proposed model is a
graphic model, where it presents the fragmentations of the system in indexed binary tree and then the
integration process follows the concept of tree traversing. The proposed model can be a scientific approach
for solving the difficulties of database design in understanding the structure and the behaviors of the
application and in integrating of the system parts
The relations between the constituent elements of the predominant activities may cannot be measur... more The relations between the constituent elements of the predominant activities may cannot be measured using the accurate measurement, the reason may be on the conflict or the effect of such rules with other factor, accordingly uncertainty measurement is needed. This paper presents a measurement and predictive fuzzy model that can be fit to work on such rules. The proposed model measures and predicting the impacts of such rules using fuzzy logic, it designed based on the concept of fuzzy logic, fuzzy set and the production rule. The model transforms if-then business rule to weighted fuzzy production rule, and then used this production rule for predicting and measuring the impact of the business rule. The model is tested using real data and provide considerable results.