Waseem Javaid Soomro | Sindh University Jamshoro (original) (raw)
Uploads
Papers by Waseem Javaid Soomro
The research article focuses on the Image Compression techniques such as. Discrete Cosine Transfo... more The research article focuses on the Image Compression techniques such as. Discrete Cosine Transform (DCT) and Fast Fourier Transform (FFT). These techniques are chosen because of their vast use in image processing field, JPEG (Joint Photographic Experts Group) is one of the examples of compression technique which uses DCT. The Research compares the two compression techniques based on DCT and FFT and compare their results using MATLAB software, Graphical User Interface (GUI). These results are based on two compression techniques with different rates of compression i.e. Compression rates are 90%, 60%, 30% and 5%. The technique allows compressing any picture format to JPG format. The result shows that DCT is better technique than FFT; however the compression results are same as that of 30% compression to 5% compression reflecting not significant change in visual results excepting the file size varying to small fraction. The compression technique works fine with the images having little...
The aim of this project is to design the system that can detect P-wave before the first S-wave sp... more The aim of this project is to design the system that can detect P-wave before the first S-wave spike. Typically, P-wave travel 1.68 to 1.75 times faster than S-wave. Our proposed designed device consists of a pendulum type earthquake detection device which is interconnected with fault point finder, wireless alarm, GSM kit and automatic turn off system. when P- wave strike the pendulum it activates relay and send the pulse to stimulate the wireless alarm which can be install at any place as it detects the P-waves and can save human lives as they will be aware of how to deal with this situation.
Sindh University Research Journal, 2016
Recent advancements in Computer Technologies have rapidly revolutionized the world. These advance... more Recent advancements in Computer Technologies have rapidly revolutionized the world. These advancements have immensely increased the need of localization of computer technologies in regional languages and for convenient natural language processing. In this paper, the problem of design and development of Unicode based digital thesaurus is discussed for Sindhi language. Sindhi is one of the oldest and richest languages of the world with a very rich linguistics and literary text. The development of digital Sindhi Thesaurus application is done on Java platform, using hash table structure to act as a database for storing word repository. The hash table structure provides a convenient and easy to implement data structure with multiple advantages of speed and ease of use. The words data is saved as a java bean object in the hash table element with the primary Sindhi word as key. The object is then retrieved and displayed on a user interface of thesaurus.
Optical character recognition is popular field for researchers during last decade of research, wh... more Optical character recognition is popular field for researchers during last decade of research, which is able to successfully recognize the scanned English image into editable text form. However, optical character systems for other regional languages such as Urdu, Arabic, and Sindhi, still presents a huge challenge and implementation problems. Thus, in this paper various techniques of optical character recognition system for such low level regional languages have been discussed and analyzed. This survey paper consolidates all such techniques and presents an overview to aid researcher understand the methodology of performing and implementing OCR system for Sindhi language.
This paper presents a novel combinational phonetic algorithm for Sindhi Language, to be used in d... more This paper presents a novel combinational phonetic algorithm for Sindhi Language, to be used in developing Sindhi Spell Checker which has yet not been developed prior to this work. The compound textual forms and glyphs of Sindhi language presents a substantial challenge for developing Sindhi spell checker system and generating similar suggestion list for misspelled words. In order to implement such a system, phonetic based Sindhi language rules and patterns must be considered into account for increasing the accuracy and efficiency. The proposed system is developed with a blend between Phonetic based SoundEx algorithm and ShapeEx algorithm for pattern or glyph matching, generating accurate and efficient suggestion list for incorrect or misspelled Sindhi words. A table of phonetically similar sounding Sindhi characters for SoundEx algorithm is also generated along with another table containing similar glyph or shape based character groups for ShapeEx algorithm. Both these are first ever attempt of any such type of categorization and representation for Sindhi Language.
Pakistan is currently facing huge hurdles to maintain round the clock supply of electric power in... more Pakistan is currently facing huge hurdles to maintain round the clock supply of electric power in the major areas. This makes more annoying when a short fall increase by the failure of the power transmissions and even this increases more in summer season when due to overload and high-level environmental effects of heat power transformers failure rate increases numerously. The transformers being damaged due to the over heat and high load across one or two of its three phases. Mismanagement of the power distribution causes most of the problems when a heavy power load observed on single phase whereas other phases were not equally loaded. The system design will provide a systematic solution to protect transformer and fault detection using PLC, phase monitoring and temperature sensing with power management of particular distribution and give notification through GSM and information sharing with control room/Grid station through Internet of Things (IoT). In case of overloading and heating...
This paper presents a novel combinational phonetic algorithm for Sindhi Language, to be used in d... more This paper presents a novel combinational phonetic algorithm for Sindhi Language, to be used in developing Sindhi Spell Checker which has yet not been developed prior to this work. The compound textual forms and glyphs of Sindhi language presents a substantial challenge for developing Sindhi spell checker system and generating similar suggestion list for misspelled words. In order to implement such a system, phonetic based Sindhi language rules and patterns must be considered into account for increasing the accuracy and efficiency. The proposed system is developed with a blend between Phonetic based SoundEx algorithm and ShapeEx algorithm for pattern or glyph matching, generating accurate and efficient suggestion list for incorrect or misspelled Sindhi words. A table of phonetically similar sounding Sindhi characters for SoundEx algorithm is also generated along with another table containing similar glyph or shape based character groups for ShapeEx algorithm. Both these are first ev...
Through this research the problem of Sindhi Word Segmentation has been addressed and various tech... more Through this research the problem of Sindhi Word Segmentation has been addressed and various techniques have been discussed to solve this problem. Word Segmentation is the preliminary phase involved in any tool based on Natural Language Processing (NLP). For any system to understand the written text, it needs to be able to break it into individual tokens for processing. Sindhi being a cursive ligature based Persio-Arabic script, is quite complex and rich having large number of characters in its script with all characters having multiple glyph's based on its position in the text. In this paper Sindhi word Tokenization model has been proposed implementing various algorithms showing the process of tokenizing Sindhi text into individual words for corpus building and creating word repository for Sindhi Spell, grammar checker and other NLP applications. The problem of tokenization is resolved by first identifying the sentence boundaries and extracting each sentence into isolated list form, where each list element is a complete sentence. Then the segregated sentences are broken down into words with hard space character used as word boundaries and soft spaces are considered as part of word and thus ignored from segmenting. Finally each word is again filtered to remove special characters and then each word is converted and saved as token after validation.
The research article focuses on the Image Compression techniques such as. Discrete Cosine Transfo... more The research article focuses on the Image Compression techniques such as. Discrete Cosine Transform (DCT) and Fast Fourier Transform (FFT). These techniques are chosen because of their vast use in image processing field, JPEG (Joint Photographic Experts Group) is one of the examples of compression technique which uses DCT. The Research compares the two compression techniques based on DCT and FFT and compare their results using MATLAB software, Graphical User Interface (GUI). These results are based on two compression techniques with different rates of compression i.e. Compression rates are 90%, 60%, 30% and 5%. The technique allows compressing any picture format to JPG format. The result shows that DCT is better technique than FFT; however the compression results are same as that of 30% compression to 5% compression reflecting not significant change in visual results excepting the file size varying to small fraction. The compression technique works fine with the images having little...
The aim of this project is to design the system that can detect P-wave before the first S-wave sp... more The aim of this project is to design the system that can detect P-wave before the first S-wave spike. Typically, P-wave travel 1.68 to 1.75 times faster than S-wave. Our proposed designed device consists of a pendulum type earthquake detection device which is interconnected with fault point finder, wireless alarm, GSM kit and automatic turn off system. when P- wave strike the pendulum it activates relay and send the pulse to stimulate the wireless alarm which can be install at any place as it detects the P-waves and can save human lives as they will be aware of how to deal with this situation.
Sindh University Research Journal, 2016
Recent advancements in Computer Technologies have rapidly revolutionized the world. These advance... more Recent advancements in Computer Technologies have rapidly revolutionized the world. These advancements have immensely increased the need of localization of computer technologies in regional languages and for convenient natural language processing. In this paper, the problem of design and development of Unicode based digital thesaurus is discussed for Sindhi language. Sindhi is one of the oldest and richest languages of the world with a very rich linguistics and literary text. The development of digital Sindhi Thesaurus application is done on Java platform, using hash table structure to act as a database for storing word repository. The hash table structure provides a convenient and easy to implement data structure with multiple advantages of speed and ease of use. The words data is saved as a java bean object in the hash table element with the primary Sindhi word as key. The object is then retrieved and displayed on a user interface of thesaurus.
Optical character recognition is popular field for researchers during last decade of research, wh... more Optical character recognition is popular field for researchers during last decade of research, which is able to successfully recognize the scanned English image into editable text form. However, optical character systems for other regional languages such as Urdu, Arabic, and Sindhi, still presents a huge challenge and implementation problems. Thus, in this paper various techniques of optical character recognition system for such low level regional languages have been discussed and analyzed. This survey paper consolidates all such techniques and presents an overview to aid researcher understand the methodology of performing and implementing OCR system for Sindhi language.
This paper presents a novel combinational phonetic algorithm for Sindhi Language, to be used in d... more This paper presents a novel combinational phonetic algorithm for Sindhi Language, to be used in developing Sindhi Spell Checker which has yet not been developed prior to this work. The compound textual forms and glyphs of Sindhi language presents a substantial challenge for developing Sindhi spell checker system and generating similar suggestion list for misspelled words. In order to implement such a system, phonetic based Sindhi language rules and patterns must be considered into account for increasing the accuracy and efficiency. The proposed system is developed with a blend between Phonetic based SoundEx algorithm and ShapeEx algorithm for pattern or glyph matching, generating accurate and efficient suggestion list for incorrect or misspelled Sindhi words. A table of phonetically similar sounding Sindhi characters for SoundEx algorithm is also generated along with another table containing similar glyph or shape based character groups for ShapeEx algorithm. Both these are first ever attempt of any such type of categorization and representation for Sindhi Language.
Pakistan is currently facing huge hurdles to maintain round the clock supply of electric power in... more Pakistan is currently facing huge hurdles to maintain round the clock supply of electric power in the major areas. This makes more annoying when a short fall increase by the failure of the power transmissions and even this increases more in summer season when due to overload and high-level environmental effects of heat power transformers failure rate increases numerously. The transformers being damaged due to the over heat and high load across one or two of its three phases. Mismanagement of the power distribution causes most of the problems when a heavy power load observed on single phase whereas other phases were not equally loaded. The system design will provide a systematic solution to protect transformer and fault detection using PLC, phase monitoring and temperature sensing with power management of particular distribution and give notification through GSM and information sharing with control room/Grid station through Internet of Things (IoT). In case of overloading and heating...
This paper presents a novel combinational phonetic algorithm for Sindhi Language, to be used in d... more This paper presents a novel combinational phonetic algorithm for Sindhi Language, to be used in developing Sindhi Spell Checker which has yet not been developed prior to this work. The compound textual forms and glyphs of Sindhi language presents a substantial challenge for developing Sindhi spell checker system and generating similar suggestion list for misspelled words. In order to implement such a system, phonetic based Sindhi language rules and patterns must be considered into account for increasing the accuracy and efficiency. The proposed system is developed with a blend between Phonetic based SoundEx algorithm and ShapeEx algorithm for pattern or glyph matching, generating accurate and efficient suggestion list for incorrect or misspelled Sindhi words. A table of phonetically similar sounding Sindhi characters for SoundEx algorithm is also generated along with another table containing similar glyph or shape based character groups for ShapeEx algorithm. Both these are first ev...
Through this research the problem of Sindhi Word Segmentation has been addressed and various tech... more Through this research the problem of Sindhi Word Segmentation has been addressed and various techniques have been discussed to solve this problem. Word Segmentation is the preliminary phase involved in any tool based on Natural Language Processing (NLP). For any system to understand the written text, it needs to be able to break it into individual tokens for processing. Sindhi being a cursive ligature based Persio-Arabic script, is quite complex and rich having large number of characters in its script with all characters having multiple glyph's based on its position in the text. In this paper Sindhi word Tokenization model has been proposed implementing various algorithms showing the process of tokenizing Sindhi text into individual words for corpus building and creating word repository for Sindhi Spell, grammar checker and other NLP applications. The problem of tokenization is resolved by first identifying the sentence boundaries and extracting each sentence into isolated list form, where each list element is a complete sentence. Then the segregated sentences are broken down into words with hard space character used as word boundaries and soft spaces are considered as part of word and thus ignored from segmenting. Finally each word is again filtered to remove special characters and then each word is converted and saved as token after validation.