Debasis Majhi | Banaras Hindu University, Varanasi (original) (raw)

Papers by Debasis Majhi

Research paper thumbnail of Scholarly Communication and Machine-Generated Text: Is it Finally AI vs AI in Plagiarism Detection?

Journal of Information and Knowledge, 2023

This study utilizes GPT (Generative Pre-Trained Transformer) language model-based AI writing tool... more This study utilizes GPT (Generative Pre-Trained Transformer) language model-based AI writing tools to create a set of 80 academic writing samples based on the eight themes of the experiential sessions of the LTC 2023. These samples, each between 2000 and 2500 words long, are then analyzed using both conventional plagiarism detection tools and selected AI detection tools. The study finds that traditional syntactic similarity-based anti-plagiarism tools struggle to detect AI-generated text due to the differences in syntax and structure between machine-generated and human-written text. However, the researchers discovered that AI detector tools can be used to catch AI-generated content based on specific characteristics that are typical of machine-generated text. The paper concludes by posing the question of whether we are entering an era in which AI detectors will be used to prevent AI-generated content from entering the scholarly communication process. This research sheds light on the challenges associated with AI-generated content in the academic research literature and offers a potential solution for detecting and preventing plagiarism in this context.

Research paper thumbnail of Identifying research fronts in NLP applications in library and information science using meta-analysis approaches

Digital Library Perspectives

Purpose The purpose of this study is to identify the research fronts by analysing highly cited co... more Purpose The purpose of this study is to identify the research fronts by analysing highly cited core papers adjusted with the age of a paper in library and information science (LIS) where natural language processing (NLP) is being applied significantly. Design/methodology/approach By excavating international databases, 3,087 core papers that received at least 5% of the total citations have been identified. By calculating the average mean years of these core papers, and total citations received, a CPT (citation/publication/time) value was calculated in all 20 fronts to understand how a front is relatively receiving greater attention among peers within a course of time. One theme article has been finally identified from each of these 20 fronts. Findings Bidirectional encoder representations from transformers with CPT value 1.608 followed by sentiment analysis with CPT 1.292 received highest attention in NLP research. Columbia University New York, in terms of University, Journal of the ...

Research paper thumbnail of Comparing research trends through author-provided keywords with machine extracted terms: A ML algorithm approach using publications data on neurological disorders

Iberoamerican Journal of Science Measurement and Communication

Objective. This study aimed to identify the primary research areas, countries, and organizational... more Objective. This study aimed to identify the primary research areas, countries, and organizational involvement in publications on neurological disorders through an analysis of human-assigned keywords. These results were then compared with unsupervised and machine-algorithm-based extracted terms from the title and abstract of the publications to gain knowledge about deficiencies of both techniques. This has enabled us to understand how far machine-derived terms through titles and abstracts can be a substitute for human-assigned keywords of scientific research articles. Design/Methodology/Approach. While significant research areas on neurological disorders were identified from the author-provided keywords of downloaded publications of Web of Science and PubMed, these results were compared by the terms extracted from titles and abstracts through unsupervised based models like VOSviewer and machine-algorithm-based techniques like YAKE and CounterVectorizer. Results/Discussion. We observe...

Research paper thumbnail of Indian Research Output on Scientometric Literature as Indexed in Scopus: a Scientometric Exploration

Library Philosophy and Practice (e-journal), May 15, 2021

In recent decay, the scientometric study is one of the major research areas in scholarly communic... more In recent decay, the scientometric study is one of the major research areas in scholarly communication. Researchers have conducted their research in the scientometric field from different core subject areas. Using bibliographic records on a scientometric field from the SCOPUS database, this paper tries to give a complete view of the evaluation of Indian research in the domain of scientometric. From 2010-2019 researchers have published 41462 publications out of the 334 number publications belongs to the scientometric domain of Indian research. Researchers have critically analyzed the collected data on various aspects like year-wise publication, author collaboration, authorship pattern, degree of collaboration, collaborative coefficient (CC), leading authors, productive journal, state-wise production in India, and mostly used keyword. The finding of the study disclosed that the maximum number of articles (97) published in the year 2019 with 222 citations. In the year 2015 got the highest number of citations (355) from only 31 publications. The highest number of articles are two-authored (140) followed by three-authored (89) and single-authored (54) respectively, and the average number of authors per article is 2.13. In respect of state-wise production, New Delhi has stood the first position with 191 publications. The word "scientometric” is the most used keyword and the top productive journal is Library Philosophy and Practice (114).

Research paper thumbnail of DDC Wise Subject Mapping of Ndli Indexed Multileveled Multilingual Resources a Special Reference with Indian Languages

The study helped with the subject mapping of the resources on 22 languages included in Indian con... more The study helped with the subject mapping of the resources on 22 languages included in Indian constitution that were indexed in the National Digital Library of India from 2018 and covering the different fields of arts and humanities, social sciences, science, engineering, technology, and history. The subject mapping is based on the Dewey Decimal Classification (DDC) 22nd edition. The relevant/required data was collected from the official website of the National Digital Library of India (https://ndl.iitkgp.ac.in/) on 4th April 2021, Sunday by browsing subject-wise. (https://ndl.iitkgp.ac.in/result?q={"t":"subject","b":{"browse":"subject","filters":[]}}) and filtering one by one of 22 officially recognized Indian languages in the Indian constitution. A total number of 398793 documents on different subjects indexed in NDLI till 4th April 2021, The maximum number of documents indexed in Computer science, Information & general works (000) subject [88.98%] and the larger number of documents indexed in the Bengali language [40.71%].

Research paper thumbnail of DDC-wise subject mapping of NDLI indexed multileveled, multilingual resources: A special reference with Indian Languages

Library Philosophy and Practice (e-journal), 2021

The study helped with the subject mapping of the resources on 22 languages included in Indian con... more The study helped with the subject mapping of the resources on 22 languages included in Indian constitution that were indexed in the National Digital Library of India from 2018 and covering the different fields of arts and humanities, social sciences, science, engineering, technology, and history. The subject mapping is based on the Dewey Decimal Classification (DDC) 22nd edition. The relevant/required data was collected from the official website of the National Digital Library of India (https://ndl.iitkgp.ac.in/) on 4th April 2021, Sunday by browsing subject-wise. (https://ndl.iitkgp.ac.in/result?q={"t":"subject","b":{"browse":"subject","filters":[]}}) and filtering one by one of 22 officially recognized Indian languages in the Indian constitution. A total number of 398793 documents on different subjects indexed in NDLI till 4th April 2021, The maximum number of documents indexed in Computer science, Information & general works (000) subject [88.98%] and the larger number of documents indexed in the Bengali language [40.71%].

Research paper thumbnail of Indian Research Output on Scientometric Literature as Indexed in Scopus: a Scientometric Exploration

Library Philosophy and Practice (e-journal), 2021

In recent decay, the scientometric study is one of the major research areas in scholarly communic... more In recent decay, the scientometric study is one of the major research areas in scholarly communication. Researchers have conducted their research in the scientometric field from different core subject areas. Using bibliographic records on a scientometric field from the SCOPUS database, this paper tries to give a complete view of the evaluation of Indian research in the domain of scientometric. From 2010-2019 researchers have published 41462 publications out of the 334 number publications belongs to the scientometric domain of Indian research. Researchers have critically analyzed the collected data on various aspects like year-wise publication, author collaboration, authorship pattern, degree of collaboration, collaborative coefficient (CC), leading authors, productive journal, state-wise production in India, and mostly used keyword. The finding of the study disclosed that the maximum number of articles (97) published in the year 2019 with 222 citations. In the year 2015 got the highest number of citations (355) from only 31 publications. The highest number of articles are two-authored (140) followed by three-authored (89) and single-authored (54) respectively, and the average number of authors per article is 2.13. In respect of state-wise production, New Delhi has stood the first position with 191 publications. The word "scientometric" is the most used keyword and the top productive journal is Library Philosophy and Practice (114).

Research paper thumbnail of Bibliometric analysis of LIS journals published from India during 2013 to 2017: A comparative Study

International Journal of Advanced and Innovative Research, 2019

The current study is conceived to assess the selected top periodical publishing scenario of India... more The current study is conceived to assess the selected top periodical publishing scenario of India in library and information science for the period of 2013-2017 based on the SJR. Various quality aspects of the 746 articles published in the period were studied. The other areas covered under the study include: the annual growth of articles, distribution of periodicals across the different journals, references per documents, distribution of citations, and range of citations per article. The maximum Articles were published from India of the journal "DESIDOC Journal of Library and Information Technology". Various bibliometric techniques used to determine the articles during the period under the study.

Research paper thumbnail of Scholarly Communication and Machine-Generated Text: Is it Finally AI vs AI in Plagiarism Detection?

Journal of Information and Knowledge, 2023

This study utilizes GPT (Generative Pre-Trained Transformer) language model-based AI writing tool... more This study utilizes GPT (Generative Pre-Trained Transformer) language model-based AI writing tools to create a set of 80 academic writing samples based on the eight themes of the experiential sessions of the LTC 2023. These samples, each between 2000 and 2500 words long, are then analyzed using both conventional plagiarism detection tools and selected AI detection tools. The study finds that traditional syntactic similarity-based anti-plagiarism tools struggle to detect AI-generated text due to the differences in syntax and structure between machine-generated and human-written text. However, the researchers discovered that AI detector tools can be used to catch AI-generated content based on specific characteristics that are typical of machine-generated text. The paper concludes by posing the question of whether we are entering an era in which AI detectors will be used to prevent AI-generated content from entering the scholarly communication process. This research sheds light on the challenges associated with AI-generated content in the academic research literature and offers a potential solution for detecting and preventing plagiarism in this context.

Research paper thumbnail of Identifying research fronts in NLP applications in library and information science using meta-analysis approaches

Digital Library Perspectives

Purpose The purpose of this study is to identify the research fronts by analysing highly cited co... more Purpose The purpose of this study is to identify the research fronts by analysing highly cited core papers adjusted with the age of a paper in library and information science (LIS) where natural language processing (NLP) is being applied significantly. Design/methodology/approach By excavating international databases, 3,087 core papers that received at least 5% of the total citations have been identified. By calculating the average mean years of these core papers, and total citations received, a CPT (citation/publication/time) value was calculated in all 20 fronts to understand how a front is relatively receiving greater attention among peers within a course of time. One theme article has been finally identified from each of these 20 fronts. Findings Bidirectional encoder representations from transformers with CPT value 1.608 followed by sentiment analysis with CPT 1.292 received highest attention in NLP research. Columbia University New York, in terms of University, Journal of the ...

Research paper thumbnail of Comparing research trends through author-provided keywords with machine extracted terms: A ML algorithm approach using publications data on neurological disorders

Iberoamerican Journal of Science Measurement and Communication

Objective. This study aimed to identify the primary research areas, countries, and organizational... more Objective. This study aimed to identify the primary research areas, countries, and organizational involvement in publications on neurological disorders through an analysis of human-assigned keywords. These results were then compared with unsupervised and machine-algorithm-based extracted terms from the title and abstract of the publications to gain knowledge about deficiencies of both techniques. This has enabled us to understand how far machine-derived terms through titles and abstracts can be a substitute for human-assigned keywords of scientific research articles. Design/Methodology/Approach. While significant research areas on neurological disorders were identified from the author-provided keywords of downloaded publications of Web of Science and PubMed, these results were compared by the terms extracted from titles and abstracts through unsupervised based models like VOSviewer and machine-algorithm-based techniques like YAKE and CounterVectorizer. Results/Discussion. We observe...

Research paper thumbnail of Indian Research Output on Scientometric Literature as Indexed in Scopus: a Scientometric Exploration

Library Philosophy and Practice (e-journal), May 15, 2021

In recent decay, the scientometric study is one of the major research areas in scholarly communic... more In recent decay, the scientometric study is one of the major research areas in scholarly communication. Researchers have conducted their research in the scientometric field from different core subject areas. Using bibliographic records on a scientometric field from the SCOPUS database, this paper tries to give a complete view of the evaluation of Indian research in the domain of scientometric. From 2010-2019 researchers have published 41462 publications out of the 334 number publications belongs to the scientometric domain of Indian research. Researchers have critically analyzed the collected data on various aspects like year-wise publication, author collaboration, authorship pattern, degree of collaboration, collaborative coefficient (CC), leading authors, productive journal, state-wise production in India, and mostly used keyword. The finding of the study disclosed that the maximum number of articles (97) published in the year 2019 with 222 citations. In the year 2015 got the highest number of citations (355) from only 31 publications. The highest number of articles are two-authored (140) followed by three-authored (89) and single-authored (54) respectively, and the average number of authors per article is 2.13. In respect of state-wise production, New Delhi has stood the first position with 191 publications. The word "scientometric” is the most used keyword and the top productive journal is Library Philosophy and Practice (114).

Research paper thumbnail of DDC Wise Subject Mapping of Ndli Indexed Multileveled Multilingual Resources a Special Reference with Indian Languages

The study helped with the subject mapping of the resources on 22 languages included in Indian con... more The study helped with the subject mapping of the resources on 22 languages included in Indian constitution that were indexed in the National Digital Library of India from 2018 and covering the different fields of arts and humanities, social sciences, science, engineering, technology, and history. The subject mapping is based on the Dewey Decimal Classification (DDC) 22nd edition. The relevant/required data was collected from the official website of the National Digital Library of India (https://ndl.iitkgp.ac.in/) on 4th April 2021, Sunday by browsing subject-wise. (https://ndl.iitkgp.ac.in/result?q={"t":"subject","b":{"browse":"subject","filters":[]}}) and filtering one by one of 22 officially recognized Indian languages in the Indian constitution. A total number of 398793 documents on different subjects indexed in NDLI till 4th April 2021, The maximum number of documents indexed in Computer science, Information & general works (000) subject [88.98%] and the larger number of documents indexed in the Bengali language [40.71%].

Research paper thumbnail of DDC-wise subject mapping of NDLI indexed multileveled, multilingual resources: A special reference with Indian Languages

Library Philosophy and Practice (e-journal), 2021

The study helped with the subject mapping of the resources on 22 languages included in Indian con... more The study helped with the subject mapping of the resources on 22 languages included in Indian constitution that were indexed in the National Digital Library of India from 2018 and covering the different fields of arts and humanities, social sciences, science, engineering, technology, and history. The subject mapping is based on the Dewey Decimal Classification (DDC) 22nd edition. The relevant/required data was collected from the official website of the National Digital Library of India (https://ndl.iitkgp.ac.in/) on 4th April 2021, Sunday by browsing subject-wise. (https://ndl.iitkgp.ac.in/result?q={"t":"subject","b":{"browse":"subject","filters":[]}}) and filtering one by one of 22 officially recognized Indian languages in the Indian constitution. A total number of 398793 documents on different subjects indexed in NDLI till 4th April 2021, The maximum number of documents indexed in Computer science, Information & general works (000) subject [88.98%] and the larger number of documents indexed in the Bengali language [40.71%].

Research paper thumbnail of Indian Research Output on Scientometric Literature as Indexed in Scopus: a Scientometric Exploration

Library Philosophy and Practice (e-journal), 2021

In recent decay, the scientometric study is one of the major research areas in scholarly communic... more In recent decay, the scientometric study is one of the major research areas in scholarly communication. Researchers have conducted their research in the scientometric field from different core subject areas. Using bibliographic records on a scientometric field from the SCOPUS database, this paper tries to give a complete view of the evaluation of Indian research in the domain of scientometric. From 2010-2019 researchers have published 41462 publications out of the 334 number publications belongs to the scientometric domain of Indian research. Researchers have critically analyzed the collected data on various aspects like year-wise publication, author collaboration, authorship pattern, degree of collaboration, collaborative coefficient (CC), leading authors, productive journal, state-wise production in India, and mostly used keyword. The finding of the study disclosed that the maximum number of articles (97) published in the year 2019 with 222 citations. In the year 2015 got the highest number of citations (355) from only 31 publications. The highest number of articles are two-authored (140) followed by three-authored (89) and single-authored (54) respectively, and the average number of authors per article is 2.13. In respect of state-wise production, New Delhi has stood the first position with 191 publications. The word "scientometric" is the most used keyword and the top productive journal is Library Philosophy and Practice (114).

Research paper thumbnail of Bibliometric analysis of LIS journals published from India during 2013 to 2017: A comparative Study

International Journal of Advanced and Innovative Research, 2019

The current study is conceived to assess the selected top periodical publishing scenario of India... more The current study is conceived to assess the selected top periodical publishing scenario of India in library and information science for the period of 2013-2017 based on the SJR. Various quality aspects of the 746 articles published in the period were studied. The other areas covered under the study include: the annual growth of articles, distribution of periodicals across the different journals, references per documents, distribution of citations, and range of citations per article. The maximum Articles were published from India of the journal "DESIDOC Journal of Library and Information Technology". Various bibliometric techniques used to determine the articles during the period under the study.