Big Data for Public Domain: A bibliometric and visualized study of the scientific discourse during 2000–2020 (original) (raw)

SOCIAL SCIENCES AND HUMANITIES ON BIG DATA: A BIBLIOMETRIC ANALYSIS

Journal of Information Systems and Technology Management, 2022

The purpose of this paper is to provide a comprehensive bibliometric review of social science, psychology, and humanities literature focusing on big data. Methods: Production and authorship trends, topics and areas as well as citations were analyzed by means of conducting a bibliometric analysis of a corpus of 5,500 Scopus articles published from 2010 to 2020. Findings: Analysis revealed similarities and differences among social science, psychology, and humanities literature in terms of publication, framing, and referencing trends as compared with the general big data literature: both fields show a steady increase, although the increase rate slowed down as from 2015; text production of both specific and general fields is led by just a few countries, with the USA and China being on top of the ranking; single authorship has been decreasing in both fields; the specificity of big data framing, in social sciences and humanities, has been identified with a critical view that surpass the ethical considerations, to include the social construction of datasets, the political and ideological uses of big data, and the discussion of its philosophical and epistemological foundations. Value: To the best of our knowledge, this is the first study to provide a comprehensive view on social sciences and humanities big data bibliometrics while providing context to compare results.

BIG DATA Output in Scopus during 2012 to 2016: A Bibliometric Analysis

The present study discusses the "Big Data" as reflected in Scopus for the period from 2012-2016 and investigates the highly productive authors, document types and h-index. The study also aims to find out the top contributing Indian institutions, the preferred sources for publications by Geographical distribution by country, Subject area, Source Type, Affiliation, and Language etc. The result indicates that there were total 9191 documents with 54129 citations on Big Data during 2012 to 2016. The result shows that China and USA are the most active countries in the area of Big Data Analytics. Study shows publication trends in the subjects of computer science and engineering.

A Bibliometric Analysis of Big Data Research, 2014 -2023: Study based on Web of Science Database

ILIS Journal of Librarianship and Informatics, 2023

The paper gives an analysis of the bibliographic records covered in the Web of Science database in the domain of Big Data for the period 2014-2023. Chronological production of the literature, distribution by types of documents, authorship pattern, author productivity, citation scores, top ranking journals which published the contributions, country-wise distribution of contributions, institutions from which more number of publications were emanated, and language-wise distribution are examined in the study. The total number of records for the period of study is 805. A boost in the number is observed from 2019 onwards. Single author contributions account for about 24% of the total. More than 30% of the records are from USA and they have got the highest citation score as well. The highest number of contributions emanated from Wuhan University, China.

Analysis of Global Published Literature on Library and Information Science: an Empirical Study through Big Data Approach (1920-2020)

Library Philosophy and Practice (e-journal); paper no. 6778, 2021

The purpose of this paper is to investigate the extent of LIS (Library and Information Science) literature referring to and being referred by patents in the form of scholarly citations they attribute and/or receive. The analysis of the present study was done based on a total data-set of 237820 research publications of the LIS discipline ranging from April 1920 to April 2020 (100 years) as mined from the LENS database. LIS research corpus received 22.75 scholarly citations and 6.87 patent citations per paper while utilizing and citing 18667 patents and thus establishing the correlation between the citing-cited link in the research ecosystem. In addition, this paper also evaluates the decadal Relative Growth Rate (RGR) and Doubling Time (Dt) of LIS publications ranging from May 1920 to April 2020. The paper reflects the value of LIS research in patents and vice versa depicting the cross-disciplinary nature and decadal growth rate of LIS in general and information science in particular.

Global Research Publications on Open Data: A Scientometric Analysis

CERN European Organization for Nuclear Research - Zenodo, 2022

Introduction-Open Data is a new phenomenon in the research field, which is the next level of the open-access movement. In the current information age, data holds enormous value in any research to validate research findings. Purpose-The purpose of the present study is to analyze the global research trends and patterns in the "Open Data" research using various Scientometric parameters. Research problem-The number of bibliometric and scientometric studies on Open Data and relevant research topics are very limited and these studies are based on the WoS records. Hence, the authors planned to investigate the present study using the Open Data research literature from Scopus Data. Objective-The key objective of this study is to analyze the research productivity and to identify the different patterns in the literature on "Open Data" research since from first publication as indexed in Scopus database. Methodology-The authors analyzed 3252 bibliographic records indexed in the Scopus database up to 2021. The study used various scientometrics indicators and mathematical formulae to analyze these records to find parameters like year-wise publication and citation trends, degree of collaboration, collaborative coefficient , collaboration index, most prolific authors, institutes involved in the research etc. The collected then fed into computer using CSV file format. Further, these records were analyzed using MS-Excel and VoSViewer. Findings-The study found collaborative trends among the researchers and the highest number of publications as conference papers. The USA has contributed the highest number of papers. The Delft University of Technology, Netherlands is the most productive organization, and the top two contributors are affiliated with the same university.

Diffusion of Big Data in Indian Scientific Literature: Study of Research Productivity and Scientific Collaboration

Purpose: Big data, a buzzword of the present time, is a term used for extremly large data sets generated from the digital process which is not possible to analyze by traditional methods. These data sets are produced by digital devices such as smart phones, remote sensing, camera, microphones, RFID etc. The literature on big data is growing exponentially since 2011. Big data is tending to establish as a very important research field. This paper aims to explore the evolution, growth and scientific collaboration of the Indian publications in the field of big data. Design/methodology/approach: A survey approach is used in the study while data for the study is collected from Scopus database for the year 2001 to 2015. Bibliometric analysis, visualization and mapping software are used to present the current status, growth trends and collaboration in big data research to examine its diffusion in Indian scientific literature. Findings: We found that the big data research in India is gaining momentum and its diffusion and adoption is increasing tremendously. Conference and seminars are used to do social connect and interaction within the research community. The collaboration at institution level is found usual while collaboration at international level is low. Application of big data in health sciences and life sciences is yet to be explored in comparison to the social sciences and physical sciences. Originality/ Value: This paper presents the growth, trends and collaboration in big data literature by the use of sophisticated bibliometric software and visualization software.

Who is who in Big Social Data? : A Bibliographic Network Analysis Study

2017

The aim of the study is to investigate who are advancing the knowledge on Big Social Data and the related concept of Social Big Data, ‘who’ are these people citing and building their work on, and what are the topics and outlets where the discussion takes place. For that purpose, data was extracted from Thomson Reuters Web of Science with the search term “Big Social Data” and “Social Big Data” spanning the years from 2012 to 2016. The search resulted in 58 articles in 39 different outlets. In order to go into the depth of Big Social Data and Social Big Data, co-author bibliographic network analysis was performed on the extracted data. The co-author network analysis revealed 149 nodes (authors), and 308 edges (co-authoring relationships) between the authors. Betweenness centrality were calculated for the nodes to demonstrate who are the central authorities and their domain on the topic of Big Social Data and Social Big Data. The visualisation based on co-author network analysis provid...

Scientometric mapping of research on ‘Big Data’

Springer, 2015

This paper presents a scientometric analysis of research work done on the emerging area of ‘Big Data’ during the recent years. Research on ‘Big Data’ started during last few years and within a short span of time has gained tremendous momentum. It is now considered one of the most important emerging areas of research in computational sciences and related disciplines. We have analyzed the research output data on ‘Big Data’ during 2010–2014 indexed in both, the Web of Knowledge and Scopus. The analysis maps comprehensively the parameters of total output, growth of output, authorship and country-level collaboration patterns, major contributors (countries, institutions and individuals), top publication sources, thematic trends and emerging themes in the field. The paper presents an elaborate and one of its kind scientometric mapping of research on ‘Big Data’.