Rasch Analysis Research Papers - Academia.edu (original) (raw)
2025, Journal of Education and Learning
This research aims to construct and validate progress maps of digital technology for diagnosing the multidimensional mathematical proficiency (MP) in Number and Algebra for Grade 7 students utilizing the Construct Modeling Approach.... more
This research aims to construct and validate progress maps of digital technology for diagnosing the multidimensional mathematical proficiency (MP) in Number and Algebra for Grade 7 students utilizing the Construct Modeling Approach. Researchers employed four building blocks as follows. Firstly, researchers developed the progress maps as an assessment framework of multidimensional MP. This is followed by creating the test for diagnosing MP. Next, researchers assigned scoring criteria and created the transition points of students’ MP levels. Finally, researchers validated the quality of the progress maps through empirical evidence. A total sample 1,500 Grade 7 students was used to support the validity and reliability evidence of the progress maps through the Wright Map using Multidimensional Random Coefficients Multinomial Logit Model. Results revealed that there were two dimensions of progress maps, namely mathematical procedures (MAP) and structure of learning outcome (SLO), and the...
2025, Research Square (Research Square)
Globally, the leading cause of years lived with disability is low back pain (LBP). Chronic low back pain (CLBP) is responsible for most of the cost and disability associated with LBP. This is more devastating in low income countries,... more
Globally, the leading cause of years lived with disability is low back pain (LBP). Chronic low back pain (CLBP) is responsible for most of the cost and disability associated with LBP. This is more devastating in low income countries, particularly in rural Nigeria with one of the greatest global burdens of LBP. No Igbo back pain speci c measure captures remunerative or non-remunerative work outcomes. Disability measurement using these tools may not fully explain work-related disability and community participation, a limitation not evident in the World Health Organisation Disability Assessment Schedule (WHODAS 2.0). This study aimed to cross-culturally adapt the WHODAS 2.0 and validate it in rural and urban Nigerian populations with CLBP. Translation, cultural adaptation, test-retest, and cross-sectional psychometric testing was performed. WHODAS 2.0 was forward and back translated by clinical/non-clinical translators. Expert review committee evaluated the translations. Twelve people with CLBP in a rural Nigerian community piloted/pre-tested the questionnaire. Cronbach's alpha assessing internal consistency; intraclass correlation coe cient and Bland-Altman plots assessing test-retest reliability; and minimal detectable change were investigated in a convenient sample of 50 adults with CLBP in rural and urban Nigeria. Construct validity was examined using Spearman's correlation analyses with the back-performance scale, Igbo Roland Morris Disability Questionnaire and eleven-point box scale; and exploratory factor analysis in a random sample of 200 adults with CLBP in rural Nigeria. Ceiling and oor effects were investigated in both samples. Patient instructions were also translated. 'Waist pain/lower back pain' was added to 'illness(es)' to make the measure relevant for this study whilst allowing for future studies involving other conditions. The Igbo phrase for 'family and friends' was used to better represent 'people close to you' in item D4.3. The Igbo-WHODAS had good internal consistency (α = 0.75-0.97); intra class correlation coe cients (ICC = 0.81-0.93); standard error of measurements (5.05-11.10) and minimal detectable change (13.99-30.77). Igbo-WHODAS correlated moderately with performance-based disability, self-reported back pain-speci c disability and pain intensity, with a seven-factor structure and no oor and ceiling effects. Igbo-WHODAS appears psychometrically sound. Its research and clinical utility require further testing.
2025, The Journal of rheumatology
Discussion and endorsement of the OMERACT total joint replacement (TJR) core domain set for total hip replacement (THR) and total knee replacement (TKR) for endstage arthritis; and next steps for selection of instruments. The OMERACT TJR... more
Discussion and endorsement of the OMERACT total joint replacement (TJR) core domain set for total hip replacement (THR) and total knee replacement (TKR) for endstage arthritis; and next steps for selection of instruments. The OMERACT TJR working group met at the 2016 meeting at Whistler, British Columbia, Canada. We summarized the previous systematic reviews, the preliminary OMERACT TJR core domain set and results from previous surveys. We discussed preliminary core domains for TJR clinical trials, made modifications, and identified challenges with domain measurement. Working group participants (n = 26) reviewed, clarified, and endorsed each of the inner and middle circle domains and added a range of motion domain to the research agenda. TJR were limited to THR and TKR but included all endstage hip and knee arthritis refractory to medical treatment. Participants overwhelmingly endorsed identification and evaluation of top instruments mapping to the core domains (100%) and use of sub...
2025, US neurology
The publication of this article was supported by Grifols. The views and opinions expressed in the article are those of the authors and not necessarily those of Grifols. US/ GX/1016/0386 Chronic inflammatory demyelinating polyneuropathy... more
The publication of this article was supported by Grifols. The views and opinions expressed in the article are those of the authors and not necessarily those of Grifols. US/ GX/1016/0386 Chronic inflammatory demyelinating polyneuropathy (CIDP) is an acquired immune-mediated disease that evolves in a progressive or relapsing pattern over months to years. Although "typical" CIDP is characterized by symmetric proximal and distal motor and sensory deficits, it is now recognized that multifocal (asymmetric), distally predominant, pure sensory, and pure motor variants also fall within the CIDP spectrum. First-line treatment options for CIDP include corticosteroids, intravenous immunoglobulin (IVIG), and plasmapheresis (plasma exchange). For patients refractory to first-line options or those chronically dependent on high-dose first-line therapy, no evidence-based treatment recommendations exist. Cytotoxic
2025, Health and Quality of Life Outcomes
Background: To develop and validate an item bank to measure mobility in older people in primary care and to analyse differential item functioning (DIF) and differential bundle functioning (DBF) by sex. Methods: A pool of 48 mobility items... more
Background: To develop and validate an item bank to measure mobility in older people in primary care and to analyse differential item functioning (DIF) and differential bundle functioning (DBF) by sex. Methods: A pool of 48 mobility items was administered by interview to 593 older people attending primary health care practices. The pool contained four domains based on the International Classification of Functioning: changing and maintaining body position, carrying, lifting and pushing, walking and going up and down stairs. The Late Life Mobility item bank consisted of 35 items, and measured with a reliability of 0.90 or more across the full spectrum of mobility, except at the higher end of better functioning. No evidence was found of non-uniform DIF but uniform DIF was observed, mainly for items in the changing and maintaining body position and carrying, lifting and pushing domains. The walking domain did not display DBF, but the other three domains did, principally the carrying, lifting and pushing items. Conclusions: During the design and validation of an item bank to measure mobility in older people, we found that strength (carrying, lifting and pushing) items formed a secondary dimension that produced DBF. More research is needed to determine how best to include strength items in a mobility measure, or whether it would be more appropriate to design separate measures for each construct.
2025, Assessment & Evaluation in Higher Education
Over recent years UK medical schools have moved to more integrated summative examinations. This paper analyses data from the written assessment of undergraduate medical students to investigate two key psychometric aspects of this type of... more
Over recent years UK medical schools have moved to more integrated summative examinations. This paper analyses data from the written assessment of undergraduate medical students to investigate two key psychometric aspects of this type of high stakes assessment. Firstly, the strength of the relationship between examiner predictions of item performance (as required under the Ebel standard setting method employed) and actual item performance ('facility') in the examination is explored. It is found that there is a systematic pattern of difference these two measures, with examiners tending to under-estimate the difficulty of items classified as relatively easy, and over-estimating that of items classified harder. The implications of these differences for standard setting are considered. Secondly, the integration of the assessment raises the question as to whether the student total score in the exam can provide a single meaningful measure of student performance across a broad range of medical specialties. Therefore Rasch
2025, Journal of Technology and Science Education
This study explores the evolving interaction between Generative Artificial Intelligence (AI) and education, focusing on how technologies such as Natural Language Processing and specific models like OpenAI's ChatGPT can be used on... more
This study explores the evolving interaction between Generative Artificial Intelligence (AI) and education, focusing on how technologies such as Natural Language Processing and specific models like OpenAI's ChatGPT can be used on high-stakes examinations. The main objective is to evaluate the ability of ChatGPT version 4.0 to generate written language assessment items and compare them to those created by human experts. The pilot items were developed for the Higher Education Entrance Examination (ExIES, according to its Spanish initials) administered at the Autonomous University of Baja California. Item Response Theory (IRT) analyses were performed on responses from 2,263 test-takers. Results show that although ChatGPT-generated items tend to be more challenging, both sets exhibit a comparable Rasch model fit and discriminatory power across varying levels of student ability. This finding suggests that Generative AI can effectively complement exam developers in creating large-scale assessments. Furthermore, ChatGPT 4.0 demonstrates a slightly higher capacity to differentiate among students of varying skill levels. In conclusion, the study underscores the importance of continually exploring AI-driven item generation as a potential means to enhance educational assessment practices and improve pedagogical outcomes.
2025
This step-by-step guide to conducting a Rasch analysis using the jMetrik software package describes how to format and import data, conduct a simple classical test theory item analysis, and then conduct a simple Rasch analysis. Some basic... more
This step-by-step guide to conducting a Rasch analysis using the jMetrik software package describes how to format and import data, conduct a simple classical test theory item analysis, and then conduct a simple Rasch analysis. Some basic graphical outputs are also described. After completing the steps described in this guide, novice users should be able to conduct a basic analysis unaided and be able to explore the more advanced features of jMetrik by referring to the manual.
2025, Journal of Advanced Nursing
A study examining the appropriateness of a self-rated alcoholrelated clinical confidence tool as a method of measurement among registered hospital nurses using Rasch analysis.
2025, Journal of Rehabilitation Medicine
Objective: To examine the impact of home modifications on self-rated ability in everyday life from various aspects for people ageing with disabilities. Methods: The study sample was recruited from an agency providing home modification... more
Objective: To examine the impact of home modifications on self-rated ability in everyday life from various aspects for people ageing with disabilities. Methods: The study sample was recruited from an agency providing home modification services in Sweden and comprised 73 subjects whose referrals had been approved and who were scheduled to receive home modifications (intervention group) and 41 subjects waiting for their applications to be assessed for approval (comparison group). The subjects rated their ability in everyday life using the Client-Clinician Assessment Protocol Part I on 2 occasions: at baseline and follow-up. The Client-Clinician Assessment Protocol Part I provides data on the clients' self-rated independence, difficulty and safety in everyday life. The data were first subjected to Rasch analysis in order to convert the raw scores into interval measures. Further analyses to investigate changes in self-rated ability were conducted with parametric statistics. Results: Subjects who had received home modifications reported a statistically significant improvement in their selfrated ability in everyday life compared with those in the comparison group. Subjects who had received home modifications reported less difficulty and increased safety, especially in tasks related to self-care in the bathroom and transfers, such as getting in and out of the home. Conclusion: Home modifications have a positive impact on self-rated ability in everyday life, especially on decreasing the level of difficulty and increasing safety.
2025
Introduction: Problematic internet use (PIU) can present itself in a variety of online activities. Given the increasing prevalence of PIU among young adults, there is a dearth of comprehensive assessment tools to characterize various PIU... more
Introduction: Problematic internet use (PIU) can present itself in a variety of online activities. Given the increasing prevalence of PIU among young adults, there is a dearth of comprehensive assessment tools to characterize various PIU in Malaysia. The 11‑item Assessment of Criteria for Specific Internet‑use Disorders (ACSID‑11) assesses specific PIU including online gaming, online buying‑shopping, online pornography use, social networking use, and online gambling. The present study investigated the psychometric properties of the Malay ACSID‑11. Methods: A cross‑sectional study using an online survey was used for the data collection. The sample comprised 610 young adults aged 22.55 years (standard deviation ± 3.49). Participants were recruited from July 2023 to September 2023 using convenience sampling. Results: The confirmatory factor analysis findings supported the four‑factor structure of the Malay ACSID‑11 across gender, ethnicity, and academic achievement with good fit statistics: comparative fit index (CFI) ≥ 0.968, Tucker‑Lewis index (TLI) ≥ 0.949, root mean square error of approximation (RMSEA) ≥ 0.057, standardized root mean square residual (SRMR) ≥ 0.028 (frequency response); CFI ≥ 0.968, TLI ≥ 0.958, RMSEA ≥ 0.079, SRMR ≥ 0.033 (intensity response). The different online subscales (except for some of the ACSID‑11 online gambling subscales) showed good internal consistency (Cronbach’s α and McDonald’s ω between 0.58 and 0.90 for frequency responses; Cronbach’s α and McDonald’s ω between 0.61 and 0.93 for intensity responses). Conclusion: The Malay ACSID‑11 is a valid and reliable instrument for assessing various specific PIU among Malaysian young adults. However, caution is required using the ACSID‑11 to assess online gambling because some of its subscales had low internal consistency
2025, International Education Studies
This study explored the psychometric properties of a locally developed information skills test for youth students in Malaysia using Rasch analysis. The test was a combination of 24 structured and multiple choice items with a 4-point... more
This study explored the psychometric properties of a locally developed information skills test for youth students in Malaysia using Rasch analysis. The test was a combination of 24 structured and multiple choice items with a 4-point grading scale. The test was administered to 72 technical college students and 139 secondary school students. The data from the test were fitted to the Rasch partial credit model using the Winsteps program in which the unidimensionality, reliability and person-item distribution map of the test were examined. The analysis showed all 24 items meet the Rasch model expectation and thus have a potential in assessing information skills of youth students in Malaysia. The findings showed that Rasch analysis could help researchers to refine the developed test in a systematic and informed manner.
2025, Health and quality of life outcomes
Background There is no widely accepted framework to guide the development of condition-specific preferencebased instruments (CSPBIs) that includes both de novo and from existing non-preference-based instruments. The purpose of this study... more
Background There is no widely accepted framework to guide the development of condition-specific preferencebased instruments (CSPBIs) that includes both de novo and from existing non-preference-based instruments. The purpose of this study was to address this gap by reviewing the published literature on CSPBIs, with particular attention to the application of item response theory (IRT) and Rasch analysis in their development. Methods A scoping review of the literature covering the concepts of all phases of CSPBI development and evaluation was performed from MEDLINE, Embase, PsychInfo, CINAHL, and the Cochrane Library, from inception to December 30, 2022. The titles and abstracts of 1,967 unique references were reviewed. After retrieving and reviewing 154 full-text articles, data were extracted from 109 articles, representing 41 CSPBIs covering 21 diseases or conditions. The development of CSPBIs was conceptualized as a 15-step framework, covering four phases: 1) develop initial questionnaire items (when no suitable non-preference-based instrument exists), 2) establish the dimensional structure, 3) reduce items per dimension, 4) value and model health state utilities. Thirty-nine instruments used a type of Rasch model and two instruments used IRT models in phase 3. We present an expanded framework that outlines the development of CSPBIs, both from existing nonpreference-based instruments and de novo when no suitable non-preference-based instrument exists, using IRT †
2025, AnV Publication
The nursing profession is frequently described as both a calling and a career. The modern workplace, however, is increasingly impacted by the mentality of "hustle culture"a mindset that prioritises unrelenting production, overwork, and a... more
The nursing profession is frequently described as both a calling and a career. The modern workplace, however, is increasingly impacted by the mentality of "hustle culture"a mindset that prioritises unrelenting production, overwork, and a never-ending chase of success. This culture offers unique challenges in nursing, which is a naturally difficult profession owing to the high risks involved. While hustle culture can foster professional growth and resiliency, it can also exacerbate fatigue, mental health issues, and subpar patient care. This article investigates the consequences of hustle culture in nursing, assesses its benefits and drawbacks, and suggests solutions for mitigating its negative effects.
2025, International Journal of Evaluation and Research in Education (IJERE)
Every society dreams of true peace. To achieve true peace, humans need to start with inner peace. The importance of peace becomes one of the bases for developing a measure of peace for designing peace-building programs. This research... more
Every society dreams of true peace. To achieve true peace, humans need to start with inner peace. The importance of peace becomes one of the bases for developing a measure of peace for designing peace-building programs. This research answered the need for these measuring tools by developing and validating a peace measuring instrument called the Indonesian peace of mind scale (IPoMS). This instrument consists of seven items in two aspects: the internal state of peacefulness and harmony. This study used Rasch analysis to test the construct validity of IPoMS. The construct validity test involved 202 vocational high school students in Yogyakarta, Indonesia. Data analysis using Win step software provides information about the quality of respondents and instruments, items that are easy and difficult for respondents to agree on, fit order items, and unidimensionality. The results of the application of Rasch analysis showed that IPoMS is good, precise, and have item conformity with the mode...
2025, Journal of College Teaching & Learning (TLC)
The purpose of this article is to provide insight into an elementary school whose climate issues appear to plague and impact it's performance as measured by it's Annually Yearly Progress (AYP). The Northwest Georgia elementary... more
The purpose of this article is to provide insight into an elementary school whose climate issues appear to plague and impact it's performance as measured by it's Annually Yearly Progress (AYP). The Northwest Georgia elementary school is located in a rural school system approximately 50 miles northwest of Atlanta, Georgia. A review of the literature suggests school climate can affect many areas and people within schools. It further suggests that positive interpersonal relationships and optimal learning opportunities in all demographic environments can increase school achievement levels and reduce maladaptive behaviors (McEvoy & Welker, 2000). Providing a positive and supportive work environment and climate for faculty and staff, more often than not, improves faculty, staff and student performance (Freiberg, 1998). An in-depth analysis of the environment of the school in question suggests a lack of faculty and staff respect for administration, a hostile work environment, and o...
2025, School Science and Mathematics
Developing an understanding procedures observation rubric for mathematics intervention teachers. School Science and Mathematics, 120(3),[153][154][155][156][157][158][159][160][161][162][163][164]
2025, Value in Health
This paper discusses recent advances that have been made in the field of psychometrics, specifically, the application of Rasch analysis to the instrument development process. It emphasizes the importance of assessing the fundamental... more
This paper discusses recent advances that have been made in the field of psychometrics, specifically, the application of Rasch analysis to the instrument development process. It emphasizes the importance of assessing the fundamental scaling properties of an instrument prior to consideration of traditional psychometric indicators. The paper introduces Rasch analysis and shows how it has been applied in the development of needs-based measures in order to ensure that they provide unidimensional measurement. By ensuring that scales are based on the same measurement model and that they fit the Rasch model it is possible for QoL scores to be compared across diseases by means of cocalibration and item banking.
2025
This study investigated the impact of graphic design skills on the employability of business education students in Nigerian colleges of education. Employing a true experimental pre-test and post-test withinsubjects design, the study... more
This study investigated the impact of graphic design skills on the employability of business education students in Nigerian colleges of education. Employing a true experimental pre-test and post-test withinsubjects design, the study involved 40 randomly selected students from the Federal College of Education (Technical), Bichi. Baseline assessments were conducted to measure the participants' graphic design competencies and perceptions of employability. The intervention consisted of a six-week Canva-based training program focused on designing flyers and invitation cards. Internal validity was ensured through randomization, control of confounding variables, and consistent implementation procedures, while external validity was strengthened through representative sampling, real-world application, and potential for replication. Data were collected through structured surveys and analyzed using descriptive statistics and paired t-tests. The findings revealed statistically significant improvements in both graphic design skills and employability perceptions following the intervention. Consequently, the null hypotheses were rejected, affirming that graphic design training positively influences employability. Notable challenges included limited access to Canva tools, difficulties in balancing academic workload, and potential threats to validity arising from the study's unique context. The study recommends the integration of graphic design competencies into the business education curriculum to enhance graduate employability. Despite its limitations, the study underscores the value of graphic design training in equipping business education students for the competitive Nigerian job market.
2025, Health and Quality of Life Outcomes
Background: Existing instruments for measuring mobility are inadequate for accurately assessing older people across the broad spectrum of abilities. Like other indices that monitor critical aspects of health such as blood pressure tests,... more
Background: Existing instruments for measuring mobility are inadequate for accurately assessing older people across the broad spectrum of abilities. Like other indices that monitor critical aspects of health such as blood pressure tests, a mobility test for all older acute medical patients provides essential health data. We have developed and validated an instrument that captures essential information about the mobility status of older acute medical patients. Methods: Items suitable for a new mobility instrument were generated from existing scales, patient interviews and focus groups with experts. 51 items were pilot tested on older acute medical inpatients. An interval-level unidimensional mobility measure was constructed using Rasch analysis. The final item set required minimal equipment and was quick and simple to administer. The de Morton Mobility Index (DEMMI) was validated on an independent sample of older acute medical inpatients and its clinimetric properties confirmed. The DEMMI is a 15 item unidimensional measure of mobility. Reliability (MDC 90 ), validity and the minimally clinically important difference (MCID) of the DEMMI were consistent across independent samples. The MDC 90 and MCID were 9 and 10 points respectively (on the 100 point Rasch converted interval DEMMI scale). The DEMMI provides clinicians and researchers with a valid interval-level method for accurately measuring and monitoring mobility levels of older acute medical patients. DEMMI validation studies are underway in other clinical settings and in the community. Given the ageing population and the importance of mobility for health and community participation, there has never been a greater need for this instrument.
2025, Archives of Physical Medicine and Rehabilitation
Objective: To investigate the validity of item score summation for the original and modified versions of the Barthel Index. Design: Rasch analysis of Barthel Index data. Setting: General medical wards at 2 acute care hospitals in... more
Objective: To investigate the validity of item score summation for the original and modified versions of the Barthel Index. Design: Rasch analysis of Barthel Index data. Setting: General medical wards at 2 acute care hospitals in Australia. Participants: Consecutive older medical patients (Nϭ396). Interventions: Not applicable. Main Outcome Measures: Activity limitation was assessed by using the Barthel Index at hospital admission and discharge. At 1 hospital site, the original Barthel Index was used, and at the other hospital site the Modified Barthel Index (MBI) was used. Results: More than half of the items showed misfit to the Rasch model for both versions of the Barthel Index. The continence items appear to measure a different construct to the other items. After the removal of the continence items, data for the remaining items still did not fit the Rasch model. Neither the original nor the MBI are unidimensional scales. An exception to this occurred when the original Barthel Index was rescored and only then for discharge and not for admission Barthel Index data. Conclusions: Because clinicians do not typically rescore outcomes obtained by using the Barthel Index, these findings, combined with unacceptable ceiling effects, render the Barthel Index an assessment tool with limited validity for measuring and monitoring the health of older medical patients.
2025
This paper briefly looks into the role and extent of mathematical modelling in the design and analysis of measurement systems, especially measurement subsystems in the form of instruments and instrument elements. It also examines the role... more
This paper briefly looks into the role and extent of mathematical modelling in the design and analysis of measurement systems, especially measurement subsystems in the form of instruments and instrument elements. It also examines the role and use of mathematical modelling in the area of soft measurement (non-physical measurement). Based on a number of examples it demonstrates the use of modern modelling techniques in the design and analysis of sub-systems in measurement technology. In doing so, it will focus on the scope and importance of physical modelling at a sub-system level which ultimately contributes to modelling activities at a global systems level.
2025
This paper briefly looks into the role and extent of mathematical modelling in the design and analysis of measurement systems, especially measurement sub- systems in the form of instruments and instrument elements. It also examines the... more
This paper briefly looks into the role and extent of mathematical modelling in the design and analysis of measurement systems, especially measurement sub- systems in the form of instruments and instrument elements. It also examines the role and use of mathematical modelling in the area of soft measurement (non-physical measurement). Based on a number of examples it demonstrates the use of modern modelling techniques in the design and analysis of sub-systems in measurement technology. In doing so, it will focus on the scope and importance of physical modelling at a sub-system level which ultimately contributes to modelling activities at a global systems level.
2025
Much of the data presented by politicians and the media is multivariate in its nature. However, in the UK at least, the general public has little training to deal with such information. It is reasonable to explore the school curriculum to... more
Much of the data presented by politicians and the media is multivariate in its nature. However, in the UK at least, the general public has little training to deal with such information. It is reasonable to explore the school curriculum to determine the nature and extent of students' preparation for dealing with multivariate data. In the UK,
2025, Osteoarthritis and Cartilage
Objectives: Use Rasch analysis to examine the psychometric properties of the Oxford Knee Score (OKS), particularly in respect to unidimensionality, and consistency of item functioning before and after total knee replacement and across age... more
Objectives: Use Rasch analysis to examine the psychometric properties of the Oxford Knee Score (OKS), particularly in respect to unidimensionality, and consistency of item functioning before and after total knee replacement and across age and gender groups. The 12-item OKS was administered to 1,712 patients before the surgery, and 1,322 and 855 patients were administered the instrument repeatedly at the 6-month and 2-year postoperative assessments, respectively. Data were fitted to the Rasch partial credit model with the Winsteps program. Differential item functioning (DIF) analysis was performed, and fit statistics in combination with principal components analysis of the residuals were used to test the unidimensionality assumption. The fit criteria were set at 1.5 and 2.0 for infit mean-square (MNSQ) and outfit MNSQ, respectively. Results: At baseline, item difficulty ranged from À1.86 to 1.78 logits, and person measures had a mean AE SD of À0.01 AE 0.89. Misfit items were ''limping'' and ''night pain'' in preoperative data and ''limping'' and ''kneeling'' in postoperative data. After removing items limping and kneeling and recoding item night pain, none of the items misfit at each of the time points and there was stability of item difficulty ordering across time. In the modified OKS set, five items displayed DIF by age and three by gender. The original OKS had adequate targeting and good coverage of knee severity levels in preoperative patients. The modified 10-item OKS data fit the Rasch model and had stable item difficulty ordering over time.
2025, Quality of Life Research
Objective: To set out the methodological process for using Rasch analysis alongside traditional psychometric methods in the development of a health state classification that is amenable to valuation. The overactive bladder questionnaire... more
Objective: To set out the methodological process for using Rasch analysis alongside traditional psychometric methods in the development of a health state classification that is amenable to valuation. The overactive bladder questionnaire is used to illustrate a four step process for deriving a reduced health state classification from an existing nonpreference based health related quality of life instrument. Step I excludes items that do not meet the initial validation process and step II uses criteria based on Rasch analysis and psychometric testing to select the final items for the health state classification. In step III, item levels are examined and Rasch analysis is used to explore the possibility of reducing the number of item levels. Step IV repeats steps I to III on alternative data sets in order to validate the selection of items for the health state classification. The techniques described enable the construction of a health state classification amenable for valuation exercises that will allow the derivation of preference weights. Thus, the health related quality of life of patients with conditions, like overactive bladder, can be valued and quality adjustment weights such as quality adjusted life years derived.
2025, MPRA Paper
Objective: To set out the methodological process for using Rasch analysis alongside traditional psychometric methods in the development of a health state classification that is amenable to valuation. The overactive bladder questionnaire... more
Objective: To set out the methodological process for using Rasch analysis alongside traditional psychometric methods in the development of a health state classification that is amenable to valuation. The overactive bladder questionnaire is used to illustrate a four step process for deriving a reduced health state classification from an existing nonpreference based health related quality of life instrument. Step I excludes items that do not meet the initial validation process and step II uses criteria based on Rasch analysis and psychometric testing to select the final items for the health state classification. In step III, item levels are examined and Rasch analysis is used to explore the possibility of reducing the number of item levels. Step IV repeats steps I to III on alternative data sets in order to validate the selection of items for the health state classification. The techniques described enable the construction of a health state classification amenable for valuation exercises that will allow the derivation of preference weights. Thus, the health related quality of life of patients with conditions, like overactive bladder, can be valued and quality adjustment weights such as quality adjusted life years derived.
2025, International Journal of Evaluation and Research in Education (IJERE)
This study aimed to test the validity, reliability, and difficulty level of items developed based on the Frayer model and detect conceptual understanding of high school students in biology evolution. The test method evaluated 35... more
This study aimed to test the validity, reliability, and difficulty level of items developed based on the Frayer model and detect conceptual understanding of high school students in biology evolution. The test method evaluated 35 multiple-choice questions on evolution for 55 high school students. Rasch analysis was performed to assess the validity, reliability, difficulty level of items, and students’ ability level. Two experts empirically tested and analyzed the validity of the items. The assessment developed was discovered to be valid based on expert and empirical analyses. Furthermore, the construct validity test indicated that only two of the 35 questions were deemed invalid. The assessment exhibited reliability with an item reliability score of 0.92. The item difficulty levels were equally spread across the normal curve, encompassing questions ranging from very difficult to very easy categories, as depicted in the variable map. After analyzing the map, it was observed that variations in students’ proficiency levels at answering questions were evident, indicating diverse levels of ability. Students performed well in handling formal and superordinate-subordinate level questions. However, their performance differed when dealing with identity and principle-level concepts.
2025
In rater-mediated assessments, the ratings awarded to language learners' written, or spoken, performances do not necessarily reflect their language abilities because a number of other construct-irrelevant factors may affect the knowledge... more
In rater-mediated assessments, the ratings awarded to language learners' written, or spoken, performances do not necessarily reflect their language abilities because a number of other construct-irrelevant factors may affect the knowledge they demonstrate. Rater subjectivity and rating scales are among the variables possibly influencing the final results. The purpose of the present study was to examine the extent to which university students' ratings on their essays mirrored the effect of these two factors. To that end, 150 Iranian EFL teachers rated ten five-paragraph essays BA students had written as their course requirements at Imam Khomeini International University. The raters used two rating scales to rate the essays on a number of assessment criteria. The study rested on a partial rating design, and the Rasch-based computer program, FACETS, was used to analyze the data. Results of Facets analyses showed raters differed considerably in the amounts of severity they exercised when rating the essays. The results also showed rater bias interactions with holistic rating scales. The implications of the findings for proposing procedures for reducing the effects of such extraneous variables are discussed.
2025, MTISD 2008. Methods, Models and Information Technologies for Decision Support Systems
Abstract: Multivariate Additive PLS Splines, in short MAPLSS, are Partial Least-Squares models that study the dependence of a set of responses on spline transformations of the predictor variables which permit to capture additively non... more
Abstract: Multivariate Additive PLS Splines, in short MAPLSS, are Partial Least-Squares models that study the dependence of a set of responses on spline transformations of the predictor variables which permit to capture additively non linear main effects and interactions. The aim of this paper is to present a way of selecting MAPLSS models through an adaptive incremental selection of training samples by a bootstrap procedure. This approach is attractive in the case of expensive data thus implying to construct efficient ...
2025
This document reviews the research related to students' and teachers' anxiety related to science and the teaching of science in order to better understand the relationships between the variables that can predict this phenomenon. The... more
This document reviews the research related to students' and teachers' anxiety related to science and the teaching of science in order to better understand the relationships between the variables that can predict this phenomenon. The research reports reviewed used either the Science State Trait Anxiety Inventory or the Science Teaching State Trait Anxiety Inventory in gathering their data. These inventories allow the researcher to change title headings within the inventory to allow the researcher to examine particular situations. Findings for the report are presented according to titles on the state anxiety scale that measure for anxiety about science and teaching science; anxiety about specific tasks; and anxiety about different science courses. The summary of the findings discusses the following variables that emerged from the analysis: (1) attitude toward science; (2) anxiety about teaching; (3) achievement; (4) examination format; (5) content courses; (6) achievement on a specific task in a content course; (7) gender; (8) confidence; (9) self efficacy; (10) demographic variables; (11) long term effects; (12) impact on teacher classroom performance; and (13) children's anxiety. Conclusions discuss the need for a model to explain how anxiety and related variables affect learning in science and for continued research in this area. A list of 28 references is included. (MDH)
2025, Evaluation Review
In assessing criminality, researchers have used counts of crimes, arrests etc. because interval measures were not available. Additionally, crime seriousness varies depending on demographic factors. This study examined the Crime and... more
In assessing criminality, researchers have used counts of crimes, arrests etc. because interval measures were not available. Additionally, crime seriousness varies depending on demographic factors. This study examined the Crime and Violence Scale (CVS) regarding: psychometric quality using item response theory (IRT); and invariance of the crime seriousness hierarchy for gender, age, and racial/ethnic groups on 7435 respondents. The CVS is a useful measure of criminality, though some items could be improved or dropped. Differential item functioning analysis revealed that crime seriousness varies by age and gender. IRT shows promise in assessing and adjusting for demographic variations in crime seriousness.
2025, International Journal of Evaluation and Research in Education (IJERE)
Self-regulated learning (SLR) is a condition in which students actively participate in the process of acquiring knowledge, and it closely relates to students’ metacognitive, motivational, and behavioral aspects. In order to measure this... more
Self-regulated learning (SLR) is a condition in which students actively participate in the process of acquiring knowledge, and it closely relates to students’ metacognitive, motivational, and behavioral aspects. In order to measure this variable, an instrument was developed by referring to the Zimmerman cycle in the form of a questionnaire. Therefore, this study aims to analyze the construct validity of SLR questionnaires designed for high school students through Rasch model analysis. The method employed was descriptive quantitative research. The analyzed questionnaire consists of 50 positive statements, rated on 4-point Likert scale, and arranged of forethought, performance, and self-reflection phases. Furthermore, the construct validity was conducted on 235 third grade (XII) high school students in Gunungsitoli City (Indonesia), with a gender distribution of 58.29% female and 41.70% male. The results showed that the questionnaire with 4-rating scales satisfied the criteria for validity, gender inclusiveness, and unidimensionality based on Rasch model analysis for 25 statements. The implication of this research shows that the SLR questionnaire developed is valid and can be used in wider field research, especially in mathematics learning.
2025, International Journal of Evaluation and Research in Education (IJERE)
The rising rate of youth unemployment and its attendant consequences on the general populace in Nigeria has assumed a frightening dimension. The academia and other relevant stakeholders have gradually come to realize that the possession... more
The rising rate of youth unemployment and its attendant consequences on the general populace in Nigeria has assumed a frightening dimension. The academia and other relevant stakeholders have gradually come to realize that the possession of academic qualifications alone cannot guarantee a good quality job. Nigerian higher intuitions are now introducing entrepreneurship studies in their school curriculum without a clear framework. Research has shown that this does not guarantee total graduates unless we have an entrepreneurship skills framework that is functional and discipline-based. Noting that entrepreneurship is classified into two folds; entrepreneurship specific and entrepreneurship mindset, this study intends to develop an entrepreneurship skills framework that will promote the employability of the students of Electrical Technology in Colleges of Education in Nigeria. The survey research that employs the sequential exploratory mixed methods was used for the study. The population for this phase consists of entrepreneurs and academics in Nigeria. The use of the partial credit model in the Rasch analysis model guaranteed the consensus of the experts on each of the items being measured. The outcome of the study will contribute to the social economic peace and sustainability in Nigeria and the body of knowledge in entrepreneurship regarding electrical technology.
2025, Journal of Sport Rehabilitation
Previous research suggests that several knee-specific patient-reported outcome measures have poor measurement properties. The patient-reported outcomes knee assessment tool (PROKAT) was created to improve assessment of knee-specific... more
Previous research suggests that several knee-specific patient-reported outcome measures have poor measurement properties. The patient-reported outcomes knee assessment tool (PROKAT) was created to improve assessment of knee-specific function. Examination of the measurement properties of this new measure is critical to determine its clinical value. Objective: Examine the measurement properties of the PROKAT. Design: Cross-sectional study. Setting: Clinical athletic training setting. Patients or Other Participants: The pilot study included 32 student-athletes (mean age = 20.78 [1.01], males = 56.30%). The full study included 203 student-athletes (mean age = 21.46 [4.64], males = 54.70%) from 3 separate institutions. The participants were recruited for both the pilot and full study using face-to-face and electronic (eg, email and social media sites) communications. Intervention(s): Evaluation of the measurement properties of the PROKAT occurred using the Rasch partial-credit model. Main Outcome Measures: Infit and outfit statistics, item step difficulties, person ability parameters, category function, item and test information functions, and Cronbach alpha. An independent samples t test was used to evaluate the differences in injured and noninjured athletes' scores. Results: The Rasch partial-credit model analysis of pilot test items and qualitative participant feedback were used to modify the initial PROKAT. Evaluation of the revised PROKAT (32 items) indicated 27 items had acceptable model-data fit. The injured athletes scored significantly worse than the noninjured athletes (t 188 = 12.89; P < .01). The ceiling effects for the PROKAT were minimal (3.9%). Conclusions: A major advantage of this study was the use of the Rasch measurement and the targeted population. Compared with alternative knee-specific patient-reported outcome measures (eg, Knee Injury Osteoarthritis Outcome Score, International Knee Documentation Committee Subjective Knee Form), the PROKAT has low ceiling effects in athletic populations. In addition, evidence suggests the measure may be capable of distinguishing between injured and noninjured athletes.
2025, Journal of Sport Rehabilitation
Context: Previous research suggests that several knee-specific patient-reported outcome measures have poor measurement properties. The patient-reported outcomes knee assessment tool (PROKAT) was created to improve assessment of... more
Context: Previous research suggests that several knee-specific patient-reported outcome measures have poor measurement properties. The patient-reported outcomes knee assessment tool (PROKAT) was created to improve assessment of knee-specific function. Examination of the measurement properties of this new measure is critical to determine its clinical value. Objective: Examine the measurement properties of the PROKAT. Design: Cross-sectional study. Setting: Clinical athletic training setting. Patients or Other Participants: The pilot study included 32 student-athletes (mean age = 20.78 [1.01], males = 56.30%). The full study included 203 student-athletes (mean age = 21.46 [4.64], males = 54.70%) from 3 separate institutions. The participants were recruited for both the pilot and full study using face-to-face and electronic (eg, email and social media sites) communications. Intervention(s): Evaluation of the measurement properties of the PROKAT occurred using the Rasch partial-credit m...
2025, Journal of Education and Learning (edisi elektronik)/Journal of Education and Learning
This research aimed to assess the readiness of Indonesian pre-service science teachers by examining their perceptions of impending challenges, desired competencies, and anticipated instructional methodologies in preparing for the... more
This research aimed to assess the readiness of Indonesian pre-service science teachers by examining their perceptions of impending challenges, desired competencies, and anticipated instructional methodologies in preparing for the challenges of Society 5.0. Using a quantitative survey research methodology, 884 pre-service science teachers from 16 universities were surveyed. The research instrument's validity and reliability were assessed utilizing the Rasch model. Inferential statistics were employed to assess the readiness of pre-service science teachers for future challenges, taking into account their backgrounds. The test outcomes showed the Cronbach's Alpha coefficient reached 0.92, is deemed "excellent", affirming the instrument's high reliability. The findings of the study revealed that the majority of teachers share a consistent perception of future challenges and recognize the importance of mastering 21st-century skills, particularly critical thinking, creative thinking, communication, and collaboration. However, respondents believe that the current learning process does not adequately train these skills. They expressed a desire for an alternative learning method that can enhance their skills while accommodating their diverse backgrounds, including place of residence, economic background, and technological proficiency. Further research is needed to identify alternative teaching methods that can effectively foster these abilities in diverse backgrounds.
2025, International Journal of Evaluation and Research in Education (IJERE)
This study describes teachers’ perceptions of implementing education transformation after the Corona Virus Disease 2019 (COVID-19) pandemic. The research method uses a mixed-method approach: quantitative and qualitative. Quantitative data... more
This study describes teachers’ perceptions of implementing education transformation after the Corona Virus Disease 2019 (COVID-19) pandemic. The research method uses a mixed-method approach: quantitative and qualitative. Quantitative data collection using questionnaires. Analysis of quantitative data using the Rasch model. Qualitative data analysis through the collection, reduction, and presentation. The respondents of this research are 389 teachers in Indonesia. The questionnaire instrument contains 21 items of statements. Before being sent to teachers via the Google form, the questionnaire had met the readability test by 10 teachers. This research found that the most effortless education transformation is related to the education ecosystem. However, implementations still challenging to implement are related to pedagogy, especially the implementation of differentiated learning. Therefore, this study implies that the educational ecosystem needs to be maintained. Meanwhile, pedagogics, especially those related to differentiated learning, must be improved with various training to recognize student learning styles, multiple intelligences, learning readiness, teacher socio-emotional, and student socio-cultural.
2025, Explanatory Models, Unit Standards, and Personalized Learning in Educational Measurement
Implicit in the idea of measurement is the concept of objectivity. When we measure the temperature using a thermometer, we assume that the measurement we obtain is not dependent on the conditions of measurement, such as which thermometer... more
Implicit in the idea of measurement is the concept of objectivity. When we measure the temperature using a thermometer, we assume that the measurement we obtain is not dependent on the conditions of measurement, such as which thermometer we use. Any functioning thermometer should give us the same reading of, for example, 75 °F. If one thermometer measured 40 °, another 250 and a third 150, then the lack of objectivity would invalidate the very idea of accurately measuring temperature.
2025
Teachers use “extremely subtle pragmatic interpretive judgements [...] regularly in the course of mathematics teaching and learning...’ (Pimm, 1987, p.167). The form of their discourse – the coherence, the structure and modality,... more
Teachers use “extremely subtle pragmatic interpretive judgements [...] regularly in the course of mathematics teaching and learning...’ (Pimm, 1987, p.167). The form of their discourse – the coherence, the structure and modality, characteristics of natural language in use – indicates the commitment of students to the truth-value of their statements. Hence, the listener might infer the extent of students’ confidence in their understanding. In this study, linguistic features were identified that could be aligned with the conceptual growth of students in the context of introductory algebra. The aim was to devise a model that provided explicit, objective evidence to support the subtle, interpretive judgements made by teachers. Secondary students in Years 8 and 9 (13-15 year olds) from three schools in a NSW regional centre (N=222) participated in the study. The study consisted of two phases of data collection. The first was the collection of quantitative data from students’ responses to...
2025, International Journal of Evaluation and Research in Education (IJERE)
Every human being has emotional turmoil, which will be a serious problem if they cannot control it. The importance of emotion regulation has become one of the bases for developing a measure of emotion regulation for students, the... more
Every human being has emotional turmoil, which will be a serious problem if they cannot control it. The importance of emotion regulation has become one of the bases for developing a measure of emotion regulation for students, the Indonesian emotion regulation scale (IERS). No research yet describes an instrument that measures students’ level of emotion regulation in Indonesia. IERS consists of nine items in two aspects: reappraisal and suppression factor. The content validity test involves two experts in the field of psychometrics. In comparison, the construct validity test involved 354 high school students in Yogyakarta, Indonesia. Data analysis using inter-rater reliability (IRR) coefficient of Cohen’s kappa and Rasch analysis. Based on the study of the IRR coefficient of Cohen’s kappa, two experts agree on the acceptability of the IERS statement items. Besides that, the results of the application of Rasch analysis show that IERS is good, precise, and conforms with the model. IERS is a reliable and valid tool to measure students’ level of emotion regulation accurately. This paper discusses the implications and recommendations for further research for the implementation of guidance and counseling containing the value of emotion regulation as a follow-up to the performance of IERS.
2025, International Journal of Evaluation and Research in Education (IJERE)
Every society dreams of true peace. To achieve true peace, humans need to start with inner peace. The importance of peace becomes one of the bases for developing a measure of peace for designing peace-building programs. This research... more
Every society dreams of true peace. To achieve true peace, humans need to start with inner peace. The importance of peace becomes one of the bases for developing a measure of peace for designing peace-building programs. This research answered the need for these measuring tools by developing and validating a peace measuring instrument called the Indonesian peace of mind scale (IPoMS). This instrument consists of seven items in two aspects: the internal state of peacefulness and harmony. This study used Rasch analysis to test the construct validity of IPoMS. The construct validity test involved 202 vocational high school students in Yogyakarta, Indonesia. Data analysis using Win step software provides information about the quality of respondents and instruments, items that are easy and difficult for respondents to agree on, fit order items, and unidimensionality. The results of the application of Rasch analysis showed that IPoMS is good, precise, and have item conformity with the model. IPoMS is a reliable and valid measuring tool to measure students’ level of peace accurately. This research discussed the implications and recommendations for further research for the implementation of guidance and counseling containing the value of peace as a follow-up to the performance of IPoMS.
2025, International Journal of Evaluation and Research in Education (IJERE)
The existence of item bias in a set of measuring instruments can threaten the instrument’s validity. Based on the Rasch model, this study evaluated item bias in the four-tier heat and temperature diagnostic test (4T-HTDT). This study used... more
The existence of item bias in a set of measuring instruments can threaten the instrument’s validity. Based on the Rasch model, this study evaluated item bias in the four-tier heat and temperature diagnostic test (4T-HTDT). This study used a cross-sectional quantitative survey method. There were 241 students selected using a stratified random sampling technique. The
4T-HTDT instrument consisted of 20 items grouped into five concept groups. Students’ conceptual understanding was grouped into five categories, namely scientific knowledge (Rating=5), false positive (Rating=4), false negative (Rating=3), misconceptions (Rating=2), and lack of knowledge (Rating=1). The differential item functioning (DIF) score was used to evaluate item bias in the 4T-HTDT. Bias was reviewed based on the respondent’s gender, class, and school. The item has DIF if the probability value is <5%. The results showed that 35% (7 out of 20 items) spread over five groups of heat and temperature concepts were biased. However, excluding seven DIF items from the measurement set would not significantly affect the composition and distribution of items. Thus, the 13 items in the 4T-HTDT instrument are free from bias and can be used to evaluate the conceptual understanding of high school students.
2025
This cross-sectional study was conducted among librarians from Ghana, Nigeria, and South Africa to establish the prevalence of occupational burnout in the library set up. A previously validated self-administered questionnaire [Maslach... more
This cross-sectional study was conducted among librarians from Ghana, Nigeria, and South Africa to establish the prevalence of occupational burnout in the library set up. A previously validated self-administered questionnaire [Maslach Burnout Inventory (MBI)] was used to collect information on the prevalence and pattern of job burnout among the respondents. The occurrence of extreme burnout among the respondents, especially those working in the Electronic and Reference Services area of their libraries was found. The findings of this research calls for both professional and personal avenues to deal with or lessen this menace as it is a difficult issue that affects the wellbeing of library staff, which can thwart libraries from accomplishing their points.
2025, Archives of Gerontology and Geriatrics
2025, Neurorehabilitation and Neural Repair
Background. Tools chosen to measure poststroke upper-extremity rehabilitation outcomes must match contemporary theoretical expectations of motor deficit and recovery because an assessment’s theoretical underpinning forms the conceptual... more
Background. Tools chosen to measure poststroke upper-extremity rehabilitation outcomes must match contemporary theoretical expectations of motor deficit and recovery because an assessment’s theoretical underpinning forms the conceptual basis for interpreting its score. Objective. The purpose of this study was to investigate the theoretical framework of the Wolf Motor Function Test (WMFT) by (1) determining whether all items measured a single underlying trait and (2) examining the congruency between the hypothesized and the empirically determined item difficulty orders. Methods. Confirmatory factor analysis (CFA) and Rasch analysis were applied to existing WMFT Functional Ability Rating Scale data from 189 participants in the EXCITE (Extremity Constraint-Induced Therapy Evaluation) trial. Fit of a 1-factor CFA model (all items) was compared with the fit of a 2-factor CFA model (factors defined according to item object-grasp requirements) with fit indices, model comparison test, and i...
2025, Acta IMEKO
Sustainable change initiatives are often short-circuited by failures in modelling. Unexamined assumptions about measurement and numbers push modelling into the background as a presupposition rarely articulated as an explicit operation.... more
Sustainable change initiatives are often short-circuited by failures in modelling. Unexamined assumptions about measurement and numbers push modelling into the background as a presupposition rarely articulated as an explicit operation. Even when models of system dynamics are planned components of a sustainable change effort, the key role of measurement is typically overlooked. The crux of the matter concerns the distinction between numeric counts and measured quantities. Mistaking the former for the latter confuses levels of complexity and fundamentally compromises communications. Reconceiving measurement as modelling multilevel distributed decision processes offers new alternatives aligned with historically successful efforts in creating sustainable change. Five conditions for successful sustainable change are contrasted from the perspectives of single-level vs multilevel modelling: vision, plans, skills, resources, and incentives. Omitting any one of these from efforts at creating change result, respectively, in confusion, treadmills, anxiety, frustration, and resistance. The shortcomings of typically implemented single-level approaches to measurement result in the widespread experience of these negative consequences. Results show that new potentials for creating sustainable change can be expected to follow from implementations of multilevel distributed decision processes that effectively counteract organizational amnesia by embedding new learning in an externally materialized knowledge infrastructure incorporating a shared cultural memory.
2025, Educational Design Research
Design-based research in education emphasizes the contexts in which learning takes place as essential to more fruitful dialogues with practice. Contextual issues of social dynamics, place, and applications have been extensively... more
Design-based research in education emphasizes the contexts in which learning takes place as essential to more fruitful dialogues with practice. Contextual issues of social dynamics, place, and applications have been extensively investigated globally in recent design-based research, but the informational aspects of infrastructural concern to experimentation in naturalistic settings have yet to be addressed. Design Thinking (DT) augments design research by offering paths toward coherently integrating assessment and instruction across multiple levels of complexity and different communities' varying epistemic practices. DT is nonlinear but encompasses elements of empathy, problem definition, ideation, prototyping, and testing that inform the development of boundary objects mediating developmental, horizontal, and vertical forms of coherence by simultaneously functioning across
2025, International Journal of Advances in Production Research
Prigogine's theory of dissipative structures provides a general account of entropy-driven self-organized transitions through hierarchies of structures separated by discontinuities. The theory encompasses a wide range of evolving systems... more
Prigogine's theory of dissipative structures provides a general account of entropy-driven self-organized transitions through hierarchies of structures separated by discontinuities. The theory encompasses a wide range of evolving systems throughout nature and culture. Possibilities for operationalizing a new collective rationality spanning physics and psychology emerge from Prigogine's emphases on two distinct senses of probability, on the concept of the sufficient statistic, and on the role and limitations of the Poisson distribution in formulating a "nonlinear master equation." Unnoted by Prigogine are correspondences of all three of these issues in the mathematical foundations of statistics and measurement established in the works of Ronald Fisher and his student, Georg Rasch. The three areas of correspondence inform models enabling specifically metrological approaches to quality-assured quantification across the sciences. Prigogine's sense of "deterministic chaos" is re-expressed in measurement terms as stochastic invariance and the need for "a supplementary parameter" augmenting the Poisson distribution is related to a rating scale model of measurement. Considering these connections, this paper proposes that what Prigogine anticipates as a "new intelligibility" and a new science of "collective rationality" could be pragmatically operationalized in a new metrological infrastructure, one made coherent by the generality of entropydriven nonequilibrium processes.