Kara Bogusz - Academia.edu (original) (raw)
Papers by Kara Bogusz
The Social Security Administration (SSA) is challenged with determining work disability for escal... more The Social Security Administration (SSA) is challenged with determining work disability for escalating numbers of applicants. Determining work capacity for people who demonstrate maladaptive interpersonal behaviors is difficult since the relationship between symptoms and work performance is not always clear. Through an interagency agreement with the SSA, the National Institutes of Health (NIH) has collaborated with Boston University to apply Item Response Theory (IRT) and Computer Adaptive Testing (CAT) to develop functional assessment instruments that SSA may use during the disability evaluation process. The overall goal of this project is to enhance functional and behavioral assessment methods to improve precision and efficiency in assessing interpersonal interactions relevant to work. The aims of this project are to: 1) determine the feasibility of using IRT/CAT techniques in measuring Interpersonal Interactions in the context of work, 2) develop a comprehensive content-model for...
Disability and Health Journal, 2015
The Work Disability Functional Assessment Battery (WD-FAB), developed for potential use by the US... more The Work Disability Functional Assessment Battery (WD-FAB), developed for potential use by the US Social Security Administration to assess work-related function, currently consists of five multi-item scales assessing physical function and four multi-item scales assessing behavioral health function; the WD-FAB scales are administered as Computerized Adaptive Tests (CATs). The goal of this study was to evaluate the test-retest reliability of the WD-FAB Physical Function and Behavioral Health CATs. We administered the WD-FAB scales twice, 7-10 days apart, to a sample of 376 working age adults and 316 adults with work-disability. Intraclass correlation coefficients were calculated to measure the consistency of the scores between the two administrations. Standard error of measurement (SEM) and minimal detectable change (MDC90) were also calculated to measure the scales precision and sensitivity. For the Physical Function CAT scales, the ICCs ranged from 0.76 to 0.89 in the working age adult sample, and 0.77-0.86 in the sample of adults with work-disability. ICCs for the Behavioral Health CAT scales ranged from 0.66 to 0.70 in the working age adult sample, and 0.77-0.80 in the adults with work-disability. The SEM ranged from 3.25 to 4.55 for the Physical Function scales and 5.27-6.97 for the Behavioral Health function scales. For all scales in both samples, the MDC90 ranged from 7.58 to 16.27. Both the Physical Function and Behavioral Health CATs of the WD-FAB demonstrated good test-retest reliability in adults with work-disability and general adult samples, a critical requirement for assessing work related functioning in disability applicants and in other contexts.
Journal of rehabilitation medicine, Jan 27, 2015
Objective: To develop a system to guide interpretation of scores generated from 2 new instruments... more Objective: To develop a system to guide interpretation of scores generated from 2 new instruments measuring work-related physical and behavioral health functioning (Work Disability - Physical Function (WD-PF) and WD - Behavioral Function (WD-BH)). Design: Cross-sectional, secondary data from 3 independent samples to develop and validate the functional levels for physical and behavioral health functioning. Subjects: Physical group: 999 general adult subjects, 1,017 disability applicants and 497 work-disabled subjects. Behavioral health group: 1,000 general adult subjects, 1,015 disability applicants and 476 work-disabled subjects. Methods: Three-phase analytic approach including item mapping, a modified-Delphi technique, and known-groups validation analysis were used to develop and validate cut-points for functional levels within each of the WD-PF and WD-BH instrument's scales. Results: Four and 5 functional levels were developed for each of the scales in the WD-PF and WD-BH inst...
Due to increasing demand on the Social Security Administration (SSA) disability programs, novel a... more Due to increasing demand on the Social Security Administration (SSA) disability programs, novel assessment methodologies are needed. Through an interagency agreement with the SSA, the National Institutes of Health (NIH) is collaborating with Boston University to apply Item Response Theory (IRT) and Computer Adaptive Testing (CAT) to assess physical capabilities relevant to work. New approaches to comprehensively and systematically assess functioning may improve the precision and efficiency of SSA's disability evaluation process. Project objectives include: 1) assessing the feasibility of CAT within the SSA context; 2) developing a content model for Physical Capabilities sub-domains; and 3) building item pools for physical capabilities relevant to work addressing content in each sub-domain. A comprehensive literature review led to a conceptual framework and content model guiding the structure of the Physical Capabilities domain. Content experts reviewed and revised items in relat...
Archives of Physical Medicine and Rehabilitation, 2014
To assess the feasibility and psychometric properties of 8 scales covering 2 domains of the newly... more To assess the feasibility and psychometric properties of 8 scales covering 2 domains of the newly developed Work Disability Functional Assessment Battery (WD-FAB): physical function (PF) and behavioral health (BH) function. Cross-sectional study. Community. Adults (N=973) unable to work because of a physical (n=497) or a mental (n=476) disability. Not applicable. Each disability group responded to a survey consisting of the relevant WD-FAB scales and existing measures of established validity. The WD-FAB scales were evaluated with regard to data quality (score distribution, percentage of "I don't know" responses), efficiency of administration (number of items required to achieve reliability criterion, time required to complete the scale) by computerized adaptive testing (CAT), and measurement accuracy as tested by person fit. Construct validity was assessed by examining both convergent and discriminant correlations between the WD-FAB scales and scores on same-domain and cross-domain established measures. Data quality was good, and CAT efficiency was high across both WD-FAB domains. Measurement accuracy was very good for PF scales; BH scales demonstrated more variability. Construct validity correlations, both convergent and divergent, between all WD-FAB scales and established measures were in the expected direction and range of magnitude. The data quality, CAT efficiency, person fit, and construct validity of the WD-FAB scales were well supported and suggest that the WD-FAB could be used to assess PF and BH function related to work disability. Variation in scale performance suggests the need for future work on item replenishment and refinement, particularly with regard to the Self-Efficacy scale.
Archives of Physical Medicine and Rehabilitation, 2013
To develop and test an instrument to assess physical function for Social Security Administration ... more To develop and test an instrument to assess physical function for Social Security Administration (SSA) disability programs, the SSA-Physical Function (SSA-PF) instrument. Item response theory (IRT) analyses were used to (1) create a calibrated item bank for each of the factors identified in prior factor analyses, (2) assess the fit of the items within each scale, (3) develop separate computer-adaptive testing (CAT) instruments for each scale, and (4) conduct initial psychometric testing. Cross-sectional data collection; IRT analyses; CAT simulation. Telephone and Internet survey. Two samples: SSA claimants (n=1017) and adults from the U.S. general population (n=999). None. Model fit statistics, correlation, and reliability coefficients. IRT analyses resulted in 5 unidimensional SSA-PF scales: Changing & Maintaining Body Position, Whole Body Mobility, Upper Body Function, Upper Extremity Fine Motor, and Wheelchair Mobility for a total of 102 items. High CAT accuracy was demonstrated by strong correlations between simulated CAT scores and those from the full item banks. On comparing the simulated CATs with the full item banks, very little loss of reliability or precision was noted, except at the lower and upper ranges of each scale. No difference in response patterns by age or sex was noted. The distributions of claimant scores were shifted to the lower end of each scale compared with those of a sample of U.S. adults. The SSA-PF instrument contributes important new methodology for measuring the physical function of adults applying to the SSA disability programs. Initial evaluation revealed that the SSA-PF instrument achieved considerable breadth of coverage in each content domain and demonstrated noteworthy psychometric properties.
Archives of Physical Medicine and Rehabilitation, 2013
To develop a broad set of claimant-reported items to assess behavioral health functioning relevan... more To develop a broad set of claimant-reported items to assess behavioral health functioning relevant to the Social Security disability determination processes, and to evaluate the underlying structure of behavioral health functioning for use in development of a new functional assessment instrument. Cross-sectional. Community. Item pools of behavioral health functioning were developed, refined, and field tested in a sample of persons applying for Social Security disability benefits (N=1015) who reported difficulties working because of mental or both mental and physical conditions. None. Social Security Administration Behavioral Health (SSA-BH) measurement instrument. Confirmatory factor analysis (CFA) specified that a 4-factor model (self-efficacy, mood and emotions, behavioral control, social interactions) had the optimal fit with the data and was also consistent with our hypothesized conceptual framework for characterizing behavioral health functioning. When the items within each of the 4 scales were tested in CFA, the fit statistics indicated adequate support for characterizing behavioral health as a unidimensional construct along these 4 distinct scales of function. This work represents a significant advance both conceptually and psychometrically in assessment methodologies for work-related behavioral health. The measurement of behavioral health functioning relevant to the context of work requires the assessment of multiple dimensions of behavioral health functioning. Specifically, we identified a 4-factor model solution that represented key domains of work-related behavioral health functioning. These results guided the development and scale formation of a new SSA-BH instrument.
Archives of Physical Medicine and Rehabilitation, 2013
Objectives: To build a comprehensive item pool representing work-relevant physical functioning an... more Objectives: To build a comprehensive item pool representing work-relevant physical functioning and to test the factor structure of the item pool. These developmental steps represent initial outcomes of a broader project to develop instruments for the assessment of function within the context of Social Security Administration (SSA) disability programs. Design: Comprehensive literature review; gap analysis; item generation with expert panel input; stakeholder interviews; cognitive interviews; cross-sectional survey administration; and exploratory and confirmatory factor analyses to assess item pool structure. Setting: In-person and semistructured interviews and Internet and telephone surveys. Participants: Sample of SSA claimants (nZ1017) and a normative sample of adults from the U.S. general population (nZ999). Interventions: Not applicable. Main Outcome Measure: Model fit statistics. Results: The final item pool consisted of 139 items. Within the claimant sample, 58.7% were white; 31.8% were black; 46.6% were women; and the mean age was 49.7 years. Initial factor analyses revealed a 4-factor solution, which included more items and allowed separate characterization of: (1) changing and maintaining body position, (2) whole body mobility, (3) upper body function, and (4) upper extremity fine motor. The final 4-factor model included 91 items. Confirmatory factor analyses for the 4-factor models for the claimant and the normative samples demonstrated very good fit. Fit statistics for claimant and normative samples, respectively, were: Comparative Fit IndexZ.93 and .98; Tucker-Lewis IndexZ.92 and .98; and root mean square error approximationZ.05 and .04. Conclusions: The factor structure of the physical function item pool closely resembled the hypothesized content model. The 4 scales relevant to work activities offer promise for providing reliable information about claimant physical functioning relevant to work disability.
Archives of Physical Medicine and Rehabilitation, 2013
To use item response theory (IRT) data simulations to construct and perform initial psychometric ... more To use item response theory (IRT) data simulations to construct and perform initial psychometric testing of a newly developed instrument, the Social Security Administration Behavioral Health Function (SSA-BH) instrument, that aims to assess behavioral health functioning relevant to the context of work. Cross-sectional survey followed by IRT calibration data simulations. Community. Sample of individuals applying for Social Security Administration disability benefits: claimants (n=1015) and a normative comparative sample of U.S. adults (n=1000). None. SSA-BH measurement instrument. IRT analyses supported the unidimensionality of 4 SSA-BH scales: mood and emotions (35 items), self-efficacy (23 items), social interactions (6 items), and behavioral control (15 items). All SSA-BH scales demonstrated strong psychometric properties including reliability, accuracy, and breadth of coverage. High correlations of the simulated 5- or 10-item computer adaptive tests with the full item bank indicated robust ability of the computer adaptive testing approach to comprehensively characterize behavioral health function along 4 distinct dimensions. Initial testing and evaluation of the SSA-BH instrument demonstrated good accuracy, reliability, and content coverage along all 4 scales. Behavioral function profiles of Social Security Administration claimants were generated and compared with age- and sex-matched norms along 4 scales: mood and emotions, behavioral control, social interactions, and self-efficacy. Using the computer adaptive test-based approach offers the ability to collect standardized, comprehensive functional information about claimants in an efficient way, which may prove useful in the context of the Social Security Administration's work disability programs.
Archives of Physical Medicine and Rehabilitation, 2013
Physical and mental impairments represent the 2 largest health condition categories for which wor... more Physical and mental impairments represent the 2 largest health condition categories for which workers receive Social Security disability benefits. Comprehensive assessment of physical and mental impairments should include aspects beyond medical conditions such as a person's underlying capabilities as well as activity demands relevant to the context of work. The objective of this article is to describe the initial conceptual stages of developing new measurement instruments of behavioral health and physical functioning relevant for Social Security work disability evaluation purposes. To outline a clear conceptualization of the constructs to be measured, 2 content models were developed using structured and informal qualitative approaches. We performed a structured literature review focusing on work disability and incorporating aspects of the International Classification of Functioning, Disability and Health as a unifying taxonomy for framework development. Expert interviews provided advice and consultation to enhance face validity of the resulting content models. The content model for work-related behavioral health function identifies 5 major domains: (1) behavior control, (2) basic interactions, (3) temperament and personality, (4) adaptability, and (5) workplace behaviors. The content model describing physical functioning includes 3 domains: (1) changing and maintaining body position, (2) wholebody mobility, and (3) carrying, moving, and handling objects. These content models informed subsequent measurement properties including item development and measurement scale construction, and provided conceptual coherence guiding future empirical inquiry. The proposed measurement approaches show promise to comprehensively and systematically assess physical and behavioral health functioning relevant to work.
The Social Security Administration (SSA) is challenged with determining work disability for escal... more The Social Security Administration (SSA) is challenged with determining work disability for escalating numbers of applicants. Determining work capacity for people who demonstrate maladaptive interpersonal behaviors is difficult since the relationship between symptoms and work performance is not always clear. Through an interagency agreement with the SSA, the National Institutes of Health (NIH) has collaborated with Boston University to apply Item Response Theory (IRT) and Computer Adaptive Testing (CAT) to develop functional assessment instruments that SSA may use during the disability evaluation process. The overall goal of this project is to enhance functional and behavioral assessment methods to improve precision and efficiency in assessing interpersonal interactions relevant to work. The aims of this project are to: 1) determine the feasibility of using IRT/CAT techniques in measuring Interpersonal Interactions in the context of work, 2) develop a comprehensive content-model for...
Disability and Health Journal, 2015
The Work Disability Functional Assessment Battery (WD-FAB), developed for potential use by the US... more The Work Disability Functional Assessment Battery (WD-FAB), developed for potential use by the US Social Security Administration to assess work-related function, currently consists of five multi-item scales assessing physical function and four multi-item scales assessing behavioral health function; the WD-FAB scales are administered as Computerized Adaptive Tests (CATs). The goal of this study was to evaluate the test-retest reliability of the WD-FAB Physical Function and Behavioral Health CATs. We administered the WD-FAB scales twice, 7-10 days apart, to a sample of 376 working age adults and 316 adults with work-disability. Intraclass correlation coefficients were calculated to measure the consistency of the scores between the two administrations. Standard error of measurement (SEM) and minimal detectable change (MDC90) were also calculated to measure the scales precision and sensitivity. For the Physical Function CAT scales, the ICCs ranged from 0.76 to 0.89 in the working age adult sample, and 0.77-0.86 in the sample of adults with work-disability. ICCs for the Behavioral Health CAT scales ranged from 0.66 to 0.70 in the working age adult sample, and 0.77-0.80 in the adults with work-disability. The SEM ranged from 3.25 to 4.55 for the Physical Function scales and 5.27-6.97 for the Behavioral Health function scales. For all scales in both samples, the MDC90 ranged from 7.58 to 16.27. Both the Physical Function and Behavioral Health CATs of the WD-FAB demonstrated good test-retest reliability in adults with work-disability and general adult samples, a critical requirement for assessing work related functioning in disability applicants and in other contexts.
Journal of rehabilitation medicine, Jan 27, 2015
Objective: To develop a system to guide interpretation of scores generated from 2 new instruments... more Objective: To develop a system to guide interpretation of scores generated from 2 new instruments measuring work-related physical and behavioral health functioning (Work Disability - Physical Function (WD-PF) and WD - Behavioral Function (WD-BH)). Design: Cross-sectional, secondary data from 3 independent samples to develop and validate the functional levels for physical and behavioral health functioning. Subjects: Physical group: 999 general adult subjects, 1,017 disability applicants and 497 work-disabled subjects. Behavioral health group: 1,000 general adult subjects, 1,015 disability applicants and 476 work-disabled subjects. Methods: Three-phase analytic approach including item mapping, a modified-Delphi technique, and known-groups validation analysis were used to develop and validate cut-points for functional levels within each of the WD-PF and WD-BH instrument's scales. Results: Four and 5 functional levels were developed for each of the scales in the WD-PF and WD-BH inst...
Due to increasing demand on the Social Security Administration (SSA) disability programs, novel a... more Due to increasing demand on the Social Security Administration (SSA) disability programs, novel assessment methodologies are needed. Through an interagency agreement with the SSA, the National Institutes of Health (NIH) is collaborating with Boston University to apply Item Response Theory (IRT) and Computer Adaptive Testing (CAT) to assess physical capabilities relevant to work. New approaches to comprehensively and systematically assess functioning may improve the precision and efficiency of SSA's disability evaluation process. Project objectives include: 1) assessing the feasibility of CAT within the SSA context; 2) developing a content model for Physical Capabilities sub-domains; and 3) building item pools for physical capabilities relevant to work addressing content in each sub-domain. A comprehensive literature review led to a conceptual framework and content model guiding the structure of the Physical Capabilities domain. Content experts reviewed and revised items in relat...
Archives of Physical Medicine and Rehabilitation, 2014
To assess the feasibility and psychometric properties of 8 scales covering 2 domains of the newly... more To assess the feasibility and psychometric properties of 8 scales covering 2 domains of the newly developed Work Disability Functional Assessment Battery (WD-FAB): physical function (PF) and behavioral health (BH) function. Cross-sectional study. Community. Adults (N=973) unable to work because of a physical (n=497) or a mental (n=476) disability. Not applicable. Each disability group responded to a survey consisting of the relevant WD-FAB scales and existing measures of established validity. The WD-FAB scales were evaluated with regard to data quality (score distribution, percentage of "I don't know" responses), efficiency of administration (number of items required to achieve reliability criterion, time required to complete the scale) by computerized adaptive testing (CAT), and measurement accuracy as tested by person fit. Construct validity was assessed by examining both convergent and discriminant correlations between the WD-FAB scales and scores on same-domain and cross-domain established measures. Data quality was good, and CAT efficiency was high across both WD-FAB domains. Measurement accuracy was very good for PF scales; BH scales demonstrated more variability. Construct validity correlations, both convergent and divergent, between all WD-FAB scales and established measures were in the expected direction and range of magnitude. The data quality, CAT efficiency, person fit, and construct validity of the WD-FAB scales were well supported and suggest that the WD-FAB could be used to assess PF and BH function related to work disability. Variation in scale performance suggests the need for future work on item replenishment and refinement, particularly with regard to the Self-Efficacy scale.
Archives of Physical Medicine and Rehabilitation, 2013
To develop and test an instrument to assess physical function for Social Security Administration ... more To develop and test an instrument to assess physical function for Social Security Administration (SSA) disability programs, the SSA-Physical Function (SSA-PF) instrument. Item response theory (IRT) analyses were used to (1) create a calibrated item bank for each of the factors identified in prior factor analyses, (2) assess the fit of the items within each scale, (3) develop separate computer-adaptive testing (CAT) instruments for each scale, and (4) conduct initial psychometric testing. Cross-sectional data collection; IRT analyses; CAT simulation. Telephone and Internet survey. Two samples: SSA claimants (n=1017) and adults from the U.S. general population (n=999). None. Model fit statistics, correlation, and reliability coefficients. IRT analyses resulted in 5 unidimensional SSA-PF scales: Changing & Maintaining Body Position, Whole Body Mobility, Upper Body Function, Upper Extremity Fine Motor, and Wheelchair Mobility for a total of 102 items. High CAT accuracy was demonstrated by strong correlations between simulated CAT scores and those from the full item banks. On comparing the simulated CATs with the full item banks, very little loss of reliability or precision was noted, except at the lower and upper ranges of each scale. No difference in response patterns by age or sex was noted. The distributions of claimant scores were shifted to the lower end of each scale compared with those of a sample of U.S. adults. The SSA-PF instrument contributes important new methodology for measuring the physical function of adults applying to the SSA disability programs. Initial evaluation revealed that the SSA-PF instrument achieved considerable breadth of coverage in each content domain and demonstrated noteworthy psychometric properties.
Archives of Physical Medicine and Rehabilitation, 2013
To develop a broad set of claimant-reported items to assess behavioral health functioning relevan... more To develop a broad set of claimant-reported items to assess behavioral health functioning relevant to the Social Security disability determination processes, and to evaluate the underlying structure of behavioral health functioning for use in development of a new functional assessment instrument. Cross-sectional. Community. Item pools of behavioral health functioning were developed, refined, and field tested in a sample of persons applying for Social Security disability benefits (N=1015) who reported difficulties working because of mental or both mental and physical conditions. None. Social Security Administration Behavioral Health (SSA-BH) measurement instrument. Confirmatory factor analysis (CFA) specified that a 4-factor model (self-efficacy, mood and emotions, behavioral control, social interactions) had the optimal fit with the data and was also consistent with our hypothesized conceptual framework for characterizing behavioral health functioning. When the items within each of the 4 scales were tested in CFA, the fit statistics indicated adequate support for characterizing behavioral health as a unidimensional construct along these 4 distinct scales of function. This work represents a significant advance both conceptually and psychometrically in assessment methodologies for work-related behavioral health. The measurement of behavioral health functioning relevant to the context of work requires the assessment of multiple dimensions of behavioral health functioning. Specifically, we identified a 4-factor model solution that represented key domains of work-related behavioral health functioning. These results guided the development and scale formation of a new SSA-BH instrument.
Archives of Physical Medicine and Rehabilitation, 2013
Objectives: To build a comprehensive item pool representing work-relevant physical functioning an... more Objectives: To build a comprehensive item pool representing work-relevant physical functioning and to test the factor structure of the item pool. These developmental steps represent initial outcomes of a broader project to develop instruments for the assessment of function within the context of Social Security Administration (SSA) disability programs. Design: Comprehensive literature review; gap analysis; item generation with expert panel input; stakeholder interviews; cognitive interviews; cross-sectional survey administration; and exploratory and confirmatory factor analyses to assess item pool structure. Setting: In-person and semistructured interviews and Internet and telephone surveys. Participants: Sample of SSA claimants (nZ1017) and a normative sample of adults from the U.S. general population (nZ999). Interventions: Not applicable. Main Outcome Measure: Model fit statistics. Results: The final item pool consisted of 139 items. Within the claimant sample, 58.7% were white; 31.8% were black; 46.6% were women; and the mean age was 49.7 years. Initial factor analyses revealed a 4-factor solution, which included more items and allowed separate characterization of: (1) changing and maintaining body position, (2) whole body mobility, (3) upper body function, and (4) upper extremity fine motor. The final 4-factor model included 91 items. Confirmatory factor analyses for the 4-factor models for the claimant and the normative samples demonstrated very good fit. Fit statistics for claimant and normative samples, respectively, were: Comparative Fit IndexZ.93 and .98; Tucker-Lewis IndexZ.92 and .98; and root mean square error approximationZ.05 and .04. Conclusions: The factor structure of the physical function item pool closely resembled the hypothesized content model. The 4 scales relevant to work activities offer promise for providing reliable information about claimant physical functioning relevant to work disability.
Archives of Physical Medicine and Rehabilitation, 2013
To use item response theory (IRT) data simulations to construct and perform initial psychometric ... more To use item response theory (IRT) data simulations to construct and perform initial psychometric testing of a newly developed instrument, the Social Security Administration Behavioral Health Function (SSA-BH) instrument, that aims to assess behavioral health functioning relevant to the context of work. Cross-sectional survey followed by IRT calibration data simulations. Community. Sample of individuals applying for Social Security Administration disability benefits: claimants (n=1015) and a normative comparative sample of U.S. adults (n=1000). None. SSA-BH measurement instrument. IRT analyses supported the unidimensionality of 4 SSA-BH scales: mood and emotions (35 items), self-efficacy (23 items), social interactions (6 items), and behavioral control (15 items). All SSA-BH scales demonstrated strong psychometric properties including reliability, accuracy, and breadth of coverage. High correlations of the simulated 5- or 10-item computer adaptive tests with the full item bank indicated robust ability of the computer adaptive testing approach to comprehensively characterize behavioral health function along 4 distinct dimensions. Initial testing and evaluation of the SSA-BH instrument demonstrated good accuracy, reliability, and content coverage along all 4 scales. Behavioral function profiles of Social Security Administration claimants were generated and compared with age- and sex-matched norms along 4 scales: mood and emotions, behavioral control, social interactions, and self-efficacy. Using the computer adaptive test-based approach offers the ability to collect standardized, comprehensive functional information about claimants in an efficient way, which may prove useful in the context of the Social Security Administration's work disability programs.
Archives of Physical Medicine and Rehabilitation, 2013
Physical and mental impairments represent the 2 largest health condition categories for which wor... more Physical and mental impairments represent the 2 largest health condition categories for which workers receive Social Security disability benefits. Comprehensive assessment of physical and mental impairments should include aspects beyond medical conditions such as a person's underlying capabilities as well as activity demands relevant to the context of work. The objective of this article is to describe the initial conceptual stages of developing new measurement instruments of behavioral health and physical functioning relevant for Social Security work disability evaluation purposes. To outline a clear conceptualization of the constructs to be measured, 2 content models were developed using structured and informal qualitative approaches. We performed a structured literature review focusing on work disability and incorporating aspects of the International Classification of Functioning, Disability and Health as a unifying taxonomy for framework development. Expert interviews provided advice and consultation to enhance face validity of the resulting content models. The content model for work-related behavioral health function identifies 5 major domains: (1) behavior control, (2) basic interactions, (3) temperament and personality, (4) adaptability, and (5) workplace behaviors. The content model describing physical functioning includes 3 domains: (1) changing and maintaining body position, (2) wholebody mobility, and (3) carrying, moving, and handling objects. These content models informed subsequent measurement properties including item development and measurement scale construction, and provided conceptual coherence guiding future empirical inquiry. The proposed measurement approaches show promise to comprehensively and systematically assess physical and behavioral health functioning relevant to work.