Sex Differences in the Adult Human Brain: Evidence from 5216 UK Biobank Participants (original) (raw)

Abstract

Sex differences in the human brain are of interest for many reasons: for example, there are sex differences in the observed prevalence of psychiatric disorders and in some psychological traits that brain differences might help to explain. We report the largest single-sample study of structural and functional sex differences in the human brain (2750 female, 2466 male participants; mean age 61.7 years, range 44–77 years). Males had higher raw volumes, raw surface areas, and white matter fractional anisotropy; females had higher raw cortical thickness and higher white matter tract complexity. There was considerable distributional overlap between the sexes. Subregional differences were not fully attributable to differences in total volume, total surface area, mean cortical thickness, or height. There was generally greater male variance across the raw structural measures. Functional connectome organization showed stronger connectivity for males in unimodal sensorimotor cortices, and stronger connectivity for females in the default mode network. This large-scale study provides a foundation for attempts to understand the causes and consequences of sex differences in adult brain structure and function.

Keywords: brain volume, cortical thickness, fMRI, sex differences, surface area

Introduction

Sex differences have been of enduring biological interest (Darwin 1871), but our knowledge about their relevance to the human brain is surprisingly sparse. It has been noted by several researchers that the potential influences of sex are under-explored in neuroscientific research (Beery and Zucker 2011; Cahill 2006, 2017; Karp et al. 2017). A fuller understanding of morphological and functional differences between the brains of the human sexes might provide insight into why the observed prevalence of some psychiatric disorders differs substantially by sex (Rutter et al. 2003), and would assist in explaining several behavioral sex differences (Zell et al. 2015; Gur and Gur 2017). As biomedical research moves closer to the ideals of precision medicine (e.g., Collins and Varmus 2015), it is even more pressing that we have a more nuanced understanding of similarities and differences in brain structure and function across the sexes. Here, we report a study that characterizes multimodal sex differences in the adult human brain in the largest sample to date.

It is of particular importance to gain a more detailed picture of how the brains of males and females differ, because several psychiatric disorders and conditions differ in their prevalence between the sexes. For instance, rates of Alzheimer’s disease are higher in females than males, prompting a recent call for the prioritization of biomedical research into sex differences in measures relevant to this disorder (Mazure and Swendsen 2016). Females also show a higher prevalence of major depressive disorder (Rutter et al. 2003; Gobinath et al. 2017), whereas males display higher rates of disorders such as autism spectrum disorder (Baron-Cohen et al. 2011), schizophrenia (Aleman et al. 2003) and dyslexia (Arnett et al. 2017). Improving therapeutic strategies for these conditions will almost certainly require accurate quantitative estimates of where and how the sexes differ normatively.

Moreover, although many psychological sex differences are small (consistent with the “gender similarities hypothesis”; Hyde 2014), some behaviors and traits do show reliable and substantial differences. For instance, performance on mental rotation tasks (Maeda and Yoon 2013) and physical aggression (Archer 2004) are on average higher in males, whereas self-reported interest in people versus things (Su et al. 2009) and the personality traits of neuroticism (Schmitt et al. 2008) and agreeableness (Costa et al. 2001) are on average higher in females. A full explanation of these cognitive and behavioral phenomena might benefit from a better understanding of brain sex differences.

Our understanding of brain sex differences has been hampered by low statistical power in previous studies. Small-sample research has become a considerable concern in neuroscience research (Button et al. 2013; Nord et al. 2017), and the concern no less applies to research on sex differences. To illustrate this point, in the most recent meta-analysis of macrostructural sex differences in brain subregions (Ruigrok et al. 2014)—which revealed a complex pattern of differences, with both males and females showing larger brain volume depending on the brain substructure in question—studies that examined sex differences in specific subregions of interest (rather than in broad, overall measures) had a mean sample size of 130 participants (range = 28–465). Since the publication of that meta-analysis, some larger macrostructural studies have appeared, though they are either in younger participants only (Gur and Gur 2016; Gennatas et al. 2017; Wierenga et al. 2017) or somewhat limited in the number of brain measures they report (Jäncke et al. 2015). Adult macrostructural studies with a large scale—both in terms of sample size and in terms of brain regions analyzed—are required.

Beyond macrostructural measures, there may also be robust sex differences in measures of the brain’s white matter microstructure. Studies that have attempted to quantify sex differences in white matter microstructure with diffusion tensor MRI—which uses information about the movement of water molecules through the brain’s white matter tracts to produce measures such as fractional anisotropy (FA), which has been linked to variation in cognitive and health-related traits (Sundgren et al. 2004)—are rare and, where they exist, small in sample size (Kanaan et al. 2012; Dunst et al. 2014). Newer and more rarely-studied neurite orientation dispersion and density imaging (NODDI) measures can offer novel information on the microstructural bases of sex differences. NODDI treats the diffusion-weighted signal as a combination of 3 compartments, each with characteristics that differentially shape the diffusion probability function of water molecules: fast isotropic diffusion (e.g., cerebrospinal fluid), anisotropic hindered diffusion (e.g., extracellular water), and restricted anisotropic diffusion (e.g., intra-axonal compartments; Zhang et al. 2012). This relatively more complex parameterization of the water diffusion signal may also lead to lower susceptibility to partial volume effects, and can potentially yield less ambiguous estimates of underlying microstructure than more traditional measures (for instance, FA cannot resolve these different underlying tissue properties).

In addition to the above structural brain imaging measures, it is also of interest to investigate sex differences in brain function. Examinations of sex differences in resting-state functional connectivity—the functional measure used in the present study, which indexes the temporal relations between activation in anatomically-separate brain regions while the brain is at rest (that is, not completing any experimenter-directed task; van den Heuvel and Hulshoff Pol 2010)—have also shown substantial differences, for example within the default mode network (where females show stronger connectivity) and within sensorimotor and visual cortices (where males show stronger connectivity; Biswal et al. 2010). As has been noted (Scheinost et al. 2015), a better characterization of broad patterns, including sex differences, in relatively novel measures such as functional connectivity (and in the NODDI parameters described above) is of importance to establish a “baseline” upon which future studies of normal versus abnormal function can rely.

There is more to sex differences than averages: there are physical and psychological traits that tend to be more variable in males than females. The best-studied human phenotype in this context has been cognitive ability: almost universally, studies have found that males show greater variance in this trait (Deary et al. 2007a; Johnson et al. 2008; Lakin 2013; though see Iliescu et al. 2016). This has also been found for academic achievement test results (themselves a potential consequence of cognitive differences, which are known to predict later educational achievement; Deary et al. 2007b; Machin and Pekkarinen 2008; Lehre et al. 2009a, 2009b), other psychological characteristics such as personality (Borkenau et al. 2013), and a range of physical traits such as athletic performance (Olds et al. 2006), and both birth and adult weight (Lehre et al. 2009a). To our knowledge, only two prior studies have explicitly examined sex differences in the variability of brain structure (Wierenga et al. 2017; Lange et al. 1997), and no studies have done so in individuals older than 20 years. Here, we addressed this gap in the literature by testing the “greater male variability” hypothesis in the adult brain.

The Present Study

To date, there exists no single, comprehensive, well-powered analysis of sex differences in mean and variance in the adult brain that covers structural, diffusion, and functional MRI measures. Here, we examine multimodal sex differences in adult human brain structural and functional organization in the largest and most definitive study to date, ensuring high levels of statistical power and reliability. We used data from UK Biobank (Allen et al. 2012), a biomedical study based in the United Kingdom. A subset of the full sample of 500,000 participants has contributed neuroimaging data (Miller et al. 2016); a portion of these data have been released for analysis while collection is ongoing, and this subset is analyzed here. We tested male–female differences (in mean and variance) in overall and subcortical brain volumes, mapped the magnitude of sex differences across the cortex with multiple measures (volume, surface area, and cortical thickness), and also examined sex differences in white matter microstructure derived from DT-MRI and NODDI. We tested the extent to which these differences were regionally-specific or brain-general, by adjusting them for the total brain size (or other relevant overall measurement; for instance, adjusting volume differences for total brain volume and cortical thickness differences for mean cortical thickness), and examining whether the differences found in the raw analyses were still present. We tested the extent to which these structural differences (in broad, regional, and white matter measures) mediated sex variation in scores on two cognitive tests, one tapping a mixture of fluid and crystallized reasoning skills (skills previously found to be linked to brain volumes; Pietschnig et al. 2015) and one testing processing speed (previously found to be linked to white matter microstructural differences; see Penke et al. 2012). At the functional level, we also examined large-scale organization of functional networks in the brain using resting-state fMRI functional connectivity data and data-driven network-based analyses.

Materials and Methods

Participants

UK Biobank (http://www.ukbiobank.ac.uk/) is a large, population-based biomedical study comprising around 500,000 participants recruited from across Great Britain (England, Scotland, and Wales) between 2006 and 2014 (Allen et al. 2012; Collins 2012; Miller et al., 2016). After an initial visit for the gathering of medical and other information, a subset of these participants began attending for head MRI scanning. MRI data from 5216 participants were available for the present study (mean age = 61.72 years, SD = 7.51, range = 44.23–77.12), collected at an average of around 4 years after the initial visit, and completed on an MRI scanner in Manchester, UK (that is, all data in this analysis were collected on the same scanner; see below for scanner details). There were 2750 females (mean age = 61.12 years, SD = 7.42, range = 44.64–77.12) and 2466 males (mean age = 62.39 years, SD = 7.56, range = 44.23–76.99). Further details regarding the demographics and representativeness of the sample are reported in the Supplemental Materials.

UK Biobank received ethical approval from the Research Ethics Committee (reference 11/NW/0382). The present analyses were conducted as part of UK Biobank application 10 279. All participants provided informed consent to participate. Further information on the consent procedure can be found under the “Resources” tab at the following URL: http://biobank.ctsu.ox.ac.uk/crystal/field.cgi?id = 200.

Brain Image Acquisition and Processing

MRI data for all participants were acquired on a single Siemens Skyra 3 T scanner, according to previously-reported procedures (Miller et al. 2016; Online Documentation: http://biobank.ctsu.ox.ac.uk/crystal/refer.cgi?id=2367; http://biobank.ctsu.ox.ac.uk/crystal/refer.cgi?id=1977). Briefly, the acquired 3D MPRAGE T1-weighted volumes were preprocessed and analyzed using FSL tools (http://www.fmrib.ox.ac.uk/fsl) by the UK Biobank brain imaging team. This included a raw, de-faced T1-weighted volume, a reduced field-of-view (FoV) T1-weighted volume, and further processing, which included skull stripping, bias field correction and gross tissue segmentation using FNIRT (Andersson et al. 2001, 2007a, 2007b) and FAST (Zhang et al. 2001), yielding cerebrospinal fluid (CSF), grey and white matter volumes. Where large, common artefacts, such as head movement, were identified during scanning, image acquisition was re-started. However, visual quality control was not systematically undertaken by the UK Biobank team; this would be unfeasible due to the very large sample size (Alfaro-Almagro et al. 2018). Subcortical segmentation was also conducted by the UK Biobank imaging team using FIRST (Patenaude et al. 2011) to provide the volumes of 15 structures (atlas in Fig. S1). These data are made available to researchers registered with UK Biobank as a downloadable dataset of Imaging Derived Phenotypes (IDPs). No significant changes were made to scanner hardware or software during the period of MRI data acquisition; full details on protocol phases and relevant upgrades are available at the following URL: http://biobank.ctsu.ox.ac.uk/crystal/docs/brain_mri.pdf.

Subregional Analyses

In addition, we used the FoV-reduced T1-weighted volumes from the first release of UK Biobank MRI data to reconstruct and segment the cortical mantle using default parameters in FreeSurfer v5.3 (http://surfer.nmr.mgh.harvard.edu/; Fischl and Dale 2000; Fischl et al. 2004; Ségonne et al. 2007), according to the Desikan–Killiany atlas (Desikan et al. 2006). Visual checking of each segmentation (including tissue identification and boundary positioning errors) was performed at the University of Edinburgh according to the pipeline suggested by the ENIGMA consortium (https://drive.google.com/file/d/0Bw8Acd03pdRSU1pNR05kdEVWeXM/view) and implemented in our previous work (e.g., Cox et al. 2018); we removed severe failures (including global tissue identification errors, for which cases were removed listwise), and also regional parcellation errors (such as minor skull-stripping or boundary-positioning errors, for which ROIs were removed casewise). The volume, thickness, and surface area of all 68 cortical regions of interest (see atlas in Fig. S3) were extracted for 3875 participants. The magnitudes of sex differences across the cortical surface were visualized using the freely-available Liewald-Cox Heatmapper tool (http://www.ccace.ed.ac.uk/research/resources/software-resources/software). We also registered the vertices of each participants’ cortical model to the FreeSurfer average pial surface, smoothed at 20 mm full width half maximum. Vertex-wise regression analyses were then conducted across each aligned cortical vertex for volume, surface area, and thickness using the SurfStat MATLAB toolbox (http://www.math.mcgill.ca/keith/surfstat) for Matrix Laboratory R2014a (The MathWorks Inc., Natick, MA).

White Matter Microstructure

MRI (dMRI) acquisitions are openly available from the UK Biobank website in the form of a Protocol (http://biobank.ctsu.ox.ac.uk/crystal/refer.cgi?id=2367), Brain Imaging Documentation (http://biobank.ctsu.ox.ac.uk/crystal/refer.cgi?id = 1977), and in Miller et al. (2016). Following gradient distortion correction, and further correction for head movement and eddy currents, BEDPOSTx was used to model within-voxel multi-fiber tract orientation, followed by probabilistic tractography (with crossing fiber modeling) using PROBTRACKx (Behrens et al. 2003, 2007; Jbabdi et al. 2012). The AutoPtx plugin for FSL (de Groot et al. 2013) was used to map 27 major white matter tracts from which tract-average fractional anisotropy was derived. On the basis of the factor analyses described by Cox et al. (2016), we selected 22 of the white matter tracts for inclusion in the present study. Neurite orientation dispersion and density imaging (NODDI) modeling was conducted using the AMICO tool (https://github.com/daducci/AMICO; Daducci et al. 2015), and the resultant orientation dispersion (OD) maps were registered with the AutoPtx tract masks to yield an average OD value per tract. These measures were also derived by the UK Biobank imaging team and were available as IDPs. An atlas of the selected white matter tracts is provided in Figure S4.

Note that the mean sex differences in the white matter microstructural parameters studied here were already reported by Cox et al. (2016). Here, we add the analyses of variance differences, and the mediation models with diffusion properties as the mediator of the sex difference in cognitive abilities (see below).

Resting-state fMRI (rsfMRI)

To analyze resting-state connectivity, we used bulk data from network matrices generated by UK Biobank. As described in the Online Methods section of Miller et al. (2016), participants lay in the scanner and were instructed to “keep their eyes fixated on a crosshair, relax, and “think of nothing in particular””. Data preprocessing, group-Independent Components Analysis (ICA) parcellation, and connectivity estimation were carried out by UK Biobank using FSL packages (http://biobank.ctsu.ox.ac.uk/crystal/refer.cgi?id=1977). The following preprocessing procedures were applied: motion correction using MCFLIRT (Jenkinson et al. 2002), grand-mean intensity normalization using a single multiplicative factor, high-pass temporal filtering with a Gaussian-weighted least-squares straight line fitting (sigma was set as 50.0 s), EPI unwarping using a field map scanned before data collection, gradient distortion correction (GDC) unwarping, and removal of structural artefacts using an ICA-based X-noiseifier (Beckmann and Smith 2004). Any gross preprocessing failure was visually checked and eliminated (Miller et al. 2016). Group-ICA parcellation was conducted on 4162 participants. The preprocessed EPI images were fed into the MELODIC tool in FSL to generate 100 distinct ICA components (45 were removed as noise after visual quality-control, leaving 55 components for analysis). The spatial maps for the components are available at the following URL: http://www.fmrib.ox.ac.uk/datasets/ukbiobank/index.html. Details of preprocessing steps can be found in pages 12, 15 and 16 of Brain Imaging Document (version 1.3) from UK Biobank data showcase website: https://biobank.ctsu.ox.ac.uk/crystal/docs/brain_mri.pdf.

Time series data from the 55 components were used for connectivity analysis, with each component as a node. Two 55 × 55 matrices of fully-normalized temporal correlations and partial temporal correlations were derived for each participant. A larger absolute number indicates stronger temporal connectivity, and the valence represents whether the connection is positive or negative. Partial temporal correlation matrices were used for analysis, as they represent direct connections better than full temporal correlations. Estimation of the partial correlation coefficients was conducted using FSLnets package in FSL (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/FSLNets). To produce a sparser partial correlation matrix, L2 regularization was applied by setting rho as 0.5 in the Ridge Regression “netmats” option. A description of the settings for the estimations is available at the following URL: http://biobank.ctsu.ox.ac.uk/crystal/refer.cgi?id=9028. To better illustrate the group-average network matrix, the nodes were clustered into 5 categories based on the full-correlation matrices (Miller et al. 2016). The group-average network matrix is shown in Figure S13.

Before analysis of sex differences, we multiplied the strength of each connection by the sign of its group-mean (Smith et al. 2015). For example, where the time series data from two ICA components were positively correlated, but the valence of the connection at the level of the group was negative, the valence for that individual was determined to be negative; that is, individual valences were determined by the valence of that connection at the level of the group. In this way, the valence of the majority of participants’ connections for each node were positive, allowing us to investigate the degree to which temporal connectivity differed by sex without combining positive and negative effects and losing information on the absolute magnitude. We then tested the association of sex with the strength of connections, using the glm function in R. As in the other analyses, age and ethnicity were controlled by using them as covariates. Any participant without age or ethnicity information was excluded. 4004 participants were therefore included in this part of the analysis (mean age = 61.63, SD = 7.56; 47.65% male). To assess the importance of the nodes, we generated the weighted degree for a node by calculating the mean strength of its connections with all 54 other nodes. Full results for connection strength (partial and full correlations) and for weighted degree are provided as 3 separate tabs in Table S14. In that table, Cohen’s _d_-values are provided as standardized effect sizes of the sex difference in the strength of connectivity: as for the other analyses, a negative effect size means the strength of the connection was higher in males, and a positive effect size means it was higher in females.

Cognitive Testing

Cognitive testing took place at the same visit as the MRI scan. Two tests were analyzed here: “fluid intelligence” (henceforth called “verbal-numerical reasoning”), and reaction time. These are described in detail in the Supplemental Materials.

Statistical Analysis

This section provides a summary of the statistical analyses we ran; a fuller technical description is provided in the Supplemental Materials.

We first adjusted all variables for age and ethnicity (both of which may have been associated with differences in brain measures; Cox et al. 2016; Isamah et al. 2010; Tang et al. 2010). In some analyses, as described below, we adjusted for total brain volume and height. The adjustment techniques are described in the Supplemental Materials.

Welch’s _t_-test was used for the mean comparisons, and a variance ratio test (_F_-test) was used to assess differences in the variance between the sexes. To calculate the associated Cohen’s _d_-value for each _t_-test, we multiplied the _t_-value by 2 and divided it by the square root of the degrees of freedom. The difference between correlations for each sex was calculated using Fisher’s _r_-to-z transformation and a _z_-test (using the r.test function in the psych package for R; Revelle 2016). __P_-values were adjusted, within each analysis and within each brain measure, with the False Discovery Rate correction (Benjamini and Hochberg 1995; for example, the __P_-values for all the sex comparisons on volume were corrected separately from the __P_-values for all the sex comparisons on surface area) using the P.adjust function (with the “fdr” correction) for R. We used an alpha level of 0.05 to denote statistical significance. In an additional Bayesian analysis of the mean difference, we used the BayesFactor package for R (Morey and Rouder 2015) to compute BF10 values from a Bayesian _t_-test (using the ttestBF function; see Supplemental Materials).

We used cross-sectional mediation models (in a structural equation modeling framework) to test whether the brain variables (total brain volume, grey matter volume, white matter volume, total surface area, mean cortical thickness, general fractional anisotropy, and general orientation dispersion—the latter two estimated as latent variables—each in separate models, as well as specific brain regions) were significant mediators of the relation between sex and cognitive ability (either verbal-numerical reasoning score or reaction time, in separate models). We also ran multiple-mediator models that used individual brain subregions as mediators of the sex-cognitive relation, instead of overall measures. All methods for running the mediation analyses, along with the equation used to calculate a “percentage of mediation” for each brain variable, are described in the Supplemental Materials.

Results

Sex Differences in Overall and Subcortical Brain Volumes

The subcortical structures examined were the hippocampus, the nucleus accumbens, the amygdala, the caudate nucleus, the dorsal pallidum, the putamen, and the thalamus (Fig. S1). Raw volumetric sex differences are illustrated in Figure 1. The male distributions were further to the right, indicating higher means, and wider, indicating greater variance. This was confirmed by computing shift functions (Rousselet et al. 2017) for each overall and subcortical brain structure, illustrated in Figure S2a for the raw values and Figure S2b for the values corrected for total brain volume (TBV). There was a substantial degree of overlap between the sexes on all measures.

Figure 1.

Figure 1.

Density plots of sex differences in overall brain volumes (left section) and subcortical structures (right section). d = Cohen’s d (mean difference); VR = Variance Ratio (variance difference). All mean differences were statistically significant at _P < 3.0 × 10−25, all variance differences were significant at _P < .003, after correction for multiple comparisons (see Table 1).

We first tested for mean sex differences in overall cortical and subcortical brain volumes, adjusting each measure for age and ethnicity (analyses adjusted for TBV were performed separately and are described below). We examined differences in total as well as grey and white matter volumes separately. Differences are shown in Table 1. We observed statistically significant sex differences (adjusted for multiple comparisons using the False Discovery Rate correction), all showing larger volume in males. Note that, in what follows, negative effect sizes indicate higher values for males, and positive effect sizes indicate higher values for females. The effect sizes ranged from small to large; for example, Cohen’s d = −0.39 and −0.31 for the left and right nucleus accumbens volume, respectively; −1.41, −1.28, and −1.49 for total, grey matter, and white matter volumes respectively. The average difference for the 14 subcortical volumes was d = −0.70. A set of Bayesian _t_-tests (see Supplemental Materials and Table 1) confirmed that the mean sex differences were very large, with extremely strong evidence in favor of the hypothesis that males differed from females on every overall and subcortical volume. We also tested for age-by-sex interactions, assessing whether brain measures were more strongly associated with age in males or females. This was not the case for the overall measures (adjusted __P_-values > 0.8). However, all of the subcortical measures except the amygdala and the caudate showed significant interactions, indicating that the age association was stronger (and the implied age trend steeper) for males. Note that the reported effect sizes come from _t_-tests on variables adjusted for age and sex, but not their interaction.

Table 1.

Descriptive statistics with mean and variance comparisons for overall volumes, subcortical volumes, and cognitive tests

Measure type Measure Female (n = 2750) Male (n = 2466) Mean difference test Variance Ratio test
M (SD) M (SD) t P d BF10 VR P
Overall volumes (cm3) Total brain volume 1115.76 (89.68) 1233.58 (98.31) –48.91 ~0.00 –1.41 9.57 × 10426 0.82 6.46 × 10–06
Grey matter volume 597.02 (47.78) 643.45 (52.08) –38.97 1.75 × 10–287 –1.28 1.62 × 10289 0.81 3.60 × 10–06
White matter volume 518.85 (47.89) 589.59 (52.69) –51.53 ~0.00 –1.49 1.47 × 10465 0.82 7.31 × 10–06
Subcortical volumes (cm3) Left hippocampusa 3.73 (0.42) 3.94 (0.46) –18.91 2.69 × 10–76 –0.55 1.09 × 1074 0.86 3.83 × 10–04
Right hippocampusa 3.82 (0.42) 4.04 (0.48) –18.43 1.16 × 10–72 –0.54 7.97 × 1070 0.77 1.16 × 10–09
Left accumbensa 0.49 (0.11) 0.53 (0.12) –13.42 5.19 × 10–39† –0.39 2.13 × 1036 0.81 2.95 × 10–06
Right accumbensa 0.40 (0.10) 0.42 (0.11) –10.64 3.82 × 10–26† –0.31 1.04 × 1023 0.83 4.46 × 10–05
Left amygdala 1.21 (0.22) 1.35 (0.25) –20.04 5.23 × 10–85† –0.59 4.73 × 1083 0.74 5.89 × 10–12
Right amygdala 1.18 (0.24) 1.31 (0.27) –17.55 2.16 × 10–66† –0.51 1.60 × 1064 0.79 1.54 × 10–07
Left caudate 3.28 (0.38) 3.54 (0.41) –23.00 3.04 × 10–110 –0.66 2.70 × 10108 0.85 2.38 × 10–04
Right caudate 3.45 (0.40) 3.72 (0.44) –22.67 2.37 × 10–107 –0.65 4.08 × 10105 0.84 4.46 × 10–05
Left palliduma 1.69 (0.21) 1.85 (0.22) –26.64 4.87 × 10–145† –0.77 2.19 × 10143 0.88 .002
Right palliduma 1.74 (0.20) 1.89 (0.22) –26.96 3.82 × 10–148† –0.78 8.59 × 10146 0.84 1.03 × 10–04
Left putamena 4.61 (0.50) 5.07 (0.56) –34.72 1.73 × 10–234† –1.01 1.29 × 10235 0.83 1.46 × 10–05
Right putamena 4.64 (0.49) 5.13 (0.55) –37.13 4.76 × 10–264† –1.08 3.02 × 10265 0.81 1.98 × 10–06
Left thalamusa 7.54 (0.64) 8.11 (0.72) –33.73 7.76 × 10–223 –0.98 1.50 × 10223 0.82 1.34 × 10–05
Right thalamusa 7.34 (0.62) 7.92 (0.69) –35.76 2.42 × 10–247 –1.03 6.62 × 10247 0.83 4.46 × 10–05
Cognitive tests Verbal-numerical reasoning (max. score 13) 6.80 (2.10) 7.14 (2.13) –6.21 5.77 × 10–10 –0.18 6.94 × 106 0.97 .451
Reaction time (ms) 590.37 (98.04) 574.71 (100.71) –7.63 2.71 × 10–14 –0.21 1.30 × 1011 0.92 .033

We tested whether sex differences in the subcortical measures were accounted for by the substantial difference in total brain volume. We regressed each subcortical variable on total brain volume, testing these adjusted residuals for sex differences. After this adjustment, there were no longer statistically significant differences in the hippocampus, caudate nucleus, or thalamus (all __P_adj-values > 0.60, absolute _d_-values < 0.03; Table S1). There remained differences in each of the other measures, albeit with attenuated effect sizes (average d for significant differences after adjustment = 0.17). Females had greater nucleus accumbens volume after adjustment for total brain volume (d = .08, __P_adj = .07 for left accumbens; d = 0.10, __P_adj = 0.003 for right). Overall, the majority of the sex differences in specific subcortical structures appeared to be linked to the difference in total brain size (average pre-adjustment attenuation of _d_-values for subcortical structures = 85.0%). We also ran analyses adjusting for height, since overall body size may have influenced these differences (as expected, males were substantially taller on average: d = −2.15). This attenuated all of the _d_-values (average attenuation across global and subcortical measures = 71.3%), but males still showed significantly larger volumes for all subcortical regions except the nucleus accumbens (Table S1). For example, post-adjustment _d_-values were −0.42 for total brain volume, −0.31 for grey matter volume, and −0.47 for white matter volume. The Bayesian analyses, also shown in Table S1, again confirmed these results: for instance, showing more evidence for the null hypothesis of no difference for the bilateral nucleus accumbens after adjustment for height, but still showing substantially more evidence for the hypothesis of a difference for all other regions.

As shown in Table 1, there were statistically significant variance differences in all overall cortical and subcortical brain volumes, with males showing greater variance; the average variance ratio for overall volumes and subcortical volumes was 0.82 (variance ratios <1.00 indicate greater male variance). After adjusting for total brain volume or height, the variance differences reported in Table 1 remained relatively unchanged (see Table S1).

Sex Differences in Subregional Brain Volume, Surface Area, and Cortical Thickness

Using FreeSurfer to parcellate cortical regions according to the Desikan–Killiany neuroanatomical atlas (Desikan et al. 2006; S3), we tested for sex differences in (raw) volume, surface area, and cortical thickness across 68 cortical subregions. As with the analyses above, we adjusted all subregions for age and ethnicity; __P_-values were also adjusted within each measure type using the False Discovery Rate correction. The results are illustrated in Figure 2A (see also Table S2 for means, standard deviations, and difference tests for volume, surface area, and cortical thickness across all cortical regions).

Figure 2.

Figure 2.

Sex differences across the subregions in volume, surface area, and cortical thickness. Shown are (A) mean differences, (B) mean differences adjusted for total brain volume, total surface area, and mean cortical thickness (respectively by column); and (C) variance differences. Adjusted variance differences were near-identical to those shown in (C); see Figure S5. See Figure S3 for subregional atlas.

Males showed larger brain volume across all cortical subregions. The sex difference was statistically significant in every subregion, ranging in size from small (d = −0.24 in the right temporal pole) to large (d = −1.03 in the right insula). The mean _d_-value across all subregions was −0.67 (__P_adj-values < 9.00 × 10−13). Even larger differences, all favouring males, were observed for surface area; these ranged from moderate (d = −0.43 in the left caudal anterior cingulate) to large (d = −1.20 in the left superior frontal region). The mean _d_-value across all subregions was −0.83 (all __P_adj-values < 2.00 × 10−36).

Cortical thickness displayed a different pattern. Unlike volume and surface area, females had thicker cortex across almost the entire brain. The only area where males showed a statistically significantly thicker cortex was the right (but not left) insula, and the difference was small (d = 0.14). In all other areas, there was either no significant thickness difference (20/68 areas), or a statistically significant difference favouring females. The mean _d_-value in the 47 areas that reached statistical significance after multiple-comparisons correction was 0.22, ranging from d = 0.07 in the right rostral middle frontal region to d = 0.45 in the left inferior parietal region. Overall, higher female cortical thickness was generally not found in the temporal lobe (except the parahippocampal gyrus) or in the medial orbitofrontal regions. In some regions there appeared to be converse differences: in the motor and somatosensory regions in the parietal lobe, the frontal pole, and the parahippocampal gyrus, females showed relatively higher thickness but males showed relatively higher volume and surface area. In the superior temporal lobe and orbitofrontal regions, males showed relatively higher volume and surface area, but there was no particular sex difference in thickness. Once more, all of the subregional analyses were confirmed using the alternative Bayes Factor analyses.

We also tested age-by-sex interactions for each of the 3 variables (volume, surface area, and cortical thickness; Table S2). After multiple-comparisons correction, only two interactions were significant: the left and right superior frontal regions showed significantly stronger volume relations with age in males. That is, males may have had steeper volume decline in this region bilaterally with age. There were no statistically significant age-by-sex interactions for surface area or cortical thickness.

We next adjusted the subregional volume, surface area, and cortical thickness measures for total brain volume, total surface area, or mean cortical thickness, respectively. As shown in Figure 2B (and Table S3, which also shows the surface area and cortical thickness measures adjusted for total volume, since this is a common procedure in neuroimaging analyses), 11 regions were still significantly larger in volume for males (mean d = −0.14; the largest being the left isthmus cingulate, d = −0.22). There were also 13 regions where females now had a significantly larger volume (mean d = 0.15; the largest difference being in the right superior parietal, d = 0.21). The majority of regions (44/68) no longer showed significant volume differences (all __P_adj-values > .34). For surface area, males were significantly larger in 18/68 areas after total surface area adjustment (mean d = −0.12, the largest difference in the right pars orbitalis; d = −0.16), and females were larger in 9 (mean d = 0.11; the largest being the left caudal anterior cingulate, d = 0.18). The remaining 41 surface areas showed no significant difference (all __P_adj-values > .05) For cortical thickness, after correction for mean thickness there were still significant differences favouring females in 24/68 regions (mean d = 0.17; the largest in the left inferior parietal, d = 0.32), but also 25 regions with differences favouring males (mean d = −0.16, the largest in the right insula, d = −0.34). Next, we adjusted the cortical subregional measures for height (Table S4). For volume, all of the comparisons were still significant, but with reduced effect sizes (−0.33 < d < −0.07, mean d = −0.19, all __P_adj-values < .05); this was the same for surface area (−0.35 < d < −0.10, mean d = −0.25, all __P_adj-values < .002). For thickness, there were 34/68 regions that were still significantly thicker in females (mean d = 0.12, the largest in the left inferior parietal, d = 0.19), and one thicker in males (the left entorhinal cortex, d = −0.08).

Variance differences across the 3 structural measures are illustrated in Figure 2C. For volume and surface area, males showed significantly greater variance than females across almost all brain regions. The volume variance ratio was significant in 64/68 regions, ranging from 0.88 in the right temporal pole to 0.67 in the left isthmus cingulate, with all __P_adj-values < .031 after correction. The surface area variance ratio was significant in 66/68 regions, ranging from 0.88 in the left pars orbitalis to 0.65 in the left isthmus cingulate, all __P_adj-values < .018 after correction. For cortical thickness (Fig. 2_C_), there were no significant variance differences in any region (all __P_adj-values > .14) except one: females showed significantly greater variance in the thickness of the left medial orbitofrontal cortex (VR = 1.19, __P_adj = .01). As can be observed from Figure S5 (and Table S3), controlling for the respective overall measures made only a negligible difference to the pattern of variance ratios reported above.

We tested whether the regions showing larger mean differences were also those with larger variance differences, by correlating the vector of _d_-values with the vector of VRs for each brain measure. As shown in Figures S6 and S7, there was some correspondence between mean and variance: in the unadjusted analysis, mean and variance were correlated at r = .51 for volume, but there were smaller correlations for surface area and thickness (_r_-values = .25 and −.06, respectively). Adjusted for TBV, all 3 correlations were relatively weak (_r_-values = .22, .03, and −.25 for the 3 brain measures respectively).

To verify whether the pattern of results across the cortical mantle was agnostic to the gyral boundaries of the Desikan–Killiany atlas, we conducted a supplemental analysis, testing sex differences using a vertex-wise approach, the results of which are shown in Figures S8 (for mean differences) and S9 (for variance differences). This precisely replicated the subregional atlas-based results.

Sex Differences in White Matter Microstructure

We tested sex differences in 22 white matter tracts. We focused on two white matter microstructural properties that had previously been shown to demonstrate differences between males and females in the initial release of UK Biobank imaging data (Cox et al. 2016). The first was fractional anisotropy (FA), an index of the directionality of water diffusion through the white matter. The second was orientation dispersion (OD), a NODDI measure of white matter tract complexity. For FA, there were generally higher values in males, particularly in the cortico-spinal tract (d = −0.54) and the acoustic radiation (d = −0.51). The average difference across tracts was d = −0.19. OD was higher in all tracts for females (average d = 0.30). These mean differences are shown in Figure 3, and fully reported in Tables S5 and S6.

Figure 3.

Figure 3.

Mean sex differences in white matter microstructural measures (A) fractional anisotropy and (B) orientation dispersion across 22 white matter tracts. For both measures, numerically the largest effect was found in the right cortico-spinal tract. See Figure S4 for tract atlas.

Variance differences are illustrated in Figure S10 (see also Tables S5 and S6). Generally, there was greater male variance in FA (average VR = 0.92); however, there was substantially greater female variance in the cortico-spinal tract in particular (VR = 1.17, _P = .0003). For OD, the only tract that showed a significant variance difference following FDR correction was the left superior thalamic radiation, where males showed greater variance (VR = 0.79).

Adjusting the microstructural measures attenuated the size of the mean sex differences in FA (in one case producing a significantly higher score in females that was not present in the raw analysis, for the left inferior fronto-occipital fasciculus; Table S5). For OD, this adjustment made very few changes to the mean differences (Table S6). The TBV adjustment left the variance differences broadly unchanged—in some cases with a small increase in the size of the sex difference—for both FA and OD. This attenuation in the size of the mean sex differences in FA, but not OD, after adjustment for TBV may be a result of FA’s sensitivity to “partial volume” effects.

Relation of Neurostructural Differences to Cognitive Differences

We linked the structural brain differences to scores on two cognitive tests taken at the time of the imaging visit: verbal-numerical reasoning and reaction time (see Method). Descriptive statistics for the cognitive tests are shown in Table 1. Note that we coded both tests (reflecting the reaction time variable) so that higher scores indicated better performance. The test scores correlated positively, but weakly (r = .12). Males had a slightly higher mean score than females on verbal-numerical reasoning (d = −0.18) and slightly faster mean reaction time (d = −0.22); there was no significant variance difference for verbal-numerical reasoning (VR = 0.97, _P = .45), though males had marginally more variance in reaction time (VR = 0.92, _P = .03).

As a first step toward the mediation analyses, we correlated performance on the two cognitive tests with the overall brain measures in the full sample (Table S7), and in two randomly-selected sample halves separately (Table S8). The sample was split in this way to avoid overfitting and assess the replicability of the results. We then ran the same correlations across all the raw brain subregions, for volume, surface area, and cortical thickness (Table S9). These correlations were generally small, with all brain-cognitive _r_-values <.20. We compared the size of the correlations across the sexes; after multiple comparisons correction, there were no significant sex differences in these correlations. Thus, there was no evidence in the present analysis for sex differences in how regional brain structure related to the two measured cognitive skills.

Next, we tested the extent to which the mean cognitive differences were mediated by any of the overall brain measures (total, grey, and white matter volumes, total surface area, mean cortical thickness, or general factors of FA or OD). We ran a separate model, illustrated in Figure S11, for each brain measure. Results are displayed in Tables S10 and S11 for verbal-numerical reasoning and reaction time, respectively. For verbal-numerical reasoning, the sex difference in test scores was mediated substantially by brain volume measures and by surface area (all mediation percentages >82%). Cortical thickness showed far smaller mediation percentages (7.1% and 5.4% in the two sample halves, respectively). For reaction time, total brain and white matter volumes had mediation percentages >27%, but the other measures all produced smaller percentages (<15.3%), particularly mean cortical thickness (mediating <3% of the variance).

Finally, we tested which (raw) brain subregions were most important in explaining the mediation of the sex-cognitive relation, by running mediation models that included multiple individual regions as mediators. These variables were selected for their association with the cognitive ability in question (again, either verbal-numerical reasoning or reaction time) using LASSO regression models (see Method for details). The percentage of mediation for each selected region is illustrated in Figure 4 (see Figure S12 and Table S12 for full results). For verbal-numerical reasoning, the volume and surface area of the superior temporal region mediated the largest amounts of variance (29.1% and 18.4% in their respective models), with other relatively substantial contributions coming from the precuneus and insula for volume, and the pars opercularis and rostral middle frontal regions for surface area. For the cortical thickness predictors, and for the outcome of reaction time, as expected on the basis of the overall mediation results reported above, few of the regions showed substantial mediation (there was some mediation by the volume of frontal regions; at most 7.3% by the frontal pole).

Figure 4.

Figure 4.

Percentage of the sex-cognitive relation mediated by each of the brain regions selected in a LASSO model to be linked to either verbal-numerical reasoning (left column) or reaction time (right column). Results for volume, surface area, and cortical thickness are shown in each row. Regions were averaged across the hemispheres; thus only a medial and lateral view for each measure and each cognitive test is shown.

Sex Differences in Resting-state Functional Connectivity

For our final set of analyses, we examined sex differences in resting-state functional MRI (rsfMRI) responses within a number of functional networks. The connections between each pair of functional networks were estimated and then transformed into measures of strength (see Method). We found that 54.7% (811 of 1485) of network connections showed a statistically significant sex difference (absolute _β_-values = 0.071–0.447 for females; 0.071–0.519 for males). A map showing the strengths of the connections between the 55 network nodes, and whether the difference was stronger in males (blue) or females (red) is provided in Figure 5A (see also Table S13). The strength of connectivity between sensorimotor, visual, and rostral lateral prefrontal areas was absolutely higher in males than females (see the cluster of brain regions with orange numerals in Fig. 5A), whereas the strength of connectivity within the default mode network (DMN; cluster of regions with red numerals in Fig. 5A) was absolutely higher in females than males.

Figure 5.

Figure 5.

Results for resting-state _f_MRI connectivity and weighted degree of nodes. (A) Spatial maps for individual connections. Colors and line thickness represent the effect sizes of sex on the strength of connections (red = stronger in females; blue = stronger in males; darker/thicker = larger effect size). Only effect sizes (Cohen’s d) larger than ±0.2 are shown. Nodes were clustered into 5 categories using FSLnets based on their group-mean full-correlation matrix (yellow/orange: sensorimotor network; red: default mode network; purple: salience network and executive control network; green: dorsal attention network; blue: visual network). (B) and (C) Weighted degrees of nodes with higher values in males and females, respectively. The spatial maps of significant group-ICA nodes were multiplied by the effect size of the sex correlation. In order to show the regions with the largest associations with sex, only regions that had intensity over 50% of the whole-brain peak value are presented. See Table S14 for values for each connection and for each node’s weighed degree.

To further analyze these functional sex differences, we calculated the mean strength of all 54 connections to each individual node, producing a “weighted degree” statistic. Sex differences in weighted degree are shown in Figure 5B,C. Males showed stronger weighted degree than females in bilateral sensorimotor areas, the visual cortex, and the rostral lateral prefrontal cortex. Females showed stronger weighted degree than males in cortical areas comprising the DMN: the bilateral posterior cingulate cortex/precuneus, the dorsal anterior cingulate cortex, medial prefrontal cortex, temporo-parietal junction, anterior temporal lobe, medial temporal lobe (e.g., hippocampus and surrounding areas), and some cerebellar regions (see Tables S13 and S14).

Discussion

In a single-scanner sample of over 5000 participants from UK Biobank, we mapped sex differences in brain volume, surface area, cortical thickness, diffusion parameters, and functional connectivity. One main theme of the neurostructural results was that associations with sex were global. Males generally had larger volumes and surface areas, whereas females had thicker cortices. The differences were substantial: in some cases, such as total brain volume, more than a standard deviation. The effect size of d = −1.41 for total brain volume (Table 1) translates to 92.1% of males being above the female mean, and an 84.1% chance that a randomly-chosen male will have a larger total brain volume than a randomly-chosen female. (The following online app can be used to calculate further such values: http://rpsychologist.com/d3/cohend/.)

We also found that volume and surface area mediated nearly all of the small sex difference in reasoning ability, but far less of the difference in reaction time. For white matter microstructure, females showed lower directionality (FA) and higher tract complexity (OD); white matter microstructure was a poor mediator of the cognitive sex difference. Resting-state fMRI analyses also revealed a global effect: around 54% of connections showed a sex difference. These differences clustered around specific networks, with stronger connectivity in females in the default mode network and stronger connectivity in males between unimodal sensory and motor cortices as well as high-level cortical areas in the rostral lateral prefrontal cortex. For every brain measure that showed even large sex differences, there was always overlap between males and females (see Fig. 1): even in the case of the large difference in total brain volume noted above, there was 48.1% sample overlap.

The principal strengths of the present study are its sample size (providing sensitivity for the identification of small effects with high statistical power), the wide range of MRI modalities, and the consideration of both mean and variance differences. Given the surfeit of small-n studies in neuroscience (Button et al. 2013; Nord et al. 2017), it is of great importance to test hypotheses in large, well-powered samples, especially given that many neural sex differences are modest in size (Joel et al. 2015). Here, we had excellent statistical power to find small effects in brain subregions, providing a robust and detailed analysis. For our subregional analysis, we had a far larger sample size than the most recent meta-analysis (Ruigrok et al. 2014). In contrast to that meta-analysis—which found greater volume for females in areas such as the thalamus, the anterior cingulate gyrus, and the lateral occipital cortex—our study found no brain subregions where females had a larger volume than males. The reason for this may be the more restricted age range of the participants in our study (sex may relate differently to the brain at different ages, as we found for several brain regions in our age-by-sex interaction analyses, and as was found in a previous developmental study of children and adolescents; Gennatas et al. 2017) or, more likely, study size and heterogeneity: the data for section of the meta-analysis on regional volumes came from many separate studies, on separate scanners, generally with small sample sizes (many with n < 100), whereas our contrasts were based on one very large, single-scanner study.

The higher male volume in our study appeared largest in some regions involved in emotion and decision-making, such as the bilateral orbitofrontal cortex, the bilateral insula, and the left isthmus of the cingulate gyrus (Craig 2009; MacPherson et al. 2015; Ochsner and Gross 2005; Wager et al. 2008; note that the insula showing the largest sex difference is consistent with a recent large-scale study of children and adolescents (Gennatas et al. 2017)—it appears this region retains its substantial sex difference into later life), but also areas such as the right fusiform gyrus. For surface area, which showed an even larger difference favouring males, the regions that showed the largest effects were broadly areas involved in the hypothesized intelligence-related circuit in the “P-FIT” model (Jung and Haier 2007): for example, the bilateral superior frontal gyri, the bilateral precentral gyri, the left supramarginal gyrus, and the bilateral rostral middle frontal areas. However, some of the regions involved in this theorized circuit were also larger, in terms of thickness, for females. For instance, the bilateral inferior parietal regions were the regions with numerically the largest difference favouring females in cortical thickness. Our finding that raw cortical thickness was greater for females—a finding with a smaller mean effect than the one in the opposite direction for volume or surface area—is consistent with a number of previous, smaller studies (e.g., Luders et al. 2006; Sowell et al. 2007; Lv et al. 2010; van Velsen et al. 2013; though not others: Salat et al. 2004; Reid et al. 2010). Our greater statistical power may have allowed us to find smaller differences in thickness across the cortex. It is interesting to note that a previous study from the animal literature found greater cortical thickness in male mice (Markham et al. 2003); however, this study had a small sample size and by necessity used very different methods from ours: further comparison of sex differences in the brains across different species might reveal more about the cellular mechanisms underlying these differences.

Whereas previous work has found some white matter regions where fractional anisotropy was higher for females (Kanaan et al. 2012; Dunst et al. 2014), we found that males showed higher FA in 18 of the 22 tracts we examined. FA also generally showed greater variance in males. On the other hand, higher orientation dispersion was found for females in all tracts. Unexpectedly, higher OD was found to be related to lower cognitive performance on the two tests examined here. Since OD is a relatively new measure of white matter microstructure (Daducci et al. 2015), further work should aim to clarify its behavioral correlates. The fact that (as described in the Method section) measurement invariance did not hold across the sexes for the latent variables of FA and OD, indicating that the tract-specific measurements may be assessing somewhat different latent variables in each sex, may also be of interest for future researchers examining general-level indicators of white matter microstructure.

The issue of adjusting for overall brain size in analyses of sex differences (Rippon et al. 2014) was addressed in each of our macrostructural analyses. As can be seen comparing Figure 2A and B, after this adjustment the higher male volume and surface area was substantially reduced, often to non-significance. For those latter brain regions, this implies that the sex difference was general and that the larger volume or surface area was a by-product of the overall larger male brain. However, for some regions, especially for surface area (particularly in areas such as the left isthmus of the cingulate gyrus and the right precentral gyrus), males still showed a significantly higher measurement, indicating specific sex differences in the proportional configuration of the cortex, holding brain size equal. Most interestingly, for some areas (for example the right insula, the right fusiform gyrus, and the left isthmus of the cingulate gyrus), the difference was reversed after adjustment, with females showing significantly larger brain volume.

A recent meta-analysis of sex differences in amygdala volume (Marwha et al. 2016) found that, although males showed larger raw volume, after correction for total brain volume there was no longer an appreciable sex difference. However, in our study the amygdala was significantly, but modestly, larger in males even after adjusting for total brain volume (d = 0.18 bilaterally). The heterogeneity in the methods of the studies being meta-analyzed may have led to the divergent conclusion from our single-sample study. With regard to the hippocampus, however, we found results consistent with another recent meta-analysis (Tan et al. 2016): there were no longer significant sex differences after adjustment for total brain volume (this was also the case for the thalamus and caudate). We recommend that future studies perform comparisons both before and after adjusting for total volume (or the relevant overall measure, in the case of measures such as cortical thickness or surface area), since these results pertain to quite different questions: unadjusted, they ask which regions differ in an absolute sense; adjusted, they ask which regions are different relative to the overall size (or other overall measure) of the brain. Both of these may be important questions to ask in different contexts.

One question that could not be addressed using the current data regards the underlying bio-social causes, ultimate or proximate, for the sex differences that we observed. Many variables were collected in UK Biobank that might be linked to the sex differences observed here (and may be proximal causes of them) but our intention in the present study was to characterize, not necessarily explain, these differences: future research should investigate more targeted hypotheses of the causes of the differences. Sex differences in brain structure are observed early in the life course (e.g., Knickmeyer et al. 2014), though this does not imply that the pattern of adult differences we observed is necessarily the same as is found in childhood. The literature on developmental sex differentiation of the brain highlights influences of factors, such as genes on sex chromosomes or sex hormones acting via receptors throughout the developing and adult brain, that were not analyzed in the present study (McCarthy and Arnold 2011; Lombardo et al. 2012; McEwen and Milner 2017). Likewise, understanding the potential neurobiological effects of social influences during development (Dawson et al. 2000) was beyond the scope of our research and our dataset.

Our analysis also focused on sex differences in variance. Here, for the first time in an adult sample, we directly tested sex differences in the variance of several brain measures, finding greater male variance across almost the entire brain for volume, surface area, and white matter fractional anisotropy, but only patchy and inconsistent variance differences for cortical thickness and white matter orientation dispersion. Our correlation of the vector of mean differences with that of variance ratios showed that there was some degree of correspondence between them for volume, but the relation was far smaller for surface area (and was near-zero for cortical thickness). In all cases, this analysis indicated that mean and variance differences appear to be largely independent aspects of sex differences in the brain; we should not expect to see that areas showing the largest difference in mean also show the largest difference in variance, at least to a great extent.

One potential candidate to explain greater male variability across multiple phenotypes is the hypothesized “female-protective” mechanism involving effects of the X chromosome (Craig et al. 2009; Johnson et al. 2009; Reinhold and Engqvist 2013), or other protective factors that might “buffer” females from potential deleterious consequences of rare genetic mutations (Jacquemont et al. 2014; Robinson et al. 2013). Such explanations are speculative at present; as studies like UK Biobank release even larger amounts of data on individuals who have both neurostructural and genotype data, researchers will be able to perform well-powered tests of these hypotheses.

In discussing any potential genetic effects on sex differences, analyses should take into account the fact that such effects are likely active at multiple points across the lifespan, representing a continuing, complex set of influences that may interact with environment and experience. This complexity is deepened by phenomena such as genomic imprinting (where the expression profiles of certain genes differs according to whether they were inherited paternally or maternally, which has been hypothesized to contribute to the aforementioned sex-biased nature of some psychiatric disorders; Dulac and Christopher 2013) and the “mother’s curse” (a buildup of male-specific mutations in mitochondrial DNA, which is passed on maternally; Gemmell et al. 2004).

Using the (limited) data on cognitive abilities available in our sample, we tested whether the data were consistent with any consequences of brain structural differences in terms of ability differences. There were only weak correlations between brain variables and the cognitive tests (consistent with previous evidence of these links: Karama et al. 2014; Schnack et al. 2014; Pietschnig et al. 2015), and these associations did not differ by sex (consistent with the brain-volume-cognitive meta-analysis of Pietschnig et al. 2015). Mediation modeling suggested that, for verbal-numerical reasoning, a very large portion (up to 99%) of the modest sex difference was mediated by brain volumetric and surface area measures. Smaller fractions (up to 38%) of the modest link between sex and reaction time could be explained by volume or surface area. Perhaps unexpectedly, given evidence and theory linking white matter microstructure to cognitive processing speed (Penke et al. 2012; Bennett and Madden 2014), white matter microstructural measures only mediated a small proportion of the sex difference in reaction time (this may have been due to weaknesses in this cognitive measure; see below). Cortical thickness had trivial mediating effects compared to volume and surface area: no more than 7.1% of the sex-cognitive relation was mediated by thickness in any analysis. With our multiple-mediator models, we built a map of which brain regions were most relevant in this mediation of the sex-cognitive relation (Fig. 4). Overall, the data were consistent with higher volume and cortical surface area—but not cortical thickness or microstructural characteristics—chiefly in the superior temporal region, but also spread across multiple other regions to a lesser extent, being of particular relevance to sex differences in reasoning (but not reaction time).

An additional hypothesis—one that is not incompatible with the hypothesis that some of the sex differences seen here are a proximate cause of behavioral differences—is that brain structural differences might sometimes be the result of compensatory mechanisms for differences in sex-specific hormones, and might thus act to reduce behavioral sex differences that would otherwise have been present (De Vries 2004; McCarthy and Arnold 2011). This perspective may in part explain an apparent paradox in human sex difference research: that the (raw) effect sizes found for brain measures such as volume and surface area are so large, whereas most behavioral sex differences are so small (Hyde 2014). Our descriptive results do not directly speak to any causal mechanisms, but it should be borne in mind that they are compatible with these multiple interpretations.

Sex differences in intrinsic functional connectome organization also revealed results that corroborate and extend prior work. We successfully replicated the results from the 1 000 Functional Connectomes dataset (an entirely separate dataset)—that is, we found female > male connectivity within the default mode network and some evidence for male>female connectivity in sensorimotor and visual cortices (Biswal et al. 2010). The higher female connectivity within circuits like the DMN may be particularly important, given that DMN regions are often considered as an important part of the “social brain” (Kennedy and Adolphs 2012; Mars et al. 2012; Amft et al. 2015). Whether such an effect can help explain higher average female ability in domains like social cognition (Gur et al. 2012), and whether such functional differences can be integrated with differences in the structural connectome (Ingalhalikar et al. 2014), remains to be seen. Finally, recent work has shown that intrinsic functional connectome organization can be parsimoniously described as a small number of connectivity gradients (Margulies et al. 2016). The most prominent connectivity gradient has at one pole the DMN and at the other unimodal sensory and motor cortices. The observed pattern of sex differences in functional connectome organization observed here appears to recapitulate the two main poles of that principal connectivity gradient (Margulies et al. 2016). One potential way of describing the biological significance of these functional sex differences is that mechanisms involved in shaping sex differences (biological, cultural, or developmental) may influence this principal connectivity gradient; the result, which should be explored in future investigations of brain sex differences, may be the multiple network differences found in the present study.

Limitations

The UK Biobank sample was selective. It covered only one part of the life course (from approximately 45 to 75 years of age), and thus our findings may not apply to younger adults. With ageing may come larger variation in some brain parameters (Cox et al. 2016), meaning that the distributions found here may not fully represent those across the lifespan. Many of the female participants might have been undergoing, or have undergone, menopause; this (or associated Hormone Replacement Therapy) might exert modest effects on the structure of some regions of the brain (Zhang et al. 2016), effects which may themselves change with increasing age. In addition, UK Biobank had a very low response rate to invitations to participate (5.47% in the full sample of ~500,000; Allen et al. 2012). We would thus expect the individuals studied here would not be fully representative of males and females from the general UK population. This was the case for education: individuals with college or university degrees were over-represented (see Method), though the male:female education ratio itself appeared representative. These selection effects may in part explain the differences between our results and those of previous studies and meta-analyses, as discussed above. Although we adjusted for the effects of age, it should also be noted—as for any study with a relatively wide age range—that there was substantial variation in the birth date of the participants, undoubtedly leading to different (unmeasured) social experiences during their development.

On the topic of age adjustment, it should also be noted that we adjusted for linear effects of age, whereas some variables may have nonlinear trends (although, given a preliminary analysis as described in the Supplemental Materials, we would not expect this to affect the sex differences in these variables to a substantial extent). We should also note that, as described in the Supplementary Materials, there was a significant, yet small, age difference between the sexes (men were older by 1.27 years on average). A final issue of representativeness concerns clinical outcomes. Although we noted above that there is much interest in sex-differential patterns of psychiatric disorder diagnoses, the unrepresentativeness of UK Biobank extends to generally low rates of such disorders in general in the sample. For this reason, we did not attempt to link the MRI sex differences observed here to clinical diagnoses, though studies of normal-range variation in traits linked to psychiatric disease (such as neuroticism, a known risk factor for Major Depressive Disorder; Kotov et al. 2010), may produce more fruitful results.

Caution should be taken in interpreting the results of the analyses involving the cognitive tests (the mediation analyses in addition to the correlations). Whereas previous, representative studies (e.g., Johnson et al. 2008) have found no mean difference, but a variance difference, in general cognitive test performance, the tests examined here showed mean differences but no strong variance differences. This may be due to problems of sample representativeness (Dykiert et al. 2009), or due to the tests tapping specific cognitive skills rather than general ability (Burgaleta et al. 2012). The cognitive measures were relatively psychometrically poor compared to a full IQ assessment: the verbal-numerical reasoning test had only 13 items, and the reaction time test had only 4 trials that counted towards the final score (see Lyall et al. 2016, for analyses of the reliability of these tests). Although the tests—particularly verbal-numerical reasoning—have some external validity (Hagenaars et al. 2016), the above issues mean that the cognitive analyses reported here should be considered preliminary. Fuller cognitive testing, currently underway in UK Biobank, will allow a more comprehensive exploration. Studies that use tests where males or females are known to show higher average scores (such as 3D mental rotation tests, which generally show higher scores in males; Maeda and Yoon 2013), would potentially allow for more informative results. In addition, cross-sectional mediation models of observational data, such as those used here, are inherently limited: they cannot address causal relations between variables. The models were simple, including only 3 main variables (sex, the brain measure, and cognitive ability; Fig. S11). Note also that there exists a great deal of debate over testing the quantifying the indirect effect in mediation models (e.g., Hayes and Scharkow 2013). More complex models, using longitudinal data and latent variables derived from multiple cognitive tests, should be specified in future research.

Finally, although this study used a wide variety of neuroimaging measures, it should be noted that these were but a small selection of the possible modalities that we could have investigated, and that studies should address in future. Other diffusion and NODDI measures of white matter microstructure such as radial and axial diffusivity and intracellular volume fraction (Cox et al. 2016), cortical measures such as regional gyrification (Gregory et al. 2016) and grey matter density (Ruigrok et al. 2014), and pathological brain structures such as white matter hyperintensities (Wardlaw et al. 2015) and enlarged perivascular spaces (Potter et al. 2015) may show interesting patterns of sex differences both across the population, and in how they relate to healthy behavioral variation as well as disease states.

Conclusions

The present study is the largest single-sample study of neuroanatomical sex differences to date. We report evidence on the pattern of sex differences in brain volume, surface area, cortical thickness, white matter microstructure, and functional connectivity between adult males and females in the range between middle- and older-age. As has previously been argued (Fine 2017), providing a clear characterization of neurobiological sex differences is a step towards understanding patterns of differential prevalence in neurodevelopmental disorders such as autism spectrum disorder (Baron-Cohen et al. 2011), a variety of psychiatric conditions such as schizophrenia (Aleman et al. 2003), and neurodegenerative disorders such as Alzheimer’s Disease (Viña and Lloret 2010; Mazure and Swendsen 2016). We hope that the results provided here, given their large-scale, multimodal nature, will constitute an authoritative point of reference for future studies on a wide range of questions on brain sex differences. Insights into how and where the brain differs as a function of sex—with considerably more precision than in previous investigations—will enable more targeted examinations into potential drivers of these differences across psychiatric, psychological, and other domains. In particular, integrating macrostructural, microstructural, and functional data is an important long-term goal (Gur and Gur 2017). Data on many thousands of further MRI scans (to a maximum sample of 100,000 with MRI data) will be available from UK Biobank in the coming years, in addition to more complex cognitive testing batteries and genotypic data. Future studies will be able to explore in much greater depth the links between sex differences in the brain, their possible causes, and their potential medical and behavioral consequences.

Supplementary Material

Supplementary Data

Notes

We are grateful to the UK Biobank members for their participation in the study, and to the UK Biobank team, who collected, processed, and made available the data for analysis. This work was carried out under UK Biobank application number 10 279. The work was primarily carried out in The University of Edinburgh Centre for Cognitive Ageing and Cognitive Epidemiology (CCACE), is part of the UK Research Councils’ cross-council Lifelong Health and Wellbeing initiative (MR/K028992/1). We are grateful to Anne Scheel, Gina Rippon, Michel Nivard, and Odile Fillod for helpful comments on a previous draft. None of the funders or other acknowledged individuals are responsible for our analysis or interpretation of our results. Conflict of interest: None declared.

Supplementary Material

Supplementary material is available at Cerebral Cortex online.

Funding

Additional funding has been received from the Biotechnology and Biological Sciences Research Council (BBSRC), the Medical Research Council (MRC), and Age UK’s Disconnected Mind project. Authors S.J.R., S.R.C., D.C.M.L., and I.J.D. were supported by the MRC award to CCACE (MR/K026992/1). Author S.R.C. was supported by MRC grant MR/M013111/1. Author I.J.D. is additionally supported by the MRC Dementias Platform UK (MR/L015382/1). The work was also supported by a Wellcome Trust Strategic Award, “Stratifying Resilience and Depression Longitudinally” (STRADL; reference 104036/Z/14/Z). Author X.S. receives support from the China Scholarship Council. Author L.M.R. was supported by an Erasmus Traineeship Grant. Author H.C.W. is supported by a JMAS SIM fellowship from the Royal College of Physicians of Edinburgh, and by an ESAT College Fellowship from the University of Edinburgh. Authors A.M.M., H.C.W., and S.M.L. gratefully acknowledge the support of the Sackler Foundation. Author A.M.M. has previously received grant support from Pfizer, Lilly, and Janssen for work entirely unrelated to the data or analyses in this study.

References

  1. Aleman A, Kahn RS, Selten JP. 2003. Sex differences in the risk of schizophrenia: evidence from meta-analysis. Arch Gen Psychiatry. 60:565–571. [DOI] [PubMed] [Google Scholar]
  2. Alfaro-Almagro F, Jenkinson M, Bangerter NK, Andersson JL, Griffanti L, Douaud G, Sotiropoulos SN, Jbabdi S, Hernandez-Fernandez M, Vallee E, et al. 2018. Image processing and Quality Control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage. 166:400–424. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Allen N, Sudlow C, Downey P, Peakman T, Danesh J, Elliott P, Gallacher J, Green J, Matthews P, Pell J, et al. 2012. UK Biobank: current status and what it means for epidemiology. Health Policy Technol. 1:123–126. [Google Scholar]
  4. Amft M, Bzdok D, Laird AR, Fox PT, Schilbach L, Eickhoff SB. 2015. Definition and characterization of an extended social-affective default network. Brain Struct Func. 220:1031–1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Andersson J, Jenkinson M, Smith S. 2007. a. Non-linear registration aka spatial normalisation. Internal Technical Report TR07JA2, Oxford Centre for Functional Magnetic Resonance Imaging of the Brain, Department of Clinical Neurology, Oxford University, Oxford, UK.
  6. Andersson J, Smith S, Jenkinson M. 2007. b. Non-linear optimisation. Internal Technical Report TR07JA1, Oxford Centre for Functional Magnetic Resonance Imaging of the Brain, Department of Clinical Neurology, Oxford University, Oxford, UK.
  7. Archer J. 2004. Sex differences in aggression in real-world settings: a meta-analytic review. Rev Gen Psychol. 8(4):291–322. [Google Scholar]
  8. Arnett AB, Pennington BF, Peterson RL, Willcutt EG, DeFries JC, Olson RK. 2017. Explaining the sex difference in dyslexia. J Child Psychol Psychiatry. 58(6):719–727. 10.1111/jcpp.12691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Baron-Cohen S, Lombardo MV, Auyeung B, Ashwin E, Chakrabarti B, Knickmeyer R. 2011. Why are autism spectrum conditions more prevalent in males? PLoS Biol. 9:e1001081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Beckmann C, Smith S. 2004. Probabilistic independent component analysis for functional magnetic resonance imaging. IEEE Trans Med Imaging. 23:137–152. [DOI] [PubMed] [Google Scholar]
  11. Beery AK, Zucker I. 2011. Sex bias in neuroscience and biomedical research. Neurosci Biobehav Rev. 35:565–572. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Behrens TEJ, Berg HJ, Jbabdi S, Rushworth MFS, Woolrich MW. 2007. Probabilistic diffusion tractography with multiple fibre orientations: what can we gain? Neuroimage. 34:144–155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Behrens TEJ, Woolrich MW, Jenkinson M, Johansen-Berg H, Nunes RG, Clare S, Matthews PM, Brady JM, Smith SM. 2003. Characterization and propagation of uncertainty in diffusion-weighted MR imaging. Magn Reson Med. 50:1077–1088. [DOI] [PubMed] [Google Scholar]
  14. Benjamini Y, Hochberg Y. 1995. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc B. 57:289–300. [Google Scholar]
  15. Bennett IJ, Madden DJ. 2014. Disconnected aging: cerebral white matter integrity and age-related differences in cognition. Neurosci. 276:187–205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Biswal BB, Mennes M, Zuo X-N, Gohel S, Kelly C, Smith SM, Beckmann CF, Adelstein JS, Buckner RL, Colcombe S, et al. 2010. Toward discovery science of human brain function. Proc Natl Acad Sci USA. 107:4734–4739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Borkenau P, McCrae RR, Terracciano A. 2013. Do men vary more than women in personality? A study in 51 cultures. J Res Personality. 47:135–144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Burgaleta M, Head K, Álvarez-Linera J, Martínez K, Escorial S, Haier R, Colom R. 2012. Sex differences in brain volume are related to specific skills, not to general intelligence. Intelligence. 40:60–68. [Google Scholar]
  19. Button KS, Ioannidis JPA, Mokrysz C, Nosek BA, Flint J, Robinson ESJ, Munafö MR. 2013. Power failure: why small sample size undermines the reliability of neuroscience. Nat Rev Neurosci. 14:365–376. [DOI] [PubMed] [Google Scholar]
  20. Cahill L. 2006. Why sex matters for neuroscience. Nat Rev Neurosci. 7:477–484. [DOI] [PubMed] [Google Scholar]
  21. Cahill L. 2017. An issue whose time has come. J Neurosci Res. 95:12–13. [DOI] [PubMed] [Google Scholar]
  22. Collins R. 2012. What makes UK Biobank special? Lancet. 379:1173–1174. [DOI] [PubMed] [Google Scholar]
  23. Collins FS, Varmus H. 2015. A new initiative on precision medicine. N Engl J Med. 372:793–795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Costa PT, Terracciano A, McCrae RR. 2001. Gender differences in personality traits across cultures: robust and surprising findings. J Pers Soc Psychol. 81:322–331. [DOI] [PubMed] [Google Scholar]
  25. Cox SR, Bastin ME, Ritchie SJ, Dickie DA, Liewald DC, Muñoz Maniega SM, Redmond P, Royle NA, Pattie A, Valdés Hernández M, et al. 2018. Brain cortical characteristics of lifetime cognitive ageing. Brain Struct Func. 223:509–518. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Cox SR, Ritchie SJ, Tucker-Drob EM, Liewald DC, Hagenaars SP, Davies G, Wardlaw JM, Gale CR, Bastin ME, Deary IJ. 2016. Ageing and brain white matter structure in 3,513 UK Biobank participants. Nat Commun. 7:13629. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Craig AD. 2009. How do you feel – now? The anterior insula and human awareness. Nat Rev Neurosci. 10:59–70. [DOI] [PubMed] [Google Scholar]
  28. Craig IW, Haworth CM, Plomin R. 2009. Commentary on “A role for the X chromosome in sex differences in variability in general intelligence?” (Johnson et al., 2009). Perspect Psychol Sci. 4:615–621. [DOI] [PubMed] [Google Scholar]
  29. Daducci A, Canales-Rodríguez EJ, Zhang H, Dyrby TB, Alexander DC, Thiran JP. 2015. Accelerated microstructure imaging via convex optimization (AMICO) from diffusion MRI data. Neuroimage. 105:32–44. [DOI] [PubMed] [Google Scholar]
  30. Darwin C. 1871. The descent of man, and selection in relation to sex. London, UK: John Murray. [Google Scholar]
  31. Dawson G, Ashman SB, Carver LJ. 2000. The role of early experience in shaping behavioral and brain development and its implications for social policy. Dev Psychopathol. 12:695–712. [DOI] [PubMed] [Google Scholar]
  32. de Groot M, Vernooij MW, Klein S, Ikram MA, Vos FM, Smith SM, Niessen WJ, Andersson JL. 2013. Improving alignment in Tract-based Spatial Statistics: evaluation and optimization of image registration. Neuroimage. 76:400–411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. De Vries GJ. 2004. Minireview: sex differences in adult and developing brains: compensation, compensation, compensation. Endocrinology. 145:1063–1068. [DOI] [PubMed] [Google Scholar]
  34. Deary IJ, Irwing P, Der G, Bates TC. 2007. a. Brother–sister differences in the g factor in intelligence: analysis of full, opposite-sex siblings from the NLSY1979. Intelligence. 35:451–456. [Google Scholar]
  35. Deary IJ, Strand S, Smith P, Fernandes C. 2007. b. Intelligence and educational achievement. Intelligence. 35:13–21. [Google Scholar]
  36. Desikan RS, Ségonne F, Fischl B, Quinn BT, Dickerson BC, Blacker D, Buckner RL, Dale AM, Maguire RP, Hyman BT, et al. 2006. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage. 31:968–980. [DOI] [PubMed] [Google Scholar]
  37. Dulac C, Christopher G. 2013. Genomic imprinting in the adult and developing brain In: Pfaff D, Christen Y, editors. Multiple origins of sex differences in brain: research and perspectives in endocrine interactions, 35–41. Berlin: Springer. [Google Scholar]
  38. Dunst B, Benedek M, Koschutnig K, Jauk E, Neubauer AC. 2014. Sex differences in the IQ-white matter microstructure relationship: a DTI study. Brain Cogn. 91:71–78. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Dykiert D, Gale CR, Deary IJ. 2009. Are apparent sex differences in mean IQ scores created in part by sample restriction and increased male variance? Intelligence. 37:42–47. [Google Scholar]
  40. Fine C. 2017. Testosterone Rex: unmaking the myths of our gendered minds. London, UK: Omnibus Books. [Google Scholar]
  41. Fischl B, Dale AM. 2000. Measuring the thickness of the human cerebral cortex from magnetic resonance images. Proc Natl Acad Sci USA. 97:11050–11055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Fischl B, van der Kouwe A, Destrieux C, Halgren E, Ségonne F, Salat DH, Busa E, Seidman LJ, Goldstein J, Kennedy D, et al. 2004. Automatically parcellating the human cerebral cortex. Cereb Cortex. 14:11–22. [DOI] [PubMed] [Google Scholar]
  43. Gemmell NJ, Metcalf VJ, Allendorf FW. 2004. Mother’s curse: the effect of mtDNA on individual fitness and population viability. Trends Ecol Evol. 19:238–244. [DOI] [PubMed] [Google Scholar]
  44. Gennatas ED, Avants BB, Wolf DH, Satterthwaite TD, Ruparel K, Ciric R, Hakonarson H, Gur RE, Gur RC. 2017. Age-related effects and sex differences in gray matter density, volume, mass, and cortical thickness from childhood to young adulthood. J Neurosci. 37:5065–5073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Gobinath AR, Choleris E, Galea LA. 2017. Sex, hormones, and genotype interact to influence psychiatric disease, treatment, and behavioral research. J Neurosci Res. 95:50–64. [DOI] [PubMed] [Google Scholar]
  46. Gregory MD, Kippenhan JS, Dickinson D, Carrasco J, Mattay VS, Weinberger DR, Berman KF. 2016. Regional variations in brain gyrification are associated with general cognitive ability in humans. Curr Biol. 26:1301–1305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Gur RE, Gur RC. 2016. Sex differences in brain and behavior in adolescence: findings from the Philadelphia Neurodevelopmental Cohort. Neurosci Biobehav Rev. 70:159–170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Gur RC, Gur RE. 2017. Complementarity of sex differences in brain and behavior: from laterality to multimodal neuroimaging. J Neurosci Res. 95:189–199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Gur RC, Richard J, Calkins ME, Chiavacci R, Hansen JA, Bilker WB, Loughead J, Connolly JJ, Qiu H, Mentch FD, et al. 2012. Age group and sex differences in performance on a computerized neurocognitive battery in children age 8-21. Neuropsychology. 26:251–265. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Hagenaars SP, Harris SE, Davies G, Hill WD, Liewald DC, Ritchie SJ, Marioni RE, Fawns-Ritchie C, Cullen B, Malik R, et al. 2016. Shared genetic aetiology between cognitive functions and physical and mental health in UK Biobank (N = 112 151) and 24 GWAS consortia. Mol Psychiatry. 21:1624–1632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Hayes AF, Scharkow M. 2013. The relative trustworthiness of inferential tests of the indirect effect in statistical mediation analysis: does method really matter? Psychol Sci. 24:1918–1927. [DOI] [PubMed] [Google Scholar]
  52. Hyde JS. 2014. Gender similarities and differences. Ann Rev Psychol. 65:373–398. [DOI] [PubMed] [Google Scholar]
  53. Iliescu D, Ilie A, Ispas D, Dobrean A, Clinciu AI. 2016. Sex differences in intelligence: a multi-measure approach using nationally representative samples from Romania. Intelligence. 58:54–61. [Google Scholar]
  54. Ingalhalikar M, Smith A, Parker D, Satterthwaite TD, Elliott MA, Ruparel K, Hakonarson H, Gur RE, Gur RC, Verma R. 2014. Sex differences in the structural connectome of the human brain. Proc Natl Acad Sci USA. 111:823–828. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Isamah N, Faison W, Payne ME, MacFall J, Steffens DC, Beyer JL, Krishnan KR, Taylor WD. 2010. Variability in frontotemporal brain structure: the importance of recruitment of African Americans in neuroscience research. PLoS ONE. 5:e13642. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Jacquemont S, Coe BP, Hersch M, Duyzend MH, Krumm N, Bergmann S, Beckmann JS, Rosenfeld JA, Eichler EE. 2014. A higher mutational burden in females supports a “female protective model” in neurodevelopmental disorders. Am J Hum Genet. 94:415–425. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Jbabdi S, Sotiropoulos SN, Savio AM, Graña M, Behrens TEJ. 2012. Model-based analysis of multishell diffusion MR data for tractography: how to get over fitting problems. Magn Reson Med. 68:1846–1855. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Jenkinson M, Bannister P, Brady J, Smith S. 2002. Improved optimisation for the robust and accurate linear registration and motion correction of brain images. Neuroimage. 17:825–841. [DOI] [PubMed] [Google Scholar]
  59. Joel D, Berman Z, Tavor I, Wexler N, Gaber O, Stein Y, Shefi N, Pool J, Urchs S, Margulies DS, et al. 2015. Sex beyond the genitalia: The human brain mosaic. Proc Natl Acad Sci USA. 112:15468–15473. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Johnson W, Carothers A, Deary IJ. 2008. Sex differences in variability in general intelligence: a new look at the old question. Perspect Psychol Sci. 3:518–531. [DOI] [PubMed] [Google Scholar]
  61. Johnson W, Carothers A, Deary IJ. 2009. A role for the X chromosome in sex differences in variability in general intelligence. Perspect Psychol Sci. 4:598–611. [DOI] [PubMed] [Google Scholar]
  62. Jung RE, Haier RJ. 2007. The Parieto-Frontal Integration Theory (P-FIT) of intelligence: converging neuroimaging evidence. Behav Brain Sci. 30:135–154. [DOI] [PubMed] [Google Scholar]
  63. Jäncke L, Mérillat S, Liem F, Hänggi J. 2015. Brain size, sex, and the aging brain. Hum Brain Mapp. 36:150–169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Kanaan RA, Allin M, Picchioni M, Barker GJ, Daly E, Shergill SS, Woolley J, McGuire PK. 2012. Gender differences in white matter microstructure. PLoS ONE. 7:e38272. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Karama S, Bastin ME, Murray C, Royle NA, Penke L, Muñoz Maniega S, Gow AJ, Corley J, Valdés Hernàndez MDC, Rousseau MÉ, et al. 2014. Childhood cognitive ability accounts for associations between cognitive ability and brain cortical thickness in old age. Mol Psychiatry. 19:555–559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Karp N, Mason J, Beaudet AL, Benjamini Y, Bower L, Braun RE, Brown SD, Chesler EJ, Dickinson ME, Flenniken AM, et al. 2017. Prevalence of sexual dimorphism in mammalian phenotypic traits. Nat Commun. 8:15475. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Kennedy DP, Adolphs R. 2012. The social brain in psychiatric and neurological disorders. Trends Cogn Sci. 16:559–572. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Knickmeyer RC, Wang J, Zhu H, Geng X, Woolson S, Hamer RM, Konneker T, Styner M, Gilmore JH. 2014. Impact of sex and gonadal steroids on neonatal brain structure. Cereb Cortex. 24:2721–2731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Kotov R, Gamez W, Schmidt F, Watson D. 2010. Linking “big” personality traits to anxiety, depressive, and substance use disorders: a meta-analysis. Psychol Bull. 136:768–821. [DOI] [PubMed] [Google Scholar]
  70. Lakin JM. 2013. Sex differences in reasoning abilities: surprising evidence that male–female ratios in the tails of the quantitative reasoning distribution have increased. Intelligence. 41:263–274. [Google Scholar]
  71. Lange N, Giedd JN, Castellanos FX, Vaituzis AC, Rapoport JL. 1997. Variability of human brain structure size: ages 4–20 years. Psychiatr Res. 74:1–12. [DOI] [PubMed] [Google Scholar]
  72. Lehre AC, Hansen A, Laake P. 2009. b. Gender and the 2003 Quality Reform in higher education in Norway. Higher Ed. 58:585–597. [Google Scholar]
  73. Lehre AC, Lehre KP, Laake P, Danbolt NC. 2009. a. Greater intrasex phenotype variability in males than in females is a fundamental aspect of the gender differences in humans. Dev Psychobiol. 51:198–206. [DOI] [PubMed] [Google Scholar]
  74. Lombardo MV, Ashwin E, Auyeung B, Chakrabarti B, Taylor K, Hackett G, Bullmore ET, Baron-Cohen S. 2012. Fetal testosterone influences sexually dimorphic gray matter in the human brain. J Neurosci. 32:674–680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Luders E, Narr KL, Thompson PM, Rex DE, Woods RP, DeLuca H, Jancke L, Toga AW. 2006. Gender effects on cortical thickness and the influence of scaling. Hum Brain Mapp. 27:314–324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Lv B, Li J, He H, Li M, Zhao M, Ai L, Yan F, Xian J, Wang Z. 2010. Gender consistency and difference in healthy adults revealed by cortical thickness. Neuroimage. 53:373–382. [DOI] [PubMed] [Google Scholar]
  77. Lyall DM, Cullen B, Allerhand M, Smith DJ, Mackay D, Evans J, Anderson J, Fawns-Ritchie C, McIntosh AM, Deary IJ, et al. 2016. Cognitive test scores in UK Biobank: data reduction in 480,416 participants and longitudinal stability in 20,346 participants. PLoS ONE. 11:e0154222. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Machin S, Pekkarinen T. 2008. Assessment. Global sex differences in test score variability. Science. 322:1331–1332. [DOI] [PubMed] [Google Scholar]
  79. MacPherson SE, Della Sala S, Cox SR, Girardi A, Iveson MH. 2015. Handbook of frontal lobe assessment. Oxford, UK: Oxford University Press. [Google Scholar]
  80. Maeda Y, Yoon S. 2013. A meta-analysis on gender differences in mental rotation ability measured by the Purdue Spatial Visualization Tests: Visualization of rotations (PSVT:R) In: Ed Psychol Rev. 25:69–94. [Google Scholar]
  81. Margulies DS, Ghosh SS, Goulas A, Falkiewicz M, Huntenburg JM, Langs G, Bezgin G, Eickhoff SB, Castellanos FX, Petrides M, et al. 2016. Situating the default-mode network along a principal gradient of macroscale cortical organization. Proc Natl Acad Sci USA. 113:12574–12579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Markham JA, Jurgens HA, Auger CJ, De Vries GJ, Arnold AP, Juraska JM. 2003. Sex differences in mouse cortical thickness are independent of the complement of sex chromosomes. Neuroscience. 116:71–75. [DOI] [PubMed] [Google Scholar]
  83. Mars RB, Neubert FX, Noonan MP, Sallet J, Toni I, Rushworth MF. 2012. On the relationship between the “default mode network” and the “social brain”. Front Hum Neurosci. 21:189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Marwha D, Halari M, Eliot L. 2016. Meta-analysis reveals a lack of sexual dimorphism in human amygdala volume. Neuroimage. 147:282–294. [DOI] [PubMed] [Google Scholar]
  85. Mazure CM, Swendsen J. 2016. Sex differences in Alzheimer’s disease and other dementias. Lancet Neurol. 15:451–452. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. McCarthy MM, Arnold AP. 2011. Reframing sexual differentiation of the brain. Nat Neurosci. 14:677–683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. McEwen BS, Milner TA. 2017. Understanding the broad influence of sex hormones and sex differences in the brain. J Neurosci Res. 95:24–39. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Miller KL, Alfaro-Almagro F, Bangerter NK, Thomas DL, Yacoub E, Xu J, Bartsch AJ, Jbabdi S, Sotiropoulos SN, Andersson JL, et al. 2016. Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nat Neurosci. 19:1523–1536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Morey RD, Rouder JN. 2015. BayesFactor: Computation of Bayes Factors for common designs. R package v0.9.12-2. URL: http://CRAN.R-project.org/package=BayesFactor (accessed 2 May 2018).
  90. Nord CL, Valton V, Wood J, Roiser JP. 2017. Power-up: a re-analysis of ‘power failure’ in neuroscience using mixture modelling. J Neurosci. 37:8051–8061. 10.1523/JNEUROSCI.3592-16.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Ochsner KN, Gross JJ. 2005. The cognitive control of emotion. Trends Cog Sci. 9:242–249. [DOI] [PubMed] [Google Scholar]
  92. Olds T, Tomkinson G, Léger L, Cazorla G. 2006. Worldwide variation in the performance of children and adolescents: an analysis of 109 studies of the 20-m shuttle run test in 37 countries. J. Sport Sci. 24:1025–1038. [DOI] [PubMed] [Google Scholar]
  93. Patenaude B, Smith SM, Kennedy DN, Jenkinson M. 2011. A Bayesian model of shape and appearance for subcortical brain segmentation. Neuroimage. 56:907–922. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Penke L, Maniega SM, Bastin ME, Hernández MV, Murray C, Royle NA, Starr JM, Wardlaw JM, Deary IJ. 2012. Brain white matter tract integrity as a neural foundation for general intelligence. Mol Psychiatr. 17:1026–1030. [DOI] [PubMed] [Google Scholar]
  95. Pietschnig J, Penke L, Wicherts JM, Zeiler M, Voracek M. 2015. Meta-analysis of associations between human brain volume and intelligence differences: How strong are they and what do they mean? Neurosci Biobehav Rev. 57:411–432. [DOI] [PubMed] [Google Scholar]
  96. Potter GM, Doubal FN, Jackson CA, Chappell FM, Sudlow CL, Dennis MS, Wardlaw JM. 2015. Enlarged perivascular spaces and cerebral small vessel disease. Int J Stroke. 10:376–381. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Reid AT, van Norden AG, de Laat KF, van Oudheusden LJ, Zwiers MP, Evans AC, de Leeuw FE, Kötter R. 2010. Patterns of cortical degeneration in an elderly cohort with cerebral small vessel disease. Hum Brain Mapp. 31:1983–1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Reinhold K, Engqvist L. 2013. The variability is in the sex chromosomes. Evolution. 67:3662–3668. [DOI] [PubMed] [Google Scholar]
  99. Revelle W. 2016. psych: Procedures for personality and psychological research. R package v1.6.4. URL: http://CRAN.R-project.org/package=psych (accessed 2 May 2018).
  100. Rippon G, Jordan-Young R, Kaiser A, Fine C. 2014. Recommendations for sex/gender neuroimaging research: key principles and implications for research design, analysis, and interpretation. Front Hum Neurosci. 8:650. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Robinson EB, Lichtenstein P, Anckarsäter H, Happé F, Ronald A. 2013. Examining and interpreting the female protective effect against autistic behavior. Proc Natl Acad Sci USA. 110:5258–5262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Rousselet GA, Pernet CR, Wilcox RR. 2017. Beyond differences in means: robust graphical methods to compare two groups in neuroscience. Eur J Neurosci. 46:1738–1748. 10.1111/ejn.13610. [DOI] [PubMed] [Google Scholar]
  103. Ruigrok AN, Salimi-Khorshidi G, Lai MC, Baron-Cohen S, Lombardo MV, Tait RJ, Suckling J. 2014. A meta-analysis of sex differences in human brain structure. Neurosci Biobehav Rev. 39:34–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Rutter M, Caspi A, Moffitt TE. 2003. Using sex differences in psychopathology to study causal mechanisms: unifying issues and research strategies. J Child Psychol Psychiatry. 44:1092–1115. [DOI] [PubMed] [Google Scholar]
  105. Salat DH, Buckner RL, Snyder AZ, Greve DN, Desikan RS, Busa E, Morris JC, Dale AM, Fischl B. 2004. Thinning of the cerebral cortex in aging. Cereb Cortex. 14:721–730. [DOI] [PubMed] [Google Scholar]
  106. Scheinost D, Finn ES, Tokoglu F, Shen X, Papademetris X, Hampson M, Constable RT. 2015. Sex differences in normal age trajectories of functional brain networks. Hum Brain Mapp. 36:1524–1535. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Schmitt DP, Realo A, Voracek M, Allik J. 2008. Why can’t a man be more like a woman? Sex differences in Big Five personality traits across 55 cultures. J Pers Soc Psychol. 94:168–182. [DOI] [PubMed] [Google Scholar]
  108. Schnack HG, Van Haren NE, Brouwer RM, Evans A, Durston S, Boomsma DI, Kahn RS, Hulshoff Pol HE. 2014. Changes in thickness and surface area of the human cortex and their relationship with intelligence. Cereb Cortex. 25:1608–1617. [DOI] [PubMed] [Google Scholar]
  109. Smith SM, Nichols TE, Vidaurre D, Winkler AM, Behrens TE, Glasser MF, Ugurbil K, Barch DM, Van Essen DC, Miller KL. 2015. A positive-negative mode of population covariation links brain connectivity, demographics and behavior. Nat Neurosci. 18:1565–1567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Sowell ER, Peterson BS, Kan E, Woods RP, Yoshii J, Bansal R, Xu D, Zhu H, Thompson PM, Toga AW. 2007. Sex differences in cortical thickness mapped in 176 healthy individuals between 7 and 87 years of age. Cereb Cortex. 17:1550–1560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Su R, Rounds J, Armstrong PI. 2009. Men and things, women and people: a meta-analysis of sex differences in interests. Psychol Bull. 135:859–884. [DOI] [PubMed] [Google Scholar]
  112. Sundgren PC, Dong Q, Gomez-Hassan D, Mukherji SK, Maly P, Welsh R. 2004. Diffusion tensor imaging of the brain: review of clinical applications. Neuroradiology. 46:339–350. [DOI] [PubMed] [Google Scholar]
  113. Ségonne F, Pacheco J, Fischl B. 2007. Geometrically accurate topology-correction of cortical surfaces using nonseparating loops. IEEE Trans Med Imaging. 26:518–529. [DOI] [PubMed] [Google Scholar]
  114. Tan A, Ma W, Vira A, Marwha D, Eliot L. 2016. The human hippocampus is not sexually-dimorphic: meta-analysis of structural MRI volumes. Neuroimage. 124:350–366. [DOI] [PubMed] [Google Scholar]
  115. Tang Y, Hojatkashani C, Dinov ID, Sun B, Fan L, Lin X, Qi H, Hua X, Liu S, Toga AW. 2010. The construction of a Chinese MRI brain atlas: a morphometric comparison study between Chinese and Caucasian cohorts. Neuroimage. 51:33–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  116. van den Heuvel MP, Hulshoff Pol HE. 2010. Exploring the brain network: a review on resting-state fMRI functional connectivity. Eur Neuropsychopharmacol. 20:519–534. [DOI] [PubMed] [Google Scholar]
  117. van Velsen EF, Vernooij MW, Vrooman HA, van der Lugt A, Breteler MM, Hofman A, Niessen WJ, Ikram MA. 2013. Brain cortical thickness in the general elderly population: the Rotterdam Scan Study. Neurosci Lett. 550:189–194. [DOI] [PubMed] [Google Scholar]
  118. Viña J, Lloret A. 2010. Why women have more Alzheimer’s disease than men: gender and mitochondrial toxicity of amyloid-β peptide. J Alzheimers Dis. 20:527–533. [DOI] [PubMed] [Google Scholar]
  119. Wager TD, Davidson ML, Hughes BL, Lidquist MA, Ochsner KN. 2008. Prefrontal-subcortical pathways mediating successful emotion regulation. Neuron. 59:1037–1050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  120. Wardlaw JM, Hernández MCV, Muñoz‐Maniega S. 2015. What are white matter hyperintensities made of? Relevance to vascular cognitive impairment. J Am Heart Assoc. 4:e001140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  121. Wierenga LM, Sexton JA, Laake P, Giedd JN, Tamnes CK. 2017. A key characteristic of sex differences in the developing brain: greater variability in brain structure of boys than girls. Cereb Cortex. 28:2741–2751. [DOI] [PMC free article] [PubMed] [Google Scholar]
  122. Zell E, Krizan Z, Teeter SR. 2015. Evaluating gender similarities and differences using metasynthesis. Am Psychol. 70:10–20. [DOI] [PubMed] [Google Scholar]
  123. Zhang Y, Brady M, Smith SM. 2001. Segmentation of brain MR images through a hidden Markov random field model and the expectation maximization algorithm. IEEE Trans Med Imaging. 20:45–57. [DOI] [PubMed] [Google Scholar]
  124. Zhang T, Casanova R, Resnick SM, Manson JE, Baker LD, Padual CB, Kuller LH, Bryan RN, Espeland MA, Davatzikos C. 2016. Effects of hormone therapy on brain volumes changes of postmenopausal women revealed by optimally-discriminative voxel-based morphometry. PLoS ONE. 11:e0150834. [DOI] [PMC free article] [PubMed] [Google Scholar]
  125. Zhang H, Schneider T, Wheeler-Kingshott CA, Alexander DC. 2012. NODDI: practical in vivo neurite orientation dispersion and density imaging of the human brain. Neuroimage. 61:1000–1016. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data