No great mind ever existed without a touch of madness

Aristotle

Pain and suffering are always inevitable for a large intelligence and a deep heart. The really great men must, I think, have great sadness on earth

Fyodor Dostoevsky

BACKGROUND

A positive relationship between intelligence and mental health is well established by a number of high-quality studies in the field of cognitive epidemiology (e.g., Aichele et al., 2018; Gale et al., 2010; Keyes et al., 2017; Wraw et al., 2015). Intelligence is a strong predictor of educational and financial achievement (Gottfredson, 1997; Kuncel et al., 2004; Strenze, 2007), which can provide resources beneficial to managing psychological distress (Ali et al., 2013). Intelligent individuals tend to use more adaptive coping strategies and are more likely to have flexible schedules, allowing for more opportunities to engage in pleasant activities (Jokela, 2022; Ujma & Scherrer, 2021). Despite that, a stereotype which characterizes individuals with extremely high levels of intelligence (also referred to as gifted) as socially maladjusted, emotionally unstable, and eccentric is prevalent amongst the general population or teachers (Baudson, 2016; Baudson & Preckel, 2013; Solano, 1987). Several figures conforming to this stereotype can be found in pop culture (Cox, 2000) as well as among eminent real-life individuals considered “geniuses” (Simonton, 2014).

A theoretical framework congruent with the notion that high intelligence can be detrimental to one’s mental health is the idea of the too-much-of-a-good-thing effect (Grant & Schwartz, 2011; Pierce & Aguinis, 2013). The authors of the theory draw on the works of Aristotle and the idea of the golden mean, suggesting that an excess of virtue is as dangerous as a dearth of it (Aristotle, trans. 1999). They suggest that any positive phenomenon taken to extreme values eventually reaches an inflection point at which its effect becomes negative. Another theory that could explain the “mad genius” stereotype is the positive disintegration theory (Dąbrowski, 1979). It posits that high intelligence is positively related to psychological overexcitability, a heightened and more intense way of experiencing and responding to one’s environment. This type of overstimulation can potentially have a negative impact on mental health (Karpinski et al., 2018). Moreover, functioning distinctively from most of one’s peers on a basic cognitive level can be detrimental to an individual’s sense of relatedness (Falck, 2020; Flakus et al., 2021). The need for relatedness is, alongside the need for autonomy and the need for competence, one of the basic psychological needs according to the self-determination theory (Ryan & Deci, 2000, 2017). The fulfillment of these needs is essential for one’s mental health (Stanley et al., 2021; Tang et al., 2019).

Although this topic has existed in the minds of researchers for decades, the data published so far have been highly inconsistent and contradictory. A meta-analysis by Martin et al. (2010) showed no differences between gifted children and non-gifted children in terms of depression and suicidal ideation, and that gifted children showed lower levels of anxiety. Congruently, another meta-analysis reported no differences in happiness between the two groups (Zeidner, 2021). A systematic review on the subject concluded that intellectually gifted individuals had decreased levels of psychopathy operationalized as internalizing and externalizing problems (Francis et al., 2015). However, all three of those papers concerned a relatively small number of studies that met the criteria set by their authors, and in all three of them, the authors recognized that most studies included were problematic in terms of small and unrepresentative samples.

On the other hand, studies on Mensa members show that compared to the general population, they had an increased risk of affective disorders, anxiety disorders, attention deficit hyperactivity disorder, autism spectrum disorder, and physiological disease (Karpinski et al., 2018), as well as lower levels of meaningfulness, subjective well-being, and self-compassion (Vötter & Schnell, 2019). It is important to note, however, that Mensa members are also not considered representative of the entire population of the intellectually gifted (Haugen, 2018). A cohort study of over 1 million Swedish men showed that the relationship between intelligence and bipolar disorder was curvilinear U-shaped for the g factor and specific subtests, except for the logical ability scale (Gale et al., 2012). Another cohort study found a U-shaped relationship between intelligence and psychotic symptoms among 12-year-olds (Horwood et al., 2008). Brown et al. (2021) also reported that for the relationship between general intelligence and depression, as well as subjective well-being, a quadratic model explained additional variance over a linear one and that the correlation between the two decreased above a certain threshold. Autism spectrum disorder is more common at the extreme values of intelligence (Billeiter & Froiland, 2023) and has a complex genetic overlap with intelligence (see Crespi, 2016). Moreover, qualitative data show intellectually gifted individuals expressing feeling isolated due to having higher intelligence than their peers, overstimulated, and struggling with their identity (Jackson, 1998; Jackson & Peterson, 2003; Jacobsen, 1999). Some gifted go as far as to refer to their high intelligence as a disability “because you then can’t understand the norm” (Falck, 2020, p. 79).

When referring to research on the intellectually gifted, it is imperative to consider that most studies on this subject suffer from methodological limitations partially caused by the difficulty in accessing the target population. The aforementioned unrepresentative samples (such as students from special programs or Mensa members) and low sample sizes, which in turn cause low statistical power of the statistical methods used, could be resolved by making use of datasets from cohort studies (Ertl et al., 2020; Lubinski & Humphreys, 1997; Martin et al., 2010). Investigating the top 2% in IQ scores (the most common cut-off point for extremely high intelligence, used, for example, by Mensa) of the large samples usually obtained in such studies should supply a representative sample that will also be sizable enough to provide satisfactory statistical power. An overwhelming majority of the research focuses on children (Rinn & Bishop, 2015), even though having high intelligence is not a phenomenon exclusive to childhood.

Another crucial drawback of the existing research is the overreliance on comparing gifted samples to nongifted with mean comparisons. If mental health generally increases with intelligence up to a certain inflection point, then even if the relationship turns from positive to negative at that point and mental health will start to drop down, it will still start from a point that is higher than the overall mean (see Figure 1). All the positive consequences of having higher intelligence do not suddenly disappear at a certain threshold. It is just that at high enough intelligence one might start experiencing unique challenges that begin to outweigh the beneficial effects associated with intelligence. As such, comparing means cannot reliably detect a too-much-of-a-good-thing effect if the inflection point is substantially far from the mean.

Figure 1

Visualization of how a comparison of means can fail to detect a too-much-of-a-good-thing effect

https://hpr.termedia.pl/f/fulltexts/187337/HPR-13-187337-g001_min.jpg

Nonlinear relationships in psychological research are most often tested with polynomial quadratic regressions. This method, however, has recently found itself under some scrutiny (see Simonsohn, 2018). It imposes a parabolic shape, tends to overfit, and can be problematic when the inflection point is expected to be at extremely high or low values of the predictor variable (Harrell, 2015; Magee, 1998). An alternative approach might be spline regression (Durrleman & Simon, 1989; Marsh & Cormier, 2001; Poirier, 1973), which divides the dataset into intervals, creates a separate regression for each interval, and then smooths out the breakpoints into inflection points (called knots in this technique). This method has an additional advantage in that it can test a specified inflection point. It is more congruent with a confirmatory approach to science when there are theoretical premises about where such an inflection point might be. For this reason, both quadratic regression and spline regression were used in this study.

The aim of the present study was to investigate whether the relationship between intelligence and indicators of mental health is nonlinear in such a way that at extreme values of intelligence, it reaches an inflection point (see Figure 1). For this purpose, data from the 1970 British Cohort Study (BCS70) were used. The cut-off point for intellectual giftedness was set to the 98th percentile of intelligence scores; however, there are also definitions of giftedness that propose a 90th percentile cut-off point (Obergriesser & Stoeger, 2015; Reis et al., 2004). Hence, this will also be tested as an inflection point as auxiliary analyses. Moreover, it has been argued that investigating specific cognitive abilities alongside a general factor of intelligence (g) can provide significant additional insights for intelligence research overall (Coyle & Greiff, 2021), as well as specifically for research on giftedness (Wai et al., 2022). For this reason, alongside the findings from Gale et al. (2012), the nonlinear relationship with mental health was also tested for each intelligence subtest. However, these analyses were purely exploratory in nature. Although Brown et al. (2021) have already analyzed the data from BCS70 regarding curvilinear relationships of intelligence, they omitted several waves where mental health was measured and decided to use math ability and reading ability tests as a measure of intelligence. Those can be interpreted more as measures of educational progress than strictly intelligence.

PARTICIPANTS AND PROCEDURE

PARTICIPANTS

Data from BCS70 were used. BCS70 is a study conducted by the Centre for Longitudinal Studies (CLS) that follows the lives of 17 196 individuals born in the UK in a single week of 1970 (for a cohort profile, see Elliott & Shepherd, 2006). The data from BCS70 are freely available at https://ukdataservice.ac.uk/. Intelligence was measured at age 10, and mental health was measured in subsequent waves at ages 16, 26, 30, 34, 42, 46, and 50. A total of 11,212 individuals who completed all subscales of an intelligence measure were included in the analyses. The numbers of participants with complete data on both intelligence and each of the mental health measures at each successive wave are presented in Table 1.

Table 1

Mean scores and standard deviations (SD), numbers of observations with complete data for mental health measures and correlation coefficients with intelligence

VariableMean (SD)NGeneral intelligence (age 10)Word definitions (age 10)Recall of digits (age 10)Word similarities (age 10)Matrices (age 10)
Malaise Inventory9.19 (5.53)3968–.12**–.11**–.02−.08**–.06**
(age 16)
GHQ-12 (age 16)21.73 (5.53)3609.03.02.01.03.03*
Malaise Inventory3.85 (3.38)6403–.15**–.12**–.06**–.13**–.13**
(age 26)
Malaise Inventory3.50 (3.47)7984–.12**−.09**–.06**–.11**–.12**
(age 30)
GHQ-12 (age 30)22.61 (4.52)7980–.01.00.00–.01–.01
Malaise Inventory1.66 (1.89)6924–.12**–.09**–.05**–.10**–.11**
(age 34)
Malaise Inventory1.86 (1.98)6147–.11**–.08**–.04**–.09**–.09**
(age 42)
WEMWBS49.17 (8.26)5794.15**.13**.07**–.12**–.09**
(age 42)
Malaise Inventory1.78 (2.12)5623–.11**–.08**–.04**–.09**–.09**
(age 46)
WEMWBS50.09 (8.55)5579.10**.09**.00.07**.08**
(age 46)
SF-36 Role83.75 (31.89)5726–.04**.02.00–.04**.05**
limitations due
to emotional
problems (age 46)
SF-36 Vitality56.76 (22.14)5736.05**.04*.00.03*.04**
(age 46)
SF-36 Emotional74.14 (19.32)5736.09**.07**.03*.07**.09**
well-being
(age 46)
SF-36 Social85.48 (24.04)5738.13**.10**.03*.10**.10**
functioning
(age 46)
Malaise Inventory1.65 (1.95)2864–.08**–.05**–.02–.07**–.07**
(age 50)
GAD-2 (age 50)2.91 (1.36)2870–.05**–.03–.03–.03–.04*
PHQ-2 (age 50)2.98 (1.42)2874–.04*–.02–.03–.03–.02

[i] Note. GHQ-12 – General Health Questionnaire-12; WEMWBS – Warwick-Edinburgh Mental Wellbeing Scale; SF – Short Form Health Survey; GAD-2 – Generalized Anxiety Disorder scale-2; PHQ-2 – Patient Health Questionnaire-2; *p < .05, **p < .01.

INSTRUMENTS

The British Ability Scales. Four subtests from the British Ability Scales (BAS; Elliot et al., 1978) were used to measure intelligence at age 10. The four subscales were: word definitions (37 items), recall of digits (34 items), word similarities (21 items), and matrices (28 items). A 2PL bifactor item response theory model (Gibbons & Hedeker, 1992) was used to obtain latent scores for the g factor with the maximum a-posteriori (MAP) method. For the subscales raw z-transformed scores were used. The Cronbach’s α reliability coefficients in a previous study were .90 for word definitions, .91 for recall of digits, .86 for word similarities, and .75 for matrices (Sturgis et al., 2010).

The Malaise Inventory (Rutter et al., 1970) was used as a measure of general mental health at ages 16, 26, 30, 34, 42, 46, and 50 (e.g., item “Do you often get worried about things?”, “Do you often feel miserable or depressed?”). Higher scores indicate worse mental health. A 22-item version was used at age 16, a 24-item version was used at ages 26 and 30, and a 9-item version was used at ages 34, 42, 46 and 50. The scale utilizes a dichotomous (yes/no) response format, with the exception of the 22-item version, which utilized a 3-point Likert-type response format, ranging from 0 (rarely or never) to 2 (most of the time). It has shown scalar invariance across gender and successive waves (Ploubidis et al., 2019). The KR20 reliability coefficient reported in a previous study was .84 (Rodgers et al., 1999).

The General Health Questionnaire-12 (GHQ-12; Goldberg, 1972) was used as a measure of mental health at ages 16 and 30. The instrument consists of 12 items (e.g., “felt constantly under strain”) with a 4-point Likert-type response format, ranging from 1 (not at all) to 4 (much more than usual). The scale is coded in such a way that a higher score indicates worse mental health. The test-retest reliability reported in a previous study was .82 (Piccinelli et al., 1993).

The Warwick-Edinburgh Mental Wellbeing Scale (WEMWBS; Tennant et al., 2007) was used as a measure of mental well-being at ages 42 and 46. The instrument consists of 14 items (e.g., “I’ve been dealing with problems well”) with a 5-point Likert-type response format, ranging from 1 (none of the time) to 5 (all of the time). The test-retest reliability reported in a previous study was .83 (Tennant et al., 2007).

The Short Form Health Survey. Four subscales from the Short Form Health Survey (SF-36; Ware & Sherbourne, 1992) were used as a measure of mental health at age 46. Three items measured role limitations due to emotional problems (e.g., “Accomplished less than you would like”), 4 items measured vitality (e.g., “Did you have a lot of energy?”), 5 items measured emotional well-being (e.g., “Have you felt calm and peaceful?”), and 2 items measured social functioning (“During the past 4 weeks, to what extent has your physical health or emotional problems interfered with your normal social activities with family, friends, neighbors, or groups?”/”During the past 4 weeks, how much of the time has your physical health or emotional problems interfered with your social activities (like visiting with friends, relatives, etc.)?”). Items for role limitations due to emotional problems utilize a dichotomous (yes/no) response format. Items for vitality and emotional well-being utilize a 6-point Likert-type response format, ranging from 1 (all of the time) to 6 (none of the time). Items for social functioning utilize a 5-point Likert-type response format, ranging from 1 (not at all/all of the time) to 5 (extremely/none of the time). The test-retest reliabilities reported in a previous study were .76 for role limitations due to emotional problems, .84 for vitality, .81 for emotional well-being, and .80 for social functioning (Ruta et al., 1994).

Generalized Anxiety Disorder scale-2 (GAD-2; Kroenke et al., 2007) was used as a measure of generalized anxiety disorder at age 50. The instrument consists of 2 items: “Feeling nervous, anxious or on edge” and “Not being able to stop or control worrying” with a 4-point Likert-type response format, ranging from 1 (not at all) to 4 (nearly every day). The test-retest reliability reported in a previous study was .81 (Staples et al., 2019).

Patient Health Questionnaire-2 (PHQ-2; Löwe et al., 2005) was used as a measure of depression at age 50. The instrument consists of 2 items: “Feeling down, depressed, or hopeless” and “Little interest or pleasure in doing things” with a 4-point Likert-type response format, ranging from 1 (not at all) to 4 (nearly every day). The test-retest reliability reported in a previous study was .79 (Staples et al., 2019).

STATISTICAL ANALYSES

Means, standard deviations, and correlation coefficients were calculated. Eighty-five quadratic regression models and 170 spline regression models were conducted, where either the g factor or one of the BAS subscales was the independent variable, and the mental health measures at different BCS70 waves were the dependent variables. The inflection points for the independent variables in spine regressions were set at either +2 SD (98th percentile) or +1.28 SD (90th percentile). Natural splines were used due to the placement of the inflection points near the boundaries of the data (Perperoglou et al., 2019). The nonlinear models were then compared to the corresponding linear models using lack-of-fit tests. Analyses were conducted in R 4.2.0 (R Core Team, 2022) with the splines2 and mirt packages and visualized with the ggplot2 package. All tests were two-tailed, and the significance level was set to α = .05. The code for the analyses is available through Open Science Framework at the following link: https://osf.io/qxz27/?view_only=b72159cfe93d4b2c895d22e4ca9f6e57.

RESULTS

The supplementary material is available at https://osf.io/qxz27/?view_only=b72159cfe93d4b2c895d22e4ca9f6e57. Table 1 presents means, standard deviations, and correlation coefficients of intelligence with each mental health measurement at each wave of BCS70, as well as numbers of participants with complete data for each analysis.

Table 2, Figure 2, and Figure 3 present the results of quadratic regression and spline regression analyses where the general factor of intelligence was the independent variable. The regression coefficients for quadratic models with calculated inflection points for those models are available in the supplemental material. The nonlinear models fit the data better than the corresponding linear ones for 13 out of 17 model comparisons. The differences in explained variance between the different nonlinear models were mostly marginal. However, the only nonlinear model that had a better data fit for the vitality subscale of SF-36 was the spline model with the inflection point set at +1.28 SD and for PHQ-2 it was the only one that did not have a better fit.

Table 2

Variance explained by linear, quadratic and spline models and results of lack-of-fit tests comparing the nonlinear models to analogous linear models, for the analyses using the g factor

VariableGeneral intelligence (age 10)
LinearQuadratic Spline (+2 SD inflection point)Spline (+1.28 SD inflection point)
R2R2FR2FR2F
Malaise Inventory (age 16).01.010.90.010.07.010.00
GHQ-12 (age 16).00.000.00.000.05.000.08
Malaise Inventory (age 26).02.0210.86**.029.95**.028.28**
Malaise Inventory (age 30).01.017.50**.017.73**.016.23*
GHQ-12 (age 30).00.001.63.001.46.001.60
Malaise Inventory (age 34).01.0212.61**.0211.31**.028.84**
Malaise Inventory (age 42).01.0116.45**.0115.06**.0114.68**
WEMWBS (age 42).02.029.27**.025.98*.026.31*
Malaise Inventory (age 46).01.0116.81**.0116.02**.0115.74**
WEMWBS (age 46).01.011.53.010.02.010.75
SF-36 Role limitations due to.00.0013.55**.0013.48**.0013.31**
emotional problems (age 46)
SF-36 Vitality (age 46).00.002.30.002.83.004.64*
SF-36 Emotional well-being (age 46).01.019.71**.018.86**.018.73**
SF-36 Social functioning (age 46).02.0212.39**.0212.77**.0213.68**
Malaise Inventory (age 50).01.015.19*.014.51*.014.96*
GAD-2 (age 50).00.0111.02**.019.90**.0111.53**
PHQ-2 (age 50).00.004.37*.004.40*.003.82

[i] Note. GHQ-12 – General Health Questionnaire-12; WEMWBS – Warwick-Edinburgh Mental Wellbeing Scale; SF – Short Form Health Survey; GAD-2 – Generalized Anxiety Disorder scale-2; PHQ-2 – Patient Health Questionnaire-2; *p < .05, **p < .01.

Figure 2

Plots for the analyses conducted using the g factor and the mental health measurements using the Malaise Inventory (red line – linear model, blue line – quadratic model, green line – spline model)

Note. Only the spline models with the inflection points set at +2 SD are presented for image clarity. The plots with the inflection points set at +1.28 SD are included in the supplementary materials.

https://hpr.termedia.pl/f/fulltexts/187337/HPR-13-187337-g002_min.jpg
Figure 3

Plots for the analyses conducted using the g factor and the rest of the mental health measurements (red line – linear model, blue line – quadratic model, green line – spline model)

Note. Only the spline models with the inflection points set at +2 SD are presented for image clarity. The plots with the inflection points set at +1.28 SD are included in the supplementary materials.

https://hpr.termedia.pl/f/fulltexts/187337/HPR-13-187337-g003_min.jpg

Table 3 presents the results of quadratic regression and spline regression analyses where the subscales of BAS were the independent variables. The figures for these analyses and regression coefficients for the quadratic models with calculated inflection points for those models are available in the supplemental material. For word definitions the nonlinear models fit the data better in 12 out of 17 model comparisons for all nonlinear models. For recall of digits the nonlinear models fit the data better in 4 out of 17 model comparisons for quadratic models, 3 out of 17 for spline models with a +2 SD inflection point and 2 out of 17 for spline models with a +1.28 SD inflection point. For word similarities the nonlinear models fit the data better in 7 out of 17 model comparisons for quadratic models and 5 out of 17 for spline models with a +2 SD inflection point and 4 out of 17 for spline models with a +1.28 SD inflection point. For matrices the nonlinear models fit the data better in 3 out of 17 model comparisons for all nonlinear models.

Table 3

Variance explained by linear, quadratic and spline models, and results of lack-of-fit tests comparing the nonlinear models to analogous linear models, for the analyses using the subscales of BAS

VariableLinearQuadraticSpline (+2 SD inflection point)Spline (+1.28 SD inflection point)
R2R2FR2FR2F
Word definitions (age 10)
Malaise Inventory (age 16).01.014.27*.015.22*.015.54*
GHQ-12 (age 16).00.000.23.000.02.000.02
Malaise Inventory (age 26).01.0210.04**.0210.65**.0210.29**
Malaise Inventory (age 30).01.019.37**.0110.56**.0110.81**
GHQ-12 (age 30).00.000.95.000.88.000.94
Malaise Inventory (age 34).01.019.06**.018.80**.018.47**
Malaise Inventory (age 42).01.0119.23**.0120.25**.0120.84**
WEMWBS (age 42).02.0218.22**.0217.18**.0216.05**
Malaise Inventory (age 46).01.0113.49**.0113.51**.0113.46**
WEMWBS (age 46).01.015.77*.014.78*.013.89*
SF-36 Role limitations due to.00.004.71*.004.49*.004.53*
emotional problems (age 46)
SF-36 Vitality (age 46).00.002.67.002.21.002.00
SF-36 Emotional well-being (age 46).01.0110.32**.019.49**.018.93**
SF-36 Social functioning (age 46).01.016.79**.015.75*.015.62*
Malaise Inventory (age 50).00.002.18.001.16.000.63
GAD-2 (age 50).00.007.26**.006.28*.005.50*
PHQ-2 (age 50).00.010.95.000.75.000.44
Recall of digits (age 10)
Malaise Inventory (age 16).00.000.02.000.80.000.40
GHQ-12 (age 16).00.004.04*.001.70.002.16
Malaise Inventory (age 26).00.004.35*.004.59*.002.35
Malaise Inventory (age 30).00.007.32**.007.78**.005.60*
GHQ-12 (age 30).00.001.01.001.37.001.07
Malaise Inventory (age 34).00.003.82.002.27.003.69
Malaise Inventory (age 42).00.001.44.003.06.001.61
WEMWBS (age 42).00.000.45.012.11.000.00
Malaise Inventory (age 46).00.006.12*.005.75*.006.16*
WEMWBS (age 46).00.001.63.001.09.000.70
SF-36 Role limitations due to.00.001.84.001.34.000.56
emotional problems (age 46)
SF-36 Vitality (age 46).00.000.81.000.58.000.70
SF-36 Emotional well-being (age 46).00.001.09.000.16.000.00
SF-36 Social functioning (age 46).00.001.87.000.20.000.00
Malaise Inventory (age 50).00.000.21.000.00.000.00
GAD-2 (age 50).00.000.39.000.23.000.06
PHQ-2 (age 50).00.000.49.000.41.000.43
Word similarities (age 10)
Malaise Inventory (age 16).01.010.83.010.04.000.00
GHQ-12 (age 16).00.000.54.000.34.000.39
Malaise Inventory (age 26).02.001.98.020.00.020.00
Malaise Inventory (age 30).01.012.53.010.94.010.00
GHQ-12 (age 30).00.000.37.000.36.000.30
Malaise Inventory (age 34).01.015.97*.011.86.010.00
Malaise Inventory (age 42).01.012.02.010.27.010.00
WEMWBS (age 42).02.023.77.022.37.022.75
Malaise Inventory (age 46).01.015.57*.012.84.012.43
WEMWBS (age 46).00.001.63.001.09.010.04
SF-36 Role limitations due to.00.008.49**.007.39**.006.41*
emotional problems (age 46)
SF-36 Vitality (age 46).00.001.85.002.48.002.89
SF-36 Emotional well-being (age 46).00.013.81.014.08*.013.36
SF-36 Social functioning (age 46).01.0110.84**.018.23**.016.98**
Malaise Inventory (age 50).00.016.79**.013.70.013.15
GAD-2 (age 50).00.007.08**.004.92*.004.22*
PHQ-2 (age 50).00.008.52**.008.24**.006.82**
Matrices (age 10)
Malaise Inventory (age 16).00.002.32.001.44.000.00
GHQ-12 (age 16).00.000.18.000.27.000.08
Malaise Inventory (age 26).02.020.53.020.00.020.00
Malaise Inventory (age 30).01.010.24.010.00.010.00
GHQ-12 (age 30).00.001.49.001.10.000.96
Malaise Inventory (age 34).01.010.04.010.00.010.00
Malaise Inventory (age 42).01.011.60.010.84.012.40
WEMWBS (age 42).01.010.40.010.00.010.00
Malaise Inventory (age 46).01.013.33.012.83.013.46
WEMWBS (age 46).01.010.31.010.43.011.06
SF-36 Role limitations due to.00.000.68.000.74.001.12
emotional problems (age 46)
SF-36 Vitality (age 46).00.007.17**.007.23**.007.17**
SF-36 Emotional well-being (age 46).01.011.35.010.91.010.25
SF-36 Social functioning (age 46).01.017.40**.018.31**.017.30**
Malaise Inventory (age 50).01.012.76.012.18.012.22
GAD-2 (age 50).00.000.68.000.17.000.21
PHQ-2 (age 50).00.005.16*.004.73*.004.50*

[i] Note. GHQ-12 – General Health Questionnaire-12; WEMWBS – Warwick-Edinburgh Mental Wellbeing Scale; SF – Short Form Health Survey; GAD-2 – Generalized Anxiety Disorder scale-2; PHQ-2 – Patient Health Questionnaire-2; *p < .05, **p < .01.

DISCUSSION

The aim of this study was to test the nonlinearity of the relationship between intelligence and mental health. Congruently with the expected patterns, for 13 out of 17 model comparisons, the nonlinear model fit the data better than the linear one. For the 4 cases where nonlinearity was not confirmed, several explanations can be proposed. The relationship was linear both times GHQ-12 was used, hinting that this might be an artifact caused by this particular measure of mental health. Moreover, for GHQ-12 the baseline correlations with intelligence were nonsignificant, which is an outlier result in cognitive epidemiology (Aichele et al., 2018; Gale et al., 2010; Jokela, 2022; Keyes et al., 2017; Wraw et al., 2015). For both measures used at age 16, all model comparisons were in favor of the linear model. It is possible that with age, the more intellectually gifted individuals experience more specific challenges due to their exceptionally high cognitive abilities.

The relationship between intelligence and general mental health does indeed seem to be nonlinear. At high intelligence values, the relationship reaches an inflection point, creating a too-much-of-a-good-thing effect. It suggests that intellectually gifted individuals might experience unique issues, which outweigh the beneficial effects associated with high intelligence. It is however important to note that the nonlinearity of this relationship does not mean that intellectually gifted individuals have worse mental health compared to the general population. More research should be conducted on potential mechanisms underlying this effect and the related phenomena such as underachievement (Reis & Mccoach, 2000; White et al., 2018). While there are some general theoretical frameworks potentially explaining these effects, more comprehensive and intelligence-dedicated models should be developed and investigated empirically.

Issues of unidimensional identity (Coakley, 1992) based solely on cognitive prowess coupled with social, emotional, or physical development deficits may likely explain some of the findings. Analogous processes are perhaps best documented in cases of burnout in sports. Young athletes experience identity foreclosure based on mastery in a particular discipline, resulting in a unidimensional identity and perceived loss of autonomy leading to psychological and physical crises (Goodger et al., 2007; Gustafsson et al., 2017). Congruently with this theory, being formally labeled as intellectually gifted has been shown to be related to decreased mental health (Freeman 2006; Lavrijsen & Verschueren, 2023). This could also explain why the results from studies on Mensa members show that they have worse mental health compared to the general population. Being a member of Mensa is likely related to one’s intelligence being important to one’s identity. A similar phenomenon related to intellectual giftedness may occur among students who are overly focused on academic achievement. In their cases, obsessive and compulsive learning, to the exclusion of other spheres of life, may lead to harm in terms of general health, mental health, and quality of life (Atroszko et al., 2015; Czerwiński et al., 2023).

Auxiliary analyses showed that this effect occurred less frequently when analyzing specific cognitive abilities instead of g. This can be seen as a confirmation that specific abilities lack predictive power compared to the general intelligence factor (Thorndike, 1984). However, there was an overall tendency for the effect to occur more often for the subtests measuring crystallized intelligence than those measuring fluid intelligence. This result is somewhat congruent to the one obtained in the study on the relationship between intelligence and bipolar disorder (Gale et al., 2012). It might be of particular importance to other studies using intelligence tests that use items based on matrices. The estimated inflection points for the quadratic models were generally lower than +2 SD, especially in the case of particular subscales. This may suggest that the cut-off point where intelligence stops being beneficial for mental health might be lower than expected. Further investigation on the topic is warranted. Despite their shortcomings, the quadratic models provided similar results to more complex spline models. Although the polynomial quadratic models in most cases explained more variance than the spline models, this was expected due to the inflection point being freely estimated in these models.

The results of this study also fit within a wider research framework of showing that personality traits and individual differences that are widely considered to be beneficial can have negative effects on one’s life under certain circumstances (Carter et al., 2015; Czerwiński et al., 2023; Smith et al., 2017). Future studies could also use datasets from large cohort studies and statistical techniques such as spline regression to thoroughly investigate the too-much-of-a-good-thing effect in other psychological constructs.

To the authors’ knowledge, there have been no simulation studies on the statistical power of analyses for nonlinear relationships with different placements of inflection points. Hence, it may be that the analyses conducted may be underpowered. Although the sample sizes for each analysis are relatively large for psychological studies, the main group of interest consists of only 2% of the entire sample. Under such circumstances, even with a sample as large as 8,000 participants, only 160 of those represent intellectually gifted individuals. Therefore, studies on the intellectually gifted should aim to obtain even larger samples. Another limitation of the present study is that the intelligence test used in the BCS70 measures a fairly narrow scope of cognitive abilities. The Cattell-Horn-Carroll theory of cognitive abilities (CHC; Carroll, 1993; Schneider & McGrew, 2012) is the most widely utilized contemporary psychometric theory of intelligence and provides a much broader taxonomy of abilities. Therefore, the study’s results might not fully reflect the modern understanding of intelligence and its measurement. Similarly, for mental health, most of the measures used focus on symptoms of psychopathology. Although this is congruent with more traditional definitions of mental health, in recent years there have been increased efforts to focus on both positive and negative aspects of mental health (Keyes, 2005; Lukat et al., 2016). In line with this, future studies should utilize more comprehensive psychometric tools for both intelligence and mental health that reflect modern definitions of these phenomena.

Understanding the psychosocial problems of the intellectually gifted and the underlying mechanisms behind these problems will allow the development of educational programs, as well as prevention programs for psychological issues and therapies designed specifically to fit the unique experiences of this group. More integrated theoretical models explaining potential causes of issues that affect mental health among the intellectually gifted are needed alongside systematic empirical studies. This is particularly important as highly intelligent individuals have a greater potential to have a positive impact on the functioning of the whole society, and this potential might be wasted due to experiencing certain psychosocial difficulties (Wai & Lovett, 2021).