Skip to main content

An IERI – International Educational Research Institute Journal

Gender, affect, and math: a cross-national meta-analysis of Trends in International Mathematics and Science Study 2015 outcomes


Understanding why women are consistently underrepresented in STEM fields has been a constant puzzle, with a consistent feature of the puzzle being performance in math. This study uses data from TIMSS exams to investigate cross-national gender differences in math-related affect, more precisely liking mathematics, confidence in mathematics, and valuing mathematics. We compared fourth and eighth graders to track any differences in these gender-related affective characteristics. Our findings suggest that despite the variability and some changes to the magnitude and direction of gender differences in math affect, boys and girls are similar. We also found that cross-national sociocultural, political, and educational equality of adults does not necessarily predict positive affect for both genders. In fact, the researchers found that some countries with a smaller adult gender gaps have students with higher gender differences in mathematics-relevant affect.


The problem of female underrepresentation in mathematics and science related careers has been an ongoing concern despite the societies’ efforts to facilitate the process for women to hold “male-dominated” jobs (Ceci and Williams 2011; Frome et al. 2006; Wang et al. 2013). In the United States, for instance, based on NGCP (National Girls Collaborative Project 2018), cited from Science and Engineering Indicators 2016 (National Science Board (US) 2018), women earned 50.3% of science and engineering bachelor’s degrees. However, women’s percentage in science and engineering at the undergraduate level significantly differs by the fields of study: women receive over half of bachelor’s degrees awarded in the biological sciences, they receive 17.9% of bachelor’s degrees in computer sciences, 19.3% in engineering, 39% in physical sciences, and 43.1% in mathematics.

Based on the same report by NGCP, however, women are still underrepresented in the STEM labor force. The most imbalance exists in the fields of engineering, computer science, and the physical sciences with only 29% of the science and engineering jobs held by women.

Literature review

There is a voluminous literature on possible reasons for the female underrepresentation in STEM fields. What follows is only some of the explanations for the issue. Relatively, some of the older studies relied on evidence related to physical and cognitive differences, such as differences in brain size (Romanes 1887) and assumed evolution-based explanations like male superiority in spatial skills (Levine et al. 1999; Linn and Petersen 1985; Voyer et al. 1995). While there are some physical differences between the brains, the research findings on cognitive distinctions are inconclusive (Hill et al. 2010). For instance, Lynn and Irwing (2004) in their study of gender differences in the Standard and Advanced Progressive Matrices found that there was no difference among children aged 6–14 years on the progressive matrices. They also found that males obtain just slightly higher means from the age of 15 through the old age on the progressive matrices.

Some other researchers found differences in cognitive skills with females outperforming males in certain tasks and vice versa. For example, Hedges and Nowell (1995) found that males, on average, are disadvantaged in reading and writing skills. They found that females slightly outperformed males in tests of reading comprehension, perceptual speed, and associative memory. On the other hand, males slightly, outperformed females in the tests of mathematics and social studies.

Still some other studies found that the existing gender differences, such as differences in spatial and visualization skills in which boys usually outperform girls, could decline and disappear with appropriate training (Baenninger and Newcombe 1989; Sorby and Baartmans 2000; Vasta et al. 1996).

More recent explanations relate these differences to socio-cultural environments and gender stereotypes that support the channeling of young girls away from STEM studies (Beede et al. 2011; Else-Quest et al. 2010; Guiso et al. 2008; Hyde and Mertz 2009). For instance, Nosek et al. (2002a) found that gender stereotypes that mathematics is for males were related to identification with and attitudes towards mathematics, with stronger gender stereotypes corresponding with more negative mathematics attitudes for women but more positive attitudes for men. They suggest that fundamental categorization of ‘males’ and ‘females’ produces identification with one’s social group which consequently shapes and is shaped by experiences that are expected of that group by the society (p. 57). The gender stereotypes are not necessarily explicit and could be subconscious hypotheses and expectations of men’s and women’s careers (Nosek et al. 2002b; Valian 1999).

Gender stratification of educational and occupational opportunities (also called gender segregation and gender inequality) has also been recognized as a social factor that could explain women underrepresentation in STEM fields. It has been argued that women and men equality in having access to higher education and job opportunities is positively related to mathematics achievement (Baker and Jones 1993). Guiso et al. (2008) found support for the role of gender stratification in mathematics test performance cross-nationally. This hypothesis has also been supported strongly in reading achievement and partially supported in science achievement (Reilly 2012).

In a similar vein, self-perception of abilities in STEM school subjects, shaped by the children’s cultural and social milieu and gender stereotypes, could impact women’s interest in STEM careers (Correll 2001, 2004). Correll (2001), analyzing National Educational Longitudinal Study dataset on high school students, found that males have higher assessment of their own mathematical competence than their female counterparts with the same math grades and test scores. The reverse was observed for verbal skills; female students had higher assessment of their verbal skills than male students. Moreover, she found that the self-assessments of mathematics competence had an impact on both males’ and females’ decisions to continue the path towards quantitative careers (e.g. enrolling in calculus and choosing quantitative college majores). Ganley and Lubienski (2016) in their analysis of data from the a nationally representative sample of students in the United States over 5 years (3rd–8th grades) found that that girls are less confident and less interested in mathematics than boys across third through eighth grades. Similarly, Cvencek et al. (2011) claim that children as early as second grade demonstrate their understanding of the American cultural stereotype that math is for boys. They also found that elementary school boys identify with math more than girls.

Gender difference in science course selection patterns has also been observed for young children; Farenga and Joyce (1999) found that students perceived physical science and technology-related courses as appropriate subjects for boys and life sciences as appropriate for girls to study. Turner et al. (2008) in their study of gender differences in vocational personality types (Holland 1997) of eighth and ninth grade students found that boys had significantly greater Investigative vocational personality scores than girls associated with enjoying studying and solving mathematics and science problems and valuing science and mathematics.

Finally, some other researchers have focused on the society’s negative reactions to women in the workplaces that are perceived as male dominated (Heilman and Okimoto 2007; Heilman et al. 2004). Heilman et al. (2004) found that women who were acknowledged to have been successful in the STEM workplace were less liked than equivalently successful men. Moreover, they found that being disliked can have impacts on the career in terms of overall evaluation and for recommendations for reward allocation. These findings further highlight the impact of social and cultural factors in women decision to choose or stay in ‘male gender-typed’ jobs.

Regional differences in TIMSS: outliers or a cluster

In IEA’s Trends in International Mathematics and Science Study (TIMSS) 2015 results, a few countries, mainly from the Middle East, are (apparently) outliers in terms of gender difference in achievement (International Association for the Evaluation of Educational Achievement (IEA) 2017). The relationship between socio-economic and educational gender equities and girls’ achievement has been negative in this set of nations. In other words, while they are usually ranked low in terms of general gender gaps (e.g. World Economic Forum’s Global Gender Gap Report 2019), girls outperform boys in these countries. This observation is in contrast with gender stratification hypothesis which anticipates a positive relationship between general gender equity and scholastic achievement equity (for more information on gender stratification hypothesis see Fiorentine 1993; Kane 1992; Baker and Jones 1993).

The girls’ higher achievement in some of the Middle East nations also shows consistency across several waves of TIMSS. For instance, in TIMSS 2015, Saudi Arabi, Oman, Jordan, Bahrain, and Kuwait are among the countries with highest gender difference in mathematics achievement in favor of girls. In TIMSS 2011 (Arora et al. 2012), in addition to the above-mentioned nations, girls in Qatar, Yemen, and Palestinian Nat’l Auth outperformed boys. This trend also exists for the results of TIMSS 2007 (Gonzales et al. 2008) and to a considerably lesser extent for TIMSS 2003 (Mullis et al. 2004).

The literature related to some of these countries reveal that the gender gap for these countries is not limited only to international tests of mathematics and science. The girls have a better performance than boys in other subjects in secondary and tertiary education. There is also a big difference in the proportion of girls and boys at universities in these countries. For instance, 70% of students in tertiary education in the United Arab Emirates are women and at Jordan’s largest university, women to men ratio is two to one (Abdulla and Ridge 2011; AlSindi 2013; Ripley 2017; Ridge 2010). Based on Ripley (2017), fewer than one in every five workers in most of these countries is woman which is, as she claims, against the conventional sense in the Western nations: more female graduates must result in more employment for women but it is not what happens in these countries. This highlights the importance of cultural and motivational considerations and a need for some form of classification in cross-national gender studies.

Tables 11, 12 in Appendix 1 represents the mathematics achievement of the 10 countries with the highest gender gap based on Global Gender Gap Report and achievement data availability in TIMSS 2015. Among fourth graders, in eight countries girls outperformed boys in math achievement tests from which four of the differences were statistically significant. For eighth graders, out of the ten high-gap countries, nine countries had girls outperforming boys with two of them significantly different. It is noteworthy to mention that in both grades, all countries except Republic of Korea are in the Middle East.

Current study

The current study examined gender differences in mathematics-related affect among fourth and eighth grade students cross-nationally using a recent international database. Large and nationally representative data can provide more reliable and generalizable findings to decide on gender similarities and/or differences compared to smaller and selective samples (Reilly 2012). Moreover, the use of secondary national and international data has been a common practice in gender studies (e.g. Guiso et al. 2008; Wiseman 2008). More precisely, we investigated gender difference in confidence in mathematics, liking mathematics, and valuing mathematics. We were interested in investigating the possible affective gender difference related to mathematics in both elementary and middle schools using large and representative samples.

Additionally, this study took a bird’s eye view of gender, affect, and math using meta-analysis procedures. Compared to the regular meta-analysis which endorses the use of studies as units of analysis, the present study used nations as the units of analysis (Else-Quest et al. 2010; Reilly 2012). We meta-analyzed data from the International Association for the Evaluation of Educational Achievement’s TIMSS 2015 to assess the magnitude and direction of gender differences in mathematics related affect.

The analysis was conducted for both fourth and eighth grade students. Previous research found some evidence of gender difference in mathematics test performance in high school and college but not elementary school (e.g. Hyde et al. 1990). One of our goals was to see if there was a gender difference in mathematics affect between fourth and eighth grade participants.

We were also interested in investigating how countries gender disparity in socio-economic and cultural areas may affect their students’ attitude towards math by putting countries in different categories based on the gender gap indices. We used World Economic Forum’s Global Gender Gap Report (GGGR) 2017 to achieve this goal. This report includes rankings and indices of 144 countries on four dimensions including economic participation and opportunity, educational attainment, health and survival, and political empowerment (see World Economic Forum 2019).

In sum this study was conducted to answer the following questions:

  1. 1.

    Is there a gender difference in students’ interest in mathematics (Liking Mathematics) cross-nationally?

  2. 2.

    Is there a gender difference in students’ mathematics self-confidence cross-nationally?

  3. 3.

    Is there a gender difference in how much students value mathematics cross-nationally?

  4. 4.

    How is the general gender gap index (including education, economic participation, political representation, and health) associated with the gender difference in interest in mathematics, mathematics self-confidence, and valuing mathematics?

  5. 5.

    Is there a significant difference between fourth and eighth grade students in terms of mathematics-related affect?


Data sources

The data for mathematics affect came from TIMSS 2015. TIMSS is a set of international examinations measuring achievement in mathematics and science. It is sponsored by International Association for the Evaluation of Education Achievement (IEA) and developed by TIMSS & PIRLS International Study Center at Boston College. It is administered every 4 years. At the time of conducting this research, the 2015 wave was the most recent administration. The data were based on students’ answers to three sets of items comprising scales in the TIMSS 2015 international database available to the public. The scales were Students Confident in Mathematics scale, Students Like Learning Mathematics scale, and Students Value Mathematics scale. Each of these scales had nine statements which were scored on a four-point scale. The choices for each statement were ‘Agree a lot’, ‘Agree a little’, ‘Disagree a little’, and ‘Disagree a lot’. The scale statements across grade levels were not completely identical (see Appendix 2 for the items of each scale at grade 4 and grade 8).

In TIMSS 2015, 57 countries and 7 benchmarking entities (regional jurisdictions of countries such as states or provinces) participated. In total, more than 580,000 students participated in TIMSS 2015. The number of countries and sample sizes for the purpose of the current study are summarized in Table 1.

Table 1 Sample size and number of countries in each analysis

Gender parity measures came from Global Gender Gap Report 2017 (GGGR 2017). It was established by World Economic Forum in 2006 to provide a picture of global gender equality. GGGR 2017 provides a Global Gender Gap Index (GGGI) for each of the major and emerging world economies that was utilized in the current study. GGGR 2017 included 144 countries. GGGR measures gender equality in four areas including economic participation, education (primary, secondary, tertiary), political representation, and health (life expectancy ratio and sex ratio at birth).

One of the relatively new concerns in the use of self-reported Likert-based scales data collected from different cultures is whether they have measurement invariance (e.g. Karakoc Alatli et al. 2016). The measurement invariance refers to a statistical feature of a scale or measurement tool which ensures the same underlying construct is being measured across different groups or times (here across different cultures). In other words, it is possible that a scale scores is affected by some culture-specific features that the researcher did not intend to assess; in that case the scale lacks measurement invariance. Moreover, the aggregation of the scale data from different cultures may result in puzzling patterns and erroneous inferences (He et al. 2017, 2018). In the current study, the measurement invariance was assumed for the scales utilized and the inferences made from the findings.

Data analysis

This study is based on meta-analysis procedures and techniques proposed Lipsey and Wilson (2001) for the computation of the effect sizes, the overall effect size, and calculation of heterogeneity (i.e. the variation of effect sizes). The mean comparison was utilized as the statistic for the calculation of the effect sizes. Microsoft Excel, IBM SPSS, and IEA’s IDB Analyzer were utilized for the purpose of meta-analysis. To calculate the mean effect sizes both Wilson’s SPSS macros (Wilson 2017) and hand-written SPSS syntax were utilized.

To check the homogeneity of effect sizes across nations Q and τ2 (Tau-squared) statistics were calculated. Q values were assessed using the χ2 distribution with k − 1 degrees of freedom, where k stood for the number of countries in the analysis. The τ2 statistic, was assessed from Q random effects analysis. Based upon a rule of thumb, τ2 values greater than 1 indicated heterogeneity.

To merge and calculate the mean and standard deviation of the mathematics data into a single file IDB analyzer was used. IDB analyzer generates SPSS syntax for both merging and the analysis of the data in the TIMSS 2015 international database. It considers sampling design and standard errors using the jackknife repeated replication (JRR) method. It also makes appropriate use of sampling weights in the analysis.

Meta-analysis is a statistical method for synthesizing the results of multiple mean difference calculations to increase the power of estimates from different studies (Lipsey andWilson 2001). For this analysis, the researchers treated each nation in the TIMSS database as a separate study of gender differences. For each nation, the researchers calculated the effect size (ES) as the mean difference between girls and boys divided by within-country pooled standard deviation. Negative ES values represented the boys’ advantage over girls regarding the construct. Effect size refers to the difference of two groups in the standard deviation unit. As a generally accepted guideline, the effect size below 0.2 is considered negligible, effect size of 0.2 is small, 0.5 is moderate, and 0.8 and higher are large (Cohen 1988).

Besides the meta-analysis of all nations together to answer research questions 1–3 (i.e. gender difference in students’ interest in mathematics, gender difference in students’ mathematics self-confidence, and gender difference in how much students value mathematics), the researchers also decided to select two sets of countries as high-gap and low-gap countries and do separate meta-analysis on each to answer the research question 4. We thought this procedure could provide a better picture of the role of socioeconomic and cultural differences related to mathematics affect. The selected countries, with lowest gender gap, based on the GGGR 2017 report with TIMSS 2015 data available, are as follows: Norway (2), Finland (3), Sweden (5), Slovenia (7), Ireland (8), New Zealand (9), France (11), Germany (12), Denmark (14), and Canada (16). The numbers in the parentheses are the countries global ranking of gender equality. The selected countries with the highest gender gap are: Iran (140), Saudi Arabia (138), Morocco (136), Jordan (135), Turkey (131), Qatar (130), Kuwait (129), Bahrain (126), United Arab Emirates (120), and Republic of Korea (118).


The meta-analysis of Student Like Mathematics scales revealed that there was almost no gender difference in interest in mathematics between fourth graders (grand random mean ES = − 0.073, grand fixed mean ES = − 0.065); the effect sizes were heterogeneous [Q(47) = 1596.04, p < 0.001] and the τ2 (between nations true heterogeneity) was 0.024 (see Table 1).

The meta-analysis of gender difference in interest (i.e. Student Like Mathematics scales) for eighth graders showed that there was a slight gender difference favoring male students (Grand Mean ES = − 0.106, see Table 2). The effect sizes were heterogeneous [Q (39) = 935.85, p < 0.001]. The τ2 value was .014.

Table 2 Overall effect size of gender differences in “Liking Mathematics” for fourth and eighth graders

Regarding the unweighted effect sizes, for fourth graders, 15 countries out of 48 countries (31%), and for eighth graders 10 out of 40 countries (25%) had small to medium gender disparity in “liking mathematics” (Tables 3 and 4). In other words, they had ds of 0.2 or more but less than 0.5 which were either positive or negative (Cohen 1988).

Table 3 Unweighted effect sizes and descriptive statistics for “Liking Mathematics” for fourth graders
Table 4 Unweighted effect sizes and descriptive statistics for “Liking Mathematics” for eighth graders

Regarding mathematics self-confidence both fourth and eighth grade boys were slightly superior than girls and this difference were statistically significant (p <0.001), though very small. The grand mean effect size for gender difference in math self-confidence for fourth graders was − 0.133 and the effect sizes were heterogeneous [Q (47) = 2514.04, p < 0.001]. The τ2 value was 0.03. The grand mean effect size for gender difference in self-confidence among eighth grade students was − 0.141. The effect sizes were heterogeneous [Q (39) = 1076.27, p < 0.001]. The τ2 value was 0.01.

In other words, the gender difference in mathematics self-confidence among eighth graders was slightly higher than fourth graders, and for both they were in favor of boys (Table 5). In 26 out of 48 (54%) countries for fourth graders and 15 out of 40 (37%) countries for eighth graders the unweighted effect sizes were small to medium when the mean scores of boys and girls were compared (Tables 6 and 7).

Table 5 Overall effect size of gender differences in “Confidence in Mathematics” for fourth and eighth graders
Table 6 Unweighted effect sizes and descriptive statistics for “Confidence in Mathematics” for fourth graders
Table 7 Unweighted effect sizes and descriptive statistics for “Confidence in Mathematics” for eighth graders

In the TIMSS 2015 report the Students Value Mathematics scale data were available just for eighth graders. The meta-analysis of those data showed almost no difference between boys and girls in terms of valuing math (Grand Mean ES = − 0.066). The individual effect sizes were heterogeneous [Q (39) = 1295.5, p < 0.001, τ2 = 0.02]. Please see Table 8 for more details. Out of 40 countries, 10 countries (25%) had small effect sizes and none of these 10 effect sizes went over 0.3 when boys and girls were compared (Table 9).

Table 8 Overall effect size of gender differences in “Valuing Mathematics” for eighth graders
Table 9 Unweighted effect sizes and descriptive statistics for “Valuing Mathematics” for eighth graders

The meta-analysis results of ten high gap and ten low gap countries are summarized in Table 10. For liking mathematics construct, as mentioned previously, the overall effect size was − 0.073 for fourth graders. The mean effect size for the low gap countries was − 0.127 implying that boys like mathematics more than girls. However, for the high gap countries the effect size was 0.071 representing girls’ higher interest in mathematics. For eighth graders, the students in low gap countries revealed higher difference in interest (effect size = − 0.141) compared to students in high gap countries (effect size = − 0.128), both in favor of boys.

Table 10 Grand mean effect sizes (ES) for all countries together as well as classified based on their GGGR Index of gender parity

The meta-analysis of the self-confidence in mathematics scale also revealed that boys in both low gap and high gap countries show more confidence, however, in low gap countries the difference between girls and boys were considerably larger. For the low gap countries, the effect sizes were − 0.199 and − 0.248 for the fourth and eighth grade students respectively. For high gap countries the effect sizes were − 0.078 and − 0.052 for the fourth and eighth grade students respectively.

The same pattern was observed for valuing mathematics questionnaire as well. While boys revealed valuing mathematics more than girls, the gender difference was about three times higher for low gap countries (effect size = − 0.114) than the high gap countries (effect size = − 0.036).

The tau-squared values decreased for all three constructs when two subsets of countries were assigned to high gap and low gap categories (see Table 10). Tau-squared represents the extent of variation among the effects in different countries here (i.e. between-country variance); it reflects the variance of the true effect sizes (Borenstein et al. 2011). This decrease could be assumed natural since the number of the effect sizes were reduced to ten.


From an international perspective, the magnitude and direction of the mean effect sizes in mathematics affect (i.e. liking mathematics, confidence in mathematics, and valuing mathematics) imply that boys and girls are similar. The mean effect sizes for all three constructs were less than 0.2. However, the variations in the unweighted effect sizes possibly suggest the importance of social and cultural factors in the observed differences. The individual effect sizes of gender difference in mathematics constructs investigated here included both positive and negatives values. The implication is that biological explanations for gender difference in mathematics-related affect are not supported by the current findings. Moreover, the change in the magnitude of the gender differences when fourth and eighth grade students of individual countries were compared is an evidence of the possibility of fostering parity between males and females in mathematics affect. The reduction of gender difference in mathematics confidence among eighth graders compared to fourth graders in the United States is a typical instance. The similar argumentation about the possibility of developing spatial skills in females to match spatial and visual skills of the male counterparts has been made by some researchers (Baenninger and Newcombe 1989; Sorby and Baartmans 2000; Vasta et al. 1996).

The cross-sectional comparison of grand means also reveals that there is an increase in gender difference in mathematics affect (i.e. Liking Math and Confidence in Math) as students make a transition from fourth grade (elementary school) to eighth grade (middle school). The gender difference in interest rose to − 0.106 from − 0.073 and the gender difference in confidence rose to − 0.141 from − 0.131. While the magnitude of the differences is small in both grades, its growth from fourth to eighth grade could be a signal implying that losing interest and confidence in mathematics in female students becomes more noticeable as they transition to higher grades. It could be the beginning of the formation of a larger gender gap in mathematics affect as students make transition to colleges and universities.

Despite methodological and procedural differences, current findings are partially consistent with the previous findings. For instance, Ganley and Lubienski (2016) found that math confidence showed smaller gender differences at the eighth grade than at third and fifth grades which is consistent with the current findings. Moreover, like the current findings, they found that gender differences in math confidence are larger than disparities in interest. However, our findings do not seem to be consistent with Turner et al. (2008) findings.

Regional differences in international studies

In international studies of gender differences in scholastic achievement, the differential effects of grouping countries based on gender gaps, to the authors’ knowledge, has not been sufficiently investigated (e.g. Guiso et al. 2008; Hyde and Mertz 2009). However, some studies have identified the importance of grouping countries in such studies. Reilly (2012), for instance, investigated the gender differences in reading, mathematics and science literacy for nations that participated in Program for International Student Assessment 2009. In some of his analyses, he made a distinction between 34 countries that were members of Organization for Economic Co-operation and Development (OECD) and 31 nations that were non-members or what he called non-OECD countries. He found a larger mean effect size for non-OECD nations in science literacy than when OECD and non-OECD were combined. He claims that “a focus on the combined sample overlooks the pattern of gender differences at a national level where girls show small but meaningful gains over boys in science literacy across large parts of the world” (Reilly 2012, p. 8). The findings of the current study further highlight the need for some form of classification to capture the importance of sociocultural and motivational factors in shaping mathematics-related affect. One of the main implications of this study is that the smaller sociocultural, economic, and educational gender gap does not necessarily mean more parity in boys’ and girls’ affect related to mathematics. For instance, in terms of “liking mathematics”, the maximum gender disparity in favor of boys for fourth graders was observed for France which in World Economic Forum’s Global Gender Gap Report (2019) was among top eleven countries in terms of gender equality. Germany and England were two other examples of countries with high ratings for gender equality but among the countries with most gender disparity in liking mathematics in favor of boys. Regarding mathematics self-confidence, Germany, Belgium, and Netherlands were three countries with most gender disparity for fourth graders and England, Sweden, and Australia for eighth graders in favor of boys and again these are countries with high standards in terms of sociocultural, political, and educational gender equality in GGGR (2017). Note that all of these countries are in Europe.

This pattern was observed the other way around as well. For example, in terms of mathematics self-confidence among fourth graders, Saudi Arabia’s d was the highest positive value (d = 0.446) showing almost a medium effect size in favor of girls. Saudi Arabia is among 7 countries with the lowest gender parity in GGGR 2017. Another example of girls’ superiority in confidence over boys in countries with lower gender equality is Oman. Oman also had the maximum female superiority in our meta-analysis in liking mathematics. This apparently unexpected trend was observed for all three constructs in both grades with some exceptions (See Figs. 1, 2 and 3). As mentioned above (in “Regional differences in TIMSS: outliers or a cluster”), these countries are are known for boys that do worse than girls at school in general, not only in mathematics. Therefore, the trend observed here should not be surprising: higher interest, higher confidence, and putting higher value on mathematics resulted in better mathematics achievement for the girls in this set of countries. It is worth mentioning that the girls’ better performance is relative to their nations’ boys. Compared to countries that did well in TIMSS, the Middle East nations are behind. Some numbers would help in getting a vision of where these nations stand globally; in TIMSS 2015 the top achievers are East Asian countries including Singapore, Hong Kong SAR, Korea, Chinese Taipei, and Japan with relatively large performance gap between them and the next high performing country. In the eighth grade, for instance, Singapore achievement is 621. The closest neighbor to East Asian countries is Russian Federation with the achievement of 538. The highest achievement for the Middle East countries belongs to the United Arab Emirates (465) and the lowest belongs to Saudi Arabia (368).

Fig. 1
figure 1

Positive values of the effect size represent difference in “liking mathematics” in favor of girls and negative values represent difference in favor of boys. Countries with higher gender gaps mostly represent females’ superiority in mathematics interest

Fig. 2
figure 2

Positive values of effect size represent difference in “mathematics confidence” in favor of girls and negative values represent difference in math confidence in favor of boys. In countries with higher gender gap, girls showed a little more confidence in mathematics and in countries with lower gender gap, math confidence favored boys

Fig. 3
figure 3

Gender differences in math affect as represented by the mean effect sizes. Three different bars for each construct are for all countries, low gender gap countries and high gender gap countries

Some previous studies found that gender inequality and gaps could be a factor that negatively influences mathematics performance; these studies suggest that variation in the gender stratification of educational, and occupational opportunities is a factor that leads to variation in the mathematics performance in favor of men (Baker and Jones 1993; Guiso et al. 2008). Regarding mathematics-relevant interest and self-confidence, and valuing mathematics this hypothesis is not fortified. The socio-cultural features, especially in male dominated countries, could have contributed to this seemingly controversial correlation of gender gap and math affect.

In case of Middle Eastern nations, the gap between boys and girls in schools has been identified and studied relatively well. Ripley (2017) in her article entitled “Boys are not defective” describes motivation as the dark matter of education in these nations. She mentions several factors that resulted in this gap. In some of these countries, boys unlike girls are guaranteed a government job no matter how they perform at schools which gives them low incentives to study hard. Moreover, there are distinct parental expectations from boys and girls: while girls are kept closer to home with more observation over what they do, boys enjoy more freedom and consequently are subject to more distractions in their education. Girls also have fewer job options than boys: many service jobs (e.g. jobs at restaurants and hotels) are not considered socially appropriate for women. There are other explanations such as girls having better teachers and better schools than boys, and male teachers have less job satisfaction than female teachers which result in boys’ poor performance in the single-sex education systems of these countries (see Ripley 2017) (Fig. 3).

The contrastive affect pattern of some of the Middle East nations and Western nations, the negative correlation of general gender gap and gender difference in mathematics affect in the some of the European countries, and the achievement gap between East Asian nations and other nations imply the importance of regional differences in the interpretations made from cross-national studies. Moreover, in the calculation of the mean effect size for all nations together, the positive and negative effect sizes cancel out each other and obscure the significance of cultural and motivational patterns that are mainly regional.

Limitations of the study

There are some limitations for this study that should be considered in the interpretation of the findings. First, the idea of comparing fourth and eighth grade students’ affect based on two very similar scales for each construct at a single point in time overlooks the differences in mathematics perceptions of the two grades; the two groups are so different regarding the materials they have been taught and tested on and in terms of psychological characteristics. Future studies may get a better estimation of the change in affect by comparing boys and girls in two waves of TIMSS instead of one. For instance, they could compare the fourth graders in TIMSS 2011 with eighth graders in TIMSS 2015, assuming they are the same students after 4 years of education. However, this approach is not flawless either.

The second limitation is the assumption of measurement invariance made for the scales used in the study. Next studies could check the measurement invariance first before proceeding to deeper analyses. As previously mentioned, the lack of invariance could result in misleading interpretation in cross-cultural studies (He et al. 2017, 2018).

Finally, some generalizations were made for specific regions in this study such as Middle East nations, East Asian nations, European nations, and high and low gap countries. These generalizations are limited by the number countries that participated in TIMSS 2015 and is by no means all-inclusive.


The findings of the current study show that the gender difference in liking mathematics, confidence in mathematics, and valuing mathematics are very small and negligible in general. However, variations were observed in the magnitude of the differences across different nations. This provides further evidence for the malleable nature of the existing gender differences in mathematics. In addition, the comparison of the mean effect sizes of the gender difference in interest and mathematics-related confidence, revealed an increase in the gender gap from fourth grade to the eighth grade. Although the increase was relatively small, and the differences were still negligible, it could be a representation of an ascending trend in gender differences as students grow and get closer to the final years of compulsory education.

In contrary to some previous research findings, the current study did not support the idea that students develop ‘math is for boys’ gender stereotypes as early as elementary school. However, the scales used here touched on students’ explicit perceptions and attitudes and not the implicit ones. Finally, the social-cultural gender gaps could not consistently account for the direction and the magnitude of the effect size of gender difference in the investigated constructs. More precisely, the findings imply that more gender parities in social, educational, economic, and health standards did not lead to less gender distinctions in attitudes towards mathematics (in a number of European countries) and less gender parities in those standards did not lead to more gender differences, as in the case of Middle Eastern nations. Moreover, this implication further illustrates the significance of regional differences and motivational patterns in cross-cultural studies.

Availability of data and materials

Data sets used in this study come from TIMSS international database which is publicly available for use in research. Therefore, this study did not need IRB approval.


  • Abdulla, F., & Ridge, N. (2011, March). Where are all the men? Gender, participation and higher education in the United Arab Emirates. In Towards an Arab higher educational space: International challenges and societal responsibilities: Proceedings of the Arab Regional Conference on Higher Education, edited by Bechir Lamine (pp. 125–136).

  • AlSindi, N. (2013). Single-sex schooling and mathematics achievement in the Middle East: The case of Iran, Syria, Jordan, and Oman (Doctoral dissertation, Georgetown University).

  • Arora, A., Foy, P., Martin, M. O., & Mullis, I. V. S. (2012). TIMSS 2011: International results in mathematics. Chestnut Hill: TIMSS & PIRLS International Study Center, Lynch School of Education, Boston College/International Association for the Evaluation of Educational Achievement.

    Google Scholar 

  • Baenninger, M., & Newcombe, N. (1989). The role of experience in spatial test performance: A meta-analysis. Sex Roles, 20(5), 327–344.

    Article  Google Scholar 

  • Baker, D. P., & Jones, D. P. (1993). Creating gender equality: Cross-national gender stratification and mathematical performance. Sociology of Education, 66, 91–103.

    Article  Google Scholar 

  • Beede, D., Julian, T., Langdon, D., McKittrick, G., Khan, B., & Doms, M. (2011). Women in STEM: A gender gap to innovation. ESA Issue Brief# 04-11. US Department of Commerce.

  • Borenstein, M., Hedges, L. V., Higgins, J. P., & Rothstein, H. R. (2011). Introduction to meta-analysis. Hoboken: Wiley.

    Google Scholar 

  • Ceci, S. J., & Williams, W. M. (2011). Understanding current causes of women’s underrepresentation in science. In Proceedings of the National Academy of Sciences of the United States of America, 108(8), 3157–3162. Retrieved from

    Article  Google Scholar 

  • Cohen, J. (1988). Statistical power analysis for the behavioral sciences (p. 2). Hilsdale: Lawrence Earlbaum Associates.

    Google Scholar 

  • Correll, S. J. (2001). Gender and the career choice process: The role of biased self-assessments. American Journal of Sociology, 106(6), 1691–1730.

    Article  Google Scholar 

  • Correll, S. J. (2004). Constraints into preferences: Gender, status, and emerging career aspirations. American Sociological Review, 69(1), 93–113.

    Article  Google Scholar 

  • Cvencek, D., Meltzoff, A. N., & Greenwald, A. G. (2011). Math-gender stereotypes in elementary school children. Child Development, 82(3), 766–779.

    Article  Google Scholar 

  • Else-Quest, N. M., Hyde, J. S., & Linn, M. C. (2010). Cross-national patterns of gender differences in mathematics: A meta-analysis. Psychological Bulletin, 136(1), 103–127.

    Article  Google Scholar 

  • Farenga, S. J., & Joyce, B. A. (1999). Intentions of young students to enroll in science courses in the future: An examination of gender differences. Science Education, 83(1), 55–75.;2-O.

    Article  Google Scholar 

  • Fiorentine, R. (1993). Theories of gender stratification: Assumptions, evidence, and “agency” and “equity” implications. Rationality and Society, 5(3), 341–366.

    Article  Google Scholar 

  • Frome, P. M., Alfeld, C. J., Eccles, J. S., & Barber, B. L. (2006). Why don’t they want a male-dominated job? An investigation of young women who changed their occupational aspirations. Educational Research & Evaluation, 12(4), 359–372.

    Article  Google Scholar 

  • Ganley, C. M., & Lubienski, S. T. (2016). Mathematics confidence, interest, and performance: Examining gender patterns and reciprocal relations. Learning and Individual Differences, 47, 182–193.

    Article  Google Scholar 

  • Gonzales, P., Williams, T., Jocelyn, L., Roey, S., Kastberg, D., & Brenwald, S. (2008). Highlights from TIMSS 2007: Mathematics and science achievement of US fourth-and eighth-grade students in an international context. NCES 2009-001. National Center for Education Statistics.

  • Guiso, L., Monte, F., Sapienza, P., & Zingales, L. (2008). Culture, gender, and math. Science, 320(5880), 1164–1165.

    Article  Google Scholar 

  • He, J., Barrera-Pedemonte, F., & Buchholz, J. (2018). Cross-cultural comparability of noncognitive constructs in TIMSS and PISA. Assessment in Education: Principles, Policy & Practice, 26, 1–17.

    Google Scholar 

  • He, J., Buchholz, J., & Klieme, E. (2017). Effects of anchoring vignettes on comparability and predictive validity of student self-reports in 64 cultures. Journal of Cross-Cultural Psychology, 48(3), 319–334.

    Article  Google Scholar 

  • Hedges, L. V., & Nowell, A. (1995). Sex differences in mental test scores, variability, and numbers of high-scoring individuals. Science (New York, N.Y.), 269(5220), 41–45.

    Article  Google Scholar 

  • Heilman, M. E., & Okimoto, T. G. (2007). Why are women penalized for success at male tasks? The implied communality deficit. The Journal of Applied Psychology, 92(1), 81–92.

    Article  Google Scholar 

  • Heilman, M. E., Wallen, A. S., Fuchs, D., & Tamkins, M. M. (2004). Penalties for success: Reactions to women who succeed at male gender-typed tasks. The Journal of Applied Psychology, 89(3), 416–427.

    Article  Google Scholar 

  • Hill, C., Corbett, C., & St Rose, A. (2010). Why so few? Women in science, technology, engineering, and mathematics. American Association of University Women. 1111 Sixteenth Street NW, Washington, DC 20036.

  • Holland, J. L. (1997). Making vocational choices: A theory of vocational personalities and work environments (3rd ed.). Odessa: Psychological Assessment Resources.

    Google Scholar 

  • Hyde, J. S., Fennema, E., Ryan, M., Frost, L. A., & Hopp, C. (1990). Gender comparisons of mathematics attitudes and affect. Psychology of Women Quarterly, 14(3), 299.

    Article  Google Scholar 

  • Hyde, J. S., & Mertz, J. E. (2009). Gender, culture, and mathematics performance. In Proceedings of the National Academy of Sciences of the United States of America, 106(22), 8801-8807. Retrieved from

  • International Association for the Evaluation of Educational Achievement (IEA). (2017, March 25). TIMSS 2015 and TIMSS Advanced 2015 international results. Retrieved from:

  • Kane, E. W. (1992). Race, gender, and attitudes toward gender stratification. Social Psychology Quarterly, 55, 311–320.

    Article  Google Scholar 

  • Karakoc Alatli, B., Ayan, C., Polat Demir, B., & Uzun, G. (2016). Examination of the TIMSS 2011 fourth grade mathematics test in terms of cross-cultural measurement invariance. Eurasian Journal of Educational Research, 66, 389–406.

    Google Scholar 

  • Levine, S. C., Huttenlocher, J., Taylor, A., & Langrock, A. (1999). Early sex differences in spatial skill. Developmental Psychology, 35(4), 940.

    Article  Google Scholar 

  • Linn, M. C., & Petersen, A. C. (1985). Emergence and characterization of sex differences in spatial ability: A meta-analysis. Child Development, 56(6), 1479–1498.

    Article  Google Scholar 

  • Lipsey, M. W., & Wilson, D. B. (2001). Practical meta-analysis. Thousand Oaks: Sage Publications Inc.

    Google Scholar 

  • Lynn, R., & Irwing, P. (2004). Sex differences on the progressive matrices: A meta-analysis. Intelligence, 32(5), 481–498.

    Article  Google Scholar 

  • Mullis, I. V., Martin, M. O., Gonzalez, E. J., & Chrostowski, S. J. (2004). TIMSS 2003 International Mathematics Report: Findings from IEA’s Trends in International Mathematics and Science Study at the Fourth and Eighth Grades. TIMSS & PIRLS International Study Center. Boston College, 140 Commonwealth Avenue, Chestnut Hill, MA 02467.

  • National Girls Collaborative Project. (2018, December 20). State of girls and women in STEM. Retrieved from:

  • National Science Board (US). (2018, December 20). Science & engineering indicators. Retrieved from:

  • Nosek, B. A., Banaji, M. R., & Greenwald, A. G. (2002a). Math = Male, Me = Female, Therefore Math ≠ Me. Journal of Personality & Social Psychology, 83(1), 44–59.

    Article  Google Scholar 

  • Nosek, B. A., Banaji, M. R., & Greenwald, A. G. (2002b). Harvesting Implicit Group Attitudes and Beliefs From a Demonstration Web Site. Group Dynamics, 6(1), 101–115.

    Article  Google Scholar 

  • Reilly, D. (2012). Gender, culture, and sex-typed cognitive abilities. PLoS ONE, 7(7), e39904–e39904.

    Article  Google Scholar 

  • Ridge, N. (2010). Teacher quality, gender and nationality in the United Arab Emirates: A crisis for boys. Dubai: Dubai School of Government.

    Google Scholar 

  • Ripley, A. (21, 2017). Boys are not defective. Retrieved from

  • Romanes, G. J. (1887). Mental differences between men and women. Nineteenth Century, 21(123), 654–672.

    Google Scholar 

  • Sorby, S. A., & Baartmans, B. J. (2000). The development and assessment of a course for enhancing the 3-D spatial visualization skills of first year engineering students. Journal of Engineering Education, 89(3), 301–307.

    Article  Google Scholar 

  • Turner, S. L., Conkel, J. L., Starkey, M., Landgraf, R., Lapan, R. T., Siewert, J. J., et al. (2008). Gender differences in holland vocational personality types: Implications for school counselors. Professional School Counseling, 11(5), 317–326.

    Article  Google Scholar 

  • Valian, V. (1999). Why so slow? The advancement of women. Women & Language, 22(2), 62.

    Google Scholar 

  • Vasta, R., Knott, J. A., & Gaze, C. E. (1996). Can spatial training erase the gender differences on the water-level task? Psychology of Women Quarterly, 20, 549–567.

    Article  Google Scholar 

  • Voyer, D., Voyer, S., & Bryden, M. P. (1995). Magnitude of sex differences in spatial abilities: A meta-analysis and consideration of critical variables. Psychological Bulletin, 117, 250–270.

    Article  Google Scholar 

  • Wang, M.-T., Eccles, J. S., & Kenny, S. (2013). Not lack of ability but more choice: Individual and gender differences in choice of careers in science, technology, engineering, and mathematics. Psychological Science (0956-7976), 24(5), 770–775.

    Article  Google Scholar 

  • Wilson, D.B. (2017, July 20). Meta-analysis macros for SAS, SPSS, and Stata. 2005. Retrieved from

  • Wiseman, A. W. (2008). A culture of (in) equality? A cross-national study of gender parity and gender segregation in national school systems. Research in Comparative and International Education, 3(2), 179–201.

    Article  Google Scholar 

  • World Economic Forum. (2019, January 1). Global gender gap report 2017. Retrieved from:

Download references


Not applicable.


This study was conducted by the researchers’ initiatives and did not have any funding sources.

Author information

Authors and Affiliations



Both authors had equal dedications and contributions in compiling this manuscript. Both authors read and approved the final manuscript.

Corresponding author

Correspondence to Ehsan Ghasemi.

Ethics declarations

Competing interests

The author declares that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.



Appendix 1

See Tables 11, 12.

Table 11 Mathematics achievement of ten high GGGR 2017 gap countries adopted from TIMSS 2015 report for the 4th grade students
Table 12 Mathematics achievement of ten high GGGR 2017 gap countries adopted from TIMSS 2015 report for the 8th grade students

Appendix 2

Students confident in mathematics scale (4th grade)

figure a

Reprinted from “Student Engagement and Attitudes”, by IEA’s TIMSS and PRILS International Study Center, 2015, Retrieved from

Students confident in mathematics scale (8th grade)

figure b

Reprinted from “Student Engagement and Attitudes”, by IEA’s TIMSS and PRILS International Study Center, 2015, Retrieved from

Students like learning mathematics scale (4th grade)

figure c

Reprinted from “Student Engagement and Attitudes”, by IEA’s TIMSS and PRILS International Study Center, 2015, Retrieved from

Students like learning mathematics scale (8th grade)

figure d

Reprinted from “Student Engagement and Attitudes”, by IEA’s TIMSS and PRILS International Study Center, 2015, Retrieved from

Students value mathematics scale (8th grade) copied directly from TIMSS 2015 website

figure e

Reprinted from “Student Engagement and Attitudes”, by IEA’s TIMSS and PRILS International Study Center, 2015, Retrieved from

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ghasemi, E., Burley, H. Gender, affect, and math: a cross-national meta-analysis of Trends in International Mathematics and Science Study 2015 outcomes. Large-scale Assess Educ 7, 10 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: