288
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Trends in assortative mating in the United States, 1700–1910. Evidence from FamiLinx data

, &
Received 28 Aug 2023, Accepted 03 May 2024, Published online: 23 Jun 2024

ABSTRACT

Couple formation and assortative mating significantly influence societal structures, as marriages between individuals from diverse geographical or social backgrounds promote intra-family diversity. Understanding these patterns is crucial for grasping the demographic processes that shape contemporary societies. However, the scarcity of comprehensive data has impeded progress in this area. This paper aims to fill this gap by investigating assortative mating trends in the United States among birth cohorts from 1700 to 1910, utilizing data from FamiLinx, an online crowdsourced genealogical database. We focus on two primary dimensions: migration background (including natives, first and second-generation migrants) and age at marriage. Our analysis yields three major findings. First, we document significant changes in assortative mating trends over time, reflecting the dynamic nature of mate selection and its responsiveness to societal shifts. Second, we uncover substantial heterogeneity in assortative mating patterns across different social groups, indicating varying social dynamics and preferences. Third, we illustrate how these trends can be differently interpreted depending on whether the perspective is individual or familial. Additionally, we explore the advantages and limitations of using online genealogical data for historical studies of assortative mating, highlighting its potential for offering new insights while acknowledging the challenges posed by data quality and representativeness.

1. Introduction

Couple formation is a significant demographic phenomenon with multiple implications. At the individual level, having a partner influences various aspect of one’s social and economic trajectory (Van Bavel, Citation2021) and often serves as a prerequisite for biological reproduction. At the societal level, couples play a crucial role in shaping the distribution of characteristics and inequalities among families, influencing their transmission across generations (Schwartz, Citation2013).

Extensive research has demonstrated that individuals do not randomly choose their partners; instead, they tend to exhibit similarities in various traits. This phenomenon is known as assortative mating. Findings from different fields have highlighted the extent to which partners tend to resemble each other in socioeconomic aspects (e.g. education, income, and occupation) (Greenwood et al., Citation2014; Kalmijn, Citation1994; Schwartz, Citation2013), ethnic backgrounds (Chiswick & Houseworth, Citation2011), physical attributes (Little et al., Citation2006) and genetic traits (Robinson et al., Citation2017).

While the patterns and causes of assortative mating have been extensively studied in the social sciences, social history, and historical demography, there is still limited knowledge about long-term trends. Nevertheless, a comprehensive understanding of the historical transformations in assortative mating is essential to unravel how contemporary societies have emerged. Marriage merges families, increasing or limiting within-family diversity and social cohesion, and influencing the distribution of social characteristics and genetic traits (Thiessen & Greeg, Citation1980). Moreover, the combination of partner’s characteristics significantly influences the distribution of socioeconomic resources across households. Different levels of assortative mating are associated with other socio-demographic phenomena such as fertility (Conley et al., Citation2016; Nitsche et al., Citation2018) or health (Guner et al., Citation2018). Finally, the characteristics of parents determine the family background and life prospects of subsequent generations, influencing social mobility and stratification.

Despite its relevance, studying long-term mating patterns has traditionally been challenging due to the demanding data requirements. However, recent developments and the availability of crowdsourced online genealogies offer new and unprecedented opportunities to study historical demographic processes across geographically diverse populations. Genealogical data have already been utilized to examine longevity (Cozzani et al., Citation2023; Kaplanis et al., Citation2018; Minardi et al., Citation2023) and fertility (Blanc, Citation2021; Hsu et al., Citation2021). We contribute to this emerging body of literature by demonstrating, for the first time, how crowdsourced genealogical data can be leveraged to study assortative mating.

This paper documents long-term assortative mating trends in the United States from the 18th to the early 20th centuries, utilizing FamiLinx genealogical data. Specifically, we focus on assortative mating based on migration background and age. On the one hand, assortative mating by geographical origin is used as an indicator of the openness of a society, and it can be studied to assess the evolution of societies and the integration of migrants into them (Kalmijn, Citation1998). On the other, assortative mating by age is used as a proxy of changes in partner choice; the reduction in the age gap between partners is often interpreted as the rise of personal criteria over instrumental ones over time, which can be read as a further indicator of the changing meaning of the institution of marriage over time (Van de Putte et al., Citation2009). Thus, we believe that assortative mating by geographical origin and age are important dimensions to investigate when studying the evolution of a society over time. And genealogical data can be an interesting source to pursue such an investigation.

Our findings highlight the dynamic and multidimensional nature of assortative mating. First, changes in assortative mating did not follow a consistent trajectory over time but were concentrated within particular birth cohorts. Second, persistent variations existed in assortative mating between individuals defined by their migration histories and gender across different periods. Finally, results indicate the remarkable persistence in the intergenerational reproduction of mating behavior across history.

2. Assortative mating by migration background and age: concepts and previous studies

2.1. Migration background

Assortative mating based on place of origin is particularly interesting when examining the interaction between different social groups. High intermarriage rates indicate extensive contact and mutual acceptance between groups and successful integration and assimilation of migrants into the host society (Kalmijn, Citation1993).

Societal and economic changes may have influenced patterns of intermarriage over time. Firstly, intermarriage based on geographical origin is contingent upon the population structure and group sizes within a society (Blau & Schwartz, Citation1984). Migration has been a significant driver of population change throughout history, bringing individuals with distinct social, economic, and demographic characteristics into the host societies.

On the one hand, changes in the ethnic composition of a society may increase intermarriage rates due to a more diverse population. On the other hand, social groups, especially migrants, often experience spatial and social segregation, which can hinder intermarriage (Danubio & Pettener, Citation1997; De Marco, Citation1980). Previous studies have demonstrated the extent to which intermarriage has increased during periods of high migration and how migration has heightened the likelihood of intermarriage at the individual level (Dribe & Lundh, Citation2009; Dribe et al., Citation2019; Puschmann et al., Citation2014; Van de Putte, Citation2003; Van de Putte & Oris, Citation2005).

An significant historical barrier to geographical intermarriage was the influence of families, as they often sought to maintain continuity and connections with the original group through their children’s marriages. Families also played a significant role in shaping the social environment in which their children were raised and where they typically sought partners. Families were responsible for the primary socialization of children, instilling a sense of group solidarity that influenced their preference for endogamy. However, societal changes throughout history have weakened the influence of families on their children’s’ choices. Factors such as the development of educational systems (Klesment et al., Citation2020), the expansion of wage-based economic systems (Treiman, Citation1970), industrialization (Blau & Duncan, Citation1967), and the proliferation of mass communication and transportation (Ekamper et al., Citation2011; Maas & Zijdeman, Citation2010) have increased individuals’ independence from their families and are often associated with a higher likelihood of geographical heterogamy.

2.2. Age similarity

Age differences between partners have traditionally served as an indicator of the balance between personal and sentimental factors versus instrumental considerations in partner choice (Shorter, Citation1975; Van de Putte et al., Citation2009; Van Poppel et al., Citation2001).

In pre-industrial societies, newly formed households needed to achieve economic independence without jeopardizing the family’s overall economic situation. As a result, age hypergamy, where men were typically older and possessed greater economic security and power, was the prevailing couple arrangement (Ariès, Citation1983; Van de Putte et al., Citation2009). On the other hand, younger women, with their higher potential for reproduction, might have been preferred by families interested in lineage continuation. However, with modernization and industrialization, the economic imperatives associated with marriage gradually diminished, allowing more room for personal considerations in partner selection. Research has indicated a significant increase of age homogamy by the end of the 19th century, which has been interpreted as a reflection of the growing prominence of non-instrumental factors in choosing a partner.

2.3. Assortative mating as a multidimensional process

Most studies on assortative mating focus on specific dimensions, such as migration background or age, in isolation. However, in reality, individuals seek partners who possess multiple characteristics simultaneously (Kalmijn, Citation1998). In this paper, we approach assortative mating from a multidimensional perspective, specifically examining the interplay between migration background and age. We develop two competing hypotheses to explain this phenomenon. The first, known as the by-product hypothesis (Kennedy, Citation1944), suggests that if individuals choose partners with similar migration backgrounds, they are also likely to exhibit homogamy in terms of age. This could be attributed to the structure of local mating markets, which may exhibit both ethnic and age homogeneity. In contrast, the status exchange hypothesis (Davis, Citation1941; Merton, Citation1941) suggests that higher levels of age heterogamy may be observed when there is homogamy based on migration background. According to this hypothesis, individuals might be willing to trade similarities in age for similarities in migration background, particularly if they value maintaining strong connections with their community of origin.

Furthermore, we anticipate that there may be gender differences in these patterns (Sassler, Citation2005). Historical societies often restricted women’s opportunities for education and employment outside their parental homes (Sassler, Citation2005, 2000), resulting in a less diverse social environment for women than men. Additionally, the transmission of ethnic identity primarily occurred within the home environment through practices such as food, religion, and traditions, with women often playing a central role in preserving ethnic identity (Foner & Dreby, Citation2011). Lastly, women typically faced greater pressure from their families of origin to marry within their ethnic group. Therefore, we expect homogamy based on migration background to be of greater importance for women. As a result, they may be more willing to exchange other characteristics, such as age, to achieve homogamy. We might think of migrant women as an exception to this pattern, as migration might be a way to gain independence and emancipation (Sharpe, Citation2002), and this could be reflected also in their marriage choices.

2.4. The intergenerational persistence of mating behavior across generations

Studying the degree of resemblance in demographic behaviors among individuals within the same family can provide valuable insights into population transformation processes (Mare, Citation2011). The parent-child link has been extensively examined concerning various economic (income, occupation) and social (education, class) outcomes, highlighting the strength of this association (Breen & Jonsson, Citation2005; Erikson & Goldthorpe, Citation1993). Additionally, there has been a growing interest in understanding the intergenerational transmission of demographic behaviors and the resemblance between generations in recent years (Mare, Citation2011; Song et al., Citation2015).

Parents influence children’s behaviors through interconnected channels, including socialization, cultural factors, and genetic transmission. Children internalize preferences and attitudes through socialization and cultural influences, which can lead to replicating of their parents’ demographic choices. Moreover, recent research has shed light on the heritability of certain demographic characteristics, indicating genetic factors at play (Barban et al., Citation2016). Families can also directly shape their children’s choices, particularly in past societies where parental interference was more prevalent (Van Leeuwen et al., Citation2005).

Interestingly, there has been little focus on the intergenerational persistence of assortative mating between generations. However, a strong correlation between the mating patterns of parents and their children is one mechanism through which resources become concentrated within families across generations (Goñi, Citation2022; Marcassa et al., Citation2020). Moreover, the persistence of mating behavior between generations can be seen as an additional indicator of group isolation and polarization. Continuous reinforcement of group strength and cohesion can occur if endogamous marriages consistently renew and preserve group boundaries. Although research has shown that parents may actively influence their children’s marital choices (Van Leeuwen et al., Citation2005), to the best of our knowledge, there are no explicit studies examining the intergenerational persistence of assortative mating.

3. Historical patterns of assortative mating in the United States

Our study investigates the evolution of mating patterns in the United States among birth cohorts from the 18th and 19th centuries. The United States presents an ideal context for our research due to the remarkable economic, political, social, and demographic changes during this period. Particularly noteworthy is the country’s experience of multiple waves of migration throughout the 18th, 19th, and early 20th centuries (Fogleman, Citation1992; Martin, Citation2021; Potter, Citation1965). Immigration waves in the US followed some specific country patterns. Large-scale immigration in the 1830s and 1850s came from Britain, Ireland and Germany, followed by Catholics and Jewish migrants from Southern and Eastern Europe (Martin, Citation2021).

Previous empirical studies on intermarriage by migration background in the United States have predominantly focused on the Mass Migration period (1870–1920) (Abramitzky & Boustan, Citation2017; Angrist, Citation2002; Kalmijn, Citation1993; Meng & Gregory, Citation2005; Pagnini & Morgan, Citation1990; Sassler, Citation2005). Findings from these studies emphasize the persistent barriers to marriage between native-born individuals and migrants, despite the increasing proportion of migrants and their descendants in the overall population.

The first-generation migrants exhibited higher rates of endogamy, while the second-generation individuals demonstrated a greater inclination to marry within the native population, supporting the concept of assimilation (Kalmijn, Citation1993). Furthermore, variations emerged among ethnic groups, with more recent migrants, such as Italians and Poles, displaying a stronger inclination toward endogamy. Additionally, research suggests that gender plays a role, as women with a migration background exhibited a higher degree of homogamy in terms of ethnic origin, indicating potential differences in marriage market dynamics (Sassler, Citation2005).

Historical investigations into age homogamy in the United States have been limited. Atkinson and Glass (Citation1985) explored age differences between spouses using Census data from 1900, 1960, and 1980, revealing a shift toward age homogamy. While there is some understanding of earlier periods and long-term trends, we acknowledge the existence of two notable studies. Haines (Citation1996) examined the age of marriage from colonial times to the present, drawing from a range of pre-census sources and Census data. The marriage patterns in North America differed from the characteristic pattern of late marriage observed in Europe (Hajnal, Citation1953), with spouses in the United States marrying at younger ages. Similarly, Fitch and Ruggles (Citation2000) investigated trends in median marital age between 1850 and 1990, revealing their sensitivity to economic cycles.

4. The potential of genealogical data for the study of assortative mating

To the best of our knowledge, no study has specifically investigated long-term trends in assortative mating by geographical origin and age in the United States, primarily due to limited data availability. We fill this gap using online crowdsourced genealogical data, which suits our research questions for two main reasons. Firstly, they allow to investigate a broader period compared to other data sources, enabling us to trace individuals back to the 18th century. Secondly, genealogical data offer information on vertical relationships, such as parent-child connections over long periods, which is not always possible with official historical data.

Genealogical data have often suffered from issues of representativeness, because they often included very selective populations such as aristocracies or high-status families, with lower sample sizes and geographically and temporally located (Westendorp & Kirkwood, Citation1998). Compared to previously used genealogical data, FamiLinx provides a bigger sample with a larger geographical and temporal focus. Nevertheless, assessing how it resembles official statistics is crucial. Extensive data validation has been performed by Kaplanis et al. (Citation2018) and more recently for Europe by Blanc (Citation2023) showing it is consistent with official statistics. On the other hand, it has been documented that FamiLinx tends to underestimate women (Minardi et al., Citation2023), and mortality levels (Chong et al., Citation2022; Stelter & Alburez-Gutierrez, Citation2022). As for couples, given the genealogical design of the data, FamiLinx tends to underestimate childless couples. Moreover, some recent contributions (Minardi et al., Citation2023) have raised the attention of scholars toward the fact that individuals with more complete information might be selected in terms of socioeconomic status.

Keeping these limitations in mind, we conducted in Appendix D further data quality checks of our FamiLinx sample with the US Census data, when possible (see Appendix D). We compare trends in the population by migration background, and patterns of assortative mating by age and geographical origin to the Censuses for the years when it is possible. Overall, the Familinx estimates are consistent with the censuses.

5. Data, sample, variables, and methods

5.1. Data and sample

We use FamiLinx data, the publicly available dataset derived from the genealogical website Geni.com. This dataset encompasses comprehensive individual-level information and genealogical trees, encompassing 86 million individuals across various geographical locations and spanning over four centuries.

We started our sample selection process by keeping cases with valid information on the birth and death countries and the year of birth. Therefore, we selected individuals dead in the United States who were born between 1700 and 1910, with complete information on their dates and places of birth. We also restricted the sample to individuals and parents who reached at least 30 years old and not more than 110 years old. The upper limit is meant to exclude outliers and unreasonable values; the lower limit is because Familinx data are not acceptable representative for younger individuals, but become more accurate for middle-aged and older adults (Chong et al., Citation2022).

Leveraging the genealogical structure of the data, we were able to gather detailed information on the birthplaces of their parents. Since we do not have direct information on marriages, we identify as a partner the individual of the opposite with whom the subject shares at least one child. For this individual, we can retrieve information on his/her place and year of birth, and for his/her parents too. To visualize the relationships, refer to .

Figure 1. Id and relationships for the construction of the dataset.

Figure 1. Id and relationships for the construction of the dataset.

The resulting dataset comprises 19,275 individuals who possess complete information regarding their birth and death dates, and their parents’, partners’, and parents-in-law’s birthplaces. We ended up with a very strong reduced sample compared to the original FamiLinx data. This is the result of two processes. On the one hand, there is a selection directly coming from the completeness of the original data. For instance, of the original 86 million individuals, only around 8 million had complete information on the place of birth and death. On the other, the criteria that we impose of complete information for two different genealogical trees (the individual and his/her partner) further decreases our sample. Appendix C provides a more detailed explanation of the steps in the sample selection. Besides this selection, our sample is representative of the US census data of the same periods (Appendix D shows the comparison between US Census data and Familinx).

5.2. Variables

To examine patterns of assortative mating by geographical origin, we utilize the country of birth information to construct relevant variables. Firstly, we categorize individuals based on their migration histories. By considering the individual’s country of birth and that of their parents, we can differentiate between three groups. If the individual and their parents are born in the United States, we classify them as native individuals with native parents. In cases where the individual is born in the United States but has at least one parent born outside the country,Footnote1 we classify them as second-generation. To include an additional level of detail, we add in the models a variable accounting for the number of parents born inside or outside the United States. Individuals who were born outside the United States are classified as first-generation migrants.Footnote2 We apply the same methodology to compute the corresponding variables for partners.

We constructed a measure of assortative mating based on geographical origin using the migration information. If both the individual and their partner share the same migration background, we consider them to be homogamous, while they are classified as heterogamous if their migration histories differ. We apply the same categorization to parents. We further build a variable indicating whether the partners were born in the same country.

As for age assortative mating, we developed two measures. As suggested by Kolk, age heterogeneity and age hypergamy are two theoretically different dimensions. The first indicates just the age gap between the partners, and it is used as an indicator of the egalitarian nature of the union (Kolk, Citation2015). Empirically, it can be calculated in our data as the absolute difference between the year of birth of the male partner and that of the female partner. Age hypergamy rather indicates the gendered dimension of age assortative mating, which can be grounded in gendered age preferences and can have gendered consequences for partners in the union (Kolk, Citation2015). We created a categorical variable that identifies whether an individual is in an age homogamous (with a difference of 0 ± 2 years), hypergamous (with the man at least 3 years older), or hypogamous (with the woman at least 3 years older) couple. provides a summary of the characteristics of the main variables for our analyses.

Table 1. Variables.

We exploit a set of additional variables: the 50-years birth cohorts (1700–1749, 1750–1799, 1800–1849, 1850–1909), the state where the individual died, and the share of individuals with a migration background in the birth cohort calculated from FamiLinx data.

5.3. Methods

5.3.1. Mating trends description

We calculated crude rates of migration homogamy and heterogamy for moving birth cohorts of 10 years.Footnote3 Additionally, we document the prevalence of age pairing categories across birth cohorts and track the trend in the absolute age difference within couple arrangements based on geographical origin (both natives, both migrants, mixed).

5.3.2. Association between partners’ migration background

We employ log-linear analysis to examine how the association between partners’ migration histories evolved, while accounting for historical changes in marginal distributions (Hout, Citation1983). Specifically, we utilize the log-multiplicative layer effects model (Xie, Citation1992), which is well-suited for investigating changes across groups, such as cohorts. The model is represented as follows in EquationEquation 1:

(1) logFijk=λ+λiR+λjC+λkL++λjkRL+λijCL+exp(ψijφk)(1)

logFijk is the natural logarithm of the expected frequency in cell (I,j,k), λiR,λjC,λkL the marginal effects of R (row), C (column) & L (layer), λikRL,λjkCL the two-way interactions RxL and CxL. The parameters of interest are ψij, which expresses the origin-destination association, and φk, which indicates the origin-destination association levels for different layers being compared. It expresses in relative terms how much the association between R&C varies between layer k and layer k*. We specify the model from a 2 × 2 marriage table in which the individual and his/her partner’s migration history (R and C) are included (see ), the layers (1700–1749, 1750–1799, 1800–1849, 1850–1909). The empirical marriage table can be found in Appendix A1. We show results for φk, that gives a measure of changes in the association between variables across time. Further estimates are shown in Appendix A1.

Table 2. Marriage table for partners’ migration histories.

5.3.3. Association between parents and children’s mating behavior

We apply EquationEq. 1 to a 2 × 2 intergenerational contingency table in which the individual mating (homogamy and heterogamy) corresponds to the row (R), and the parents’ is the column (C) (see ). The empirical table and further estimates are in Appendix A2.

Table 3. Contingency table for individual’s and parents’ mating.

5.3.4. Assortative mating as a multidimensional process

We use multinomial regression models to analyze the categorical variable of age difference (age homogamy, age hypergamy, and age hypogamy). This variable is modeled as a function of mating by migration background (homogamy/heterogamy) and migration background (native/1st gen/2nd gen). We include interaction terms to examine potential heterogeneities across groups. Our models also incorporate the birth cohort, the individual’s death state to account for regional variations, and the share of migrants in the birth cohort to control – although indirectly – for the mating market structure. Additionally, we consider other couples’ characteristics, such as similarity in country of origin and the number of parents born in the United States. We conduct separate models for men and women and present the results as predicted probabilities by migration background and birth cohorts.

6. Results

6.1. Trends in mating patterns

illustrates the trends in rates of homogamy and heterogamy based on migration history across different birth years. In this analysis, we grouped first and second-generation migrants into a broader category of migrants and examined mobile cohorts of 10 years. Notably, we observed distinct patterns in trends between individuals with a migration history in their families (1st or 2nd generation) and natives (individuals whose parents were born in the United States). Throughout the two centuries, natives consistently displayed higher rates of homogamy.

Figure 2. Rates of homogamy and heterogamy by birth cohort and migration histories, 1700–1910.

Figure 2. Rates of homogamy and heterogamy by birth cohort and migration histories, 1700–1910.

Another noteworthy aspect is the cyclic pattern observed in the trends, particularly among individuals with migration histories. We identified two significant drops in rates of homogamy, which coincided with two prominent migration waves in the history of the United States: the wave spanning 1820–1860 and the Mass Migration Era (1870–1920) (Martin, Citation2021).

presents the trends in assortative mating by age for individuals born between 1700 and 1910. The trends are depicted based on the type of couple, including both natives, both migrants, and mixed couples. Throughout the analyzed time period and across all couple types, male hypergamy (where the man is older by more than 3 years) is the predominant age arrangement. It remains consistent over time. Age homogamy follows as the second most prevalent arrangement, while female hypergamy represents approximately 10% of couples throughout the years.

Figure 3. Trends in assortative mating by age, 1700–1910.

Figure 3. Trends in assortative mating by age, 1700–1910.

In addition to these consistent patterns, there are noticeable differences and changes in trends. Couples in which both partners are migrants exhibit greater fluctuations in their age arrangements, particularly in birth cohorts from the late 18th and 19th centuries. Furthermore, there is an increasing trend towards age homogamy among individuals born in the late 19th century, particularly among native and mixed couples, as it has been documented in other Western countries (Van de Putte et al., Citation2009).

The trend of growing age homogamy at the end of the 19th century is further supported by examining the changes in the absolute age difference between partners, as illustrated in . The average age difference is approximately 5 years, with an initial increase observed in cohorts from the first half of the 19th century. However, this is followed by a notable decrease in subsequent generations.

Figure 4. Trends in the absolute value of age difference between partners, 1700–1910.

Figure 4. Trends in the absolute value of age difference between partners, 1700–1910.

6.2. Association between partners’ migration background

reveals a cyclic pattern in the trends of assortative mating by migration, indicating lower rates of homogamy in specific birth cohorts. However, it is crucial to determine whether these trends result from changes in the population composition or actual shifts in partner choice criteria. To address this question, log-linear models are valuable as they enable control over marginal distributions. In our analysis, we apply these models to marriage tables, provided in Appendix A1, and examine them separately for each birth cohort.

Specifically, we model the association between an individual’s migration profile (native or migration history) and their partner’s migration profile across different birth cohorts. The log-multiplicative layer effect serves as the framework for our analysis, with birth cohorts acting as the layers. In , we present the φk parameters, which allow us to interpret the extent of changes in the association between partners’ migration histories compared to the reference birth cohort (1700–1749). Additional model measures such as k coefficients and measures of fit can be found in Appendix A1 and A2.

Table 4. Layer coefficients, multiplicative-log effect layer model of partners’ migration background association.

provides insights into the relative changes in the φk parameters across different birth cohorts. Consistent with the observed trends in , we observe significant decreases in the association between partners’ migration histories in the cohorts of 1750–1799 and 1850–1910 compared to the cohorts of 1700–1749 and 1800–1849. This confirms that, even when controlling for changing marginal distributions, the cyclic pattern of partner choice remains evident. It suggests that partner choice criteria, including personal preferences and the influence of third parties, may undergo transformations within a context characterized by substantial changes in the population structure.

6.3. Association between parents and children’s mating behavior

We explore the intergenerational persistency of mating behavior across birth cohorts using log-linear modeling. To analyze this, we apply the model to the intergenerational contingency tables provided in Appendix A2. In these tables, the rows represent the mating behavior of children, while the columns represent the mating behavior of their parents.

presents the φk parameters across various birth cohorts. Surprisingly, we observe a remarkable stability in the association between parents’ and children’s mating behavior, despite significant changes in the population structure over time. However, there is a notable reduction in the intergenerational persistency of mating behavior in the 1850–1910 cohort. This cohort demonstrates a lower degree of persistence than the previous cohorts, suggesting a shift in mating patterns during this period.

Table 5. Layer coefficients, multiplicative-log effect layer model of parents’ and children’s mating behavior association.

6.4. Assortative mating as a multidimensional process

We delve into assortative mating as a multidimensional process, specifically examining the dimensions of migration background and age. We explore two different scenarios based on prevailing theories: the by-product hypothesis and the status exchange theory. To present our findings, we utilize predicted probabilities for various age pairing outcomes (homogamy, hypergamy, or hypogamy) stratified by sex and migration background.

focuses on natives and provides valuable insights. Firstly, we observe that the trends in age homogamy outcomes align with what we observed in the descriptive section. There is an increasing trend of age homogamy across birth cohorts, accompanied by a decrease in age hypergamy. Notably, we identify differences between women and men in these outcomes. Among women, those in homogamous partnerships (indicated by the orange dot) exhibit higher levels of age homogamy (left panel) and lower levels of age hypergamy (middle panel) compared to women in heterogamous partnerships (represented by the blue dot). However, no significant differences are found for age hypogamy. In contrast, among men, we do not find such evidence, as there are no notable differences in age homogamy outcomes based on mating by migration. This finding supports the by-product hypothesis and suggests a gendered dynamic in the mating market. Women may have had access to a more homogeneous mating market in terms of geographical and age composition, possibly due to their relatively limited access to diverse environments like the labor market.

Figure 5. Predicted probabilities of age homogamy, male hypergamy and hypogamy conditional to mating and migration background. Natives, women and men.

Figure 5. Predicted probabilities of age homogamy, male hypergamy and hypogamy conditional to mating and migration background. Natives, women and men.

The picture interestingly changes when we look at first-generation migrants (). First, they display lower levels of age homogamy than natives across all birth cohorts, and higher rates of age hypogamy.

Figure 6. Predicted probabilities of age homogamy, male hypergamy and hypogamy conditional to mating and migration background. First generation migrants, men and women.

Figure 6. Predicted probabilities of age homogamy, male hypergamy and hypogamy conditional to mating and migration background. First generation migrants, men and women.

Surprisingly, we observe that migration homogamy (represented by the orange dot) is associated with higher levels of age hypogamy than marriages where one partner has a migration background (indicated by the blue dot), regardless of gender. This result suggests the presence of sex-biased dynamics within the mating market. Given that women constituted a smaller proportion of the migrant population (Martin, Citation2021), they may have been exposed to an excess of young men, creating room for non-traditional age pairings.

Turning our attention to second-generation individuals (), we observe a shift in the pattern. They exhibit levels of age homogamy comparable to those of natives, indicating a potential assimilation process. When considering age homogamy outcomes, we once again detect a gendered pattern. Among women, homogamy by migration background (i.e. partnering with a migrant with a migration history) is associated with lower age homogamy than intermarriages. Conversely, homogamy by migration history is linked to higher levels of age hypergamy. This finding supports to the exchange hypothesis, suggesting that when individuals partner within the migrant group, there is evidence of a potential trade-off involving age as a trait.

Figure 7. Predicted probabilities of age homogamy, male hypergamy and hypogamy conditional to mating and migration background. Second generation, men and women.

Figure 7. Predicted probabilities of age homogamy, male hypergamy and hypogamy conditional to mating and migration background. Second generation, men and women.

7. Discussion and conclusion

Family networks are complex social structures influenced by various factors, including individual choices related to fertility, marriage, and migration. These decisions, made across generations, shape the dynamics of families and have long-lasting effects on society. Among these decisions, marriage and assortative mating play a particularly significant role in how families reproduce themselves, both in terms of biological reproduction (through fertility) and social reproduction (through the transmission of resources and social advantages). Consequently, exploring and understanding long-term patterns of assortative mating is a crucial area of study in the social sciences. By examining how individuals select their partners based on various characteristics, such as socioeconomic background, education, and cultural factors, we can gain valuable insights into the evolution of societies over time and the mechanisms that contribute to the reproduction of social inequalities.

Online platforms have provided unprecedented access to vast amounts of data. One area that has benefited from these developments is the study of historical societies. Online genealogical data, in particular, has emerged as a valuable resource for investigating patterns of longevity, fertility, and population dynamics in the past (Blanc, Citation2021; Hsu et al., Citation2021; Kaplanis et al., Citation2018). In this study, we take advantage of the FamiLinx dataset, a rich source of genealogical information that offers extensive coverage across different time periods and geographical locations. Using this dataset, we documented assortative mating in the United States, focusing on individuals born between 1700 and 1910.

Our study uncovers three significant findings regarding assortative mating. Firstly, we reveal that the trends in partner selection, specifically concerning geographical origins, do not follow a consistent trajectory over time. Instead of a linear progression towards increased heterogamy, we observe fluctuations concentrated within specific birth cohorts. This pattern aligns with previous research by Haines (Citation1996) and Fitch and Ruggles (Citation2000), which highlight the sensitivity of mating behaviors to shifts in the social and economic landscape, such as migration waves. These findings emphasize the dynamic nature of mate selection and its responsiveness to broader societal changes, that should be pursued as a future venue for research in the long-term trends of assortative mating.

Second, our study uncovers significant variations among different groups, shedding light on the complexities of assortative mating. We find that migrants’ mating behavior is more responsive to changes than natives, and this could be due also to the differences in group sizes. Additionally, we observe that women are particularly sensitive to the interplay between age and geographical origin, whereas men exhibit less pronounced patterns. These findings underscore the multidimensional nature of mate selection, where various dimensions interact and vary depending on individual characteristics. The observed differences between men and women in mating dynamics may be attributed to their distinct societal positions throughout history (Sassler, Citation2005).

Third, an intriguing finding of our study is the notable slowdown in processes when shifting the focus from the individual to the family level. Intergenerational reproduction of mating behavior proves to be remarkably persistent and consistent across time, highlighting the significance of carefully considering the unit of analysis in investigations of long-term social trends.

It should be acknowledged that in this paper we aimed to document long-term trends of assortative mating. The exploration of the underlying mechanisms and the association with historical events and developments are beyond our scope. We find indications that changes in marriage choices coincide with major migrant waves, that notably changed the American marriage market and society (Martin, Citation2021), but we are not able to further investigate these aspects; an interesting venue for future research would be the integration of online crowdsourced genealogical data with other sources that allow measuring historical processes and events.

Nevertheless, it is crucial to acknowledge the limitations of our work. The reliance on reconstructed individual and family background information introduces a strong selection bias to our sample. As demonstrated in Appendix C, data availability constraints result in sample restrictions. Individuals who provide detailed and complete information are likely a subset of the population that is positively selected based on various characteristics. For instance, missing information tends to be more prevalent in earlier periods and diminishes gradually over time. Those with comprehensive information on individual traits and family networks are more likely to belong to groups that exhibit positive selection based on factors such as socioeconomic background, geographic location, and educational attainment (Minardi et al., Citation2023).

Moreover, genealogies of ascendant types, like FamiLinx, may be susceptible to additional sources of bias (Hollingsworth, Citation1976). It should be noted that childless individuals are unlikely to be represented in the data, which can be relevant as childless couples may display selectivity in terms of individual and couple characteristics. These considerations emphasize the importance of carefully acknowledging potential biases when utilizing crowdsourced genealogical data (Chong et al., Citation2022; Minardi et al., Citation2023; Stelter & Alburez-Gutierrez, Citation2022).

Supplemental material

Supplemental Material

Download MS Word (975.1 KB)

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplementary material

Supplemental data for this article can be accessed online at https://doi.org/10.1080/1081602X.2024.2352539.

Additional information

Funding

This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 865356)

Notes

1. Because of the limited sample size, we are not able to further distinguish geographical areas.

2. Among first-generation migrants, we exclude individuals whose first child was born in the country of origin. This selection allows us to indirectly include individuals in our sample who formed a couple in the United States.

3. We calculate estimates on a sample defined as the reference year from 1700 to 1900 plus and minus ten years – i.e. 1690 (1700) 1710, 1691 (1701) 1711, 1692 (1702) 1712 … 1890 (1900)1910. This allows us to avoid a strict separation between cohorts and maximize sample size.

References

  • Abramitzky, R., & Boustan, L. (2017). Immigration in American economic history. Journal of Economic Literature, 55(4), 1311–1345. https://doi.org/10.1257/jel.20151189
  • Angrist, J. (2002). How do sex ratios affect marriage and labor markets? Evidence from America’s second generation. The Quarterly Journal of Economics, 117(3), 997–1038. https://doi.org/10.1162/003355302760193940
  • Ariès, P. (1983). Les Classes d’Age dans les societes modernes occidentales. In Les Ages De La Vie. VIIe Colloque National De Demographie. Actes Du Colloque. Tome II (pp. 18–20). Presses Universitaires de France.
  • Atkinson, M. P., & Glass, B. L. (1985). Marital age heterogamy and homogamy, 1900 – 1980. Journal of Marriage and the Family, 47(3), 685–691. https://doi.org/10.2307/352269
  • Barban, N., Jansen, R., De Vlaming, R., Vaez, A., Mandemakers, J. J., Tropf, F. C., Wilson, J. F., Chasman, D. I., Nolte, I. M., Tragante, V., van der Laan, S. W., Perry, J. R. B., Kong, A., Ahluwalia, T. S., Albrecht, E., Yerges-Armstrong, L., Atzmon, G., Auro, K., Ayers, K., … Shen, X. (2016). Genome-wide analysis identifies 12 loci influencing human reproductive behavior. Nature Genetics, 48(12), 1462–1472. https://doi.org/10.1038/ng.3698
  • Blanc, G. (2021). Modernization before industrialization: Cultural roots and economic consequences of the demographic transition in France. Working Paper. Brown University.
  • Blanc, G. (2023). The Cultural Origins of the Demographic Transition in France. Working paper. https://www.guillaumeblanc.com/secularization.html
  • Blau, P. M., & Duncan, O. D. (1967). The American occupational structure. New York, London: Sidney, J. Wiley & Sons, Inc.
  • Blau, P. M., & Schwartz, J. E. (1984). Crosscutting social circles. Academic Press.
  • Breen, R., & Jonsson, J. O. (2005). Inequality of opportunity in comparative perspective: Recent research on educational attainment and social mobility. Annual Review of Sociology, 31(1), 223–243. https://doi.org/10.1146/annurev.soc.31.041304.122232
  • Chiswick, B. R., & Houseworth, C. (2011). Ethnic intermarriage among immigrants: Human capital and assortative mating. Review of Economics of the Household, 9(2), 149–180. https://doi.org/10.1007/s11150-010-9099-9
  • Chong, M., Alburez-Gutierrez, D., Del Fava, E., Alexander, M., & Zagheni, E. (2022). Identifying and correcting bias in big crowd-sourced online genealogies (No. WP-2022-005). Max Planck Institute for Demographic Research.
  • Conley, D., Laidley, T., Belsky, D. W., Fletcher, J. M., Boardman, J. D., & Domingue, B. W. (2016). Assortative mating and differential fertility by phenotype and genotype across the 20th century. Proceedings of the National Academy of Sciences, 113(24), 6647–6652. https://doi.org/10.1073/pnas.1523592113
  • Cozzani, M., Minardi, S., Corti, G., & Barban, N. (2023). Birth month and adult lifespan. Demographic Research, 49, 201–218. https://doi.org/10.4054/DemRes.2023.49.9
  • Danubio, M. E., & Pettener, D. (1997). Marital structure of the Italian community of Boston, Massachusetts, 1880–1920. Journal of Biosocial Science, 29(3), 257–269. https://doi.org/10.1017/S0021932097002575
  • Davis, K. (1941). Intermarriage in caste societies 1. American Anthropologist, 43(3), 376–395. https://doi.org/10.1525/aa.1941.43.3.02a00030
  • De Marco, W. M. (1980). Boston’s Italian enclave, 1880–1930. Studi Emigrazione, 17(59), 331–359.
  • Dribe, M., Eriksson, B., & Scalone, F. (2019). Migration, marriage and social mobility: Women in Sweden 1880–1900. Explorations in Economic History, 71, 93–111. https://doi.org/10.1016/j.eeh.2018.09.003
  • Dribe, M., & Lundh, C. (2009). Status homogamy in the preindustrial marriage market: Partner selection according to age, social origin, and place of birth in nineteenth-century rural Sweden. Journal of Family History, 34(4), 387–406. https://doi.org/10.1177/0363199009344708
  • Ekamper, P., Poppel, F. V., & Mandemakers, K. (2011). Widening horizons? The geography of the marriage market in nineteenth and early-twentieth century Netherlands. In Navigating time and space in population studies (pp. 115–160). Springer Science & Business Media.
  • Erikson, R., & Goldthorpe, J. H. (1993). The constant flux. Clarendon Press.
  • Fitch, C. A., & Ruggles, S. (2000). Historical trends in marriage formation. In L. J. Waite & C. Bachrach (Eds.), The ties that bind: Perspectives on marriage and cohabitation (pp. 59–88). Transaction Publishers.
  • Fogleman, A. (1992). Migrations to the thirteen British North American colonies, 1700–1775: New estimates. The Journal of Interdisciplinary History, 22(4), 691–709. https://doi.org/10.2307/205241
  • Foner, N., & Dreby, J. (2011). Relations between the generations in immigrant families. Annual Review of Sociology, 37(1), 545–564. https://doi.org/10.1146/annurev-soc-081309-150030
  • Goñi, M. (2022). Assortative matching at the top of the distribution: Evidence from the world’s most exclusive marriage market. American Economic Journal: Applied Economics, 14(3), 445–487. https://doi.org/10.1257/app.20180463
  • Greenwood, J., Guner, N., Kocharkov, G., & Santos, C. (2014). Marry your like: Assortative mating and income inequality. American Economic Review, 104(5), 348–353. https://doi.org/10.1257/aer.104.5.348
  • Guner, N., Kulikova, Y., & Llull, J. (2018). Reprint of: Marriage and health: Selection, protection, and assortative mating. European Economic Review, 109, 162–190. https://doi.org/10.1016/j.euroecorev.2018.06.002
  • Haines, M. R. (1996). Long-term marriage patterns in the United States from colonial times to the present. The History of the Family, 1(1), 15–39. https://doi.org/10.1016/S1081-602X(96)90018-4
  • Hajnal, J. (1953). Age at marriage and proportions marrying. Journal of Population Studies, 7(2), 111–136. https://doi.org/10.1080/00324728.1953.10415299
  • Hollingsworth, T. H. (1976, January). Genealogy and historical demography. In Annales de demographie historique (pp. 167–170). Editions Belin.
  • Hout, M. (1983). Analysing mobility tables. Sage.
  • Hsu, C. H., Posegga, O., Fischbach, K., Engelhardt, H., & Ortega, J. A. (2021). Examining the trade-offs between human fertility and longevity over three centuries using crowdsourced genealogy data. PLOS ONE, 16(8), e0255528. https://doi.org/10.1371/journal.pone.0255528
  • Kalmijn, M. (1993). Spouse selection among the children of European immigrants: A comparison of marriage cohorts in the 1960 census. International Migration Review, 27(1), 51–78. https://doi.org/10.1177/019791839302700103
  • Kalmijn, M. (1994). Assortative mating by cultural and economic occupational status. American Journal of Sociology, 100(2), 422–452. https://doi.org/10.1086/230542
  • Kalmijn, M. (1998). Intermarriage and homogamy: Causes, patterns, trends. Annual Review of Sociology, 24(1), 395–421. https://doi.org/10.1146/annurev.soc.24.1.395
  • Kaplanis, J., Gordon, A., Shor, T., Weissbrod, O., Geiger, D., Wahl, M., Gershovits, M., Markus, B., Sheikh, M., Gymrek, M., Bhatia, G., MacArthur, D. G., Price, A. L., & Erlich, Y. (2018). Quantitative analysis of population-scale family trees with millions of relatives. Science, 360(6385), 171–175. https://doi.org/10.1126/science.aam9309
  • Kennedy, R. J. R. (1944). Single or triple melting pot? Intermarriage trends in New Haven, 1870-1940. American Journal of Sociology, 49(4), 331–339.
  • Klesment, M., Jaadla, H., & Gortfelder, M. (2020). Educational heterogamy during the early phase of the educational expansion. Demographic Research, 43, 329–364. https://doi.org/10.4054/DemRes.2020.43.13
  • Kolk, M. (2015). Age differences in unions: Continuity and divergence among Swedish couples between 1932 and 2007. European Journal of Population, 31(4), 365–382. https://doi.org/10.1007/s10680-015-9339-z
  • Little, A. C., Burt, D. M., & Perrett, D. I. (2006). Assortative mating for perceived facial personality traits. Personality and Individual Differences, 40(5), 973–984. https://doi.org/10.1016/j.paid.2005.09.016
  • Maas, I., & Zijdeman, R. L. (2010). Beyond the local marriage market: The influence of modernization on geographical heterogamy. Demographic Research, 23, 933–962. https://doi.org/10.4054/DemRes.2010.23.33
  • Marcassa, S., Pouyet, J., & Trégouët, T. (2020). Marriage strategy among the European nobility. Explorations in Economic History, 75, 101303. https://doi.org/10.1016/j.eeh.2019.101303
  • Mare, R. D. (2011). A multigenerational view of inequality. Demography, 48(1), 1–23. https://doi.org/10.1007/s13524-011-0014-7
  • Martin, S. F. (2021). A nation of immigrants. Cambridge University Press.
  • Meng, X., & Gregory, R. G. (2005). Intermarriage and the economic assimilation of immigrants. Journal of Labor Economics, 23(1), 135–174. https://doi.org/10.1086/425436
  • Merton, R. K. (1941). Intermarriage and the social structure: Fact and theory. Psychiatry, 4(3), 361–374. https://doi.org/10.1080/00332747.1941.11022354
  • Minardi, S., Corti, G., & Barban, N. (2023). Historical patterns in the intergenerational transmission of lifespan and longevity: Evidence from the United States, 1700–1900. https://doi.org/10.31235/osf.io/gfxcm
  • Nitsche, N., Matysiak, A., Van Bavel, J., & Vignoli, D. (2018). Partners’ educational pairings and fertility across Europe. Demography, 55(4), 1195–1232. https://doi.org/10.1007/s13524-018-0681-8
  • Pagnini, D. L., & Morgan, S. P. (1990). Intermarriage and social distance among U.S. Immigrants at the turn of the century. American Journal of Sociology, 96(2), 405–432. https://doi.org/10.1086/229534
  • Potter, J. (1965). The growth of population in America, 1700-1860. In Population in history (pp. 631–688). Routledge.
  • Puschmann, P., Grönberg, P. O., Schumacher, R., & Matthijs, K. (2014). Access to marriage and reproduction among migrants in Antwerp and Stockholm. A longitudinal approach to processes of social inclusion and exclusion, 1846–1926. The History of the Family, 19(1), 29–52. https://doi.org/10.1080/1081602X.2013.796889
  • Robinson, M. R., Kleinman, A., Graff, M., Vinkhuyzen, A. A., Couper, D., Miller, M. B., Peyrot, W. J., Abdellaoui, A., Zietsch, B. P., Nolte, I. M., van Vliet-Ostaptchouk, J. V., Snieder, H., Alizadeh, B. Z., Boezen, H. M., Franke, L., Harst, P. V. D., Navis, G., Rots, M., Snieder, H., … Visscher, P. M. (2017). Genetic evidence of assortative mating in humans. Nature Human Behaviour, 1(1), 0016. https://doi.org/10.1038/s41562-016-0016
  • Sassler, S. (2005). Gender and ethnic differences in marital assimilation in the early twentieth century. International Migration Review, 39(3), 608–636. https://doi.org/10.1111/j.1747-7379.2005.tb00282.x
  • Schwartz, C. R. (2013). Trends and variation in assortative mating: Causes and consequences. Annual Review of Sociology, 39(1), 451–470. https://doi.org/10.1146/annurev-soc-071312-145544
  • Sharpe, P. (Ed.). (2002). Women, gender and labour migration: Historical and cultural perspectives (vol. 5). Routledge.
  • Shorter, E. (1975). The making of the modern family. New York: Basic Books.
  • Song, X., Campbell, C. D., & James, Z. L. (2015). Ancestry matters: Patrilineage growth and extinction. American Sociological Review, 80(3), 574–602. https://doi.org/10.1177/0003122415576516
  • Stelter, R., & Alburez-Gutierrez, D. (2022). Representativeness is crucial for inferring demographic processes from online genealogies: Evidence from lifespan dynamics. Proceedings of the National Academy of Sciences, 119(10). https://doi.org/10.1073/pnas.2120455119
  • Thiessen, D., & Gregg, B. (1980). Human assortative mating and genetic equilibrium: An evolutionary perspective. Ethology and Sociobiology, 1(2), 111–140.
  • Treiman, D. J. (1970). Industrialization and social stratification. Sociological Inquiry, 40(2), 207–234. https://doi.org/10.1111/j.1475-682X.1970.tb01009.x
  • Van Bavel, J. (2021). Partner choice and partner markets. In Research Handbook on the Sociology of the Family (pp. 219–231). Edward Elgar Publishing.
  • Van de Putte, B. (2003). Homogamy by geographical origin: Segregation in nineteenth-century Flemish cities (gent, Leuven, and Aalst). Journal of Family History, 28(3), 364–390. https://doi.org/10.1177/0363199003256015
  • Van de Putte, B., & Oris, M. (2005). The influence of migration on marriage partner selection according to social class: A methodological contribution. Onderzoeksverslag Gezin en Bevolking GB/2005-35.
  • Van de Putte, B., Van Poppel, F., Vanassche, S., Sanchez, M., Jidkova, S., Eeckhaut, M., Oris, M., & Matthijs, K. (2009). The rise of age homogamy in 19th century Western Europe. Journal of Marriage and Family, 71(5), 1234–1253. https://doi.org/10.1111/j.1741-3737.2009.00666.x
  • Van Leeuwen, M. H. D., Maas, I., & Miles, A. (Eds). (2005). Marriage choices and class boundaries: Social endogamy in history. Cambridge University Press.
  • Van Poppel, F., Liefbroer, A. C., Vermunt, J. K., & Smeenk, W. (2001). Love, necessity and opportunity: Changing patterns of marital age homogamy in the Netherlands, 1850–1993. Population Studies, 55(1), 1–13. https://doi.org/10.1080/00324720127681
  • Westendorp, R. G., & Kirkwood, T. B. (1998). Human longevity at the cost of reproductive success. Nature, 396(6713), 743–746.
  • Xie, Y. (1992). The log-multiplicative layer effect model for comparing mobility tables. American Sociological Review, 57(3), 380–395. https://doi.org/10.2307/2096242