989
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Quality of the development of self-report instruments assessing women’s antepartum expectations of motherhood: a systematic review

, , , , , & ORCID Icon show all
Pages 74-94 | Received 03 Feb 2022, Accepted 22 Jul 2022, Published online: 03 Nov 2022

ABSTRACT

Background

The selection of valid and reliable instruments for research and clinical purposes is needed to increase our understanding of the role that expectations of motherhood have in early identification and treatment of perinatal mental health difficulties. We aimed to identify and assess the psychometric properties of self-report measures of expectations of motherhood.

Method

A systematic review was conducted and articles describing the development or validation of quantitative self-report measures of expectations of motherhood were included. The developmental quality (risk of bias) and psychometric properties of each measure were assessed according to the 2018 COSMIN guidelines.

Results

Twenty-five articles describing 16 measures were identified and included. Eleven measures were designed to assess a broad range of expectations, while the remaining measures mainly assessed expectations of childbirth. The quality of measure development was mostly poor and rated as adequate or better for only three measures. The most suitable measures with the strongest evidence of good psychometric properties are the Slade-Pais Expectations About Childbirth Scale and the Rigidity of Maternal Beliefs Scale.

Conclusions

This systematic review of measures of expectations has identified relatively few measures with adequate psychometric properties. Further development of existing measures is needed for use in both research and clinical settings.

KEY POINTS

What is already known about this topic:

  1. Maternal mental health difficulties can have significant and long-term adverse consequences on mothers, their children, and families.

  2. Previous evidence suggests that holding unrealistic maternal expectations may contribute to postpartum mental health difficulties.

  3. Early identification of the potentially modifiable risk factor of maternal expectations may contribute to improving postpartum mental health outcomes.

What this topic adds:

  1. This is the first systematic review to assess the developmental quality and psychometric properties of 16 measures of expectations of motherhood.

  2. The Slade-Pais Expectations About Childbirth Scale had the strongest evidence for good psychometric properties.

  3. For the majority of the measures, adequate methods for ensuring good content validity during measure development and validation were not used.

The perinatal period, from pregnancy through to the early postpartum, is a time of psychological and physical transition that is often accompanied by elevated levels of stress and increased risk of mental health problems (Austin & Highet, Citation2017). There is a high estimated prevalence of self-reported anxiety symptoms during pregnancy (23%) and early postpartum (17.8%; Dennis et al., Citation2017), while the prevalence of depressive disorders appeared to be stable across the perinatal period (12%; Woody et al., Citation2017). Symptoms of perinatal depression and anxiety are also often comorbid with other psychological disorders (e.g., post-traumatic stress disorder, eating disorders, personality disorders) (Howard et al., Citation2014). The effects of maternal mental health difficulties during the perinatal period are of particular concern, as there are serious adverse consequences for women, their children, and families (Dowse et al., Citation2020; Wachs et al., Citation2009). For example, in a systematic review, Stein et al. (Citation2014) reported significant associations between maternal perinatal depression and anxiety with their child’s emotional, behavioural, cognitive, and developmental difficulties.

During the perinatal period, women often have increased contact with a range of health professionals, providing an opportunity for early identification and intervention with women at risk of developing or presenting with perinatal mental health difficulties. A history of psychopathology and psychosocial difficulties are predictors of perinatal mental health difficulties (Howard et al., Citation2014). For instance, a previous diagnosis of a depressive disorder increases a woman’s risk for developing postpartum depression (PPD) more than 20 times (Silverman et al., Citation2017) and a history of either depressive or anxiety disorders prior to pregnancy are robust risk factors for perinatal anxiety (Martini et al., Citation2015). Current methods used to identify women at increased risk of developing postpartum symptoms of depression and anxiety are brief screening instruments aimed at identifying the presence of symptoms during pregnancy (Austin & Highet, Citation2017). However, antepartum screening is limited in its predictive power, as many significant risk factors occur during delivery and in the postpartum period (Austin & Lumley, Citation2003). For example, Wisner et al. (Citation2013) suggested that 40% of depressive episodes started in the postpartum compared to 33% during pregnancy and 26% before pregnancy. In addition, Heron et al. (Citation2004) reported that in a large community sample 44% of women with symptoms of PPD and 35% of women with symptoms of postpartum anxiety did not exhibit depression or anxiety in the antepartum. Thus, the identification of future risk factors occurring onwards from late pregnancy is needed. Researchers have identified several future pregnancy-related risk factors including gestational diabetes, preterm birth (Silverman et al., Citation2017), problems with breastfeeding, lack of social support, relationship problems (Robertson et al., Citation2004), and mode of delivery (Martini et al., Citation2015).

An area of research where attempts have been made to identify other potentially modifiable risk factors related to mental health focuses on women’s expectations of what the future holds across a broad range of domains. Results from two qualitative meta-syntheses of women’s experiences with perinatal depression identified the incongruity between expectations and the reality of motherhood (Beck, Citation2002) and disconfirmed maternal role expectations (Mollard, Citation2014) as overarching themes. Additionally, in the qualitative literature, researchers have consistently identified that the discrepancy between expectations and experiences of motherhood is associated with difficulties with PPD (O’Mahen et al., Citation2012), anxiety (Harrison et al., Citation2020), perceptions of birth experience (Hauck et al., Citation2007), and adjustment to motherhood (Choi et al., Citation2005; Read et al., Citation2012; Young, Citation2008).

Several avenues of research have developed measures to quantify the construct of expectations in a perinatal context by assessing areas such as maternal role fulfilment (e.g., Henshaw et al., Citation2014), parenthood (Harwood et al., Citation2007), postpartum support (e.g., Dennis, & Ross, Citation2006), the infant (Muscat et al., Citation2014), societal influence (e.g., Thomason et al., Citation2015), breastfeeding (Gregory et al., Citation2015) and obstetric events (e.g., Houston et al., Citation2015). However, results are often inconsistent. For instance, some studies have found that most women have positive expectations which align with their postpartum experiences (e.g., Robakis et al., Citation2015), while others have found significant associations between expectations and poor postpartum emotional adjustment (e.g., Henshaw et al., Citation2014). These inconsistencies may be explained by the quality of existing measures of expectations, as an appropriate interpretation of results depends on the degree to which measures accurately assess the construct of interest (Vogt et al., Citation2004).

Upon initial review of the literature, several overarching problems with measurement instruments were observed. First, there is an absence of a gold standard measure of expectations and many of the instruments used are novel (i.e. lack of validation studies) and study specific (i.e. assess a narrow range of domains). Second, the construct of perinatal expectations is often defined inconsistently across studies and used interchangeably with other concepts (e.g., beliefs and attitudes). A precise definition of the construct of interest and its function within a theoretical framework is necessary to ensure that a measurement instrument is relevant and representative of the target construct (Vogt et al., Citation2004). Third, inconsistent methods in instrument development coupled with limited reporting of procedures provide inadequate information about the validity and reliability of many instruments. Thus, the limitations of existing instruments of expectations may prevent appropriate interpretation of results and impede the development of a theoretical framework encompassing expectations of motherhood. To date, no reviews of instruments measuring expectations of motherhood have been identified. To provide researchers and clinicians with a comprehensive, systematic, and standardised review of instruments, the evidence-based COnsensus-based Standards for the selection of health Measurement INstruments guidelines (COSMIN; Mokkink et al., Citation2018; Prinsen et al., Citation2018) were selected to evaluate both the psychometric properties and the developmental quality of the instruments.

The aim of the current systematic review is to provide researchers and clinicians with a critical systematic analysis of available quantitative instruments intended to measure antepartum expectations of motherhood. The selection of valid and reliable instruments for research and clinical purposes has the potential to increase our understanding of the role that expectations of motherhood have in early identification and treatment of perinatal mental health difficulties. Specifically, the aims of this review are to 1) identify measures of women’s antepartum expectations of motherhood; 2) evaluate the quality of development studies, 3) evaluate the psychometric properties of instruments and 4) offer recommendations for use and further development of instruments.

Methods

The general methodology for this systematic review was conducted in accordance with the Preferred Reporting of Items for Systematic Reviews and Meta-Analyses (PRISMA; Moher et al., Citation2009) guidelines. The search strategy was guided by recommendations from the Joanna Briggs Institute (Aromataris & Munn, Citation2017). Data extraction, quality (risk of bias) assessment, summary of results, and recommendations were conducted according to the COSMIN guidelines (Mokkink et al., Citation2018; Prinsen et al., Citation2018). The protocol for this review was registered with the PROSPERO International prospective register of systematic reviews on 26 May 2016 (Registration number: CRD42016039534).

Eligibility criteria

To ensure that the predefined eligibility criteria for this review would allow for appropriate differentiation between expectations and other constructs, it was necessary to define the common terms which are often used interchangeably in this area of research. For the purposes of this review, expectations are conceptualised as mental representations of specific future outcomes and may be about an individual’s internal responses or external events (Atlas & Wager, Citation2013). Beliefs are defined as ideas held to be true by an individual about their world and themselves and are a result of complex cognitive processes (for detailed review see Connors & Halligan, Citation2014). Belief systems provide individuals with a framework for interpreting and processing information and generating expectations (Connors & Halligan, Citation2014). Attitudes can be defined as an evaluation of an object or concept based on an individual’s beliefs, feelings, and experiences with the attitude object (for detailed review see Maio & Haddock, Citation2004). Taken together, outcome expectations related to motherhood are based in belief systems, may be influenced by attitudes towards specific future outcomes, and are differentiated from these concepts as they are specifically future-oriented estimations about the likelihood of an event occurring.

Inclusion and exclusion criteria

Inclusion criteria were (1) studies describing the development or validation of quantitative self-report measures of antepartum expectations of motherhood, (2) in English, (3) peer-reviewed primary research studies where full text was available, (4) systematic reviews and meta-analyses of relevant measures and (5) theses describing the development of published measures. Exclusion criteria were (1) articles citing measures used to assess constructs other than expectations, (2) non-research articles (i.e. literature reviews, editorials, commentaries) and (3) qualitative instruments (i.e., structured interview).

Search strategy

The Cochrane library and PROSPERO register were searched to ensure that no other systematic reviews have been published on this topic. The final search was conducted on 22 February 2021. To identify articles related to expectations of motherhood a 3-step search strategy was used recommended by Aromataris and Munn (Citation2017). First, an initial limited search of PsycINFO and EMBASE was conducted to analyse the keywords contained in the titles and abstracts of the index terms used to describe target articles. A logic grid was created to guide the search strategy including key words for the (a) domain being studied (e.g., expectations), (b) population of interest (e.g., mothers), (c) timing (e.g., antepartum), and (d) type of instrument (e.g., scale). Second, a search of databases using the identified keywords and index terms was conducted (see Appendix A for a sample search). The following electronic databases were searched: PsycINFO, PsycTESTS, Mental Measurements Yearbook and Tests in Print, EMBASE, Maternity & Infant Care Database (MIDIRS), PubMed, Scopus, and CINAHL Plus. Third, the reference lists of identified systematic reviews and a hand search of one journal devoted to the population of interest (Archives of Women’s Mental Health) were reviewed for additional studies. To identify development and validation studies that may have been missed in the original search, an additional step was added. This final step included a database search of PsycINFO and Google Scholar for each identified self-report measure and a review of reference list of all included articles.

Study selection

EndNote X9 reference management software was used to review search results. After the database searches, duplicate references were removed, and the remaining titles and abstracts were screened for relevance. Next, a full-text review was conducted of the remaining articles and of the identified self-report measures. Studies that did not meet eligibility criteria were excluded with reasons. The screening and eligibility review were conducted by the first author (IB). A random 10% sample of full-text articles was assessed for inclusion by an independent researcher and the inter-rater reliability analysis using Cohen’s Kappa statistic was conducted (κ = .83) and interpreted as “almost perfect” agreement (Landis & Koch, Citation1977). If the methods of the development study of a measure referred to another article or thesis, where possible this was obtained and included in the analysis. Study authors were contacted to obtain any further relevant information about the measures. Data collection was conducted independently by IB, RR, LB, RK, and KC using standardised forms.

Evaluation of measurement properties of included measures

This review used the COSMIN definitions of measurement properties (see ) (Prinsen et al., Citation2018). The procedure for evaluating the measurement properties of included instruments consists of three steps: Step 1) Quality assessment of studies using the COSMIN Risk of Bias (RoB) checklist (Mokkink et al., Citation2018); Step 2) Rating of study results against the criteria for good measurement properties (Prinsen et al., Citation2018); and Step 3) Summary of pooled evidence was rated against the criteria for good measurement properties (Prinsen et al., Citation2018) and the quality of the evidence was graded using the modified Grading of Recommendations, Assessment, Development, and Evaluation (GRADE) approach (Mokkink et al., Citation2018). The 3-step procedure was applied to the evaluation of each of the following measurement properties: 1) content validity, 2) structural validity, 3) internal consistency, 4) reliability, and 5) hypothesis testing (for construct validity). Criterion validity was not evaluated as a “gold standard” instrument in this area of research is not available. None of the included studies reported measurement error, cross-cultural validity, or responsiveness, and these measurement properties were excluded from the analysis. A detailed description of each step is provided below.

Table 1. COSMIN Definitions of measurement properties.

Step 1: Quality assessment of studies

The COSMIN RoB checklist (Mokkink et al., Citation2018) was used to assess the methodological quality of each single study on a measurement property (see Appendix B for order of boxes). The purpose of assessing the quality of studies is to determine whether results based on the study are trustworthy (Mokkink et al., Citation2018). The quality of a study on each measurement property was rated on a “worst score counts” procedure and yielded overall ratings of “very good”, “adequate”, “doubtful”, or “inadequate” quality. As per COSMIN guidelines, if the content validity or the internal structure of a study was evaluated as “inadequate”, that instrument was excluded from further evaluation at that point (Prinsen et al., Citation2018).

Step 2: Assessment of study results

The results of each single study on a measurement property (see ) were evaluated against the updated criteria for good measurement properties (Prinsen et al., Citation2018) and each result was rated as “sufficient” [+], “insufficient” [-], “indeterminate” [?], or “inconsistent” [±]. Content validity is considered to be the most important measurement property of self-report instruments (Mokkink et al., Citation2018). The content of an instrument should reflect the experiences of members of the target population and as such their involvement is essential in both content development and validity studies (Terwee et al., Citation2018). An evaluation of content validity requires a subjective judgement by the reviewers based on a summary of multiple available evidence including: (1) the quality of the instrument development study, (2) the quality of additional studies on content validity, (3) reviewers’ subjective ratings of content, and (4) an assessment of the quality of available evidence using the modified GRADE approach (Mokkink et al., Citation2018).

Table 2. Updated criteria for good measurement properties.

An evaluation of internal structure refers to an assessment of item-level relationships and their quality, including an assessment of structural validity (unidimensionality) and internal consistency (Prinsen et al., Citation2018). Structural validity is usually assessed by factor or IRT/Rasch analysis and is considered to be a prerequisite for the interpretation of an internal consistency statistic (i.e. Cronbach’s alpha). The criteria for structural validity were amended to include an evaluation of Exploratory Factor Analysis (EFA) as proposed in a similar recent systematic review by Wittkowski et al. (Citation2020). Thus, EFA was rated as “sufficient” if ≥50% of the variance was explained by the proposed factor structure.

An evaluation of the remaining reported measurement properties (reliability and hypothesis testing) refers to an assessment of the quality of a scale or sub-scale (Prinsen et al., Citation2018). Although the preferred statistics for reporting reliability are the intraclass correlation (ICC) or weighted Kappa (Streiner & Norman, Citation2008), in practice other correlation coefficients (e.g., Pearson) are commonly reported. Thus, the criteria for reliability were amended to include an evaluation of Pearson or Spearman correlation coefficients and rated as “sufficient” if coefficient is ≥0.70 (Wittkowski et al., Citation2020).

Step 3: Summary of evidence and recommendations

The ratings of all available evidence from steps 1 and 2 were qualitatively summarised or quantitatively pooled and the results on each measurement property were rated against the criteria for good measurement properties (Prinsen et al., Citation2018). The quality of the evidence was graded using the modified GRADE approach (Mokkink et al., Citation2018). The GRADE ratings are presented in and further information about this approach can be found in the COSMIN manuals (Mokkink et al., Citation2018; Prinsen et al., Citation2018; Terwee et al., Citation2018). The summarised results yield an overall rating of “sufficient” [+], “insufficient” [-], “inconsistent” [±], or “indeterminate” [?]. Recommendations are based on guidance by COSMIN () (Prinsen et al., Citation2018).

Table 3. COSMIN Definitions of quality levels using the modified GRADE approach.

Table 4. Recommendations for future use and development of instruments per category.

Note. Adapted from Prinsen et al. (Citation2018).

Results

Systematic literature search and study identification

shows a flow-chart of the study selection process and a list of exclusion reasons. The database search identified 4,342 records and another 278 were identified by a search of relevant systematic review references and one journal devoted to women’s mental health. After duplicates were removed, 3,361 articles remained which were screened for relevance based on the title and abstract. The full texts of the remaining 240 articles were assessed for eligibility and 217 studies were excluded with reasons. At this stage, 23 articles describing 14 instruments were identified and included. Next, a database search of each instrument and a reference check was conducted identifying an additional 214 studies. After duplicates were removed, the remaining 141 articles were screened for relevance. The full texts of the remaining 35 articles were assessed for eligibility and 33 were excluded with reasons. In total, 25 articles describing the development or validation of 16 instruments were identified and included (see reference list for included articles).

Figure 1. PRISMA flow diagram showing study selection process and exclusion criteria.

Figure 1. PRISMA flow diagram showing study selection process and exclusion criteria.

Characteristics of self-report instruments

The characteristics of the instruments are presented in . Twenty-five articles were identified, describing the development and/or validation of 16 self-report measures of expectations of motherhood. Eleven instruments were designed to address general expectations of motherhood: Maternal Expectations Scale (MES; Flynn et al., Citation2010; Henshaw et al., Citation2014; O’Mahen et al., Citation2012), Attitudes Towards Motherhood Scale (AToM; Sockol et al., Citation2014; Sockol & Battle, Citation2015), Childbearing Attitudes Questionnaire (CAQ; Ruble et al., Citation1990), Rigidity of Maternal Beliefs Scale (RMBS; O’Mahen et al., Citation2013; Thomason et al., Citation2015), Prenatal Experiences for Mexican Americans (PESMA; Gress-Smith et al., Citation2013), Parenting Expectations Measure (PEM; Harwood et al., Citation2007; Harwood, Citation2004), Prenatal Maternal Expectations Scale (PMES; Coleman et al., Citation1999), Maternal Attitudes Questionnaire (MAQ; Warner et al., Citation1997), Mother’s Expectations of Parenthood Scale (MEP; Lazarus & Rossouw, Citation2015), Pregnancy Related Beliefs Questionnaire (PRBQ; Leach et al., Citation2018; Moorhead et al., Citation2003), and Measure of Maternal Optimism (MMO; Robakis et al., Citation2015). Four instruments were designed to measure the domain of expectations of childbirth: Slade-Pais Expectations About Childbirth Scale (SPECS; Slade et al., Citation2016), Expectations About Childbirth Scale (EC1; DeLuca, Citation1999; DeLuca & Lobel, Citation2014), Childbirth Expectations Questionnaire (CEQ; Gupton et al., Citation1991), and Expectations of the Childbirth Experience (ECBE; Oweis, & Abushaikha, Citation2004). One instrument was designed to measure infant expectations: Prenatal Expectations (compared to postnatal experience) Questionnaire (PEQ; Evans et al., Citation2012).

Table 5. Characteristics of included self-report instruments.

Characteristics of samples in development studies

The sample characteristics representing the target population are presented in Appendix E. A substantial lack of consistency of reporting demographic characteristics between studies was observed. One measure was designed for a specific target population: Mexican American women of low income (<25,000 USD) and was excluded from the following summary. The parity of the sample was reported for 15 measures with six measures developed in a primiparous sample and nine measures in a combination of primiparous and multiparous participants. Mean maternal age was reported for 15 measures and ranged from 25 to 32.5 years of age. Ten measures were developed in a sample of pregnant women, three measures in a postpartum sample, and one measure was developed in a sample of pre-pregnant, pregnant, and postpartum women. The mean gestation age was reported in nine studies and ranged from 23 to 34 weeks of pregnancy. The ethnicity of the sample was reported for 12 measures with 10 measures developed in a predominantly Caucasian/White sample (range of 78–100%), while the remaining two measures reported the nationality of the sample. Relationship status was reported for 15 measures and the majority of the women were partnered (78–100%). Some information about educational attainment of the sample was reported for 12 measures. For 10 studies, a range of 0 to 22% of participants did not attain a high school education, while 13 studies reported a range of 21 to 70% of participants with a college/university education, and 10 studies reported a range of 23 to 58% of participants with a postgraduate degree. Information regarding income was reported for eight measures, with the majority of participants described as “middle class”.

Results of step 1: Quality assessment of studies

Quality of instrument development

Instrument design

For three studies concept elicitation was based on a widely recognised or well justified qualitative method in a sample of the target population and provided sufficient information about the methods and data analysis (MES, CEQ, SPECS). Another five reported using some form of qualitative methods, but did not provide sufficient information (PMES, PRBQ, AToM, CAQ, MEP). Three studies incorporated open-ended questions regarding relevance, comprehensiveness, and comprehensibility as part of a quantitative survey (AToM, PEM, MAQ). For two studies, concept elicitation was based on quantitative methods only (RMBS, PESMA). Four studies did not report any involvement from the target population (qualitative or quantitative) during concept elicitation (MMO, PEQ, EC1, ECBE). Out of the 12 studies that involved members of the target population in the concept elicitation stage, four provided sufficient information (SPECS, CEQ, MES, PESMA), while eight provided limited, unclear, or no information regarding data collection and analysis procedures which contributed to reduced ratings (PEM, PMES, RMBS, AToM, PRBQ, CAQ, MAQ, MEP).

Pilot testing

An attempt to assess the comprehensibility and/or comprehensiveness of the instrument during the pilot test phase was reported in six studies (SPECS, CEQ, AToM, PEM, MAQ, CAQ). The CEQ used brief interviews with the initial ten participants, the SPECS compared outcome of empirical procedures to initial interviews with target population, and the CAQ used follow-up interviews with a small pilot, which resulted in very good ratings. The AToM, PEM, and MAQ used open-ended questions incorporated into the survey which resulted in an adequate rating. The authors of one study (PRBQ) explicitly reported that participants were not asked about the comprehensibility of the instrument which resulted in an inadequate rating. A pilot test was conducted for the remaining nine studies, and none reported whether participants were asked about the comprehensibility and/or comprehensiveness of the draft instrument which resulted in doubtful ratings (PMES, MES, RMBS, MMO, PESMA, PEQ, EC1, MEP, ECBE).

Summary

Detailed ratings of the quality of studies on instrument development are presented in Appendix C and overall quality of development is provided in . The total quality of instrument development studies was assessed as very good for two (SPECS, CEQ), adequate for one (CAQ), doubtful for eight (PEM, PMES, MES, RMBS, PESMA, AToM, MAQ, MEP), and inadequate for five (MMO, PEQ, PRBQ, EC1, ECBE). The studies rated as having inadequate quality were excluded from subsequent data extraction. No qualitative content validity studies were identified by this review.

Table 6. Synthesis of quality of studies on a measurement property, pooled ratings of results, and the quality of evidence (GRADE).

Quality of internal structure (). Of the remaining 11 development studies, the quality of studies on structural validity were assessed as very good for four (PESMA, AToM, CAQ, CEQ), adequate for three (MES, RMBS, SPECS), and inadequate for four (PEM, PMES, MAQ, MEP). The quality of studies on internal consistency were assessed as very good for seven (MES, RMBS, PESMA, AToM, CAQ, CEQ, SPECS) and inadequate for the four measures that did not demonstrate sufficient structural validity (PEM, PMES, MAQ, MEP). As evidence of structural validity is considered a prerequisite for the interpretation of an internal consistency statistic, these studies were excluded from further analyses. No studies described cross cultural validity.

Quality of remaining measurement properties ()

The quality of studies on reliability (test-retest) was assessed as adequate for one (CEQ) and inadequate for one (RMBS). The quality of studies on hypothesis testing for construct validity was assessed as very good for five studies (MES, RMBS, PESMA, AToM, SPECS). No studies reported measurement error, cross-cultural validity, responsiveness, or criterion validity.

Results of step 2: Assessment of study results

The results of each single study on a measurement property were summarised and overall ratings presented in . Content validity was rated as sufficient for four instruments (SPECS, CEQ, RMBS, PEM) and inconsistent for seven (CAQ, AToM, PESMA, MES, PMES, MAQ, MEP). Structural validity and internal consistency were rated as sufficient for seven instruments (SPECS, CEQ, RMBS, CAQ, AToM, PESMA, MES). Four instruments were rated as having insufficient structural validity which resulted in an indeterminate rating of internal consistency (PEM, PMES, MAQ, MEP). Two studies reported reliability (test-retest) of instruments: one was rated as sufficient (CEQ) and one insufficient (RMBS). All three studies that reported hypothesis testing for construct validity were rated as sufficient (SPECS, RMBS, AToM).

Results of step 3: Overall summary of the evidence and recommendations

A synthesis of the evidence per instrument is provided in . Results and recommendations are presented according to the intended measurement domain, and additional detailed information is provided for instrument characteristics in and target population characteristics in Appendix D. Instruments were categorised into three categories of recommendations and suggestions for ongoing development of these instruments are provided in . A narrative overview of development procedures and recommendations is provided per instrument in each recommendation category in the next section.

Table 7. Recommendations per instrument.

Category A: Recommended for research

There are three instruments (SPECS, RMBS, CEQ) in category A with evidence of any level for sufficient content validity and internal consistency that can be recommended for use in research and results obtained with these instruments can be trusted.

The Slade-Pais Expectations About Childbirth Scale (SPECS; Slade et al., Citation2016) was developed to assess pregnant women’s expectations of childbirth. Concept elicitation and item selection were based on a literature review, semi-structured interviews with members of the target population (n = 18), and an expert review. Comprehensibility was assessed in a small sample (n = 5) of the target population and comprehensiveness was reviewed by comparing the domains of the final measure with the initial qualitative interviews. The quality of measure development was assessed as very good and further development is recommended to establish clinical utility (see Slade et al. (Citation2016) for suggestions).

The Childbirth Expectations Questionnaire (Beaton & Gupton, Citation1990; Bramadat, Citation1990; Gupton et al., Citation1991) was developed to assess childbirth expectations. Concept elicitation and item selection were based on a literature review and semi-structured interviews with members of the target population (n = 11). Draft 1 of the CEQ was administered antepartum to a sample of the target population (n = 202) and included open-ended questions regarding relevance, comprehensiveness, and comprehensibility. Ten participants were briefly interviewed regarding comprehensibility and comprehensiveness of the scale. The quality of measure development was assessed as very good. As the CEQ was developed 30 years ago, thus an updated assessment of content validity is suggested.

The Rigidity of Maternal Beliefs Scale (RMBS; O’Mahen et al., Citation2012, Citation2013; Thomason et al., Citation2015) was developed to assess the rigidity of maternal parenting-specific beliefs. Concept elicitation was based on an expert consensus approach (i.e., expert researchers and clinicians), a literature review, a review of a qualitative study (O’Mahen et al., Citation2012), and a small pilot test (n = 39) in an ongoing intervention study (O’Mahen et al., Citation2013). No details were provided describing the pilot test study design, data collection, or analysis procedures, and it was not reported whether either the ante- or postpartum administrations of the draft were used to inform comprehensibility and comprehensiveness. Content validity was rated as sufficient; however, the quality of measure development was assessed as doubtful, thus a content validity assessment is suggested.

Category B: Potential to be recommended for research

Four instruments in category B have the potential to be recommended for use with some further development required (CAQ, AToM, MES, PESMA). Category B instruments have inconsistent content validity ratings and require additional qualitative assessment of comprehensibility and comprehensiveness or a content validity study to increase confidence in this measurement property.

The Childbearing Attitudes Questionnaire (CAQ; Ruble et al., Citation1990) was developed to assess attitudes towards pregnancy, birth, self-esteem, and interpersonal relationships. Concept elicitation and item selection were based on previous research, a small pilot and discussions with pregnant women, and interviews with pregnant and postpartum women. No details were provided describing qualitative data collection and analysis procedures. Comprehensibility and comprehensiveness were assessed with follow-up interviews (n = not reported). The quality of measure development was assessed as adequate. Internal structure was rated as sufficient; however, content validity was rated as inconsistent as the measure reflected the experiences of participants from 30 years ago which may differ significantly from experiences of the target population today. Further required development includes an updated qualitative content validity assessment.

The Attitudes Towards Motherhood Scale (AToM; Sockol et al., Citation2014) was developed to assess first-time mothers’ (perinatal) attitudes about motherhood. Concept elicitation and item selection were based on previous research, modification of items from scales of general cognitive bias and from an expectations measure (PEM; Harwood et al., Citation2007), a manual for cognitive-behavioural therapy for postpartum depression, interviews with pregnant women (details of study not cited), an expert review, and a pilot test (n = not reported) with women who provided qualitative feedback. No details were provided describing qualitative data collection and analysis procedures, and it was not reported whether the pilot was used to inform comprehensibility and comprehensiveness. The quality of measure development was assessed as doubtful. Two follow-up studies were conducted (Sockol et al., Citation2014; Sockol & Battle, Citation2015) which were not considered to be content validity studies as it was not reported if participants were asked about comprehensiveness or comprehensibility. Internal structure was rated as sufficient; however, content validity was rated as inconsistent as aspects of maternal attitudes appear to be missing. Further required development includes a qualitative content validity assessment to verify comprehensiveness.

The Maternal Expectations Scale (MES; Flynn et al., Citation2010; Henshaw et al., Citation2014; O’Mahen et al., Citation2012) was developed to assess the rigidity of maternal expectations about common cultural beliefs regarding motherhood in the immediate postpartum. Concept elicitation and item selection were based on a literature review, clinical experience, an expert review, and qualitative work (Flynn et al., Citation2010; O’Mahen et al., Citation2012). It was not reported whether the pilot test was used to inform comprehensibility and comprehensiveness. The quality of measure development was assessed as doubtful. Internal structure was rated as sufficient; however, content validity was rated as inconsistent as some aspects of maternal expectations appear to be missing. Further required development includes a qualitative content validity assessment to verify comprehensiveness.

The Prenatal Experiences for Mexican Americans (PESMA; Gress-Smith et al., Citation2013) was developed to assess specific cultural aspects of prenatal expectations. Concept elicitation and item selection were based on previous research (Coleman et al., Citation1999), theory (not cited), prior empirical research (not cited), and preliminary statistical analyses (EFA) which contributed to significant modification of the scale. It was unclear whether the pilot test was used to inform comprehensibility and comprehensiveness. The quality of measure development was assessed as doubtful. Internal structure was rated as sufficient; however, content validity was rated as inconsistent based on a lack of qualitative input from the target population. Further required development includes a content validity study in a sample of the target population, followed by a structural validity study.

Category C: Not recommended for use

The remaining nine instruments in category C have evidence of an insufficient measurement property and are not recommended for use at this time. Four instruments (PEM, PMES, MAQ, MEP) have insufficient internal structure as factor analysis (FA) was not performed. Five instruments (MMO, EC1, ECBE, PRBQ, PEQ) were inadequately developed as target population involvement was not reported. Substantial further development is required before drawing conclusions for research purposes for these measures.

The Parenting Expectations Measure (PEM; Harwood et al., Citation2007; Harwood, Citation2004) was developed to assess first time mothers’ parenting expectations during pregnancy and the early postpartum period. Concept elicitation and item identification were based on a literature review and a quantitative survey of a pool of items that were adapted from previous measures and written for the study. Relevance and comprehensiveness of items were assessed in a sample of the target population (n = 87) using survey methods that included open-ended questions. Limited information was provided about qualitative data analysis and the quality of measure development was assessed as doubtful. A follow-up study was conducted (Harwood et al., Citation2007) which was not considered to be a content validity study as it was not reported whether participants were asked about comprehensiveness or comprehensibility. FA was not performed and the study on structural validity was rated as insufficient which excluded the PEM from further assessment. Further required development includes a structural validity study.

The Prenatal Maternal Expectations Scale (PMES; Coleman et al., Citation1999) was developed to assess first time mothers’ expectations of the maternal role and their infant in the early postpartum period. Concept elicitation and item identification were based on a literature review, interviews, and an expert review. However, no details were provided describing the qualitative data collection and analysis procedures. It was not reported whether the pilot test was used to inform comprehensibility and comprehensiveness. The quality of measure development was assessed as doubtful. Content validity was rated as inconsistent as the measure reflected the experiences of participants from 21 years ago which may differ significantly from experiences of the target population today. FA was not performed and the study on structural validity was rated as insufficient which excluded the PMES from further assessment. Further required development includes an updated content validity assessment, followed by a structural validity study.

The Maternal Attitudes Questionnaire (MAQ; Warner et al., Citation1997) was developed to assess expectations and attitudes towards motherhood in relation to postnatal depressive illness in the postpartum period. Concept elicitation was based on a literature review, clinical experience, and items were written for the study or identified from an existing expectations measure (Kumar et al., Citation1984). A survey pilot test (n = 50) with open-ended questions was used to assess relevance and comprehensiveness, although it was not reported whether results informed comprehensibility. No details were provided describing the qualitative data collection and analysis procedure. The quality of measure development was assessed as doubtful. Content validity was rated as inconsistent as the measure reflected the experiences of participants from 23 years ago which may differ significantly from experiences of the target population today. FA was not performed and the study on structural validity was rated as insufficient which excluded the MAQ from further assessment. Further required development includes an updated content validity assessment, followed by a structural validity study.

Mother’s Expectations of Parenthood Scale (MEP; Lazarus & Rossouw, Citation2015) was developed to assess mothers’ expectations (antepartum) and experiences (postpartum) about her infant, support, and herself. Concept elicitation was based on a literature review and items were identified from an existing parenting expectations measure (Harwood et al., Citation2007), an infant characteristics measure (Bates et al., Citation1979), and written specifically for this study based on personal experience and feedback from a panel of mothers. No details were provided describing the qualitative data collection and analysis procedure and it was not reported whether the pilot was used to inform comprehensibility and comprehensiveness. The quality of measure development was assessed as doubtful. Content validity was rated as inconsistent as some aspects of mother’s expectations appear to be missing. FA was not performed and the study on structural validity was rated as insufficient which excluded the MEP from further assessment. Further required development includes an updated content validity study, followed by a structural validity study.

The remaining five measures in category C were excluded from assessment based on an inadequate quality of the development study. According to COSMIN standards, evidence of target population involvement in concept elicitation is required to ensure good content validity. However, no details were provided describing target population involvement for the MMO, PEQ, EC1, PRBQ, and ECBE. For these measures, substantial further development is required before drawing conclusions for research purposes including a content validity study in a sample of the target population, followed by a structural validity study.

The Measure of Maternal Optimism (MMO; Robakis et al., Citation2015) was developed to assess expectations about experiences related to motherhood. Concept elicitation was based on discussions with a group of clinical providers in women’s mental health (i.e., obstetricians, psychiatrists) and clinical experience. The Prenatal Expectations (compared to postnatal experience) Questionnaire (PEQ; Evans et al., Citation2012) was developed to assess the difference between women’s antepartum expectations of the experience with their baby to their actual experience, when the delivery was preterm. The general design requirements of the PEQ were assessed as inadequate as a limited description of the construct was provided. The Expectations About Childbirth Scale (EC1; DeLuca, Citation1999; DeLuca & Lobel, Citation2014) was developed to assess expectations about the level of general control that women will have during labour and delivery. Concept elicitation and item selection for the EC1 were based on a literature review, previous control research (Krantz et al., Citation1980), and previous qualitative studies of control during caesarean deliveries (not referenced). The Expectations of the Childbirth Experience (ECBE; Oweis & Abushaikha, Citation2004) was developed to assess Jordanian women’s expectations of their first childbirth experience. The psychometric properties of the ECBE were assessed in a recent review of women’s childbirth experiences (Nilvér et al., Citation2017). However, an appraisal of the quality of instrument development was not conducted by the authors and the ECBE was included in the current review. Concept elicitation and item selection were based on a literature review and the measure was designed by the researchers. The Pregnancy Related Beliefs Questionnaire (PRBQ; Moorhead et al., Citation2003) was developed to assess underlying beliefs associated with postpartum depression. Concept elicitation and item selection were based on a literature review, a book of interviews with women with PPD experience (Aiken, Citation2001), an expert review, and interviews with two women with PPD; however, no details were provided describing the qualitative data collection or analysis procedure for the interviews and the authors reported that the pilot test was not used to inform the comprehensibility of the measure. A published validation study (Leach et al., Citation2018) was identified; however, no information was provided regarding further content validation procedures; thus, this study was not assessed.

Discussion

In this systematic review, we aimed to identify and evaluate instruments intended to measure women’s antepartum expectations of motherhood based on available development and validation studies. To appropriately investigate the association of women’s expectations about the future and postpartum adjustment difficulties, such as symptoms of depression and anxiety, it is important to use valid and reliable measurement instruments to assess expectations. As a result of a comprehensive 4-step search strategy, we identified 16 measures described in 25 articles for inclusion. We evaluated the methodological quality of instrument development studies and of studies on psychometric properties and rated the psychometric properties of included instruments according to a rigorous multi-step systematic process using the updated COSMIN guidelines (Mokkink et al., Citation2018; Prinsen et al., Citation2018). The findings of this review highlight the importance of evaluating the methodological quality of instrument development studies as an integral first step in the appropriate interpretation of psychometric properties of self-report measures and of the results based on that measure. In particular, assessing the extent of the involvement of the target population in concept elicitation is crucial to evaluating the content validity of a measure based on self-report. This review identified that several of the measures used in this area of research were not developed with input from the target population or had insufficient measurement properties. As such, the use of unvalidated measures may be contributing to the inconsistent results observed in this area of research. However, by providing an overview of the gaps in instrument development studies, we offered future researchers evidence-based guidance for further development of these instruments.

Implications for research and practice

Based on the results of this systematic review and in line with COSMIN guidelines (Prinsen et al., Citation2018), we advise on the most suitable instruments for each of the two main measurement domains: motherhood and childbirth. Overall, none of the identified instruments can be recommended for clinical use and more evidence on clinical utility (e.g., responsiveness, predictive validity) is needed for all of the instruments. To establish clinical utility, future investigations exploring associations between measure outcomes and clinical symptoms (e.g., depression, anxiety, adjustment) are needed. In addition, it is unclear why the two domains of expectations of motherhood and childbirth appear to be consistently considered separately by researchers. A possible future direction for research may be to combine these two domains in a single measure that reflects a comprehensive view of the perinatal experience and transition to motherhood.

To measure self-reported expectations of childbirth for research purposes, we recommend the use of the SPECS. We recommend that the development of new questionnaires of expectations of childbirth is not necessary, and that future research should focus on further development of the SPECS for possible use in clinical populations. To measure general expectations of motherhood for research purposes, we recommend the use of the RMBS. However, we suggest further development of this scale to increase confidence in all measurement properties. In this domain, we have identified an opportunity for researchers to develop new measures designed to assess a comprehensive range of sub-domains of expectations of motherhood including childbirth. Alternatively, researchers may wish to focus on further development of comprehensive instruments, such as the PEM or PRBQ.

Methodological considerations

Updated COSMIN guidelines

Until recently, common reporting procedures by authors of systematic reviews of measures did not often include an evaluation of methodological quality of studies (Mokkink et al., Citation2009). The first COSMIN guidelines (Mokkink et al., Citation2010; Terwee et al., Citation2012) proposed criteria for the evaluation of content validity but recommended that the results of all measurement properties are presented regardless of inadequate ratings. One of the changes in the latest COSMIN guidelines is the new order of evaluating measurement properties whereas an instrument is excluded from further evaluation if evidence of an insufficient measurement property is identified (Prinsen et al., Citation2018). Specifically, an instrument is not further evaluated if there is evidence that the quality of the instrument development study is inadequate, next if content validity is insufficient, and last if structural validity is insufficient (Prinsen et al., Citation2018). In a recent systematic review of measures using the latest COSMIN guidelines, Wittkowski et al. (Citation2020) amended this procedure and evaluated the remaining measurement properties despite inadequate quality of development or insufficient content validity. The authors suggested that this was done to present a comprehensive overview of a particular measurement property. However, based on the results of the current review, we propose that reporting the results of measurement properties that are based on instruments with a serious risk of bias or insufficient content validity may encourage the use of instruments that are untrustworthy. In addition, the use of inadequately developed measures may be unethical or misleading. For example, we identified two instruments (PEM, PMES) with good internal consistency (Cronbach’s alpha) per sub-scale. However, both of these instruments had inadequate structural validity as factor analysis was not conducted and therefore there was no evidence of unidimensionality of the proposed sub-scales. Thus, the interpretation of an internal consistency statistic would not be appropriate. We suggest that in future systematic reviews, the new order of evaluating measurement properties is observed by review teams. Consequently, this practice may encourage greater consideration of methodological quality and content validity procedures during measure development.

Risk of bias checklist

The RoB checklist aims to provide a systematic approach to evaluating the trustworthiness of the results based on the methodological quality of a study (Mokkink et al., Citation2018). Although we have attempted to document all alterations to the checklist, we observed difficulties when attempting to apply the standards and a consequent need to make several subjective judgements. For example, some studies used quantitative methods, but incorporated qualitative questions during concept elicitation and pilot testing. Although this method is not considered to be a “widely recognised qualitative method”, such as focus groups or interviews, the review team agreed that the inclusion of a qualitative component differentiates this method from studies that used purely quantitative methods. Thus, our interpretations of some checklist items may differ to those of another review team. In addition, despite the extensive written guidance provided in COSMIN manuals, we found that many of the checklist items included vague wording that may increase the subjectivity of the quality assessment. For example, the RoB checklist (Box 1b.18) requires reviewers to assess whether “an appropriate qualitative method” was “widely recognised”, “well justified”, “assumable”, “doubtful”, or “not appropriate”. To maintain the systemic nature of the checklist, when evaluating methods that deviated from best practice, the review team resolved issues through discussion and consensus and documented our approach.

Criteria for good measurement properties

Based on the results of this review, the decision to include criteria for results of several additional statistical procedures for structural validity and reliability was made by the review team. The first alteration is the addition of EFA as sufficient when examining the structural validity in a development study. This decision was made by the review team as often EFA is necessary when a new construct is being explored and there are no clear hypotheses about underlying constructs. Second, we included correlation coefficients (Pearson and Spearman) as an acceptable statistic for reporting test-retest reliability. Third, when considering hypothesis testing for construct validity, we tested hypotheses proposed by the authors of development studies rather than formulating reviewer hypotheses. This criterion was relaxed, and construct validity was considered sufficient if the hypotheses proposed by authors were supported. In cases where no hypotheses about relationships were proposed, the review team did not suggest hypotheses.

Lack of adequate reporting

First, we found that the rigorous method recommended by COSMIN for ensuring good quality of content validity was rarely fully implemented or reported in most of the identified studies. This is unsurprising, as there is a paucity of empirical guidance concerning content validity procedures for psychological scale development. Out of the 12 development studies that involved members of the target population, reduced ratings were most often observed as a result of a lack of or insufficient reporting of qualitative research during both instrument design (for six studies) and pilot testing (for nine studies). Second, there is a lack of sufficient information and consistency when reporting results to allow for a full assessment of study results against the stringent COSMIN criteria. For example, authors most commonly reported correlation coefficients for reliability or omitted reporting detailed statistical procedures (i.e., percentage of variance explained for EFA). Third, none of the included articles reported assessment of measurement error, cross-cultural validity, criterion validity, or responsiveness.

Lack of content validity studies

The findings of this review did not identify any additional content validity studies for any of the included measures and this affected the evaluation process in several ways. First, the rating of content validity is based in part on the evaluation of the development study, which includes several subjective judgements, and in part by a “reviewer rating”. However, this was found to be beneficial for most studies. For example, most studies did not ask about comprehensibility of instructions and items, and therefore the total comprehensibility score was ultimately based on the reviewer’s rating of instructions and items which improved the rating for all of the instruments. Thus, the review team observed that the lack of additional content validity studies increased the subjective nature of the rating of content validity, and that this rating should be interpreted with caution. Second, with a lack of content validity studies, the grading of the quality of evidence (GRADE) was based on the quality of the development study only. Specifically, an assessment of a very good or adequate development study equalled a “moderate” level of quality, doubtful equalled a low level, and inadequate equalled a very low level, each time. Although the grading was somewhat redundant, it was included in the review as the semantic description of the quality of evidence helped to describe confidence of the assessments.

Development sample characteristics

The findings of this review identified substantial homogeneity in the characteristics of the sample populations across the majority of the studies. Specifically, the majority of the measures were developed in samples of Caucasian/white, partnered, well educated, middle class, and cis gender women. Future research may wish to investigate whether significant differences in expectations can be identified across various demographic characteristics.

Strengths and limitations

There are several strengths of this review resulting from the comprehensive approach to the search strategy and data extraction. The 4-step search strategy was designed according to evidence-based guidance and expanded to include an additional second search using the names of all identified measures identified in the initial search. Eligibility criteria were predefined and registered. At all points in the data extraction procedure, multiple independent reviewers were involved providing assurance of reliability. The systematic application of the current COSMIN guidelines provides users with a multi-layered evaluation of each instrument. Unpublished scales were included if referred to in published work. We implemented COSMIN advice to exclude measures from subsequent review if the quality of development, content validity, or structural validity studies was inadequate. A final strength of this review is the narrative synthesis of evidence and recommendations per instrument provided to simplify the process of interpreting results and choosing a measure.

Several limitations were identified. Although a comprehensive search strategy was employed, it is possible that not all measures of expectations were captured in our search. Specifically, our search may have missed measures developed in languages other than English. In addition, the modification of some standards and criteria may have increased the risk of bias in this review. However, we note that every attempt was made to apply these modifications strictly to all measures so as to maintain the systematic approach of this process. Lastly, we deviated from our original protocol (Best et al., Citation2016) on several occasions. First, after the original three-step search procedure that was detailed in the protocol was conducted, it became apparent that this was not sufficient to capture all relevant studies, and an additional step was added to conduct a second database search for each identified instrument. Second, our original aim included the evaluation of feasibility for the use of measures in clinical contexts. The primary reason for not including a feasibility appraisal is that none of the measures identified in this review were recommended for clinical populations. Although we did not present a formal appraisal of feasibility, we reported on many of the characteristics that we had specified in the protocol (i.e. administration considerations, target population) that researchers may wish to consider when choosing a measure. Finally, after the publication of the protocol, the COSMIN guidelines were updated, and we decided to utilise these in our review in place of the previous procedure.

Conclusion

This review provides an overview of existing measures of expectations of motherhood to help researchers in identifying the most suitable measure for their purposes. To select an instrument that is most appropriate for their use, we recommend that researchers first review the relevant characteristics of each of the identified instruments, with a primary focus on: 1) the measurement construct, 2) characteristics of the target population, and 3) time of administration. Following this, researchers can consider our recommendations on the use and further development of instruments. At this stage, none of the instruments are recommended for clinical use. We advise on the use of the SPECS to measure expectations of childbirth and the RMBS to measure general expectations of motherhood for research purposes.

Supplemental material

Appendix A Search strategy

Download MS Word (20.5 KB)

Appendix B Methods quality assessment

Download MS Word (21.5 KB)

Appendix C Detailed quality assessment

Download MS Word (21.2 KB)

Appendix D Sample characteristics

Download MS Word (21.2 KB)

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplemental material

Supplemental data for this article can be accessed at https://doi.org/10.1080/13284207.2022.2108694.

References

  • Aiken, C. (2001). Surviving post-natal depression. Jessica Kingsley.
  • Aromataris, E. & Munn, Z. (2017). Joanna Briggs Institute Reviewer’s Manual. The Joanna Briggs Institute. https://reviewersmanual.joannabriggs.org/
  • Atlas, L. Y. & Wager, T. (2013). Expectancies and beliefs: Insights from cognitive neuroscience (Vol. 2). Oxford University Press.
  • Austin, M.-P., Highet, N., & Expert Working Group. (2017). Mental health care in the perinatal period: Australian clinical practice guideline. Centre of Perinatal Excellence. https://www.cope.org.au/?s=guidelines
  • Austin, M.-P. & Lumley, J. (2003). Antenatal screening for postnatal depression: A systematic review. Acta Psychiatrica Scandinavica, 107(1), 10–17. https://doi.org/10.1034/j.1600-0447.2003.02024.x
  • Bates, J. E., Freeland, C. A., & Lounsbury, M. L. (1979). Measurement of infant difficultness. Child Development, 50(3), 794–803. https://doi.org/10.2307/1128946
  • *Beaton, J. & Gupton, A. (1990). Childbirth expectations: A qualitative analysis. Midwifery, 6(3), 133–139. https://doi.org/10.1016/S0266-6138(05)80170-6
  • Beck, C. T. (2002). Postpartum depression: A metasynthesis. Qualitative Health Research, 12(4), 453–472. https://doi.org/10.1177/104973202129120016
  • Best, I., Breen, L., Rooney, R., Egan, S., Kane, R., & Somerville, S. (2016). A systematic review protocol of the measurement properties and clinical feasibility of instruments used to assess women’s antepartum expectations of motherhood. PROSPERO International Prospective Register of Systematic Reviews, CRD42016039534, 1–4. http://www.crd.york.ac.uk/PROSPERO_REBRANDING/display_record.asp?ID=CRD42016039534
  • *Bramadat, I. J. (1990). Relationships among maternal expectations for childbirth, maternal perception of the birth experience, and maternal satisfaction with childbirth in women undergoing induction, augmentation and spontaneous labor [ Doctoral dissertation]. University of Texas at Austin. ProQuest Dissertations and Theses. http://search.ebscohost.com/login.aspx?direct=true&db=rzh&AN=109869637&site=ehost-live
  • Choi, P., Henshaw, C., Baker, S., & Tree, J. (2005). Supermum, superwife, supereverything: Performing femininity in the transition to motherhood. Journal of Reproductive and Infant Psychology, 23(2), 167–180. https://doi.org/10.1080/02646830500129487
  • Coleman, P., Nelson, E. S., & Sundre, D. L. (1999). The relationship between prenatal expectations and postnatal attitudes among first-time mothers. Journal of Reproductive and Infant Psychology, 17(1), 27–39. https://doi.org/10.1080/02646839908404582
  • Connors, M. H. & Halligan, P. W. (2014). A cognitive account of belief: A tentative road map. Frontiers in Psychology, 5(February), 1588. https://doi.org/10.3389/fpsyg.2014.01588
  • *DeLuca, R. S. (1999). Diminished control and unmet expectations: Testing a theory of adjustment to cesarean delivery (childbirth satisfaction) [ Doctoral dissertation]. State University of New York at Stony Brook. http://ovidsp.ovid.com/ovidweb.cgi?T=JS&PAGE=reference&D=psyc3&NEWS=N&AN=1999-95024-396
  • *DeLuca, R. S. & Lobel, M. (2014). Diminished control and unmet expectations: Testing a model of adjustment to unplanned cesarean delivery. Analyses of Social Issues and Public Policy, 14(1), 183–204. https://doi.org/10.1111/asap.12040
  • Dennis, C.-L., Falah-Hassani, K., & Shiri, R. (2017). Prevalence of antenatal and postnatal anxiety: Systematic review and meta-analysis. British Journal of Psychiatry, 210(5), 315–323. https://doi.org/10.1192/bjp.bp.116.187179
  • Dennis, C.-L. & Ross, L. (2006). Women’s perceptions of partner support and conflict in the development of postpartum depressive symptoms. Journal of Advanced Nursing, 56(6), 588–599. https://doi.org/10.1111/j.1365-2648.2006.04059.x
  • Dowse, E., Chan, S., Ebert, L., Wynne, O., Thomas, S., Jones, D., Fealy, S., Evans, T.-J., & Oldmeadow, C. (2020). Impact of perinatal depression and anxiety on birth outcomes: A retrospective data analysis. Maternal and Child Health Journal, 24(6), 718–726. https://doi.org/10.1007/s10995-020-02906-6
  • *Evans, T., Whittingham, K., & Boyd, R. (2012). What helps the mother of a preterm infant become securely attached, responsive and well-adjusted? Infant Behavior & Development, 35(1), 1–11. https://doi.org/10.1016/j.infbeh.2011.10.002
  • *Flynn, H. A., Henshaw, E., O’Mahen, H., & Forman, J. (2010). Patient perspectives on improving the depression referral processes in obstetrics settings: A qualitative study. General Hospital Psychiatry, 32(1), 9–16. https://doi.org/10.1016/j.genhosppsych.2009.07.005
  • *Gress-Smith, J., Roubinov, D., Tanaka, R., Cirnic, K., Gonzales, N., Enders, C., & Luecken, L. (2013). Prenatal expectations in Mexican American women: Development of a culturally sensitive measure. Official Journal of the Section on Women’s Health of the World Psychiatric Association, 16(4), 303–314. https://doi.org/10.1007/s00737-013-0350-2
  • *Gupton, A., Beaton, J., Sloan, J., & Bramadat, I. (1991). The development of a scale to measure childbirth expectations. The Canadian Journal of Nursing Research [Revue Canadienne de Recherche En Sciences Infirmieres], 23(2), 35–47. http://ovidsp.ovid.com/ovidweb.cgi?T=JS&PAGE=reference&D=emed4&NEWS=N&AN=22907102
  • Gregory, E. F., Butz, A. M., Ghazarian, S. R., Gross, S. M., & Johnson, S. B. (2015). Met expectations and satisfaction with duration: A patient-centered evaluation of breastfeeding outcomes in the infant feeding practices study II. Journal of Human Lactation, 31(3), 444–451. https://doi.org/10.1177/0890334415579655
  • Harrison, V., Moore, D., & Lazard, L. (2020). Supporting perinatal anxiety in the digital age a qualitative exploration of stressors and support strategies. BMC Pregnancy and Childbirth, 20(1), 1–20. https://doi.org/10.1186/s12884-020-02990-0
  • Hauck, Y., Fenwick, J., Downie, J., & Butt, J. (2007). The influence of childbirth expectations on Western Australian women’s perceptions of their birth experience. Midwifery, 23(3), 235–247. https://doi.org/10.1016/j.midw.2006.02.002
  • Heron, J., O’Connor, T. G., Evans, J., Golding, J., & Glover, V. (2004). The course of anxiety and depression through pregnancy and the postpartum in a community sample. Journal of Affective Disorders, 80(1), 65–73. https://doi.org/10.1016/j.jad.2003.08.004
  • *Harwood, K. (2004). Women’s parenting expectations and their influence on adjustment to parenthood [ Doctoral dissertation]. The University of Western Australia. UWA Profiles and Research Repository. www.research-repository.uwa.edu.au
  • *Harwood, K., McLean, N., & Durkin, K. (2007). First-Time mothers’ expectations of parenthood: What happens when optimistic expectations are not matched by later experiences? Developmental Psychology, 43(1), 1–12. https://doi.org/10.1037/0012-1649.43.1.1
  • *Henshaw, E. J., Fried, R., Teeters, J. B., & Siskind, E. E. (2014). Maternal expectations and postpartum emotional adjustment in first-time mothers: Results of a questionnaire survey. Journal of Psychosomatic Obstetrics and Gynecology, 35(3), 69–75. https://doi.org/10.3109/0167482X.2014.937802
  • Houston, K. A., Kaimal, A. J., Nakagawa, S., Gregorich, S. E., Yee, L. M., & Kuppermann, M. (2015). Mode of delivery and postpartum depression: The role of patient preferences. American Journal of Obstetrics and Gynecology, 212(2), 221–227. https://doi.org/10.1016/j.ajog.2014.09.002
  • Howard, L. M., Molyneaux, E., Dennis, C.-L., Rochat, T., Stein, A., & Milgrom, J. (2014). Non-psychotic mental disorders in the perinatal period. Lancet, 384(9956), 1775–1788. https://doi.org/10.1016/S0140-6736(14)61276-9
  • Krantz, D. S., Baum, A., & Wideman, M. V. (1980). Assessment of preferences for self-treatment and information in health care. Journal of Personality and Social Psychology, 39(5), 977–990. https://doi.org/10.1037/0022-3514.39.5.977
  • Kumar, R., Robson, K. M., & Smith, A. M. (1984). Development of a self-administered questionnaire to measure maternal adjustment and maternal attitudes during pregnancy and after delivery. Journal of Psychosomatic Research, 28(1), 43. https://doi.org/10.1016/0022-3999(84)90039-4
  • Landis, J. R. & Koch, G. G. (1977). The measurement of observer agreement for categorical data. Biometrics, 33(1), 159–174. https://doi.org/10.2307/2529310
  • *Lazarus, K. & Rossouw, P. J. (2015). Mother’s expectations of parenthood: The impact of prenatal expectations on self-esteem, depression, anxiety, and stress post birth. International Journal of Neuropsychotherapy, 3(2), 102–123. https://doi.org/10.12744/ijnpt.2015.0102-0123
  • *Leach, D. M., Terry, P., & Nikcevic, A. V. (2018). The Pregnancy Related Beliefs Questionnaire (PRBQ): An examination of the psychometric properties in perinatal samples. Clinical Psychology & Psychotherapy, 25(1), 152–162. https://doi.org/10.1002/cpp.2149
  • Maio, G. R. & Haddock, G. (2004). Theories of attitude: Creating a witches’ brew. In G. Haddock & G. R. Maio (Eds.), Contemporary perspectives on the psychology of attitudes: An introduction and overview (pp. 425–453). Psychology Press. www.api.taylorfrancis.com
  • Martini, J., Petzoldt, J., Einsle, F., Beesdo-Baum, K., Höfler, M., & Wittchen, H.-U. (2015). Risk factors and course patterns of anxiety and depressive disorders during pregnancy and after delivery: A prospective-longitudinal study. Journal of Affective Disorders, 175(January), 385–395. https://doi.org/10.1016/j.jad.2015.01.012
  • Moher, D., Liberati, A., Tetzlaff, J., & Altman, D. G. (2009). Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. British Medical Journal, 339 (7716), 332–336. https://doi.org/10.1136/bmj.b2535
  • Mokkink, L. B., Terwee, C. B., Patrick, D. L., Alonso, J., Stratford, P. W., Knol, D. L., Bouter, L. M., & Vet, H. C. W. (2010). The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: An international Delphi study. Quality of Life Research, 19(4), 539–549. https://doi.org/10.1007/s11136-010-9606-8
  • Mokkink, L. B., Terwee, C. B., Stratford, P., Alonso, J., Patrick, D., Riphagen, I., Knol, D., Bouter, L., & Vet, H. (2009). Evaluation of the methodological quality of systematic reviews of health status measurement instruments. Quality of Life Research, 18(3), 313–333. https://doi.org/10.1007/s11136-009-9451-9
  • Mokkink, L. B., Vet, H. C. W., Prinsen, C. A., Patrick, D. L., Alonso, J., Bouter, L. M., & Terwee, C. B. (2018). COSMIN Risk of Bias checklist for systematic reviews of patient-reported outcome measures. Quality of Life Research, 27(5), 1171–1179. https://doi.org/10.1007/s11136-017-1765-4
  • Mollard, E. K. (2014). A qualitative meta-synthesis and theory of postpartum depression. Issues in Mental Health Nursing, 35(9), 656–663. https://doi.org/10.3109/01612840.2014.893044
  • *Moorhead, S. R. J., Owens, J., & Scott, J. (2003). Development and piloting of the Pregnancy Related Beliefs Questionnaire (PRBQ). Behavioural and Cognitive Psychotherapy, 31(2), 207–213. https://doi.org/10.1017/S1352465803002091
  • Muscat, T., Obst, P., Cockshaw, W., & Thorpe, K. (2014). Beliefs about infant regulation, early infant behaviors and maternal postnatal depressive symptoms. Birth, 41(2), 206. https://doi.org/10.1111/birt.12107
  • Nilvér, H., Begley, C., & Berg, M. (2017). Measuring women’s childbirth experiences: A systematic review for identification and analysis of validated instruments. BMC Pregnancy and Childbirth, 17(1), 203. https://doi.org/10.1186/s12884-017-1356-y
  • *O’Mahen, H., Fedock, G., Henshaw, E., Himle, J. A., Forman, J., & Flynn, H. A. (2012). Modifying CBT for perinatal depression: What do women want? Cognitive and Behavioral Practice, 19(2), 359–371. https://doi.org/10.1016/j.cbpra.2011.05.005
  • *O’Mahen, H., Himle, J. A., Fedock, G., Henshaw, E., & Flynn, H. (2013). A pilot randomized controlled trial of cognitive behavioral therapy for perinatal depression adapted for women with low incomes. Depression and Anxiety, 30(7), 679–687. https://doi.org/10.1002/da.22050
  • *Oweis, A. & Abushaikha, L. (2004). Jordanian pregnant women’s expectations of their first childbirth experience. International Journal of Nursing Practice, 10(6), 264–271. https://doi.org/10.1111/j.1440-172x.2004.00488
  • Prinsen, C. A., Mokkink, L. B., Bouter, L. M., Alonso, J., Patrick, D. L., Vet, H. C., & Terwee, C. B. (2018). COSMIN guideline for systematic reviews of patient-reported outcome measures. Quality of Life Research, 27(5), 1147–1157. https://doi.org/10.1007/s11136-018-1798-3
  • Prinsen, C. A. C., Vohra, S., Rose, M. R., Boers, M., Tugwell, P., Clarke, M., Williamson, P. R., & Terwee, C. B. (2016). How to select outcome measurement instruments for outcomes included in a “Core Outcome Set” – a practical guideline. Trials, 17(1), 449–449. https://doi.org/10.1186/s13063-016-1555-2
  • Read, D. M. Y., Crockett, J., & Mason, R. (2012). “It was a horrible shock”: The experience of motherhood and women’s family size preferences. Women’s Studies International Forum, 35(1), 12–21. https://doi.org/10.1016/j.wsif.2011.10.001
  • *Robakis, T. K., Williams, K. E., Crowe, S., Kenna, H., Gannon, J., & Rasgon, N. L. (2015). Optimistic outlook regarding maternity protects against depressive symptoms postpartum. Archives of Women’s Mental Health, 18(2), 197–208. https://doi.org/10.1007/s00737-014-0446-3
  • Robertson, E., Grace, S., Wallington, T., & Stewart, D. E. (2004). Antenatal risk factors for postpartum depression: A synthesis of recent literature. General Hospital Psychiatry, 26(4), 289. https://doi.org/10.1016/j.genhosppsych.2004.02.006
  • *Ruble, D. N., Brooks-Gunn, J., Fleming, A. S., Fitzmaurice, G., Stangor, C., & Deutsch, F. (1990). Transition to motherhood and the self: Measurement, stability, and change. Journal of Personality and Social Psychology, 58(3), 450–463. https://doi.org/10.1037/0022-3514.58.3.450
  • Silverman, M. E., Reichenberg, A., Savitz, D. A., Cnattingius, S., Lichtenstein, P., Hultman, C. M., Larsson, H., & Sandin, S. (2017). The risk factors for postpartum depression: A population-based study. Depression and Anxiety, 34(2), 178–187. https://doi.org/10.1002/da.22597
  • *Slade, P., Pais, T., Fairlie, F., Simpson, A., & Sheen, K. (2016). The development of the Slade-Pais Expectations of Childbirth Scale (SPECS). Journal of Reproductive and Infant Psychology, 34(5), 495–510. https://doi.org/10.1080/02646838.2016.1209300
  • Stein, A., Pearson, R. M., Goodman, S. H., Rapa, E., Rahman, A., McCallum, M., Howard, L. M., & Pariante, C. M. (2014). Effects of perinatal mental disorders on the fetus and child. Lancet, 384(9956), 1800–1819. https://doi.org/10.1016/S0140-6736(14)61277-0
  • Streiner, D. L. & Norman, G. R. (2008). Health measurement scales: A practical guide to their development and use. Oxford University Press. https://doi.org/10.1093/acprof:oso/9780199231881.001.0001
  • *Sockol, L. E. & Battle, C. L. (2015). Maternal attitudes, depression, and anxiety in pregnant and postpartum multiparous women. Archives of Women’s Mental Health, 18(4), 585–593. https://doi.org/10.1007/s00737-015-0511-6
  • *Sockol, L. E., Epperson, C. N., & Barber, J. P. (2014). The relationship between maternal attitudes and symptoms of depression and anxiety among pregnant and postpartum first-time mothers. Archives of Women’s Mental Health, 17(3), 199–212. https://doi.org/10.1007/s00737-014-0424-9
  • Terwee, C. B., Bot, S. D. M., de Boer, M. R., van Der Windt, D. A., Knol, D. L., Dekker, J., Bouter, L. M., & de Vet, H. C. (2007). Quality criteria were proposed for measurement properties of health status questionnaires. Journal of Clinical Epidemiology, 60(1), 34–42. https://doi.org/10.1016/j.jclinepi.2006.03.012
  • Terwee, C. B., Mokkink, L. B., Knol, D. L., Ostelo, R. W., Bouter, L. M., & de Vet, H. C. (2012). Rating the methodological quality in systematic reviews of studies on measurement properties: A scoring system for the COSMIN checklist. Quality of Life Research, 21(4), 651–657. https://doi.org/10.1007/s11136-011-9960-1
  • Terwee, C. B., Prinsen, C. A., Chiarotto, A., Westerman, M. J., Patrick, D. L., Alonso, J., Bouter, L. M., de Vet, H. C. W., & Mokkink, L. B. (2018). COSMIN methodology for evaluating the content validity of patient-reported outcome measures: A Delphi study. Quality of Life Research, 27(5), 1159–1170. https://doi.org/10.1007/s11136-018-1829-0
  • *Thomason, E., Flynn, H., Himle, J., & Volling, B. (2015). Are women’s parenting-specific beliefs associated with depressive symptoms in the perinatal period? Development of the Rigidity of Maternal Beliefs Scale. Depression and Anxiety, 32(2), 141–148. https://doi.org/10.1002/da.22280
  • Vogt, D., King, D., & King, L. (2004). Focus groups in psychological assessment: Enhancing content validity by consulting members of the target population. Psychological Assessment, 16(3), 231–243. https://doi.org/10.1037/1040-3590.16.3.231
  • Wachs, T. D., Black, M. M., & Engle, P. L. (2009). Maternal depression: A global threat to children’s health, development, and behavior and to human rights. Child Development Perspectives, 3(1), 51–59. https://doi.org/10.1111/j.1750-8606.2008.00077.x
  • Warner, R., Appleby, L., Whitton, A., & Faragher, B. (1997). Attitudes toward motherhood in postnatal depression: Development of the Maternal Attitudes Questionnaire. Journal of Psychosomatic Research, 43(4), 351–358. https://doi.org/10.1016/S0022-3999(97)00128-1
  • Wisner, K. L., Sit, D. K. Y., McShea, M. C., Rizzo, D. M., Zoretich, R. A., Hughes, C. L., Eng, H. F., Luther, J. F., Wisniewski, S. R., Costantino, M. L., Confer, A. L., Moses-Kolko, E. L., Famy, C. S., & Hanusa, B. H. (2013). Onset timing, thoughts of self-harm, and diagnoses in postpartum women with screen-positive depression findings. JAMA Psychiatry, 70(5), 1–9. https://doi.org/10.1001/jamapsychiatry.2013.87
  • Wittkowski, A., Vatter, S., Muhinyi, A., Garrett, C., & Henderson, M. (2020). Measuring bonding or attachment in the parent-infant-relationship: A systematic review of parent-report assessment measures, their psychometric properties and clinical utility. Clinical Psychology Review, 82(September), 101906. https://doi.org/10.1016/j.cpr.2020.101906
  • Woody, C. A., Ferrari, A. J., Siskind, D. J., Whiteford, H. A., & Harris, M. G. (2017). A systematic review and meta-regression of the prevalence and incidence of perinatal depression. Journal of Affective Disorders, 219(May), 86–92. https://doi.org/10.1016/j.jad.2017.05.003
  • Young, E. (2008). Maternal expectations: Do they match experience? Community Practitioner, 81(10), 27–30.