Full article: Generalized fiducial methods for testing quantitative trait locus effects in genetic backcross studies

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

In this paper, we propose generalized fiducial methods and construct four generalized p-values to test the existence of quantitative trait locus effects under phenotype distributions from a location-scale family. Compared with the likelihood ratio test based on simulation studies, our methods perform better at controlling type I errors while retaining comparable power in cases with small or moderate sample sizes. The four generalized fiducial methods support varied scenarios: two of them are more aggressive and powerful, whereas the other two appear more conservative and robust. A real data example involving mouse blood pressure is used to illustrate our proposed methods.

Keywords:

1. Introduction

In medical and biological genetic research, quantitative trait locus (QTL) mapping is important in studies of the traits of all types of organisms. For example, QTLs can be identified and mapped to analyse the genetic factors contributing to blood pressure in animals (Sugiyama et al., Citation2001) or to the length of rice grains (Huang et al., Citation1997; R. Wu et al., Citation2007). As a standard process at the beginning of QTL mapping studies, tests for the existence of QTL effects – that is, whether the gene related to the traits is on the specified chromosome – should be deployed.

Interval mapping, proposed by Lander and Botstein (Citation1989), is a popular method for detecting QTLs. Suppose that a putative QTL, denoted by Q, is located between the left and right flanking markers, M and N, in a backcross design. For individuals in the backcross population, the possible genotypes are MM and Mm at M, NN and Nn at N, and QQ and Qq at Q. Hence, the individuals in the backcross population have four marker genotypes: MM/NN, Mm/NN, MM/Nn, and Mm/Nn, where Mm/NN and MM/Nn are recombinant types. For each individual, M and N can be observed but Q cannot. A testing method based on these data for detecting a QTL in the interval M–N is referred to the interval mapping method.

Let r, $r_{1}$ , and $r_{2}$ be the recombination frequencies – that is, the proportions of recombinant genotypes – between M and N, between M and Q, and between Q and N, respectively. In this paper, we only consider backcross designs without double recombination or interference between two-marker-QTL intervals, i.e., $r = r_{1} + r_{2}$ (R. Wu et al., Citation2007). Denote by C the coding variable for the genotypes at the two markers, with C = 1, 2, 3, 4 representing the genotypes MM/NN, Mm/NN, MM/Nn, and Mm/Nn, respectively. The probabilities of QTL genotypes are shown in Table ; see also Chen and Chen (Citation2005), R. Wu et al. (Citation2007) and Zhang et al. (Citation2008).

Table 1. Probabilities of QTL genotypes.

Display Table

Let $f_{1}$ and $f_{2}$ be the phenotype density functions corresponding to two QTL genotypes QQ and Qq. Denote by ${Y_{11}, \dots, Y_{1 n_{1}}}$ , ${Y_{21}, \dots, Y_{2 n_{2}}}$ , ${Y_{31}, \dots, Y_{3 n_{3}}}$ , and ${Y_{41}, \dots, Y_{4 n_{4}}}$ the phenotype data corresponding to the marker genotypes MM/NN, Mm/NN, MM/Nn, and Mm/Nn, respectively. Then, we have the following statistical model under the considered background: (1) $\begin{aligned} Y_{1 j} & \sim f_{1} (y), j = 1, \dots, n_{1}, \\ Y_{2 j} & \sim θ f_{1} (y) + (1 - θ) f_{2} (y), j = 1, \dots, n_{2}, \\ Y_{3 j} & \sim (1 - θ) f_{1} (y) + θ f_{2} (y), j = 1, \dots, n_{3}, \\ Y_{4 j} & \sim f_{2} (y), j = 1, \dots, n_{4}, \end{aligned}$ (1) where $θ = r_{1} / r$ and $y_{i j}$ are trait values corresponding to C = i, $i = 1, \dots, 4$ , $j = 1, \dots, n_{i}$ . Here, r is known, as the two markers M and N are pre-specified, whereas $r_{1}$ and $r_{2}$ are unknown as the location of Q is unknown. Then, ${Y_{2 j}, j = 1, \dots, n_{2}}$ and ${Y_{3 j}, j = 1, \dots, n_{3}}$ are modelled by mixture distributions because of the recombination of non-sister chromatids in these individuals. Denoting the total sample size by $n = \sum_{i = 1}^{4} n_{i}$ , we have $n_{i} / n \to Pr (C = i)$ when each $n_{i}$ tends to ∞ at the same rate (R. Wu et al., Citation2007).

Under model (Equation1(1) $\begin{aligned} Y_{1 j} & \sim f_{1} (y), j = 1, \dots, n_{1}, \\ Y_{2 j} & \sim θ f_{1} (y) + (1 - θ) f_{2} (y), j = 1, \dots, n_{2}, \\ Y_{3 j} & \sim (1 - θ) f_{1} (y) + θ f_{2} (y), j = 1, \dots, n_{3}, \\ Y_{4 j} & \sim f_{2} (y), j = 1, \dots, n_{4}, \end{aligned}$ (1) ), testing the existence of QTL effects is equivalent to testing the null hypothesis: (2) $H_{0} : f_{1} = f_{2} .$ (2) The null hypothesis in (Equation2(2) $H_{0} : f_{1} = f_{2} .$ (2) ) means that there are no QTL effects. In the literature, parametric methods are usually applied by assuming the specific distributions of $f_{1}$ and $f_{2}$ . For example, Chen and Chen (Citation2005) and Zhang et al. (Citation2008) assume that $f_{1}$ and $f_{2}$ have normal distributions with the same variance, although in fact a QTL effect in variance may be more crucial (Korol et al., Citation1996; Liu et al., Citation2020). Recently, Liu et al. (Citation2020) extended the likelihood ratio (LR) test to detect QTL effects where $f_{1}$ and $f_{2}$ are from a general location-scale family with unknown locations and/or scales, i.e., $f_{k} (y) = f (y; μ_{k}, σ_{k})$ with $f (y; μ, σ) = f ((y - μ) / σ; 0, 1) / σ$ . Here, $f (\cdot; 0, 1)$ is a known probability density function, and μ and σ are the location and scale parameters, respectively. Then, the hypothesis test problem in (Equation2(2) $H_{0} : f_{1} = f_{2} .$ (2) ) is transformed into: (3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) In particular, under the assumption that $σ_{1} = σ_{2} = σ$ , the testing problem in (Equation3(3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) ) becomes (4) $H_{0} : μ_{1} = μ_{2} .$ (4) For the null hypotheses (Equation3(3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) ) and (Equation4(4) $H_{0} : μ_{1} = μ_{2} .$ (4) ), Liu et al. (Citation2020) proposed explicit representations of the limiting distribution of LR statistics and obtained more accurate asymptotic p-values than those of Rebai et al. (Citation1994, Citation1995). The LR test in Liu et al. (Citation2020) was shown to be more powerful than the Kolmogorov–Smirnov test and Anderson–Darling test. However, we found that the LR test inflated type I errors when the sample size was small or moderate, as shown in our simulation study in Section 3.

Based on the above discussion, it is desirable to develop new methods that control type I errors more accurately while retaining powerful performance with small or moderate sample sizes. One efficient method is the generalized fiducial inference developed by Hannig et al. (Citation2006) and Hannig (Citation2009), which constructs generalized p-values for the null hypotheses (Equation3(3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) ) and (Equation4(4) $H_{0} : μ_{1} = μ_{2} .$ (4) ) under a fiducial inference frame introduced by Fisher (Citation1930). In the literature, the generalized fiducial inference is widely applied in homogeneous data, i.e., assuming that all labels of samples are known. Recent research includes Lai et al. (Citation2015), Hannig et al. (Citation2016), Li et al. (Citation2018), Cui and Hannig (Citation2019), and Williams and Hannig (Citation2019). However, generalized fiducial inference has not received much attention as a means of testing QTL effects under model (Equation1(1) $\begin{aligned} Y_{1 j} & \sim f_{1} (y), j = 1, \dots, n_{1}, \\ Y_{2 j} & \sim θ f_{1} (y) + (1 - θ) f_{2} (y), j = 1, \dots, n_{2}, \\ Y_{3 j} & \sim (1 - θ) f_{1} (y) + θ f_{2} (y), j = 1, \dots, n_{3}, \\ Y_{4 j} & \sim f_{2} (y), j = 1, \dots, n_{4}, \end{aligned}$ (1) ). In this paper, generalized fiducial inference is applied by constructing four types of generalized p-values to test the null hypotheses (Equation3(3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) ) and (Equation4(4) $H_{0} : μ_{1} = μ_{2} .$ (4) ). Our methods have two advantages. (i) They can control type I errors more accurately than LR methods, especially for small and moderate sample sizes, although they may not be optimal in a conventional sense. (ii) They retain power comparable with or even greater than that of the LR methods.

The remainder of this article is organized as follows. In Section 2, four generalized fiducial methods are proposed for the null hypothesis (Equation3(3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) ). In Section 3, we develop comparisons of these proposed methods with the method of Liu et al. (Citation2020) through simulated examples. In addition, a real genetic dataset is analysed by applying our methods in Section 4. Finally, Section 5 concludes the article. The proof of Theorem 2.1 and additional comparisons among some generalized pivotal quantities (GPQs) are provided in the supplementary material.

2. New test

The generalized fiducial inference is one of the most important ways to construct generalized p-values. In the following, we explain the general procedure proposed by Li et al. (Citation2007, Citation2018) for obtaining generalized p-values based on a data-generating equation (DGE).

Let $Y$ be a random vector following a known distribution $F_{δ} (\cdot)$ , where $δ$ is an unknown parameter vector. Suppose $ξ = ξ (δ) = (ξ_{1}, ξ_{2}^{T})^{T}$ , where $ξ_{1}$ is the parameter of interest and $ξ_{2}$ is the nuisance parameter vector. An observation of $Y$ is denoted by $y$ . Suppose we have the DGE $Y = G (δ, E),$ where $E$ is a random variable that has a known distribution. The observed version of DGE $y = G (δ, e)$ has a unique solution for $δ$ , i.e., $G^{- 1} (y, e)$ . Then, the random quantities $G^{- 1} (y, E)$ and $ξ (G^{- 1} (y, E))$ are the GPQs of $δ$ and $ξ (δ)$ , and the distributions of $G^{- 1} (y, E)$ and $ξ (G^{- 1} (y, E))$ are the fiducial distributions of $δ$ and $ξ (δ)$ . Furthermore, if $y = G (δ, e)$ has a unique solution for any $e$ and $y$ , $ξ_{1} - ξ_{1} (G^{- 1} (y, E))$ is a generalized test variable of $ξ_{1}$ , so that the generalized p-value for the one-sided hypothesis $H_{0} : ξ_{1} \leq ξ_{10} \leftrightarrow H_{1} : ξ_{1} > ξ_{10}$ is $p = Pr (ξ_{1} (G^{- 1} (y, E)) < ξ_{10})$ .

Denote by $T_{δ} = (T_{θ}, T_{μ_{1}}, T_{μ_{2}}, T_{σ_{1}}, T_{σ_{2}})^{T}$ the GPQ of the parameter vector $δ = (θ, μ_{1}, μ_{2}, σ_{1}, σ_{2})^{T}$ . Based on the ideas of the above methods, if $T_{δ}$ can be obtained by fiducial inference, we can find the GPQs of the parameters of interest $ξ_{1} = μ_{2} - μ_{1}$ and $ξ_{1^{'}} = σ_{2} - σ_{1}$ , denoted by $T_{ξ_{1}} = T_{μ_{2}} - T_{μ_{1}}$ and $T_{ξ_{1^{'}}} = T_{σ_{2}} / T_{σ_{1}}$ . Thus, the generalized p-values for the hypotheses $μ_{2} - μ_{1} = 0$ and $σ_{2} / σ_{1} = 1$ are (5) $p_{1} = 2 min {Pr (T_{μ_{2}} - T_{μ_{1}} < 0), Pr (T_{μ_{2}} - T_{μ_{1}} > 0)}$ (5) and (6) $p_{2} = 2 min {Pr (T_{σ_{2}} / T_{σ_{1}} < 1), Pr (T_{σ_{2}} / T_{σ_{1}} > 1)} .$ (6) According to Theorem 2.1 in Section 2.1, $p_{1}$ and $p_{2}$ follow the standard uniform distribution $U (0, 1)$ independently. Then, according to Fisher's combined method (Fisher, Citation1932), the generalized p-value for testing the hypothesis in (Equation3(3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) ) is (7) $p_{G V} = Pr {χ_{4}^{2} > - 2 \log (p_{1} p_{2})} .$ (7) For a given significance level α, the null hypothesis (Equation3(3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) ) is rejected if $p_{G V} \leq α$ . Similarly, under the condition $σ_{1} = σ_{2} = σ$ , the generalized p-value for the hypothesis test problem in (Equation4(4) $H_{0} : μ_{1} = μ_{2} .$ (4) ) becomes (8) $\begin{aligned} {\bar{p}}_{G V} & = 2 min {Pr ({\bar{T}}_{μ_{2}} - {\bar{T}}_{μ_{1}} < 0), \\ Pr ({\bar{T}}_{μ_{2}} - {\bar{T}}_{μ_{1}} > 0)}, \end{aligned}$ (8) where ${\bar{T}}_{μ_{k}}$ is the GPQ of $μ_{k}$ (k = 1, 2) under same-scale conditions. The null hypothesis (Equation4(4) $H_{0} : μ_{1} = μ_{2} .$ (4) ) is rejected if ${\bar{p}}_{G V} \leq α$ .

For the mixture distribution frame in (Equation1(1) $\begin{aligned} Y_{1 j} & \sim f_{1} (y), j = 1, \dots, n_{1}, \\ Y_{2 j} & \sim θ f_{1} (y) + (1 - θ) f_{2} (y), j = 1, \dots, n_{2}, \\ Y_{3 j} & \sim (1 - θ) f_{1} (y) + θ f_{2} (y), j = 1, \dots, n_{3}, \\ Y_{4 j} & \sim f_{2} (y), j = 1, \dots, n_{4}, \end{aligned}$ (1) ), the DGEs for the sample data $Y = (Y_{11}, \dots, Y_{1 n_{1}}, Y_{21}, \dots, Y_{2 n_{2}}, Y_{31}, \dots, Y_{3 n_{3}}, Y_{41}, \dots, Y_{4 n_{4}})^{T}$ are (9) $\begin{aligned} Y_{1 j} & = μ_{1} + σ_{1} Z_{1 j}, j = 1, \dots, n_{1}, \\ Y_{2 j} & = (μ_{1} + σ_{1} Z_{2 j}) I_{(0, θ]} (U_{2 j}) \\ + (μ_{2} + σ_{2} Z_{2 j}) I_{(θ, 1)} (U_{2 j}), j = 1, \dots, n_{2}, \\ Y_{3 j} & = (μ_{1} + σ_{1} Z_{3 j}) I_{(θ, 1]} (U_{3 j}) \\ + (μ_{2} + σ_{2} Z_{3 j}) I_{(0, θ]} (U_{3 j}), j = 1, \dots, n_{3}, \\ Y_{4 j} & = μ_{2} + σ_{2} Z_{4 j}, j = 1, \dots, n_{4}, \end{aligned}$ (9) where $Z_{i j} \sim f (\cdot; 0, 1)$ and $U_{i j} \sim U (0, 1)$ independently, for $j = 1, \dots, n_{i}$ , i = 1, 2, 3, 4. The explicit expressions of $T_{θ}$ , $T_{μ_{1}}$ , $T_{μ_{2}}$ , $T_{σ_{1}}$ , and $T_{σ_{2}}$ are difficult to obtain based on the observed version of (Equation9(9) $\begin{aligned} Y_{1 j} & = μ_{1} + σ_{1} Z_{1 j}, j = 1, \dots, n_{1}, \\ Y_{2 j} & = (μ_{1} + σ_{1} Z_{2 j}) I_{(0, θ]} (U_{2 j}) \\ + (μ_{2} + σ_{2} Z_{2 j}) I_{(θ, 1)} (U_{2 j}), j = 1, \dots, n_{2}, \\ Y_{3 j} & = (μ_{1} + σ_{1} Z_{3 j}) I_{(θ, 1]} (U_{3 j}) \\ + (μ_{2} + σ_{2} Z_{3 j}) I_{(0, θ]} (U_{3 j}), j = 1, \dots, n_{3}, \\ Y_{4 j} & = μ_{2} + σ_{2} Z_{4 j}, j = 1, \dots, n_{4}, \end{aligned}$ (9) ), as the labels of observations ${y_{2 j}, j = 1, \dots, n_{2}}$ and ${y_{3 j}, j = 1, \dots, n_{3}}$ are missing. Our solution is to introduce a random configuration assignment for ${y_{2 j}, j = 1, \dots, n_{2}}$ and ${y_{3 j}, j = 1, \dots, n_{3}}$ , i.e., to randomly assign ${y_{2 j}}$ and ${y_{3 j}}$ to the distribution $f_{1}$ or $f_{2}$ (Hannig, Citation2009). Inspired by the Bayesian method of McLachlan and Peel (Citation2000) and Frühwirth-Schnatter (Citation2006), we obtain the GPQs through a two-block design. Specifically, in the first step, we find the GPQs conditional on a given configuration assignment, which can be obtained much more easily, and denote them by $R_{θ}$ , $R_{μ_{1}}$ , $R_{μ_{2}}$ , $R_{σ_{1}}$ , and $R_{σ_{2}}$ . In the second step, the new configuration assignment can be randomly generated based on Bernoulli random numbers $D_{2 j} \sim B i n (1, R_{τ_{2 j}}), j = 1, \dots, n_{2}$ , and $D_{3 j} \sim B i n (1, R_{τ_{3 j}}), j = 1, \dots, n_{3}$ , where (10) $R_{τ_{2 j}} = \frac{R_{θ} f (y_{2 j}; R_{μ_{1}}, R_{σ_{1}})}{R_{θ} f (y_{2 j}; R_{μ_{1}}, R_{σ_{1}}) + (1 - R_{θ}) f (y_{2 j}; R_{μ_{2}}, R_{σ_{2}})}$ (10) and (11) $\begin{aligned} R_{τ_{3 j}} = \frac{R_{θ} f (y_{3 j}; R_{μ_{2}}, R_{σ_{2}})}{(1 - R_{θ}) f (y_{3 j}; R_{μ_{1}}, R_{σ_{1}}) + R_{θ} f (y_{3 j}; R_{μ_{2}}, R_{σ_{2}})} . \end{aligned}$ (11) For example, we randomly generate $D_{2 j}$ from $B i n (1, R_{τ_{2 j}})$ and assign $y_{2 j}$ according to $D_{2 j}$ , that is, we specify that $y_{2 j}$ is generated from the distribution $f_{1}$ if $D_{2 j} = 1$ or the distribution $f_{2}$ if $D_{2 j} = 0$ , $j = 1, \dots, n_{2}$ . According to ${D_{2 j}}$ and ${D_{3 j}}$ , we may update $R_{θ}$ , $R_{μ_{1}}$ , $R_{μ_{2}}$ , $R_{σ_{1}}$ , and $R_{σ_{2}}$ . By iterating the steps above, five Markov chains can be obtained to approximate the distributions of $T_{θ}$ , $T_{μ_{1}}$ , $T_{μ_{2}}$ , $T_{σ_{1}}$ , and $T_{σ_{2}}$ . Then, the generalized p-values in (Equation5(5) $p_{1} = 2 min {Pr (T_{μ_{2}} - T_{μ_{1}} < 0), Pr (T_{μ_{2}} - T_{μ_{1}} > 0)}$ (5) ), (Equation6(6) $p_{2} = 2 min {Pr (T_{σ_{2}} / T_{σ_{1}} < 1), Pr (T_{σ_{2}} / T_{σ_{1}} > 1)} .$ (6) ), and (Equation7(7) $p_{G V} = Pr {χ_{4}^{2} > - 2 \log (p_{1} p_{2})} .$ (7) ) can be obtained. A similar method can be applied for the generalized p-value in (Equation8(8) $\begin{aligned} {\bar{p}}_{G V} & = 2 min {Pr ({\bar{T}}_{μ_{2}} - {\bar{T}}_{μ_{1}} < 0), \\ Pr ({\bar{T}}_{μ_{2}} - {\bar{T}}_{μ_{1}} > 0)}, \end{aligned}$ (8) ).

Sections 2.1 and 2.2 provide the constructions of GPQs conditional on the configuration assignment. The Gibbs algorithm for the computation of the generalized p-values is explained in Section 2.3.

2.1. GPQs of location and scale parameters conditional on assignment

To find the GPQs of $(μ_{1}, σ_{1})$ , we combine the observations ${y_{1 j}, j = 1, \dots, n_{1}}$ , ${y_{2 j} : y_{2 j} = μ_{1} + σ_{1} z_{2 j}, j = 1, \dots, n_{2}}$ , and ${y_{3 j} : y_{3 j} = μ_{1} + σ_{1} z_{3 j}, j = 1, \dots, n_{3}}$ into $v = {v_{1}, \dots, v_{{\hat{g}}_{1}}} = {y_{11}, \dots, y_{1 n_{1}}, y_{2 j_{1}}, \dots, y_{2 j_{s_{2}}}, y_{3 j_{s_{3} + 1}}, \dots, y_{3 j_{n_{3}}}}$ , where ${\hat{g}}_{1} = n_{1} + s_{2} + n_{3} - s_{3}$ , $s_{2}$ , and $s_{3}$ are the observed values of $S_{2} = \sum_{j = 1}^{n_{2}} I_{(0, θ]} (U_{2 j})$ and $S_{3} = \sum_{j = 1}^{n_{3}} I_{(0, θ]} (U_{3 j})$ . Here, $v$ provides the information for $(μ_{1}, σ_{1})$ . Similarly, $w = {w_{1}, \dots, w_{{\hat{g}}_{2}}} = {y_{2 j_{s_{2} + 1}}, \dots, y_{2 j_{n_{2}}}, y_{3 j_{1}}, \dots, y_{3 j_{s_{3}}}, y_{41}, \dots, y_{4 n_{4}}}$ contains the information for $(μ_{2}, σ_{2})$ , where ${\hat{g}}_{2} = n_{4} + s_{3} + n_{2} - s_{2}$ . In this sense, we make a configuration assignment of the observations into two groups to infer $(μ_{1}, σ_{1})$ and $(μ_{2}, σ_{2})$ , respectively. Denote by σ the common scale of the data. Conditional on this configuration assignment, under the null hypothesis (Equation3(3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) ), we have the following conditional DGEs: (12) ${\begin{cases} {\hat{μ}}_{1} = μ_{1} + σ E_{11}, \\ {\hat{μ}}_{2} = μ_{2} + σ E_{21}, \\ \hat{σ} = σ E_{2}, \\ {\hat{σ}}_{1} = σ_{1} E_{12}, \\ {\hat{σ}}_{2} = σ_{2} E_{22}, \end{cases}$ (12) where $({\hat{μ}}_{1}, {\hat{σ}}_{1})$ , $({\hat{μ}}_{2}, {\hat{σ}}_{2})$ , and $\hat{σ}$ are the MLEs determined by ${v_{1}, \dots, v_{{\hat{g}}_{1}}}$ , ${w_{1}, \dots, w_{{\hat{g}}_{2}}}$ , and their combination, respectively, and $(E_{k 1}, E_{k 2})$ and $E_{2}$ have known distributions that are, respectively, identical to those of the MLEs $({\tilde{μ}}_{k}, {\tilde{σ}}_{k})$ and $\tilde{σ}$ , based on a sample of ${\hat{g}}_{k} (k = 1, 2)$ observations and their combined sample from a standard location-scale distribution $f (\cdot; 0, 1)$ . Hence, the GPQs of $(μ_{k}, σ_{k})$ under the null hypothesis (Equation3(3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) ) should be (Nkurunziza & Chen, Citation2011; Xu & Li, Citation2006) (13) $\begin{aligned} R_{μ_{k}} & = {\hat{μ}}_{k}^{o b s} - {\hat{σ}}^{o b s} \frac{E_{k 1}}{E_{2}} = {\hat{μ}}_{k}^{o b s} - {\hat{σ}}^{o b s} \frac{{\hat{μ}}_{k} - μ_{k}}{\hat{σ}}, \\ R_{σ_{k}} & = \frac{{\hat{σ}}_{k}^{o b s}}{E_{k 2}} = \frac{{\hat{σ}}_{k}^{o b s}}{{\hat{σ}}_{k}} σ_{k}, k = 1, 2, \end{aligned}$ (13) where ${\hat{μ}}_{k}^{o b s}$ and ${\hat{σ}}_{k}^{o b s}$ are the observed values of ${\hat{μ}}_{k}$ and ${\hat{σ}}_{k}$ . Here, $R_{μ_{k}}$ and $R_{σ_{k}}$ are independent because of the independence of $E_{k 1}$ , $E_{k 2}$ , and $E_{2}$ . In particular, in the normal distribution case, $E_{k 1}$ is the random element following $N (0, 1 / {\hat{g}}_{k})$ , and $E_{k 2}$ and $E_{2}$ are the random elements following $\sqrt{χ_{{\hat{g}}_{k} - 1}^{2} / {\hat{g}}_{k}}$ and $\sqrt{χ_{{\hat{g}}_{1} + {\hat{g}}_{2} - 2}^{2} / ({\hat{g}}_{1} + {\hat{g}}_{2})}$ , respectively. Our method, given the configuration assignment in the normal distribution case, is identical to that of Perng and Littell (Citation1976).

Theorem 2.1 indicates the null distributions of $p_{1}$ and $p_{2}$ . The proof of the theorem is obtained by the distributions of the conditional GPQs $R_{μ_{k}}$ and $R_{σ_{k}}$ , which is given in Section A of the supplementary material with some simulated results.

Theorem 2.1

The generalized p-values $p_{1}$ and $p_{2}$ defined in (Equation5(5) $p_{1} = 2 min {Pr (T_{μ_{2}} - T_{μ_{1}} < 0), Pr (T_{μ_{2}} - T_{μ_{1}} > 0)}$ (5) ) and (Equation6(6) $p_{2} = 2 min {Pr (T_{σ_{2}} / T_{σ_{1}} < 1), Pr (T_{σ_{2}} / T_{σ_{1}} > 1)} .$ (6) ) follow $U (0, 1)$ independently.

In particular, under $σ_{1} = σ_{2} = σ$ , given the configuration assignment, the GPQs are (14) $\begin{aligned} {\bar{R}}_{μ_{k}} & = {\hat{μ}}_{k}^{o b s} - {\hat{σ}}^{o b s} \frac{E_{k 1}}{E_{2}} = {\hat{μ}}_{k}^{o b s} - {\hat{σ}}^{o b s} \frac{{\hat{μ}}_{k} - μ_{k}}{\hat{σ}}, \\ {\bar{R}}_{σ} & = \frac{{\hat{σ}}^{o b s}}{E_{2}} = \frac{{\hat{σ}}^{o b s}}{\hat{σ}} σ, k = 1, 2, \end{aligned}$ (14) where $({\hat{μ}}_{1}^{o b s}, {\hat{μ}}_{2}^{o b s}, {\hat{σ}}^{o b s})$ are the observed values of $({\hat{μ}}_{1}, {\hat{μ}}_{2}, \hat{σ})$ , and $(E_{11}, E_{21}, E_{2})$ have the same distributions as the MLEs $({\tilde{μ}}_{1}, {\tilde{μ}}_{2}, \tilde{σ})$ based on a combined sample of observations ${\hat{g}}_{1}$ and ${\hat{g}}_{2}$ from $f (\cdot; 0, 1)$ .

2.2. GPQs of the mixing proportion conditional on assignment

The GPQ of the mixing proportion θ given the configuration assignment is not unique. In the literature, several GPQs have been developed. Among them, three types of GPQs are popular because they have been shown to have relatively good properties even for small and moderate-sized samples.

The mixture-beta generalized variable recommended by Efron (Citation1998) and Hannig (Citation2009), called GVM hereafter, is (15) $\begin{aligned} R_{θ}^{M} & \sim 0.5 B e t a (s_{2} + s_{3}, n_{2} + n_{3} - s_{2} - s_{3} + 1) \\ + 0.5 B e t a (s_{2} + s_{3} + 1, n_{2} + n_{3} - s_{2} - s_{3}) . \end{aligned}$ (15)
Jeffreys' generalized variable recommended by Cai (Citation2005) and Krishnamoorthy and Lee (Citation2010), called GVJ hereafter, is (16) $\begin{aligned} R_{θ}^{J} & \sim B e t a (s_{2} + s_{3} + 0.5, \\ n_{2} + n_{3} - s_{2} - s_{3} + 0.5) . \end{aligned}$ (16)
Wilson's generalized variable proposed by Li et al. (Citation2013), called GVW hereafter, is (17) $\begin{aligned} R_{θ}^{W} & = \frac{s_{2} + s_{3} + Z^{2} / 2}{n_{2} + n_{3} + Z^{2}} - \frac{Z}{n_{2} + n_{3} + Z^{2}} \\ \times {(s_{2} + s_{3}) (1 - \frac{s_{2} + s_{3}}{n_{2} + n_{3}}) + \frac{Z^{2}}{4}}^{1 / 2}, \end{aligned}$ (17) where $Z \sim N (0, 1)$ .

Besides the quantities above, we propose a new generalized variable by modifying the variance-stabilizing transformation of θ.

W. H. Wu and Hsieh (Citation2014) and Bebu et al. (Citation2016) constructed a generalized variable with variance-stabilizing transformation for binomial proportion, called GVV hereafter. For $S_{2} + S_{3} \sim B i n (n_{2} + n_{3}, θ)$ , by the asymptotic normality, (18) $2 \sqrt{n_{2} + n_{3}} (\arcsin \sqrt{\hat{θ}} - \arcsin \sqrt{θ}) ⟹ d N (0, 1),$ (18) where $\hat{θ} = \frac{S_{2} + S_{3}}{n_{2} + n_{3}}$ . However, if this result is applied directly to construct the GVV, the result $R_{θ}^{V} = \sin^{2} (\arcsin \sqrt{{\hat{θ}}^{o b s}} - \frac{Z}{2 \sqrt{n_{2} + n_{3}}})$ will become inaccurate, as can be seen in the simulation results in Section B of the supplementary material, because (Equation18(18) $2 \sqrt{n_{2} + n_{3}} (\arcsin \sqrt{\hat{θ}} - \arcsin \sqrt{θ}) ⟹ d N (0, 1),$ (18) ) only holds when $n_{2} + n_{3} \to \infty$ . Here, $Z \sim N (0, 1)$ and $⟹ d$ stands for convergence in distribution.

To avoid the problem of liberality in the Wald confidence interval for the binomial proportion in small or moderate sample size cases, Agresti and Coull (Citation1998), Agresti and Caffo (Citation2000), and Schaarschmidt et al. (Citation2008) considered adding some numbers of pseudo variables, half of which were ‘successful’ variables. The frequentist properties of their methods were thus much better than those of the Wald interval. In fact, Schaarschmidt et al. (Citation2008) pointed out that this kind of adjustment was ‘not motivated by statistical theory but determined on a rather heuristic basis’.

Motivated by the results above, we consider adding one or two variables to adjust the GVV of W. H. Wu and Hsieh (Citation2014) and Bebu et al. (Citation2016) based on a variance-stabilizing transformation. To compare the frequentist properties among GVV and its two modifications, we construct generalized confidence intervals for the binomial proportion θ; the coverage probabilities and average lengths are given in Section B of the supplementary material. The results show that this kind of adjustment can improve the frequentist properties of GVV, and the coverage probabilities when adding one pseudo variable have the smallest oscillations when the sample size is not more than 15, although its average lengths are greater than those when two pseudo variables are added. Therefore, we choose to add one pseudo variable, containing 0.5 ‘success’ and 0.5 ‘failure’, which resulting in the following result: $\begin{aligned} 2 \sqrt{n_{2} + n_{3} + 2 + \frac{1}{n_{2} + n_{3}}} \\ \times (\arcsin \sqrt{\tilde{θ}} - \arcsin \sqrt{θ}) ⟹ d N (0, 1), \end{aligned}$ where $\tilde{θ} = \frac{S_{2} + S_{3} + 0.5}{n_{2} + n_{3} + 1}$ . Note that $\tilde{θ}$ is identical to the Bayesian estimator based on Jeffreys' prior $B e t a (0.5, 0.5)$ , and this convergence is identical to (Equation18(18) $2 \sqrt{n_{2} + n_{3}} (\arcsin \sqrt{\hat{θ}} - \arcsin \sqrt{θ}) ⟹ d N (0, 1),$ (18) ) when $n_{2} + n_{3} \to \infty$ . This modified variance-stability transformation generalized variable, called GVMV hereafter, is (19) $\begin{aligned} R_{θ}^{M V} & = \sin^{2} (\arcsin \sqrt{{\tilde{θ}}^{o b s}} \\ - \frac{Z}{2 \sqrt{n_{2} + n_{3} + 2 + \frac{1}{n_{2} + n_{3}}}}), \end{aligned}$ (19) where ${\tilde{θ}}^{o b s} = \frac{s_{2} + s_{3} + 0.5}{n_{2} + n_{3} + 1}$ is the observed value of $\tilde{θ}$ .

2.3. Gibbs algorithm

According to Gelman et al. (Citation2014), a Markov chain Monte Carlo method can be used to obtain approximate distributions to the real ones of the GPQs. For convenience, we consider the two-block Gibbs sampler used by McLachlan and Peel (Citation2000) and Frühwirth-Schnatter (Citation2006). The initial $R_{τ_{2 j}}, j = 1, \dots, n_{2}$ and $R_{τ_{3 j}}, j = 1, \dots, n_{3}$ can be determined by $\begin{aligned} R_{τ_{2 j}}^{(0)} & = \frac{\overset{ˇ}{θ} f (y_{2 j}; {\overset{ˇ}{μ}}_{1}, {\overset{ˇ}{σ}}_{1})}{\overset{ˇ}{θ} f (y_{2 j}; {\overset{ˇ}{μ}}_{1}, {\overset{ˇ}{σ}}_{1}) + (1 - \overset{ˇ}{θ}) f (y_{2 j}; {\overset{ˇ}{μ}}_{2}, {\overset{ˇ}{σ}}_{2})}, \\ j = 1, \dots, n_{2}, \\ R_{τ_{3 j}}^{(0)} & = \frac{\overset{ˇ}{θ} f (y_{3 j}; {\overset{ˇ}{μ}}_{2}, {\overset{ˇ}{σ}}_{2})}{(1 - \overset{ˇ}{θ}) f (y_{3 j}; {\overset{ˇ}{μ}}_{1}, {\overset{ˇ}{σ}}_{1}) + \overset{ˇ}{θ} f (y_{3 j}; {\overset{ˇ}{μ}}_{2}, {\overset{ˇ}{σ}}_{2})}, \\ j = 1, \dots, n_{3}, \end{aligned}$ where $(\overset{ˇ}{θ}, {\overset{ˇ}{μ}}_{1}, {\overset{ˇ}{μ}}_{2}, {\overset{ˇ}{σ}}_{1}, {\overset{ˇ}{σ}}_{2}) = \arg max_{0 \leq θ \leq 1} l_{n} (θ, μ_{1}, μ_{2}, σ_{1}, σ_{2})$ , with $\begin{aligned} l_{n} (θ, μ_{1}, μ_{2}, σ_{1}, σ_{2}) \\ = \sum_{j = 1}^{n_{1}} \log f_{1} (y_{1 j}) \\ + \sum_{j = 1}^{n_{2}} \log {θ f_{1} (y_{2 j}) + (1 - θ) f_{2} (y_{2 j})} \\ + \sum_{j = 1}^{n_{3}} \log {(1 - θ) f_{1} (y_{3 j}) + θ f_{2} (y_{3 j})} \\ + \sum_{j = 1}^{n_{4}} \log f_{2} (y_{4 j}) . \end{aligned}$ Then, iterate the following steps for $b = 1, \dots, B$ .

Randomly generate $D_{2 j}^{(b)}$ from $B i n (1, R_{τ_{2 j}}^{(b - 1)}), j = 1, \dots, n_{2}$ . If $D_{2 j}^{(b)} = 1$ , the corresponding individual from $y_{2 j}$ is assigned to Group 1 ( $v$ , defined in Section 2.1); otherwise, it is assigned to Group 2 ( $w$ , defined in Section 2.1). Similarly, generate $D_{3 j}^{(b)}$ from $B i n (1, R_{τ_{3 j}}^{(b - 1)}), j = 1, \dots, n_{3}$ . If $D_{3 j}^{(b)} = 0$ , the corresponding individual from $y_{3 j}$ is assigned to $v$ ; otherwise, it is assigned to $w$ . Then, calculate $s_{2} = \sum_{j = 1}^{n_{2}} D_{2 j}^{(b)}$ , $s_{3} = \sum_{j = 1}^{n_{3}} D_{3 j}^{(b)}$ , and $({\hat{μ}}_{k}, {\hat{σ}}_{k})$ , k = 1, 2 under the current assignment.
Under the random assignment in Step 1, generate $R_{μ_{k}}^{(b)}$ , $R_{σ_{k}}^{(b)}$ , and $R_{θ}^{(b)}$ .
1. Obtain $({\hat{μ}}_{1}^{o b s}, {\hat{σ}}_{1}^{o b s})$ , $({\hat{μ}}_{2}^{o b s}, {\hat{σ}}_{2}^{o b s})$ and ${\hat{σ}}^{o b s}$ from $ν$ , $ω$ , and their combination, respectively, according to the MLE method.
2. Generate a random sample with size ${\hat{g}}_{1} = n_{1} + s_{2} + n_{3} - s_{3}$ from $f (\cdot; 0, 1)$ , and obtain the MLE $({\tilde{μ}}_{1}, {\tilde{σ}}_{1})$ . Let $E_{11} = {\tilde{μ}}_{1}$ , $E_{12} = {\tilde{σ}}_{1}$ . Similarly, $(E_{21}, E_{22})$ and $E_{2}$ are also obtained from random variables coming from $f (\cdot; 0, 1)$ . Then, calculate $R_{μ_{k}}^{(b)}$ and $R_{σ_{k}}^{(b)}$ by (Equation13(13) $\begin{aligned} R_{μ_{k}} & = {\hat{μ}}_{k}^{o b s} - {\hat{σ}}^{o b s} \frac{E_{k 1}}{E_{2}} = {\hat{μ}}_{k}^{o b s} - {\hat{σ}}^{o b s} \frac{{\hat{μ}}_{k} - μ_{k}}{\hat{σ}}, \\ R_{σ_{k}} & = \frac{{\hat{σ}}_{k}^{o b s}}{E_{k 2}} = \frac{{\hat{σ}}_{k}^{o b s}}{{\hat{σ}}_{k}} σ_{k}, k = 1, 2, \end{aligned}$ (13) ).
3. Generate $R_{θ}^{(b)}$ , a random observation from one of the distributions in (Equation15(15) $\begin{aligned} R_{θ}^{M} & \sim 0.5 B e t a (s_{2} + s_{3}, n_{2} + n_{3} - s_{2} - s_{3} + 1) \\ + 0.5 B e t a (s_{2} + s_{3} + 1, n_{2} + n_{3} - s_{2} - s_{3}) . \end{aligned}$ (15) ), (Equation16(16) $\begin{aligned} R_{θ}^{J} & \sim B e t a (s_{2} + s_{3} + 0.5, \\ n_{2} + n_{3} - s_{2} - s_{3} + 0.5) . \end{aligned}$ (16) ), (Equation17(17) $\begin{aligned} R_{θ}^{W} & = \frac{s_{2} + s_{3} + Z^{2} / 2}{n_{2} + n_{3} + Z^{2}} - \frac{Z}{n_{2} + n_{3} + Z^{2}} \\ \times {(s_{2} + s_{3}) (1 - \frac{s_{2} + s_{3}}{n_{2} + n_{3}}) + \frac{Z^{2}}{4}}^{1 / 2}, \end{aligned}$ (17) ), or (Equation19(19) $\begin{aligned} R_{θ}^{M V} & = \sin^{2} (\arcsin \sqrt{{\tilde{θ}}^{o b s}} \\ - \frac{Z}{2 \sqrt{n_{2} + n_{3} + 2 + \frac{1}{n_{2} + n_{3}}}}), \end{aligned}$ (19) ).
4. Update $R_{τ_{2 j}}^{(b)}, j = 1, \dots, n_{2}$ , and $R_{τ_{3 j}}^{(b)}, j = 1, \dots, n_{3}$ , as (Equation10(10) $R_{τ_{2 j}} = \frac{R_{θ} f (y_{2 j}; R_{μ_{1}}, R_{σ_{1}})}{R_{θ} f (y_{2 j}; R_{μ_{1}}, R_{σ_{1}}) + (1 - R_{θ}) f (y_{2 j}; R_{μ_{2}}, R_{σ_{2}})}$ (10) ) and (Equation11(11) $\begin{aligned} R_{τ_{3 j}} = \frac{R_{θ} f (y_{3 j}; R_{μ_{2}}, R_{σ_{2}})}{(1 - R_{θ}) f (y_{3 j}; R_{μ_{1}}, R_{σ_{1}}) + R_{θ} f (y_{3 j}; R_{μ_{2}}, R_{σ_{2}})} . \end{aligned}$ (11) ) with the $R_{μ_{k}}^{(b)}$ , $R_{σ_{k}}^{(b)}$ , and $R_{θ}^{(b)}$ obtained in Steps 2.2 and 2.3.

After repeating Step 1 to Step 2 B times, the Markov chains of the GPQs with size B can be obtained. Then, the generalized p-value (Equation7(7) $p_{G V} = Pr {χ_{4}^{2} > - 2 \log (p_{1} p_{2})} .$ (7) ) can be obtained by calculating $\begin{aligned} p_{1} & = 2 min [\frac{\sum_{b = 1}^{B} 1 {R_{μ_{2}}^{(b)} - R_{μ_{1}}^{(b)} < 0}}{B}, \\ \frac{\sum_{b = 1}^{B} 1 {R_{μ_{2}}^{(b)} - R_{μ_{1}}^{(b)} > 0}}{B}] \end{aligned}$ and $\begin{aligned} p_{2} & = 2 min [\frac{\sum_{b = 1}^{B} 1 {R_{σ_{2}}^{(b)} / R_{σ_{1}}^{(b)} < 1}}{B}, \\ \frac{\sum_{b = 1}^{B} 1 {R_{σ_{2}}^{(b)} / R_{σ_{1}}^{(b)} > 1}}{B}], \end{aligned}$ where $1 (\cdot)$ denotes the indicator function.

Similarly, under the condition $σ_{1} = σ_{2} = σ$ , the Markov chains of the GPQs, ${\bar{R}}_{μ_{1}}^{(b)}$ , ${\bar{R}}_{μ_{2}}^{(b)}$ and ${\bar{R}}_{σ}^{(b)}$ , $b = 1, \dots, B$ , can be produced by the above steps; then, the generalized p-value (Equation8(8) $\begin{aligned} {\bar{p}}_{G V} & = 2 min {Pr ({\bar{T}}_{μ_{2}} - {\bar{T}}_{μ_{1}} < 0), \\ Pr ({\bar{T}}_{μ_{2}} - {\bar{T}}_{μ_{1}} > 0)}, \end{aligned}$ (8) ) can be obtained.

3. Simulations

In this section, we compare the generalized p-values (Equation7(7) $p_{G V} = Pr {χ_{4}^{2} > - 2 \log (p_{1} p_{2})} .$ (7) ) and (Equation8(8) $\begin{aligned} {\bar{p}}_{G V} & = 2 min {Pr ({\bar{T}}_{μ_{2}} - {\bar{T}}_{μ_{1}} < 0), \\ Pr ({\bar{T}}_{μ_{2}} - {\bar{T}}_{μ_{1}} > 0)}, \end{aligned}$ (8) ) of the hypothesis test problem (Equation3(3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) ) with the p-values of the LR methods proposed in Liu et al. (Citation2020) via Monte Carlo simulation. As the four generalized fiducial methods differ only in their mixing proportions, we use the abbreviations GVJ, GVM, GVW, and GVMV to represent the generalized fiducial methods. Suppose the significance level is $α = 0.05$ . Consider the total sample sizes n to be 30, 50, 100, 200, and 300. The recombination frequency r is determined by the Haldane map $r = 0.5 (1 - \exp (- 2 d / 100))$ , where d is the map distance defined as ‘the expected number of crossovers occurring between them on a single chromatid during meiosis’ and is measured in centiMorgans (R. Wu et al., Citation2007). As the value of d is usually not large in practice (Zhang et al., Citation2008), we set d to 5, 10, or 20 according to Liu et al. (Citation2020), with the corresponding values of r being 0.048, 0.091, and 0.165. The four sample sizes $(n_{1}, n_{2}, n_{3}, n_{4})$ are generated from $M u l t i (n; \frac{1 - r}{2}, \frac{r}{2}, \frac{r}{2}, \frac{1 - r}{2})$ .

First, the type I errors of the five approaches are compared under $N = 10,000$ repeated simulations, and the data are generated from standard normal and logistic distributions, i.e., $f_{1} = f_{2} = N (0, 1)$ and $f_{1} = f_{2} = L o g i s (0, 1)$ . Under the nominal significance level $α = 0.05$ , the standard error of this Monte Carlo simulation is $\sqrt{0.05 \times 0.95 / 10, 000} \approx 0.218 %$ . The distributions of the GPQs are approximated by Markov chains with size B = 5000 as described in Section 2.3, whereas those of the two LR statistics are approximated by generating $M = 100,000$ simulated quantities from Equations (6) and (7) in Liu et al. (Citation2020). The type I errors and their standard errors (%) are shown in Table . The sizes of the generalized p-values proposed here are more conservative than those of the LR method. In large sample size cases, the type I errors of the five methods are close to the significance level. As the total sample size n decreases, the LR method becomes more liberal and can no longer well control type I errors when $n \leq 100$ . On the contrary, the generalized p-values become more conservative as the sample size n decreases. GVJ and GVMV give generalized p-values relatively close to the nominal level, whereas GVM and GVW are more conservative.

Table 2. Type I errors (%) and standard errors (%) of the five methods.

Display Table

Further, we compare the power of these methods. To control the type I errors of the LR method successfully, we take the above 10,000 LR quantities of each settings under n and d as the empirical distributions of the LR method under hypotheses (Equation3(3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) ) and (Equation4(4) $H_{0} : μ_{1} = μ_{2} .$ (4) ). After correction, the p-values of the LR method are close to the nominal level. The four types of generalized p-values are calculated by generating B = 5000 simulated quantities. As the type I errors of the four generalized p-values are controlled successfully, no corrections are required for these methods. For $θ = 0.5$ and 0.7, the powers of these tests are obtained by N = 2000 repetitions from each of the six mixture distributions below, using settings similar to those of Liu et al. (Citation2020).

Case I: $f_{1} = N (0, 1)$ and $f_{2} = N (0.5, 1)$ ;

Case II: $f_{1} = N (0, 1)$ and $f_{2} = N (0, {1.5}^{2})$ ;

Case III: $f_{1} = N (0, 1)$ and $f_{2} = N (0.5, {1.5}^{2})$ ;

Case IV: $f_{1} = L o g i s (0, 1)$ and $f_{2} = L o g i s (0.5, 1)$ ;

Case V: $f_{1} = L o g i s (0, 1)$ and $f_{2} = L o g i s (0, 1.5)$ ;

Case VI: $f_{1} = L o g i s (0, 1)$ and $f_{2} = L o g i s (0.5, 1.5)$ .

Note that in Cases II and V, $f_{1}$ and $f_{2}$ differ only in their scale parameters; therefore, all of the test methods are insignificant. Thus, these two cases are not compared when $σ_{1} = σ_{2} = σ$ .

As shown in Tables , as the sample size increases, the power of the five types of methods also increases. The power is very similar in most cases. Under condition $σ_{1} = σ_{2} = σ$ , the four generalized fiducial methods are slightly more powerful than the LR method. Comparing the four generalized fiducial methods, GVMV has the greatest power, although GVJ and GVW are typically very close to GVMV.

Table 3. Powers (%) of the five methods for Normal mixture model.

Display Table

Table 4. Powers (%) of the five methods for Logistic mixture model.

Display Table

Table 5. Powers (%) of the five methods for mixture model under $σ_{1} = σ_{2} = σ$ .

Display Table

In summary, the LR method becomes liberal in the case of small and moderate sample sizes ( $n \leq 100$ ), whereas GVM and GVW become slightly conservative, and GVMV and GVJ show better performance.

4. Real example

In this section, we apply the generalized fiducial methods to a real QTL analysis and further develop a comparison with the LR method.

Sugiyama et al. (Citation2001) performed a QTL analysis on male mice from a reciprocal backcross between the salt-sensitive C57BL/6J (B6) and the normotensive A/J (A) inbred strains after they had been provided with water containing 1% salt for 2 weeks. They were mainly concerned with the genetic control of salt-induced hypertension. Here, we use the five methods to analyse blood pressure data in the 250 male backcross mice typed at 174 markers; the data are available in R package ‘qtl’ with the name ‘hyper’ or can be downloaded from https://phenome.jax.org/projects/Sugiyama2. The detailed process of the experiment can be found in Sugiyama et al. (Citation2001). In this example, we only focus on the QTL locations, not the QTL–QTL interactions, in chromosome 1. This chromosome is divided by 22 markers, where each of the 21 intervals corresponds to four groups of data. For ${y_{1 j}, j = 1, \dots, n_{1}}$ and ${y_{4 j}, j = 1, \dots, n_{4}}$ in each interval, we apply the Shapiro–Wilk test to determine whether the observations are from the normal distributions. The results show that the $f_{1}$ s and $f_{2}$ s in 10 of the 21 intervals are normal distributions; their p-values from the Shapiro–Wilk test are listed in Table . Then, QTL detection in these 10 intervals can be modelled by Equation (Equation1(1) $\begin{aligned} Y_{1 j} & \sim f_{1} (y), j = 1, \dots, n_{1}, \\ Y_{2 j} & \sim θ f_{1} (y) + (1 - θ) f_{2} (y), j = 1, \dots, n_{2}, \\ Y_{3 j} & \sim (1 - θ) f_{1} (y) + θ f_{2} (y), j = 1, \dots, n_{3}, \\ Y_{4 j} & \sim f_{2} (y), j = 1, \dots, n_{4}, \end{aligned}$ (1) ) under normal distributions.

Table 6. p-values of Shapiro–Wilk test.

Display Table

Table provides the sample sizes of the 10 intervals. Nine of them have small or moderate sample sizes (n<50). The asymptotic distributions of the LR method are approximated by $M = 100,000$ simulated realizations (Liu et al., Citation2020), and the lengths of the Markov chains of the four GPQs are set to B = 5000. Taking the interval D1Mit156–D1Mit178 as an example, we can obtain the recombination proportion $\overset{ˇ}{θ} \approx 0.4006$ , the locations ${\overset{ˇ}{μ}}_{1} \approx 100.1$ and ${\overset{ˇ}{μ}}_{2} \approx 103.4$ , and the scales ${\overset{ˇ}{σ}}_{1} \approx 4.645$ and ${\overset{ˇ}{σ}}_{2} \approx 6.274$ . The observed value of the LR statistic is 4.867 and its p-value is 0.1678. The trace plots of Markov chains for the four generalized p-value methods are shown in Figure ; the p-values for GVJ, GVM, GVW, and GVMV are 0.1039, 0.1078, 0.1033, and 0.1009, respectively. Therefore, under a significance level of $α = 0.05$ , the null hypothesis (Equation3(3) $H_{0} : (μ_{1}, σ_{1}) = (μ_{2}, σ_{2}) .$ (3) ) cannot be rejected by the five methods and the existence of QTL effects cannot be confirmed in this interval.

Figure 1. The trace plot of Markov chains of the four generalized p-values methods for D1Mit156-D1Mit178.

Table 7. Sample sizes of 10 intervals.

Display Table

Similarly, the results for the remaining nine intervals are shown in Tables and . The four generalized fiducial methods lead to the same conclusions as the LR method. The QTL effect exists in the D1Mit14–D1Mit105 interval at least with respect to the mean. The D1Mit105–D1Mit159 and D1Mit159–D1Mit267 intervals contain QTLs that affect the variance but not the mean. Note that in the interval D1Mit267–D1Mit15, without the equal-scale assumption, the p-value (0.0673) of the LR method is much smaller than those of the four generalized fiducial methods. When the significance level is 0.1, the LR method declares that a QTL effect exists in the variance but not in the mean. However, as shown in Figure , ${y_{1 j}, j = 1, \dots, n_{1}}$ and ${y_{4 j}, j = 1, \dots, n_{4}}$ are distributed closely, and the p-value of the F-test for comparing the variances of ${y_{1 j}, j = 1, \dots, n_{1}}$ and ${y_{4 j}, j = 1, \dots, n_{4}}$ is 0.3982, much larger than the nominal significance level 0.05. From this perspective, the results of the four generalized p-value methods are more reliable, whereas the LR test method is liberal to some degree.

Figure 2. The boxplot of ${y_{1 j}, j = 1, \dots, n_{1}}$ and ${y_{4 j}, j = 1, \dots, n_{4}}$ for D1Mit267-D1Mit15.

Figure 2. The boxplot of {y1j, j=1,…,n1} and {y4j, j=1,…,n4} for D1Mit267-D1Mit15.

Table 8. Five kinds of p-values in 10 intervals.

Download CSV Display Table

Table 9. Five kinds of p-values in 10 intervals under $σ_{1} = σ_{2} = σ$ assumption.

Display Table

5. Conclusion

In this paper, we propose four generalized fiducial methods to test the existence of QTL effects between two flanking markers. Based on the simulation results, we find that the generalized fiducial methods can control type I errors fairly well even when sample sizes are less than 50. These four generalized fiducial methods have almost the same power as the LR method under a fair comparison, and they are slightly more powerful than the LR method when $σ_{1} = σ_{2} = σ$ . The four methods can be extended to test the existence of QTL effects with occurring double recombination, where the data from each of the four groups are from a mixture distribution in both location and scale. Meanwhile, more efficient algorithms should be explored, as our two-block algorithm is somewhat time-consuming. We leave these as directions for future research.

Acknowledgments

We are grateful to the Associate Editor and two reviewers for their thorough reading of our manuscript and the constructive comments that lead to significant improvement of our work.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was supported by the China Scholarship Council [Grant Number 201906140047] and National Natural Science Foundation of China [Grant Numbers 11801210, 11801359 and 11771145].

References

Agresti, A., & Caffo, B. (2000). Simple and effective confidence intervals for proportions and differences of proportions result from adding two successes and two failures. The American Statistician, 54(4), 280–288. https://doi.org/https://doi.org/10.1080/00031305.2000.10474560
Web of Science ®Google Scholar
Agresti, A., & Coull, B. A. (1998). Approximate is better than ‘exact’ for interval estimation of binomial proportions. The American Statistician, 52(2), 119–126. https://doi.org/https://doi.org/10.2307/2685469
Web of Science ®Google Scholar
Bebu, I., Luta, G., Mathew, T., & Agan, B. K. (2016). Generalized confidence intervals and fiducial intervals for some epidemiological measures. International Journal of Environmental Research and Public Health, 13(6), 605. https://doi.org/https://doi.org/10.3390/ijerph13060605
PubMed Web of Science ®Google Scholar
Cai, T. T. (2005). One-sided confidence intervals in discrete distributions. Journal of Statistical Planning and Inference, 131(1), 63–88. https://doi.org/https://doi.org/10.1016/j.jspi.2004.01.005
Web of Science ®Google Scholar
Chen, Z., & Chen, H. (2005). On some statistical aspects of the interval mapping for QTL detection. Statistica Sinica, 15(4), 909–925.
Web of Science ®Google Scholar
Cui, Y., & Hannig, J. (2019). Nonparametric generalized fiducial inference for survival functions under censoring. Biometrika, 106(3), 501–518. https://doi.org/https://doi.org/10.1093/biomet/asz016
Web of Science ®Google Scholar
Efron, B. (1998). RA Fisher in the 21st century. Statistical Science, 13(2), 95–122. https://doi.org/https://doi.org/10.1214/ss/1028905930
Web of Science ®Google Scholar
Fisher, R. A. (1930). Inverse probability. In B. J. Green (Ed.), Mathematical Proceedings of the Cambridge Philosophical Society (Vol. 26, pp. 528–535). Cambridge University Press.
Google Scholar
Fisher, R. A. (1932). Statistical Methods for Research Workers (4th ed.). Oliver & Boyd.
Google Scholar
Frühwirth-Schnatter, S. (2006). Finite Mixture and Markov Switching Models. Springer.
Google Scholar
Gelman, A., Stern, H. S., Carlin, J. B., Dunson, D. B., Vehtari, A., & Rubin, D. B. (2014). Bayesian Data Analysis (3rd ed.). Chapman and Hall/CRC.
Google Scholar
Hannig, J. (2009). On generalized fiducial inference. Statistica Sinica, 19(2), 491–544.
Web of Science ®Google Scholar
Hannig, J., Iyer, H., Lai, R. C., & Lee, T. C. (2016). Generalized fiducial inference: A review and new results. Journal of the American Statistical Association, 111(515), 1346–1361. https://doi.org/https://doi.org/10.1080/01621459.2016.1165102
Web of Science ®Google Scholar
Hannig, J., Iyer, H., & Patterson, P. (2006). Fiducial generalized confidence intervals. Journal of the American Statistical Association, 101(473), 254–269. https://doi.org/https://doi.org/10.1198/016214505000000736
Web of Science ®Google Scholar
Huang, N., Parco, A., Mew, T., Magpantay, G., McCouch, S., Guiderdoni, E., Xu, J., Subudhi, P., Angeles, E. R., & Khush, G. S. (1997). RFLP mapping of isozymes, RAPD and QTLs for grain shape, brown planthopper resistance in a doubled haploid rice population. Molecular Breeding, 3(2), 105–113. https://doi.org/https://doi.org/10.1023/A:1009683603862
Web of Science ®Google Scholar
Korol, A., Ronin, Y., Tadmor, Y., Bar-Zur, A., Kirzhner, V., & Nevo, E. (1996). Estimating variance effect of QTL: An important prospect to increase the resolution power of interval mapping. Genetical Research, 67(2), 187–194. https://doi.org/https://doi.org/10.1017/S0016672300033632
Google Scholar
Krishnamoorthy, K., & Lee, M. (2010). Inference for functions of parameters in discrete distributions based on fiducial approach: Binomial and Poisson cases. Journal of Statistical Planning and Inference, 140(5), 1182–1192. https://doi.org/https://doi.org/10.1016/j.jspi.2009.11.004
Web of Science ®Google Scholar
Lai, R. C., Hannig, J., & Lee, T. C. (2015). Generalized fiducial inference for ultrahigh-dimensional regression. Journal of the American Statistical Association, 110(510), 760–772. https://doi.org/https://doi.org/10.1080/01621459.2014.931237
Web of Science ®Google Scholar
Lander, E. S., & Botstein, D. (1989). Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics, 121(1), 185–199. https://doi.org/https://doi.org/10.1093/genetics/121.1.185
PubMed Web of Science ®Google Scholar
Li, X., Su, H., & Liang, H. (2018). Fiducial generalized p-values for testing zero-variance components in linear mixed-effects models. Science China Mathematics, 61(7), 1303–1318. https://doi.org/https://doi.org/10.1007/s11425-016-9068-8
Web of Science ®Google Scholar
Li, X., Xu, X., & Li, G. (2007). A fiducial argument for generalized p-value. Science in China Series A: Mathematics, 50(7), 957–966. https://doi.org/https://doi.org/10.1007/s11425-007-0067-7
Google Scholar
Li, X., Zhou, X., & Tian, L. (2013). Interval estimation for the mean of lognormal data with excess zeros. Statistics & Probability Letters, 83(11), 2447–2453. https://doi.org/https://doi.org/10.1016/j.spl.2013.07.004
Web of Science ®Google Scholar
Liu, G., Li, P., Liu, Y., & Pu, X. (2020). Hypothesis testing for quantitative trait locus effects in both location and scale in genetic backcross studies. Scandinavian Journal of Statistics, 47(4), 1064–1089. https://doi.org/https://doi.org/10.1111/sjos.v47.4
Web of Science ®Google Scholar
McLachlan, G., & Peel, D. (2000). Finite Mixture Models. John Wiley & Sons.
Google Scholar
Nkurunziza, S., & Chen, F. (2011). Generalized confidence interval and p-value in location and scale family. Sankhya B, 73(2), 218–240. https://doi.org/https://doi.org/10.1007/s13571-011-0026-8
Google Scholar
Perng, S., & Littell, R. C. (1976). A test of equality of two normal population means and variances. Journal of the American Statistical Association, 71(356), 968–971. https://doi.org/https://doi.org/10.1080/01621459.1976.10480978
Web of Science ®Google Scholar
Rebai, A., Goffinet, B., & Mangin, B. (1994). Approximate thresholds of interval mapping tests for QTL detection. Genetics, 138(1), 235–240. https://doi.org/https://doi.org/10.1093/genetics/138.1.235
PubMed Web of Science ®Google Scholar
Rebai, A., Goffinet, B., & Mangin, B. (1995). Comparing power of different methods for QTL detection. Biometrics, 51(1), 87–99. https://doi.org/https://doi.org/10.2307/2533317
Web of Science ®Google Scholar
Schaarschmidt, F., Sill, M., & Hothorn, L. A. (2008). Approximate simultaneous confidence intervals for multiple contrasts of binomial proportions. Biometrical Journal, 50(5), 782–792. https://doi.org/https://doi.org/10.1002/bimj.v50:5
PubMed Web of Science ®Google Scholar
Sugiyama, F., Churchill, G. A., Higgins, D. C., Johns, C., Makaritsis, K. P., Gavras, H., & Paigen, B. (2001). Concordance of murine quantitative trait loci for salt-induced hypertension with rat and human loci. Genomics, 71(1), 70–77. https://doi.org/https://doi.org/10.1006/geno.2000.6401
Web of Science ®Google Scholar
Williams, J. P., & Hannig, J. (2019). Nonpenalized variable selection in high-dimensional linear model settings via generalized fiducial inference. The Annals of Statistics, 47(3), 1723–1753. https://doi.org/https://doi.org/10.1214/18-AOS1733
Web of Science ®Google Scholar
Wu, R., Ma, C., & Casella, G. (2007). Statistical Genetics of Quantitative Traits: Linkage, maps and QTL. Springer.
Google Scholar
Wu, W. H., & Hsieh, H. N. (2014). Generalized confidence interval estimation for the mean of delta-lognormal distribution: An application to New Zealand trawl survey data. Journal of Applied Statistics, 41(7), 1471–1485. https://doi.org/https://doi.org/10.1080/02664763.2014.881780
Web of Science ®Google Scholar
Xu, X., & Li, G. (2006). Fiducial inference in the pivotal family of distributions. Science in China Series A, 49(3), 410–432. https://doi.org/https://doi.org/10.1007/s11425-006-0410-4
Google Scholar
Zhang, H., Chen, H., & Li, Z. (2008). An explicit representation of the limit of the LRT for interval mapping of quantitative trait loci. Statistics & Probability Letters, 78(3), 207–213. https://doi.org/https://doi.org/10.1016/j.spl.2007.05.020
Web of Science ®Google Scholar

Generalized fiducial methods for testing quantitative trait locus effects in genetic backcross studies

Abstract

1. Introduction

Table 1. Probabilities of QTL genotypes.

2. New test

2.1. GPQs of location and scale parameters conditional on assignment

2.2. GPQs of the mixing proportion conditional on assignment

2.3. Gibbs algorithm

3. Simulations

Table 2. Type I errors (%) and standard errors (%) of the five methods.

Table 3. Powers (%) of the five methods for Normal mixture model.

Table 4. Powers (%) of the five methods for Logistic mixture model.

Table 5. Powers (%) of the five methods for mixture model under $σ_{1} = σ_{2} = σ$ .

4. Real example

Table 6. p-values of Shapiro–Wilk test.

Table 7. Sample sizes of 10 intervals.

Table 8. Five kinds of p-values in 10 intervals.

Table 9. Five kinds of p-values in 10 intervals under $σ_{1} = σ_{2} = σ$ assumption.

5. Conclusion

Acknowledgments

Disclosure statement

References

Information for

Open access

Opportunities

Help and information

Generalized fiducial methods for testing quantitative trait locus effects in genetic backcross studies

Abstract

1. Introduction

Table 1. Probabilities of QTL genotypes.

2. New test

2.1. GPQs of location and scale parameters conditional on assignment

2.2. GPQs of the mixing proportion conditional on assignment

2.3. Gibbs algorithm

3. Simulations

Table 2. Type I errors (%) and standard errors (%) of the five methods.

Table 3. Powers (%) of the five methods for Normal mixture model.

Table 4. Powers (%) of the five methods for Logistic mixture model.

Table 5. Powers (%) of the five methods for mixture model under σ1=σ2=σ.

4. Real example

Table 6. p-values of Shapiro–Wilk test.

Table 7. Sample sizes of 10 intervals.

Table 8. Five kinds of p-values in 10 intervals.

Table 9. Five kinds of p-values in 10 intervals under σ1=σ2=σ assumption.

5. Conclusion

Acknowledgments

Disclosure statement

Additional information

Funding

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 5. Powers (%) of the five methods for mixture model under $σ_{1} = σ_{2} = σ$ .

Table 9. Five kinds of p-values in 10 intervals under $σ_{1} = σ_{2} = σ$ assumption.