333
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Heterogeneous logistic regression for estimation of subgroup effects on hypertension

, , , , , , , , & show all
Pages 969-985 | Received 18 Jan 2021, Accepted 18 Mar 2022, Published online: 16 May 2022
 

ABSTRACT

Personalized medicine has gained much attention in the past decades, and identifying the effects of factors is essential for personalized preventions and treatments. Hypertension is a major modifiable risk factor for cardiovascular disease and is influenced by complex factors. In order to decrease the incidence of hypertension effectively, the subjects should be divided into subgroups according to their characteristics. In this study, we proposed to use a heterogeneous logistic regression combined with a concave fusion penalty to analyze the population-based survey data, including common influencing factors of hypertension. The analytic steps include: (1) identifying the most important predictor; (2) estimating subgroup-based heterogeneous effects. In the present context of primary hypertension data, the modeling results showed that the calculated prediction accuracy under our method was greater than 99%, while zero under the classical logistic regression. The findings could provide a practical guide for further individualized measures implementation.

Disclosure statement

No potential conflict of interest was reported by the authors.

7.2. Initial value

To facilitate the update of (α(k+1),ν(k+1),β(k+1)), at the (k+1)th step in (2.8) to (2.10) of the ADMM iterative algorithm, we need to specify a proper initial value. We obtain the regression estimators β(0) at the first step by minimizing a ridge fusion criterion

R(β)=12YXβ2+λ21i<jnβiβj2,

by setting λ=0.001, and using the matrix notation, we have

β(0)=(XTX+λΩTΩ)1{XTY}.

7.3. Tuning parameter

From a grid of λ values, we select the optimal tuning parameter λˆ by minimizing a modified BIC,

(2.13) BIC(λ)=i=1n[Yi(X˜iTβˆ(λ))log{1+exp(X˜iTβˆ(λ))}]+Cnlognn(Kˆ(λ)P),(2.13)

where Cn is a positive number dependent on n. We adopt the strategy of Ma and Huang (2016) and take Cn=log(np),φ=1, and a=3.

7.4. Proof of Theorem 1

First, we prove the consistency in the Kp-dimensional space {ρRKp} by constraining l(ρ), we prove that there exists a strict local maximizer ρˆ of l(w) satisfies ρˆρ02=Op(Kp/Gmin). We define an event,

H=l(ρ0)>maxwNιl(w),

where

l(ρ)=1ni=1nYi(UiTρ)log{1+exp(UiTρ)},

and Nι denotes the boundary of the closed set Nι={wRKp:∥wρ02ιKp/Gmin}, and ι(0,). It is easy to know that there exists a local maximizer ρˆ of ln(w) on the event H in Nι. Then we need to prove that when ι is large enough, P(H) is close to 1 as n. So we need to analyze the function l on the boundary Nι.

By Taylor’s theorem, we have for any wNι, and the Taylor’s expansion is

(2) l(w)l(ρ0)=(wρ0)Tv12(wρ0)TD(wρ0),(2)

where v=1nUT[Y1exp(θ0)+1], θ0=Uρ0,

D=1nUTdiag{exp(θ)(1+exp(θ))2}U,

θ=Uρ, and ρ lies on the segment joining \bw and ρ0. Based on

Emin(D)cGmin/n,

we get

maxwNιl(w)l(ρ0)Kp/Gminι(v2cGminnιKp/Gmin/2),

also along with Markov’s inequality entails that

P(H)P(v22<Gminc2Kpι24n2)14n2Ev22c2GminKpι2.

From E(Y|U)=μ(θ0)=1exp(θ0)+1,cov([Y1exp(θ0)+1])=Σ(θ0)=diag{exp(θ0)(1+exp(θ0))2}, we have

(3) Ev22=n2EUT[Y1exp(θ0)+1]22n2tr[UTdiag{Σ(θ0)}U]=O(1/n),(3)

with Gmin=O(n), we get P(H)1O(ι2). This proves ρˆρ02=Op(Kp/Gmin).

Next we prove the asymptotic normality of ρˆ. On the event H, we have shown that ρˆNιN0 is a strict local maximizer of l(w), then we can easily get that l(ρˆ)=0. Next, we expand l(ρˆ) around ρ0 to the first-order componentwise. Then, by the properties of X and ρˆρ02=Op(Kp/Gmin), we have

0=l(ρ0)n1UTdiag{exp(θ0)(1+exp(θ0))2}U(ρˆρ0)
+o(1)ρˆρ022
=n1UT[Y1exp(θ0)+1]n1UTdiag{exp(θ0)(1+exp(θ0))2}U(ρˆρ0)
(4) +op(Gmin1).(4)

It follows from ρˆN0, then we have,

n1UTdiag{exp(θ0)(1+exp(θ0))2}U(ρˆρ0)=n1UT[Y1exp(θ0)+1]+op(Gmin1/2),

then along with the properties of U entails,

(5) Vn1/2(ρˆρ0)=Vn1/2UT[Y1exp(θ0)+1]+op(1),(5)

where Vn=UTdiag{exp(θ0)(1+exp(θ0))2}U, the small order term can be understood under the L2 norm.

Then, we can show the asympotic normality of ρˆ. Define Gn as a 1×(Kp) row vector such that Gn∥=1, Then follows from (5) that

GnVn1/2(ρˆρ0)=un+op(1),

where un=GnVn1/2UT[Y1exp(θ0)+1]. Thus, by Slutlsy’s lemma, to show GnVn1/2(ρˆρ0)DN(0,1), it suffices to prove unDN(0,1). Next, we consider the asymptotic distribution of the linear combination

vn=GnVn1/2UT[Y1exp(θ0)+1]=i=1nξi,

where ξi=GnVn1/2Ui[Yi1exp(θ0i)+1]. Clearly, ξi‘s are independent and have mean 0, and

i=1nvar(ξi)=GnVn1/2[UTdiag{exp(θ0)(1+exp(θ0))2}U]Vn1/2GnT
(6) =GnGnT1,(6)

as n. By the Cauchy-Schwarz inequality, we have

i=1nE|ξi|3=i=1n|GnVn1/2Ui|3E|Yi1exp(θ0)+1|3
=O(1)i=1n|GnVn1/2Ui|3
O(1)i=1nGn23Vn1/2Ui23
(7) =O(1)i=1n(UiTVn1Ui)3/2=o(1).(7)

Then with use of Lyapunov’s

un=i=1nξiDN(0,1),

we complete the proof

7.5. Proof of Theorem 2

Define

(β)=i=1n[Yi(XiTβ)log{1+exp(XiTβ)}];Pλ(β)=1i<jnλρλ(βiβj),
G(ρ)=i=1n[Yi(UiTρ)log{1+exp(UiTρ)}];PλG(ρ)=1l<l Lλ|GlGl |ρλ(ρlρl ),

and let p(β)=(β)+Pλ(β), pG(ρ)=G(ρ)+PλG(ρ). Let H:MGRKp be the mapping which H(β) is the Kp×1 vector consisting of K vectors with dimension p and its lth vector component equals the common value of βi for iGk. Let H:RnpRKp be the mapping which H(β)={|Gk|1iGkβiT,k=1,,K}T.

Consider the neighborhood of β0:

Θ={βRnp:supiβiβ0i∥≤cvn},

where vn=Kp/Gmin. We show that βˆor) is a strictly local minimizer of the proposed penalized objective function almost surely through the following two steps:

(i) In event A1, where A1={supiβˆiorβ0i∥≤cvn}, p(β)>p(βˆor) for any βΘ and ββˆ, where β=H1(H(β)).

(ii) There is an event A2 such that P(A2C)2n and in A1A2, there is a neighborhood Θn of βˆ, and for βΘnΘ, p(β)>p(β).

It is easy to show (i) following Ma and Huang (2016). To show the result in (ii), we consider Θn={βi:supiβiβˆior∥≤sn} for a positive sequence sn. For βΘnΘ, by Taylor’s expansion, we have

p(β)p(β)=H1+H2,

where

H1=i=1nXiT[Yi1exp{Xiβ˜i}+1](βiβi),andH2=i=1nPλ(β˜)βiT(βiβi).

Here, β˜=aβ+(1a)β, Note that

H2k=1Ki,jGk,i<jλρλ (4sn)βiβj.

Setting Qi=Xi[Yi1exp{Xiβ˜i}+1], we have

H1=l=1Li,jGk,i<j(QjQi)T(βjβi)|Gk|,
supiQi∥≤P1+P2

where

P1=supiXisupi{|Yi1exp{Xiβ0i}+1|},
P2=supiXi{|1exp{Xiβ˜i}+11exp{Xiβ0i}+1|}.

For P1, since E(Yi|Xi)=1exp{Xiβ0i}+1, with Condition (C2) we have

Psupi|Yi1exp{Xiβ0i}+1|>2log(n)/c1
i=1nP|Yi1exp{Xiβ0i}+1|>2log(n)/c1
2n,

we conclude that there is an event A2 such that P(A2C)2n, and under the event A2 and condition (C3) (i),

P1c2(2log(n)/c1).

Thus, we have

|(QjQi)T(βjβi)|Gk||2Gmin1supiQi∥∥βjβi
(8) 4c2Gmin1[2log(n)/c1+c2Kp/Gmin]βjβi,(8)

and

p(β)p(β)k=1Ki,jGk,i<j{λρλ(4sn)4c2Gmin1[2log(n)/c1+c2Kp/Gmin]}βiβj.

Let sn0, and then λρλ(4sn)cλ. Since λmax(log(n)/Gmin,Kp/Gmin/Gmin), we have p(β)p(β)0 for a sufficiently large n, which completes the proof of Theorem 2.

Additional information

Funding

This study was equally supported by the National Natural Science Foundation of China [grant number 11901352], the Social Science Foundation of Ministry of Education of China [grant number:18XJC910001], the Natural Science Foundation of Shandong Province [grant number ZR2019BA017], the Social Science Foundation of Shandong Province [grant number 19DTJJ03] and the Young Scholars Program of Shandong University [YSPSDU: 11020088964008]. Xinglin Scholar Academic Backbone Project of Chengdu University of TCM [XSGG2020006].

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 717.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.