8,890
Views
3
CrossRef citations to date
0
Altmetric

Abstract

The effectiveness of duration and convexity hedging strategies deteriorates in the presence of non-parallel shifts of the yield curve. In the absence of appropriate constraints, the extension of these strategies accounting for changes in the shape of the yield curve generates unstable weights and extreme leverage, leading to poor out-of-sample hedging performance. To address this conundrum, we recast the bond portfolio immunization problem as a multifactor optimization program with leverage constraints and weight regularization. These regularized immunization strategies offer a robust improvement in hedging performance and are particularly well-suited to secure future cash flow needs such as pension liabilities.

PL Credits: 2.0:

    A relatively recent strand of the portfolio selection literature has shown that regularization methods play a central role in the successful implementation of Markowitz (Citation1952) mean-variance analysis. In particular, it has been shown that performing shrinkage of the input risk and return parameters can satisfactorily address the lack of robustness plaguing optimization models documented by Michaud (Citation1989) or DeMiguel, Garlappi, and Uppal (Citation2009). In a nutshell, this approach, applied to expected return estimates by Stein (Citation1956) and Black and Litterman (Citation1992) and to covariance matrix estimates by Ledoit and Wolf (Citation2003, Citation2004), combines a structured but biased target for a parameter estimator with the unbiased but low convergence sample estimator. In parallel, other authors have documented the positive impact on out-of-sample performance of regularizing the model outputs, i.e., restricting portfolio weights. For instance, Frost and Savarino (Citation1986), and Jagannathan and Ma (Citation2003) show that excluding short sales reduces the gap between ex-ante and ex-post performance, and substantially decreases the out-of-sample volatility for the minimum variance portfolio. These two approaches are actually connected, as the regularization of output weights is under certain conditions equivalent to the regularization of input parameter estimates (see Jagannathan and Ma Citation2003; DeMiguel et al. Citation2009).

    While this strand of research provides compelling theoretical and empirical arguments in favor of the introduction of regularization constraints in portfolio optimization, its focus has been mostly restricted to mean-variance analysis in equity markets. Our paper extends this literature by showing that regularization also has a fundamental role to play in the construction of robust interest rate hedging portfolios. From a theoretical standpoint the foundational work of Merton (Citation1969, Citation1971, Citation1973) has highlighted the importance for long-term investors to hedge against unexpected changes in key state variables impacting the investment opportunity set, including notably interest rates. From a practical standpoint, interest rate hedging is particularly relevant for institutional investors endowed with long-term liabilities, such as pension funds or insurance companies. Furthermore, interest rate hedging portfolios also constitute a fundamental building-block in retirement investing solutions for individuals and households (e.g., Martellini and Milhau Citation2010, Citation2012; Martellini, Milhau, and Mulvey Citation2019, Citation2020).

    Historically the first factor-based approach to interest rate hedging was focused on matching Macaulay’s (Citation1938) duration, introduced in seminal papers by Samuelson (Citation1945), Redington (Citation1952), Fisher and Weil (Citation1971) and Bierwag and Khang (Citation1979).Footnote1 This widely used hedging strategy and its well-known extension to convexity matching are designed to protect against parallel shifts of the term structure of interest rates. However, there is ample evidence that changes in the shape of the yield curve, such as slope and curvature movements, can also be substantial (see Diebold and Li Citation2006), and can have a strong negative impact on the hedging quality of standard duration and convexity matching strategies (Ingersoll, Skelton, and Weil Citation1978; Fong and Vasicek Citation1984). Early papers have proposed to relax the pure parallel shift assumption, starting with Khang (Citation1979) who derived a hedging strategy that accounts for changes in the slope of the yield curve, and Fong and Vasicek (Citation1984) who proposed an immunization strategy to hedge a single future cash flow under general movements of the yield curve. Immunization strategies were then generalized to hedge multiple cash flows in a multifactor (or “multi-directional”) setting in Reitano (Citation1991) and implemented using principal components analysis (PCA) in Barber and Copper (Citation1996). The three principal components that best capture variations in interest rates, estimated from historical yields, are routinely identified with the level, slope and curvature factors of the popular Nelson and Siegel (Citation1987) model.

    Multifactor hedging strategies that account for changes in the shape of the yield curve present, however, an important robustness challenge. In the absence of regularization restrictions, multifactor bond portfolio immunization strategies often result in hedging portfolios with extreme weights and high leverage, leading to poor out-of-sample hedging performance.Footnote2 As confirmed in our empirical analysis, an unconstrained multifactor immunization strategy can lead to inferior hedging performance compared to simple Duration-matching strategies. Investors are thus left with the following dilemma, typical of a bias-variance trade-off. On the one hand, Duration-Convexity strategies that present persistent biases in their exposures to slope and curvature factors leave portfolios exposed to changes in the shape of the yield curve. On the other hand, multifactor strategies that match exposures to the level, slope and curvature factors present important instability in those factor sensitivities, i.e., large variance in their factor exposures, as well as extremely high levels of leverage.

    We propose to address this conundrum using techniques borrowed from the machine learning literature. More specifically we recast the factor hedging problem in a flexible regularization framework incorporating an explicit control of the portfolio leverage that presents several key advantages over the unconstrained factor-matching approach. First, the proposed immunization strategies are designed to achieve a superior bias-variance trade-off, because they minimize exposure biases to any number of yield factors considered (such as level, slope, and curvature) but also present stable out-of-sample factor exposures due to the regularization and control of the hedging portfolio weights and leverage. Our empirical analysis confirms that these regularized general immunization strategies offer a remarkable improvement in terms of out-of-sample robustness. Additionally, our proposed strategies do not require a pre-selection of a specific subset of the hedging instruments, so that all available assets can be considered to form the hedging portfolio. This latter feature, together with the type of regularization constraints that we use, result in portfolios with much lower turnover and weight concentration levels compared to unrestricted factor-matching strategies. Overall, the proposed strategies dominate unconstrained immunization strategies in terms of out-of-sample performance, robustness, and portfolio turnover.

    A Regularization Approach to Multifactor Interest Rate Immunization

    In this section, we begin with a description of the Nelson and Siegel (Citation1987) term structure model and we discuss different factor immunization approaches. Readers not interested in technical details could skip this section and directly move to the next section, which provides a self-contained empirical analysis of the benefits of the regularization approach to interest rate risk management.

    Yield Curve and Bond Return Approximation in Nelson–Siegel Model

    Motivated by the observation that three factors, namely, level, slope, and curvature, explain almost all the time variation in sovereign interest rates, Nelson and Siegel (Citation1987) postulate that the yields on zero-coupon bonds can be written as linear combinations of three factors. Specifically, the zero-coupon yield of maturity u at time t is given by yt,u=β0t+f(uτ)β1t+g(uτ)β2t, where f(u)=1euu, g(u)=f(u)eu.

    Here τ is a constant parameter, and the coefficients βit represent the three factor values at time t. β0t can be interpreted as the long-term yield or level factor. β1t, the negative of the spread between the long- and the short-maturity yields, is the slope factor in the sense that an increase in β1t implies a flatter yield curve. Finally, β2t can be regarded as a curvature factor in the sense that an increase in β2t leads to a more concave or less convex yield curve.

    The price of a coupon-paying bond, given by the sum of its future cash flows discounted with the zero-coupon yields of the appropriate maturities, is a function of the three factors at each point in time. If CFs denotes the cash flow paid at time s, the (dirty) price at time t is (1) Bt=s>tCFse[st]yt,st=F(β0t,β1t,β2t,t),(1) where the summation is over all payment dates after time t. Note that the function F is twice differentiable with respect to the three factor values and is differentiable with respect to time except on coupon payment dates.Footnote3

    From time t to time t + h, the total return on the bond, that is the return with any intermediate payment reinvested in the bond, can be approximated with the following Taylor expansion: (2) rt,t+hF0FΔβ0t+F1FΔβ1t+F2FΔβ2t+F3Fh,(2) where F0, F1, and F2 are short-hand notations for the derivatives of F with respect to β0t, β1t, and β2t, and F3 is the right derivative with respect to time. A more accurate approximation is obtained by introducing the second-order partial derivatives, including the cross derivatives:Footnote4 (3) rt,t+hF0FΔβ0t+F1FΔβ1t+F2FΔβ2t+F3Fh+12i,j=03FijFΔβitΔβjt.(3)

    Detailed expressions for the derivatives are provided in Appendix A. It should be noted that the quantity F0/F bears a close resemblance to the modified duration indicator in that it is a weighted average of the cash flow maturities. The difference is that in the standard expression for duration, all cash flows are discounted with the same rate, namely, the yield-to-maturity, while in the expression for F0, every cash flow has its own discount rate, which depends on maturity. Similarly, F00/F is a weighted average of squared cash flow maturities, similar to the standard convexity indicator except that it involves maturity-dependent discount rates.

    Bond Portfolio Immunization Strategies

    Let n denote the total number of constituents in a bond portfolio and wit be the weight of constituent i at time t. Applying the Taylor expansion (2) to the pricing function of each constituent, F(i), we have the following approximation for the portfolio return: Rt,t+hi=1nwit[F0(i)F(i)Δβ0t+F1(i)F(i)Δβ1t+F2(i)F(i)Δβ2t+F3(i)F(i)h].

    To immunize a given stream of liability cash flows with respect to unexpected changes in factor values, an investor needs to hold a bond portfolio with factor exposures matching those of the target liability portfolio to the first-order, and possibly to the second-order in case larger changes in factor values can occur within the trading interval.Footnote5

    Different immunization strategies have a focus on matching different factor exposures. While the level factor is the most important factor to hedge because it explains the largest fraction of bond returns, an investor may also decide to match slope and/or curvature exposures to be better immunized with respect to non-parallel shifts of the yield curve. Assume that a subset of k exposures to match has been selected and denote the exposures of the liability portfolio as D1, …, Dk. Collect the exposures of the hedging portfolio constituents in k row vectors H1, …, Hk, so that the ith element of Hp is the pth exposure of constituent i, and stack portfolio weights in the column vector x. All exposures and weights depend on time, but we drop the time subscript to alleviate notation. A factor-matching portfolio must by construction have the same exposures as the target liability portfolio, and it must also be fully invested in its constituents, implying that x must satisfy the following system of linear equations: (4) Hx=D.(4) H is the matrix whose first k rows are the vectors H1, …, Hk and the last row is filled with ones, and D is the column vector of length k + 1 whose elements are the target exposures D1, …, Dk and 1. Since k is the number of exposures that the strategy aims to match, for Duration-matching k is 1 and H has two rows, and for Duration-Convexity matching k is 2, and H has three rows. If one wanted to match the first-order sensitivities to the yield curve level, slope and curvature factors of the liability portfolio, then H would have four rows (k = 3).

    The system (4) has a unique solution if the matrix H is square and non-singular. Hence, a first approach to bond portfolio immunization, consists in selecting k + 1 bonds, so that the system in EquationEquation (4) has a unique exact solution, given byFootnote6 (5) x*=H1D(5)

    The weights of hedging strategies are set back to the target allocation x* in discrete time and not necessarily rebalanced at every possible trading date, to limit transaction costs. Hence, between rebalancing dates, hedging strategies do not always have an exact match of the target bond factor exposures. This implies possibly non-zero deviations between the sensitivities of the hedging portfolio and the ones of the replication target, i.e., ε=DHx with ε0.

    Replication errors ε can be caused by discrete hedging or by the presence of factor exposures not explicitly accounted for by the hedging strategy. For example, a strategy aiming to match only the first-order sensitivity to the level factor can present a persistent deviation or bias in its exposure to the slope factor. Indeed, in the empirical analysis presented in the next section, we find that the strategies matching only duration or duration and convexity can present significant and persistent biases in their exposures to the slope and curvature factors (even on rebalancing dates). On the other hand, while the strategy matching the first-order sensitivities to the three Nelson–Siegel (NS) factors using the exact-solution approach in EquationEquation (5) allows by construction for a perfect match of all three first-order factor exposures on any trading date, it exhibits extreme weights and high leverage levels, which results in very unstable factor exposures between rebalancing dates and ultimately leads to poor out-of-sample hedging performance.Footnote7

    A Regularization Method

    To address the problems posed by extreme weights and highly leveraged positions, we introduce a regularization approach to interest rate multifactor hedging, which is designed to match a higher number of factor sensitivities (hence reduce the biases in factor exposures), and at the same time produce stable factor exposures (i.e., reduce variance in their factor exposures). It also allows us to relax the restriction in the exact-solution approach (EquationEquation (5)) of having as many hedging instruments as linear constraints, that is k + 1, where k is the number of factor sensitivities to match. Hence, the portfolio weights vector x has length n, where n is the total number of available bonds, which can be different from (greater than) the number of constraints. The vector of the instruments’ sensitivities now has n elements, i.e., Hi=[hi1,hi2,  hin], and the matrix H has k + 1 rows and n columns.

    The objective is to find the weights of the hedging portfolio, x, that minimize the mean squared error (MSE) in yield curve factor exposures: min x(DHx)(DHx)

    This program is equivalent to (6) minx12xHHxxHD(6)

    Deriving EquationEquation (6) with respect to x and equating the gradient to zero yields the following solution to this unconstrained program: (7) x*=(HH)1(HD)(7)

    This unrestricted solution is not feasible in general, because the matrix HH in EquationEquation (7) is not invertible whenever the number of bonds n exceeds k+1.Footnote8 On the other hand, it is possible to find other solutions to the MSE minimization problem in EquationEquation (6), using a standard approach in the machine learning literature to address the bias-variance trade-off in MSE programs such as EquationEquation (6). This consists of adding to the objective function a penalization on the norms of the output vector. The first norm of vector x is: (8) L(x)i=1n|xi|(8) which coincides with the gross leverage of the portfolio. A fully invested portfolio with no leverage has L(x)=1. Denoting Λ=HH and Θ=HD, the program with leverage penalization is:Footnote9 (9) minxxΛx2xΘ+λ(i=1n|xi|)(9) u.c. i=1nxi=1 where the norm penalization coefficient λ can be set such that the leverage of the portfolio is below a given leverage budget δ¯. The optimization program in EquationEquation (9) is similar to a lasso regression with an additional constraint, which is that weights should add up to 1. For this reason, we refer to the resulting portfolio x* as a lasso hedging strategy.

    Given that short-selling of long-dated coupon-paying bonds can be operationally very cumbersome in practice, we restrict short-selling to the bond with the shortest duration, i.e., “cash” (see Appendix B for more details on the optimization program implemented).Footnote10 We also consider a similar approach to the optimization program in EquationEquation (9) but using a penalization on the second norm, which can be labeled a ridge hedging strategy. To determine the penalization coefficient of the second norm in that case, we used a heuristic to determine the coefficient that would cap the leverage of the portfolio to the leverage budget δ¯. In our empirical tests, we found that the ridge approach presents a very similar improvement in terms of hedging performance as the lasso approach when δ¯=3. However, we found that the ridge approach is not well adapted when a long-only constraint is required, i.e., for δ¯=1, which requires large penalization coefficients. Intuitively, and as formally shown in Appendix C, the ridge approach converges toward an equal-weighted portfolio when the penalization coefficient tends to infinity. Naturally, an equal-weighted strategy implies large deviations in factor exposures and a deterioration in hedging performance. For this reason, in the empirical tests presented in the following section, we focus on the lasso regularization.Footnote11

    Empirical Analysis

    It is well-known that three factors, namely, level, slope, and curvature, which value can be obtained by calibration of the NS model with respect to observed bond prices, explain a large fraction of sovereign bond returns. Hereafter we present an empirical analysis of factor-based interest rate immunization strategies and discuss how the introduction of regularization techniques aiming at penalizing large weights improves the robustness and out-of-sample hedging efficiency of these strategies. In a nutshell, investors have a choice between a narrow single-factor focus on hedging changes in the level of the yield curve and a broader multifactor focus on hedging changes in the level, slope and curvature of the yield curve. While it intuitively represents a more comprehensive and effective immunization strategy, the latter approach unfortunately typically involves unstable and exceedingly large portfolio weights that lead to poor out-of-sample hedging performance. In what follow, we demonstrate that regularization techniques can solve this conundrum via the introduction of leverage constraints allowing for better behaved hedging portfolios in a multifactor setting. We first present the liability stream to be replicated, the dataset and the empirical protocol that we use. Then, we present the results of the out-of-sample analysis for the hedging strategies considered.

    Liability Stream to Be Replicated

    One of the most important applications of bond portfolio immunization strategies is pension liability-hedging, where the focus is on securing replacement income. In this context, the target liability stream to hedge is taken to be the payoff of a retirement bond, a fictitious financial security that pays fixed monthly cash flows starting at a future (retirement) date T. After this date, the stream of monthly cash flows continue for 20 years, which roughly correspond to average life expectancy in retirement.Footnote12 The series of 240 (12×20) monthly cash flows starts at a normalized $1 value and grow at a monthly rate g to incorporate an annualized cost-of-living adjustment of 2%.Footnote13 We refer the interested reader to Martellini and Milhau (Citation2020) for a detailed analysis of the benefits of retirement bonds, or replicating portfolios for retirement bonds, in retirement investing strategies. A retirement bond is in fact a simplified version of a non-redundant security called retirement SeLFIES (see Merton and Muralidhar Citation2017, Citation2020, for arguments in favor of the issuance of such securities), which features an indexation of the cash flows with respect to aggregate per capita consumption to hedge standard-of-living risk.

    In the absence of arbitrage opportunities, the fair price for the retirement bond can be obtained as the discounted value of its promised cash flows based on the current sovereign yield curve. Let t denote the current date and g be the growth rate in cash flows representing the cost-of-living adjustment. The N = 240 cash flows occur at dates T, T + 1…,T + N-1 and are discounted at the zero-coupon rates yt,T+ht, which corresponds to the yield of a zero bond paying $1 T + h-t periods ahead from current time t. With this notation, the no-arbitrage price of the retirement bond at date t is: Bt=h=0N1[1+g]T+he[T+ht]yt,T+ht

    Dataset and Empirical Protocol

    In our empirical analysis, we use individual bond price data from the US sovereign debt market. Specifically, we download from Bloomberg historical daily prices for all US Treasury bonds (“T” bond series), along with issuance date, maturity date, coupon rate, coupon frequency, nominal value and name of the issuer.Footnote14 Over the sample period, the number of US Treasury bonds is 1,183 in total, and the number of available bonds in a given day ranges from 112 to 312. The minimum time-to-maturity of the bonds available at any point in time ranges from 1 to 31 days, and the maximum time-to-maturity ranges from 25 to 30 years.

    To estimate the yield curve, we proceed as follows. Following Diebold and Li (Citation2006) we set the parameter τ in NS model equal to 1.37 = 1/(12 × 0.0609) at all dates, and choose the three remaining parameters that minimize the weighted sum of the absolute deviations between market prices of the US Treasuries and prices implied by the model. Following Gürkaynak, Sack, and Wright (Citation2007), the weights chosen in the objective function are inversely proportional to the duration of each individual security. The reason is that the deviation between the actual and predicted prices of an individual security is approximately equal to its duration multiplied by the deviation between the actual and model implied yields. Thus, this procedure aims to minimize the (unweighted) sum of the absolute deviations between the actual and model implied yields on all of the securities. The resulting yield curve parameters are presented in , which are then used to estimate the retirement bond price, its sensitivities relative to the three NS factors, as well as the sensitivities of the bonds used as hedging instruments.

    Figure 1. Yield Curve’s Level (Left Panel), Slope (Middle) and Curvature (Right Panel) Nelson–Siegel Model Parameters Estimated from Bond Prices, Assuming a Fixed Shape Parameter Equal to τ = 1.37 = 1/(12 × 0.0609) as in Diebold and Li (Citation2006)

    Figure 1. Yield Curve’s Level (Left Panel), Slope (Middle) and Curvature (Right Panel) Nelson–Siegel Model Parameters Estimated from Bond Prices, Assuming a Fixed Shape Parameter Equal to τ = 1.37 = 1/(12 × 0.0609) as in Diebold and Li (Citation2006)

    We consider a retirement bond with an initial time-to-maturity of 25 years, with a 5-year accumulation phase, during which no coupons are paid, and a 20-year decumulation period during which income is paid every month.Footnote15 Our sample period covers 25 years of historical data from July 3, 1995, to June 30, 2020.

    To build the hedging portfolios, on each rebalancing date, we pick the bonds with over one month of remaining time-to-maturity to avoid having to drop bonds in between rebalancing dates. To remove redundant bonds, and try to avoid matrix singularity problems, we sort bonds by duration on each rebalancing date, then we round the duration figures to two digits and remove duplicates (i.e., we remove bonds with almost the same duration). All hedging strategies are rebalanced monthly.

    Hedging Performance Measures

    In this exercise, we aim to build a liability-hedging portfolio that replicates as closely as possible the value of the aforementioned retirement bond. To measure the quality of replication by a given hedging strategy, we first measure for each hedging strategy the time variations of its funding ratio, defined as the value of the cumulative total return of the replicating portfolio divided by the cumulative total return of the retirement bond portfolio to be replicated. Intuitively the funding ratio provides an estimate for the coverage level of the liability stream with the assets held in the portfolio. Setting both portfolios to have the same initial value, the initial funding ratio is equal to 100%. A perfect replication would lead to a funding ratio staying constant at 100%, and large changes in the funding ratio indicate the presence of large residual risk in hedging strategies. More generally, a change of x% in the funding ratio implies the same percentage change in terms of the affordable retirement income generated by the hedging portfolio relative to what would be affordable with perfect replication, i.e., with the retirement bond itself. We thus measure the out-of-sample quality of replication with the following indicators:

    • FR.MEAD: median absolute deviation (with respect to 100%) of the daily funding ratio of the strategy, in percentage points, a measure of average risk.

    • FR.MAAD: maximum absolute deviation (with respect to 100%) of the daily funding ratio of the strategy, in percentage points, a measure of extreme risk.

    Besides measuring deviations in funding ratios, we also characterize the out-of-sample deviations in yield curve factor sensitivities of the hedging strategies relative to the sensitivities of the retirement bond. In particular, we report for all strategies the median and the maximum absolute deviations in the first- and second-order NS factor sensitivities with respect to the corresponding sensitivities for the retirement bond. Intuitively, the first-order sensitivity of a bond portfolio with respect to a given factor (say the slope of the yield curve) represents an estimate of the approximate (positive or negative) change in value of the bond portfolio following a small change in the corresponding factor (say a small steepening or flattening move). In the same spirit, the second-order sensitivity of a bond portfolio with respect to the slope of the yield curve represents an estimate of the approximate change in the first-order sensitivity value of the bond portfolio following a small steepening or flattening move. Incorporating second-order sensitivities in interest rate immunization strategies allows one to obtain a better hedging efficiency in case of a larger change in the value of underlying factors.

    We divide the absolute deviations series by the median absolute value of the corresponding sensitivity of the retirement bond, to express them as percentages, thus controlling for differences in scale. In the tables, these median and maximum deviations in first-order factor exposures are denoted as Di.MEAD and Di.MAAD, respectively (for i = 0,1,2), when they apply to the ith factor in NS model. For the deviations in second-order exposures they are denoted DiDj.MEAD and DiDj.MAAD, respectively (for i,j = 0,1,2).

    In addition, we report the median and maximum leverage of each strategy over the sample period (denoted Med Lev and Max Lev in the tables). The (gross) leverage of the portfolio x on each date is calculated as in EquationEquation (8). We also report the average one-way monthly turnover (TO), and the average number of constituents (NC) of each hedging portfolio.

    Out-of-Sample Performance of Hedging Strategies

    For comparison purposes, first consider two simple single-factor strategies aiming at immunizing changes in portfolio values with respect to changes in the level of interest rates without regularization constraints.Footnote16 The first strategy only aims to match the duration of the retirement bond. This “Duration Barbell” strategy is implemented by selecting the bond with the longest duration available at each rebalancing date and the shortest duration bond (“cash”) to make the balance, and its weights are chosen to match the duration of the retirement bond Duration (hence H in EquationEquation (5) is a 2 by 2 matrix in this case). The second strategy matches the Duration and Convexity of the retirement bond at each rebalancing date (see the exact-solution approach in EquationEquation (5)), which requires selecting three bonds. This Duration-Convexity strategy is implemented by selecting the bond with the longest duration, the bond with the median duration, and the bond with the shortest duration (“cash”) among the set available at each rebalancing date. Technically, the weights are calculated on each month end by EquationEquation (5), where H is a 3 by 3 matrix containing the Duration and Convexity of the three bonds (stacked with a vector of ones to ensure that the sum of weights equals 1), and the vector D contains the Duration and Convexity of the retirement bond, and the constant 1 (the sum of weights).

    present the time series of the first- and second-order sensitivities to the NS factors of the retirement bond, and of the Duration Barbell and Duration-Convexity strategies. As expected, in the upper-right panel of we observe that these strategies do not exhibit significant deviations in terms of the first-order (FO) sensitivity to the level factor (denoted D0=Fβ0F) at any point in time. Furthermore, in the left panel of we can see that the Duration-Convexity strategy does not present any noticeable deviations in the second-order (SO) sensitivity to the level factor (denoted D0D0), as expected. On the other hand, we observe that the Duration Barbell strategy does present a significant and persistent bias in its second-order sensitivity to the level factor. This can also be observed in , where we see that the median absolute deviation to the level factor, D0.MEAD, is equal to 1.9% for the Duration Barbell strategy, compared to 0.6% for the Duration-Convexity strategy. Moreover, the maximum absolute deviation in first-order sensitivities to the level factor, D0.MAAD, is 18% for the Duration Barbell strategy compared to 5.2% for the Duration-Convexity strategy.

    Figure 2. Funding Ratio and First-Order Sensitivities to the NS Factors of the Retirement Bond (Continuous Line), the Duration Barbell (Dashed Line), and the Duration-Convexity (Dotted Line) strategies implemented with k + 1 Bonds, Where k Is the Number of Factor Sensitivities Matched by the Strategy (2 and 3 for the Duration and Duration-Convexity Strategies, Respectively)

    The upper-left panel presents the time series of the funding ratios, the upper-right panel presents the first-order sensitivities to the level factor (D0), the bottom-left panel the first-order sensitivities to the slope factor (D1), and the bottom-right panel presents the first-order sensitivities to the curvature factor (D2).
    Figure 2. Funding Ratio and First-Order Sensitivities to the NS Factors of the Retirement Bond (Continuous Line), the Duration Barbell (Dashed Line), and the Duration-Convexity (Dotted Line) strategies implemented with k + 1 Bonds, Where k Is the Number of Factor Sensitivities Matched by the Strategy (2 and 3 for the Duration and Duration-Convexity Strategies, Respectively)

    Figure 3. Second-Order Sensitivities to the NS Factors of the Retirement Bond (Continuous Line), the Duration Barbell (Dashed Line) and the Duration-Convexity (Dotted Line) Strategies Implemented with k + 1 Bonds, Where k Is the Number of Factor Sensitivities Matched by the Strategy (2 and 3 for the Duration and Duration-Convexity Strategies, Respectively)

    The left panel presents the time series of the second-order sensitivities to the level factor (D0D0), the middle panel the second-order sensitivities to the slope factor (D1D1), and the bottom-right panel presents the first-order sensitivities to the curvature factor (D2D2).
    Figure 3. Second-Order Sensitivities to the NS Factors of the Retirement Bond (Continuous Line), the Duration Barbell (Dashed Line) and the Duration-Convexity (Dotted Line) Strategies Implemented with k + 1 Bonds, Where k Is the Number of Factor Sensitivities Matched by the Strategy (2 and 3 for the Duration and Duration-Convexity Strategies, Respectively)

    The larger deviations in the second-order sensitivity to β0 of the Duration Barbell strategy is expected to negatively impact the out-of-sample hedging performance of the strategy in case of large changes in the level of the yield curve between rebalancing dates. Our analysis confirms this intuition since it shows that the Duration-Convexity strategy, which exhibits lower bias in its second-order sensitivity to the level factor, presents a median and a maximum absolute deviation in funding ratio of 11.27% and 14.28%, while the Duration Barbell strategy exhibits a median and maximum deviation of 11.73% and 22.46%, respectively (see , panel A). The latter imply large levels of uncertainty in the purchasing power of the retirement assets in terms of retirement income.

    Table 1. Number of Constituents (NC), Median and Maximum Leverage (Med Lev, Max Lev), Final Funding Ratio (FR.Final), Funding Ratio Median Absolute Deviation (FR.MEAD) and Maximum Absolute Deviation (FR.MAAD), and Average One-Way Monthly Turnover (TO) of Hedging Strategies Matching the Retirement Bond Yield Curve Factor Sensitivities Rebalanced Monthly

    The lower panels of also show that the Duration Barbell and Duration-Convexity strategies present large and persistent deviations in the first-order (FO) sensitivities to the slope and curvature factors (denoted by D1 and D2), as expected for single-factor immunization strategies that solely focus on changes in the level of interest rates. Indeed, in the first two columns of we see that the (scaled) median absolute deviation and maximum absolute deviation in the FO sensitivity to the slope factor (D1.MEAD and D1.MAAD) of the Duration Barbell strategy are 47.0% and 71.1%, respectively, and the corresponding numbers for the Duration-Convexity are 38.8% and 102.4%. Moreover, the median and maximum absolute deviations in the FO sensitivity to the curvature factor (D2.MEAD and D2.MAAD) of the Duration Barbell strategy are 46.1% and 66%, respectively, and 32.9% and 70.1% for the Duration-Convexity strategy. Furthermore, shows that both strategies also present important biases in the second-order sensitivities relative to the slope, and curvature factors, D1D1 and D2D2. Besides, from and , note that the biases in all factor sensitivities of the Duration Barbell and Duration-Convexity strategies have opposite signs during most of the sample, and their deviations in terms of funding ratio from the retirement bond, observed in the upper-left panel of , also have opposite signs.

    Table 2. Summary of Deviations in NS Factor Sensitivities of Hedging Strategies Relative to the Retirement Bond

    The key insight in this analysis is that these single-factor immunization strategies can imply very significant levels of unintended biases in the first- and second-order sensitivities to the slope and curvature factors, as expected since the exposure to these factors are not controlled for by the Duration and Convexity strategies. In contrast, we confirm that a multifactor “NS k + 1” strategy presents relatively low biases in the first-order sensitivities to the three NS factors. As shows, the median absolute deviations in its FO sensitivity to the level factors (D0.MEAD) is 1.8% (slightly lower than the respective figure of 1.9% for the Duration Barbell strategy), and the median absolute deviations in its FO sensitivities to the slope and curvature factors are 0.9% and 0.8%, which are significantly lower than the corresponding figures for the Duration Barbell and Duration-Convexity strategies (see the D1.MEAD and D2.MEAD in for these two strategies, which are 47%, 38.8%, and 46.1%, 32.9%, respectively).

    These results suggest that strategies aiming to match factor exposures beyond the level factor could a priori improve hedging performance. However, we also find that using the exact-solution approach (see EquationEquation 5) to match the FO sensitivities to the three NS factors results in a strategy with an extreme leverage level (see panel A of ). Indeed, the strategy is found to exhibit extremely large deviations from its well-aimed median sensitivities, as reflected in its maximum deviations relative to all the NS factor exposures. Indeed, all the MAAD figures in that correspond to the NS k + 1 strategy, are above 161,461, contrasting with the corresponding figures of the other strategies. The extreme variability in its factor exposures is caused by the extreme weights and leverage that results for this strategy using the exact-solution approach. This large instability in factor exposures results in very poor hedging performance (see the NS k + 1 row in ).

    To address the conundrum of having either uncontrolled biases in slope and curvature factors exposures, as with Duration and Duration-Convexity strategies, or the extreme instability in factor exposures of the NS k + 1 strategy, we now analyze the performance of a regularization approach (that we call the lasso strategies) which minimize the squared deviations in first- and second-order NS factor exposures under leverage constraints. As observed in , panel B, the NS lasso strategy implemented with a maximum gross leverage limit (δ¯) of 3, presents lower median and maximum absolute deviations in all NS factor sensitivities than the Duration Barbell strategy and in 16 out of the 18 MEAD and MAAD metrics compared to the Duration-Convexity strategy. The enhanced control in factor exposures implied by leverage constraints translates into an economically significant improvement in hedging performance. As observed in , the maximum absolute deviations in funding ratio for the Duration Barbell and Duration-Convexity strategies are 22.46% and 14.28%, respectively, while it is only 3.39% for the NS lasso strategy. This represents a remarkable low level of risk, leaving investors with a funding ratio that stays in a narrow (96.61%–103.39%) range. In practice, the improvement in hedging performance net of transaction costs would actually be higher given that the average monthly turnover of the NS lasso strategy is almost four times lower than the turnover of the Duration Barbell strategy, and almost nine times lower than the turnover of the Duration-Convexity strategy.Footnote17

    We also simulate the NS lasso strategy with a limit on gross leverage (δ¯) of 1, effectively corresponding to a long-only constraint. Comparing panels B and C of , we can observe that tightening the leverage constraint to its maximum, produced some slight improvement in hedging performance for the strategy. The maximum absolute deviation in funding ratio decreases from 3.39% to 2.92%. These results are in stark contrast with the corresponding numbers obtained for the Duration and Duration-Convexity, of 22.46% and 14.28%. The conclusion from this analysis is that limiting leverage to a reasonable and controlled level in NS factor immunization strategies can induce large improvements in out-of-sample hedging performance. Interestingly, the NS lasso strategy under the leverage limit of 3 does not use all the available leverage budget, since it exhibits a maximum leverage over the period of only 1.53, a number higher than 1 but much lower than the maximum leverage level of 3.48 obtained with the unrestricted Duration-Convexity strategy (see Max Lev column in panel A of ).

    Regularized Duration and Convexity Hedging

    To assess the marginal improvement generated by hedging slope and curvature factors, we next consider a regularized version of the single-factor hedging strategies, namely, the Duration-Matching and Duration-Convexity matching strategies. The goal here is to minimize the squared difference between the duration (or duration and convexity) of the hedging portfolio and the retirement bond liability to be replicated, under the constraints of keeping the gross portfolio leverage under 3, and shorting allowed only in “cash”. One key difference between these regularized strategies and the duration strategy (respectively, duration and convexity) tested before, is that the latter requires the selection of a basket of two bonds (respectively, three bonds) to avoid an over-specified problem. On the other hand, in the regularized optimization setting, the strategy can potentially allocate to all the bonds available. As can be observed in , this so-called Duration lasso strategy presents lower median and maximum absolute deviations in all NS factor sensitivities than the unconstrained Duration Barbell strategy. This reduction in bias and variance of factor exposures is expected to reflect in lower replication error. Indeed, the maximum absolute deviation in funding ratio of the Duration lasso strategy is 3.80% compared to 22.46% for the Duration Barbell strategy (see panels A and B of ), which represents a reduction of almost 6 times the uncertainty around the level of retirement income secured with the bond portfolio. In practice, this improvement in hedging performance would be higher because the strategy involves a remarkable reduction in turnover. The average monthly turnover of the Duration lasso strategy is 8.58% compared to 68.1% for the Duration Barbell strategy.

    A similar improvement is observed when comparing the Duration-Convexity lasso strategy (with δ¯=3) with the unrestricted Duration-Convexity strategy. The regularized strategy presents a median and maximum absolute deviation in funding ratio of 2.37% and 3.77%, compared to 11.27% and 14.28% for the unrestricted Duration-Convexity strategy. Furthermore, the Duration-Convexity lasso strategy generates an average turnover of 12.6% compared to 159.93% for the unrestricted strategy. The notable improvement in hedging performance can be explained by the significant decrease in the median and maximum deviations in exposures to the slope and curvature factors. For instance, the median absolute deviation in exposures to the slope and curvature factors for the unrestricted Duration-Convexity strategy are 38.8% and 32.9%, compared to 4.1% and 4.2% for the Duration-Convexity lasso strategy (see , columns DurConv k + 1 and DurConv lasso, rows D1.MEAD and D2.MEAD). More generally, this regularized strategy presents lower values in all the MEAD and MAAD factor exposure metrics, compared to the unrestricted Duration-Convexity strategy.

    It is interesting to notice that the regularized Duration lasso and Duration-Convexity lasso strategies implemented with leverage constraints allow for such a significant improvement in terms of their deviations in factor exposures to the slope and curvature factors, even though they do not explicitly aim to minimize these deviations in their design. Given that the lasso strategies are not restricted to use a subset of the bonds available, they are much less conditioned by the individual characteristics (i.e., yield factor exposures) of the constituent bonds than the unrestricted exact-solution version of the strategy, which leads to enhanced robustness. However, we expect that explicitly targeting factor exposure matching for all 3 factors, as done with the NS lasso strategy, should lead to improved hedging performance. To analyze this, we report in the time series of first- and second-order sensitivities of the three lasso strategies matching Duration, Duration-Convexity and the NS factor sensitivities. Indeed, the latter strategy presents factor sensitivities that are consistently closer to the ones of the retirement bond liability. Moreover, as observed in , the NS lasso strategy presents lower values in 13 and 14 out of the 18 MEAD and MAAD metrics compared to the Duration lasso and Duration-Convexity lasso strategies. As expected, this better matching of factor exposures results in an improvement in terms of hedging performance. Indeed, the maximum absolute deviations of the funding ratio for Duration lasso and Duration-Convexity lasso strategies, with a value reaching 3.80% and 3.77%, respectively, are higher than the corresponding measure for the NS lasso strategy at 3.39%, and this for the same leverage limit of 3. Moreover, the median absolute deviations in funding ratio of the Duration lasso and Duration-Convexity lasso strategies, reaching, respectively, 2.05% and 2.37%, is also higher in an economic sense, than the 1.74% value obtained with the NS lasso strategy (see panel B in ).

    Figure 4. Funding Ratio and First-Order Sensitivities to the NS Factors of the Retirement Bond (Continuous Line), and the Hedging Strategies Implemented with All the Bonds Available and with a Lasso Regularization that Restricts the Gross Leverage below Three at All Times

    Strategies matching Duration, Duration and Convexity, and the strategy matching the first- and second-order sensitivities to the three NS factors. The upper-left panel presents the time series of the funding ratios, the upper-right panel presents the first-order sensitivities to the level factor (D0), the bottom-left panel the first-order sensitivities to the slope factor (D1), and the bottom-right panel presents the first-order sensitivities to the curvature factor (D2).
    Figure 4. Funding Ratio and First-Order Sensitivities to the NS Factors of the Retirement Bond (Continuous Line), and the Hedging Strategies Implemented with All the Bonds Available and with a Lasso Regularization that Restricts the Gross Leverage below Three at All Times

    Figure 5. Second-Order Sensitivities to the Three NS Factors of the Retirement Bond, and of the Hedging Strategies Implemented with All the Bonds Available and with a Lasso Regularization that Restricts the Gross Leverage below Three at All Times

    Strategies matching Duration, Duration and Convexity, and the strategy matching the first- and second-order sensitivities to the three NS factors. The left panel presents the time series of the second-order sensitivities to the level factor (D0D0), the middle panel the second-order sensitivities to the slope factor (D1D1), and the right panel presents the second-order sensitivities to the curvature factor (D2D2).
    Figure 5. Second-Order Sensitivities to the Three NS Factors of the Retirement Bond, and of the Hedging Strategies Implemented with All the Bonds Available and with a Lasso Regularization that Restricts the Gross Leverage below Three at All Times

    Panel C of presents the performance summary for the Duration lasso and Duration-Convexity lasso strategies implemented with the long-only constraint (δ¯=1). In comparison with the long-only NS lasso strategy, we observe a similar pattern, that is, we find that the latter strategy presents an economically meaningful improvement in hedging performance relative to the strategies that only take into account Duration or Duration and Convexity under the same regularized framework. Indeed, the long-only NS lasso presents a FR.MEAD of 0.61% compared to 1.54% and 1.48% for the long-only Duration-Convexity lasso and Duration lasso strategies, respectively. This improvement in hedging performance is also consistent with the lower deviations in factor exposures. We report in the median and maximum absolute deviations in the factor sensitives of the lasso strategies matching Duration, Duration-Convexity and NS factors implemented with the long-only constraint (δ¯=1). We find that the long-only NS lasso strategy presents lower or equal values in 14 out of the 18 MEAD and MAAD measurements than the Duration lasso and Duration-Convexity lasso strategies, respectively.

    Table 3. Summary of Deviations in NS Factor Sensitivities of Long-Only Hedging Strategies Relative to the Retirement Bond

    Comparing the corresponding columns in and , we note that the Duration lasso strategy implemented with a leverage limit of 3 has lower values in all of the MEAD and MAAD factor exposure deviations compared to the corresponding Duration lasso strategy implemented with zero tolerance for leverage (i.e., δ¯=1). This suggests that a reasonable degree of leverage, if limited and controlled, is expected to improve hedging performance.

    Finally, we have also simulated strategies minimizing the squared deviations in the first- and second-order sensitivities of the NS bond price with respect to time. These results suggest that matching the time sensitivities has a negligible effect on the hedging performance of the lasso strategy that takes into account the three NS factors. This is presumably because matching the exposure to a relatively exhaustive set of risk factors implies by the absence of arbitrage a reasonably good matching of expected return components. On the other hand, we find that adding the minimization of deviations in the time sensitivities to strategies matching only Duration or Duration-Convexity with the lasso approach significantly decreases deviations in slope and curvature sensitivities and improves the hedging performance to an economically meaningful extent.Footnote18

    Conclusion

    Hedging against non-parallel shifts in the yield curve is of major practical importance for institutions like pension funds or insurance companies endowed with long-term liabilities. Indeed, the fair value of a long-term stream of liability cash flows, such as promised pension payments for the beneficiaries of a defined-benefit pension fund, can be obtained as the discounted value of the cash flows. Thus being a complex function of the whole yield curve, an infinite-dimensional risk factor. Fortunately, it has been shown that three main factors, namely, the level, slope, and curvature of the yield curve, explain a large fraction of the changes in the term structure of interest rates. In this context, efficient immunization strategies can be obtained in principle by holding liability-hedging bond portfolios matching at every trading date the exposure of the stream of liability cash flows with respect to changes in these three risk factors to the first and possibly second-order.

    In implementation, however, the weights of these hedging portfolios obtained as unconstrained solutions to the yield curve multifactor exposure matching problem, results in extremely large portfolio weights. The latter implies exceedingly high levels of leverage, turnover, and unstable factor exposures between portfolio rebalancing dates.

    This paper proposes to address this issue with regularization procedures in the factor exposure matching problem, based on the introduction of a lasso penalty and leverage constraints. These procedures, borrowed from the machine learning literature, have proven effective in addressing the lack of robustness of mean-variance portfolio optimization in equity markets, and this paper shows that they are also helpful in the construction of interest rate hedging strategies. In out-of-sample empirical backtests, we confirm that these generalized immunization strategies offer both a substantial improvement in hedging performance and a strong reduction in leverage and turnover compared to an unconstrained approach to bond portfolio immunization.

    Acknowledgements

    Daniel acknowledges financial support from Universidad de los Andes (grant number P18.100322.008).

    Disclosure Statement

    No potential conflict of interest was reported by the author(s).

    Additional information

    Funding

    Daniel acknowledges financial support from Universidad de los Andes (grant number P18.100322.008).

    Notes on contributors

    Daniel Mantilla-Garcia

    Daniel Mantilla-Garcia is an Assistant Professor at Universidad de Los Andes, School of Management.

    Lionel Martellini

    Lionel Martellini is a Professor of Finance at EDHEC Business School, Nice, France.

    Vincent Milhau

    Vincent Milhau is a Research Director of EDHEC-Risk Institute, Nice, France.

    Hector Enrique Ramirez-Garrido

    Hector Enrique Ramirez-Garrido is a Research Assistant at Universidad de Los Andes, School of Management.

    Notes

    1 Furthermore, Redington (Citation1952) determined the conditions under which Duration-matching of multiple cash flows results in local immunization, while Bierwag, Kaufman, and Toevs (Citation1983) established the conditions under which Duration-matching implies global immunization in presence of level shifts, and Barber (Citation1999) generalized these results to the single-factor affine term structure model.

    2 We show that the unconstrained multifactor approach produces extreme weights even if the input parameters, e.g., yield curve’s level, slope and curvature in the Nelson and Siegel (Citation1987) model, present stable dynamics reflecting their economic interpretation.

    3 However, F is still right differentiable with respect to time on these dates.

    4 By Schwarz’s theorem, the cross derivatives Fij and Fji are equal for any two indices i and j.

    5 See Appendix A for a formal discussion of this argument.

    6 This unique and exact-solution hedging approach in EquationEquation (5) appears for instance in text books such as Fabozzi, Martellini, and Priaulet (Citation2005) and Martellini, Priaulet, and Priaulet (Citation2003). When there are more constituent bonds than constraints, the linear system (4) is underdetermined.

    7 A portfolio with extremely large weights has factor exposures that are very sensitive to small changes in the factor exposures of the constituents. To illustrate this effect, we have simulated a strategy matching the first-order sensitivities to the three factors rebalanced every day, and it turns out to have large deviations in second-order sensitivities and poor out-of-sample hedging performance. These results are unreported for the sake of brevity but are available upon request from the authors. Moreover, the strategy aiming to match first- and second-order sensitivities to the NS factors with the exact solution in EquationEquation (5), was not feasible as the matrix H was ill-conditioned and numerically not invertible.

    8 If nk+1, the rank of H is at most k+1. Therefore, HH, which is a n×n matrix, is singular except if n=k+1, because HH and H have the same rank. Note EquationEquation (7) has the same functional form as the OLS estimator in a multivariate regression. Note as well that in the particular case in which the number of bonds available is equal to k + 1, then H is a squared matrix and the weight vector x* simplifies to the exact-unique solution strategy in EquationEquation (5). Hence, the exact-solution approach to multifactor immunization (EquationEquation (5)) is a special case of this generalized approach, as the latter reduces to the exact-solution immunization strategies if (i) the hedging portfolio is constrained to use only a small subset of the available hedging instruments so that the number of portfolio constituents equals the number of factor exposures to match, n = k + 1, and if (ii) the forecasts of factor exposures are assumed to have zero error (in which case, no regularization constraints would be needed).

    9 Since the matrix Λ=HH in EquationEquation (9) is singular, in order to use a faster standard quadratic programming optimizer, we add a small perturbance term (0.00001) to its diagonal, which is equivalent to a ridge regularization.

    10 We also consider the ridge and lasso strategies without the restriction to limit short positions to cash. In our empirical simulations of these strategies, we observe that for the strategy hedging only the first-order sensitivities to the three NS factors, this additional restriction did have some negative effect on hedging performance. However, for the strategies hedging up to the second-order sensitivities to the NS factors, that additional restriction did not have a negative impact on hedging performance. These results are available from the authors upon request.

    11 If the number of bonds, n, is greater than the number of linear constraints, i.e., k + 1, the system in Hx=D has infinitely many solutions, all of which are factor-matching portfolios. Some of those solutions have more extreme weights, and hence more unstable factor exposures. Another alternative to regularize the strategy is selecting the portfolio that minimizes the weights’ squared second norm, subject to the linear constraints Hx=D. The main problem that we saw with that approach, is that the constraint to have an exact match of the target factor exposures can be too restrictive, in the sense that it can result in solutions in which the weights are still too large, hence still leading to high factor exposure instability of the hedging portfolio. Indeed, in our empirical tests of this norm minimization strategy, the maximum leverage presented over the simulation was above 4, which might be too high in some contexts, and beyond the leverage limit of 3 that we considered. The results for the ridge strategy and the norm minimization are unreported for the sake of brevity, but available upon request from the authors.

    12 Life expectancy at age 65 for a generic US individual was 19.5 years in 2018 (see Table B in National Vital Statistics Reports 2021: Arias, Brigham, and Tejada-Vera Citation2021). Note that the retirement bonds replicated in this exercise do not address longevity risk beyond life expectancy.

    13 To obtain protection against unexpected inflation risk, one would need an inflation-linked retirement bond, which could be priced using the real, as opposed to nominal, zero-coupon yield curve, implying that the hedging instruments would be inflation-protected bonds.

    14 The data extraction process is as follows: using the SRCH command in the Bloomberg terminal, we obtain the full bonds’ ID list (for both corporate and public debt). Then, we restrict the “Asset Classes” query to “Government bonds”, and for “Security Status” we allow “Active and Matured securities”, set the “Issuer” column to “United States”, and fix the “Maturity Type” as “Normal” (excluding for instance callable bonds). Then, we filter to use only “T” bond series, i.e., Treasury bonds, thus excluding SP, SPX, SPY series (i.e., Strip Principal bonds).

    15 We aim to replicate a retirement bond with 25 years of initial time-to-maturity as Bierwag, Kaufman, and Toevs’s (Citation1983) immunization theorem suggests that the range of dates of cash flow for the liability stream should be within the range of available durations for the hedging instruments.

    16 These strategies are special cases of the exact-solution approach in EquationEquation (5).

    17 This large improvement in turnover is partly related to the fact that the NS lasso strategy does not need to use a new selection of k+1 bonds at each rebalancing date but uses all bonds available.

    18 The results of the strategies matching time derivatives are unreported for the sake of brevity but are available upon request from the authors.

    19 Note that because second-order derivatives are symmetric in the indices p and q, it suffices to consider only pairs for which pq.

    References

    • Arias, E., B. Brigham, and B. Tejada-Vera. 2021. “U.S. State Life Tables, 2018.” National Vital Statistics Reports 70 (1). https://www.cdc.gov/nchs/data/nvsr/nvsr70/nvsr70-1-508.pdf.
    • Barber, J. R. 1999. “Bond Immunization for Affine Term Structures.” The Financial Review 34 (2): 127–139. doi:10.1111/j.1540-6288.1999.tb00458.x.
    • Barber, J. R., and M. L. Copper. 1996. “Immunization Using Principal Component Analysis.” The Journal of Portfolio Management 23 (1): 99–105. doi:10.3905/jpm.1996.409574.
    • Bierwag, G. O., G. G. Kaufman, and A. Toevs. 1983. “Immunization Strategies for Funding Multiple Liabilities.” The Journal of Financial and Quantitative Analysis 18 (1): 113–123. doi:10.2307/2330807.
    • Bierwag, G. O., and C. Khang. 1979. “An Immunization Strategy is a Minimax Strategy.” The Journal of Finance 34 (2): 389–399. doi:10.1111/j.1540-6261.1979.tb02101.x.
    • Black, F., and R. Litterman. 1992. “Global Portfolio Optimization.” Financial Analysts Journal 48 (5): 28–43. doi:10.2469/faj.v48.n5.28.
    • DeMiguel, V., L. Garlappi, F. Nogales, and R. Uppal. 2009. “A Generalized Approach to Portfolio Optimization: Improving Performance by Constraining Portfolio Norms.” Management Science 55 (5): 798–812. doi:10.1287/mnsc.1080.0986.
    • DeMiguel, V., L. Garlappi, and R. Uppal. 2009. “Optimal versus Naive Diversification: How Inefficient is the 1/N Portfolio Dtrategy?” Review of Financial Studies 22 (5): 1915–1953. doi:10.1093/rfs/hhm075.
    • Diebold, F. X., and C. Li. 2006. “Forecasting the Term Structure of Government Bond Yields.” Journal of Econometrics 130 (2): 337–364. doi:10.1016/j.jeconom.2005.03.005.
    • Fabozzi, F. J., L. Martellini, and P. Priaulet. 2005. Advanced Bond Portfolio Management: Best Practices in Modeling and Strategies. Hoboken, NJ: John Wiley & Sons. doi:10.1002/9781119201151.
    • Fisher, L., and R. L. Weil. 1971. “Coping with the Risk of Interest-rate Fluctuations: Returns to Bondholders from Naïve and Optimal Strategies.” The Journal of Business 44 (4): 408–431. doi:10.1086/295402.
    • Fong, H. G., and O. A. Vasicek. 1984. “A Risk Minimizing Strategy for Portfolio Immunization.” The Journal of Finance 39 (5): 1541–1546. doi:10.1111/j.1540-6261.1984.tb04923.x.
    • Frost, P., and J. Savarino. 1986. “An Empirical Bayes Approach to Efficient Portfolio Selection.” The Journal of Financial and Quantitative Analysis 21 (3): 293–305. doi:10.2307/2331043.
    • Gürkaynak, R. S., B. Sack, and J. H. Wright. 2007. “The US Treasury Yield Curve: 1961 to the Present.” Journal of Monetary Economics 54 (8): 2291–2304. doi:10.1016/j.jmoneco.2007.06.029.
    • Ingersoll, J. E., J. Skelton, and R. L. Weil. 1978. “Duration Forty Years Later.” The Journal of Financial and Quantitative Analysis 13 (4): 627–650. doi:10.2307/2330468.
    • Jagannathan, R., and T. Ma. 2003. “Risk Reduction in Large Portfolios: Why Imposing the Wrong Constraints Helps.” The Journal of Finance 58 (4): 1651–1683. doi:10.1111/1540-6261.00580.
    • Khang, C. 1979. “Bond Immunization When Short-term Interest Rates Fluctuate More than Long-term Rates.” The Journal of Financial and Quantitative Analysis 14 (5): 1085–1090. doi:10.2307/2330309.
    • Ledoit, O., and M. Wolf. 2003. “Improved Estimation of the Covariance Matrix of Stock Returns with an Application to Portfolio Selection.” Journal of Empirical Finance 10 (5): 603–621. doi:10.1016/S0927-5398(03)00007-0.
    • Ledoit, O., and M. Wolf. 2004. “A Well-conditioned Estimator for Large-Dimensional Covariance Matrices.” Journal of Multivariate Analysis 88 (2): 365–411. doi:10.1016/S0047-259X(03)00096-4.
    • Macaulay, F. 1938. Some Theoretical Problems Suggested by the Movements of Interest Rates, Bond Yields and Stock Prices since 1856. New York: National Bureau of Economic Research.
    • Markowitz, H. 1952. “Portfolio Selection.” Journal of Finance 7 (1): 77–91.
    • Martellini, L., and V. Milhau. 2010. “Towards the Design of Improved Forms of Target-date Funds.” Bankers, Markets and Investors 109:6–24.
    • Martellini, L., and V. Milhau. 2012. “Dynamic Allocation Decisions in the Presence of Funding Ratio Constraints.” Journal of Pension Economics and Finance 11 (4): 549–580. doi:10.1017/S1474747212000194.
    • Martellini, L., and V. Milhau. 2020. Advances in Retirement Investing (Elements in Quantitative Finance). Cambridge, UK: Cambridge University Press. doi:10.1017/9781108917377.
    • Martellini, L., V. Milhau, and J. Mulvey. 2019. “Flexicure’ Retirement Solutions: A Part of the Answer to the Pension Crisis?” The Journal of Portfolio Management 45 (5): 136–151. doi:10.3905/jpm.2019.45.5.136.
    • Martellini, L., V. Milhau, and J. Mulvey. 2020. “Securing Replacement Income with Goal-Based Retirement Investing Strategies.” The Journal of Retirement 7 (4): 8–26. doi:10.3905/jor.2019.1.062.
    • Martellini, L., P. Priaulet, and S. Priaulet. 2003. Fixed-income Securities: Valuation, Risk Management and Portfolio Strategies. Chichester, UK: John Wiley & Sons.
    • Merton, R. 1971. “Optimal Portfolio and Consumption Rules in a Continuous-time Model.” Journal of Economic Theory 3 (4): 373–413. doi:10.1016/0022-0531(71)90038-X.
    • Merton, R., and A. Muralidhar. 2017. Time for Retirement ‘SelFIES’? Brussels, Belgium: Investments and Pensions Europe.
    • Merton, R. C. 1969. “Lifetime Portfolio Selection under Uncertainty: The Continuous-time Case.” The Review of Economics and Statistics 51 (3): 247–257. doi:10.2307/1926560.
    • Merton, R. C. 1973. “An Intertemporal Capital Asset Pricing Model.” Econometrica 41 (5): 867–887. doi:10.2307/1913811.
    • Merton, R. C., and A. Muralidhar. 2020. “SeLFIES: A New Pension Bond and Currency for Retirement.” Journal of Financial Transformation 51:8–20.
    • Michaud, R. O. 1989. “The Markowitz Optimization Enigma: Is Optimized Optimal?” Financial Analysts Journal 45 (1): 31–42. doi:10.2469/faj.v45.n1.31.
    • Nelson, C. R., and A. F. Siegel. 1987. “Parsimonious Modeling of Yield Curves.” The Journal of Business 60 (4): 473–489. doi:10.1086/296409.
    • Redington, F. M. 1952. “Review of the Principles of Life-office Valuations.” Journal of the Institute of Actuaries 78 (3): 286–340. doi:10.1017/S0020268100052811.
    • Reitano, R. R. 1991. “Multivariate Immunization Theory.” Transactions of the Society of Actuaries 43:393–442.
    • Samuelson, P. A. 1945. “The Effect of Interest Rate Increases on the Banking System.” The American Economic Review 35 (1): 16–27.
    • Stein, C. 1956. “Inadmissibility of the Usual Estimator for the Mean of a Multivariate Normal Distribution.” Technical report, Stanford University.

    Appendix A. Bond Factor Exposures in Nelson–Siegel Model

    If the excess return of the portfolio over the bond is perfectly immunized against changes in factor values, it must be the case that the portfolio exposures match those of the target replicated bond. So, we have, for p ranging from 0 to 3, i=1NwitFp(i)F(i)=FpF.

    In other words, a necessary condition for replication is that the portfolio exposure to each factor, defined as the weighted sum of the constituents’ exposures, match the target bond exposure. The previous calculation can be repeated with the second-order Taylor expansion of EquationEquation (3), to show that the matching of second-order exposures is another necessary condition for replication. Mathematically, for any two indices p and q ranging from 0 to 3, we must haveFootnote19 i=1NwitFpq(i)F(i)=FpqF.

    The first-order exposures of the coupon-paying bond to the level, slope, and curvature factors are given by F0F=1Fs>tCFse[tjt]yt,st[st], F1F=1Fs>tCFse[st]yt,st[st]f(stτ), F2F=1Fs>tCFse[st]yt,st[st]g(stτ).

    The pricing function is differentiable with respect to time before the first payment date. The right derivative is given by Ft=s>tCFse[st]yt,st[yt,st[st]yt,stt], where the time derivative of the zero-coupon yield is yt,stt=1τf(stτ)β1t1τg(stτ)β2t.

    The second-order exposures with respect to the risk factors are given by F00F=1Fs>tCFse[st]yt,st[st]2, F11F=1Fs>tCFse[st]yt,st[st]2f(stτ)2, F22F=1Fs>tCFse[st]yt,st[st]2g(stτ)2, F01F=1Fs>tCFse[st]yt,st[st]2f(stτ), F02F=1Fs>tCFse[st]yt,st[st]2g(stτ), F12F=1Fs>tCFse[st]yt,st[st]2f(stτ)g(stτ).

    Appendix B. Implementation Details of Lasso Optimization

    The optimization problem in EquationEquation (9) is equivalent to minimizing deviations in factor sensitivities subject to a leverage upper bound constraint: (B1) minxxΛx2xΘ.(B1) u.c. {1x=1i=1n|xi| δ¯

    For numerical efficiency reasons, rewrite the program (B1) by replacing the non-linear inequality constraint (the leverage constraint) with a set of linear constraints. To do so, define auxiliary variables denoted x+ and x, such that xi=xi+xi i{1,n} with xi+0 and xi0. The leverage is then given by i=1n|xi|=i=1nxi++i=1nxi, and we rewrite program (B1) as: minxxΛx2xΘ u.c {1x=1xi=xi+xii=1nxi++i=1nxiδ¯xi+0;xi0 

    Furthermore, this n-dimensional optimization problem is equivalent to the following standard quadratic programing (QP) problem of dimension of 3n with linear constraints: minXXGX2XR u.c {AX=BCXF where X=(x1,x2,,xn,x1+,x2+,,xn+,x1,x2,,xn), and G=(Λ00000000), R=(Θ00), C=(01T1TI3n),F=(δ¯03n1) A=(1T00InInIn),and B=(10)

    Note matrix F includes the constraints that all but the weight of the shortest maturity bond (i.e., cash) be positive. The matrix G is not positive definite, which can be a problem to use some QP optimizers that assume a matrix with positive eigenvalues. Adding a small disturbance term (0.00001) do its diagonal eliminates the singularity problem. This is equivalent to a ridge regularization. We refer to the resulting portfolio x, which is made of the first n elements of the resulting X, as the lasso strategy in the paper.

    Appendix C. Ridge Strategy and the Leverage Constraint

    Consider a variation of the program EquationEquation (6), with penalization for the output vector 2-norm: (C1) x(ρ)=argminxxΛx2xΘ+ρ2||x||22u.c 1x=1(C1) where ||x||22=i=1nxi2 is the squared 2-norm of x. Recall the relationship between the Manhattan norm (leverage) and the 2-norm: i=1n|xi|=||x||1n ||x||2

    Hence, a larger penalization parameter ρ on the squared 2-norm in EquationEquation (C1) implies a more strict leverage restriction on x(ρ). In order to further characterize the ridge strategy, we solve analytically for x. The Lagrangian of problem (C1) reads L(x,λ)=xΛx2xΘ+ρ2xInxλ(1x1)

    The first-order optimality conditions are x=(Λ+ρIn)1(Θ+λ1) 1x=1

    Solving for λ, we obtain λ=11(Λ+ρIn)1Θ1(Λ+ρIn)11 and the optimal ridge portfolio is (C2) x(ρ)=(Λ+ρIn)1Θ+[11(Λ+ρIn)1Θ1(Λ+ρIn)11](Λ+ρIn)11(C2)

    Hence, for a chosen leverage budget δ¯, we can solve numerically for the value of ρ such that ||x(ρ)||1δ¯ is satisfied. Consider the case of a long-only constraint, where the leverage-penalization parameter ρ tends to infinity. Denoting the symmetric matrix Q:=(Λ+ρIn)1, the ridge solution (C2) can be written as x()=limρ(QΘ+[11QΘ1Q1]Q1)

    Notice that the Λ matrix in Q becomes negligible as ρ: limρQ=limρ(Λ+ρIn)1=limρ(ρIn)1=limρ1ρIn and the portfolio becomes (C3) x()=limρ(1ρΘ+1ρ[11Θρ11ρ]1)=1111=1n1(C3)

    Hence, when the penalization parameter ρ the ridge strategy becomes an equally-weighted portfolio of all the bonds available.