Publication Cover
Statistics
A Journal of Theoretical and Applied Statistics
Volume 51, 2017 - Issue 3
204
Views
5
CrossRef citations to date
0
Altmetric
Original Articles

Characterizing variation of nonparametric random probability measures using the Kullback–Leibler divergence

, &
Pages 558-571 | Received 27 Apr 2015, Accepted 20 Jul 2016, Published online: 16 Nov 2016
 

ABSTRACT

This work characterizes the dispersion of some popular random probability measures, including the bootstrap, the Bayesian bootstrap, and the Pólya tree prior. This dispersion is measured in terms of the variation of the Kullback–Leibler divergence of a random draw from the process to that of its baseline centring measure. By providing a quantitative expression of this dispersion around the baseline distribution, our work provides insight for comparing different parameterizations of the models and for the setting of prior parameters in applied Bayesian settings. This highlights some limitations of the existing canonical choice of parameter settings in the Pólya tree process.

Acknowledgments

We are grateful to Judith Rousseau for helpful comments. This work was done whilst Nieto-Barajas was visiting the Department of Statistics at the University of Oxford.

Disclosure statement

No potential conflict of interest was reported by the authors.

Notes

1. The digamma function is defined as the logarithmic derivative of the gamma function, that is, ψ0(x)=(d/dx)logΓ(x)=Γ(x)/Γ(x). In similar fashion, the trigamma function is defined as the second derivative.

2. The variance of each element is defined in terms of first and second moments and rely on independence properties to compute them. Working out the algebra with patience and noting that E{(logYmj)Ymj}=λ2(m)/2, E{(logYmj)Ymj2}=12((αρ(m)+1)/(2αρ(m)+1))λ3(m), E{(logYmj)Ymj(1Ymj)}=12(αρ(m)/(2αρ(m)+1))λ4(m), E{(logYmj)2Ymj2}=12((αρ(m)+1)/(2αρ(m)+1))λ5(m) and E{(logYmj)log(1Ymj)Ymj(1Ymj)}=12(αρ(m)/(2αρ(m)+1))λ6(m), the result is obtained.

3. Figure  appears to show that E{KL(f||f0)} remains constant, but this is an artefact due to the scale.

4. We use this notation to emphasize the fact that w represents a random probability mass function, but taking values on the set {0,1/n,2/n,,1}. A factor of n is needed for the vector to be distributed according to a multinomial distribution.

5. It is interesting to note that in the original work they only consider this special case.

6. Var(wi)=pi(1pi)/(αn+1), Cov(wi,wj)=pipj/(αn+1), Var(wilogwi)=pi(αnpi+1)/(αn+1){ψ1(αnpi+2)ψ1(αn+2)+[ψ0(αnpi+2)ψ0(αn+2)]2}pi2{ψ0(αnpi+1)ψ0(αn+1)}2, Cov(wilogwi,wi)=pi(αnpi+1)/(αn+1){ψ0(αnpi+2)ψ0(αn+2)}pi2{ψ0(αnpi+1)ψ0(αn+1)}, Cov(wilogwi,wj)=pipj{ψ0(αnpi+1)/(αn+1)+ψ0(αn+1)αnψ0(αn+2)/(αn+1)}, Cov(wilogwi,wjlogwj)=αnpipj/(αn+1)[{ψ0(αnpi+1)ψ0(αn+2)}{ψ0(αnpj+1)ψ0(αn+2)}ψ1(αn+2)]pipj{ψ0(αnpi+1)ψ0(αn+1)}{ψ0(αnpj+1)ψ0(αn+1)}.

7. Proposition 3 of Ferguson [Citation4] states that any fixed density (measure) g absolutely continuous with respect to f0 can be arbitrarily approximated pointwise with a draw f from a Dirichlet process.

Additional information

Funding

Watson is supported by the Industrial Doctoral Training Centre (SABS-IDC) at Oxford University and Hoffman-La Roche. Nieto-Barajas is supported by CONACYT grant 244459 and Asociación Mexicana de Cultura, A.C.–Mexico. Holmes gratefully acknowledges support for this research from the Oxford-Man Institute, the EPSRC i-Like programme grant EP/K014463/1 and the Medical Research Council program leader's award MC_UP_A390_1107.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.