2,124
Views
8
CrossRef citations to date
0
Altmetric
New topics/Others

Development of spectral decomposition based on Bayesian information criterion with estimation of confidence interval

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon
Pages 402-419 | Received 04 Mar 2020, Accepted 20 May 2020, Published online: 02 Jul 2020

Figures & data

Figure 1. Three examples of artificially measured spectra with Gaussian noise. The solid line is the true curve fx;θ and the dots are the artificially measured spectral data: (a) S/N=20.0 and Δ=0.5, (b) S/N=20.0 and Δ=0.1, and (c) S/N=0.2 and Δ=0.5.

Figure 1. Three examples of artificially measured spectra with Gaussian noise. The solid line is the true curve fx;θ∗ and the dots are the artificially measured spectral data: (a) S/N=20.0 and Δ=0.5, (b) S/N=20.0 and Δ=0.1, and (c) S/N=0.2 and Δ=0.5.

Table 1. Settings of the inverse temperature βmm=1M corresponding to Eq. (18).

Figure 2. (a)–(c) Results of model selection by Bayesian estimation respectively corresponding to spectral data in )–(c).

Figure 2. (a)–(c) Results of model selection by Bayesian estimation respectively corresponding to spectral data in Figure 1(a)–(c).

Figure 3. Results of model selection for various peak-to-peak distances Δ and S/N ratios. The values in the figure indicate the posterior probability pK=2|D for K=2.

Figure 3. Results of model selection for various peak-to-peak distances Δ and S/N ratios. The values in the figure indicate the posterior probability pK=2|D for K=2.

Figure 4. Posterior distribution of each parameter when Bayesian estimation is performed on the spectral data in ). Dashed lines indicate the true parameter values used to generate the spectral data.

Figure 4. Posterior distribution of each parameter when Bayesian estimation is performed on the spectral data in Figure 1(a). Dashed lines indicate the true parameter values used to generate the spectral data.

Figure 5. STD of the posterior distribution as a function of the S/N ratio for Δ = 0.5. Triangles and inverted triangles are respectively the STDs of the parameters for the first and second peaks.

Figure 5. STD of the posterior distribution as a function of the S/N ratio for Δ = 0.5. Triangles and inverted triangles are respectively the STDs of the parameters for the first and second peaks.

Figure 6. STD as a function of the peak-to-peak distance Δ for S/N = 100.0. Triangles and inverted triangles are respectively the STDs of the parameters for the first and second peaks.

Figure 6. STD as a function of the peak-to-peak distance Δ for S/N = 100.0. Triangles and inverted triangles are respectively the STDs of the parameters for the first and second peaks.

Table 2. Values of the fitted parameters 2Bj,Cj,Dj,Ej in Eq. (19).

Figure 7. Schematic diagrams of values obtained using EquationEq. (19). (a) Scaled STD s˜ as a function of S/N =h0/σ for B=10.0,1.0,0.1 and 0.01, where the peak-to-peak distance Δ is sufficiently large: Δ=10.0,C=3.0,D=2.0 and E=0.0. (b) s˜ as a function of the peak-to-peak distance Δ for several conditions of C,D,E when we set the prefactor of EquationEq. (19) σB/h0=1.

Figure 7. Schematic diagrams of values obtained using EquationEq. (19)(A-19) sΔ,σh0=σh0BΔ−EDC+1(A-19) . (a) Scaled STD s˜ as a function of S/N =h0/σ for B=10.0,1.0,0.1 and 0.01, where the peak-to-peak distance Δ is sufficiently large: Δ=10.0, C=−3.0, D=2.0 and E=0.0. (b) s˜ as a function of the peak-to-peak distance Δ for several conditions of C,D,E when we set the prefactor of EquationEq. (19(A-19) sΔ,σh0=σh0BΔ−EDC+1(A-19) ) σB/h0=1.

Figure 8. Results of regression using the fitting function in EquationEq. (19) of the STDs of the posterior distributions for each parameter. Parameters are the (a) Lorentz–Gauss mixing ratio rk, (b) peak position μk, (c) peak height hk, and (d) HWHMs of the peaks wk.

Figure 8. Results of regression using the fitting function in EquationEq. (19)(A-19) sΔ,σh0=σh0BΔ−EDC+1(A-19) of the STDs of the posterior distributions for each parameter. Parameters are the (a) Lorentz–Gauss mixing ratio rk, (b) peak position μk, (c) peak height hk, and (d) HWHMs of the peaks wk.

Figure 9. Fitted spectra from Bayesian estimation (a) and BIC-fitting (b) for the experimental valence spectrum of SiO2. Open circles are the experimental spectrum, the orange line is the fitted spectrum, the green line is the background, and the black lines are all peaks above the background.

Figure 9. Fitted spectra from Bayesian estimation (a) and BIC-fitting (b) for the experimental valence spectrum of SiO2. Open circles are the experimental spectrum, the orange line is the fitted spectrum, the green line is the background, and the black lines are all peaks above the background.

Figure 10. Results of model selection through Bayesian estimation (a) and BIC-fitting (b) for the experimental valence spectrum of SiO2. The red circle in (b) indicates the model with the minimum BIC.

Figure 10. Results of model selection through Bayesian estimation (a) and BIC-fitting (b) for the experimental valence spectrum of SiO2. The red circle in (b) indicates the model with the minimum BIC.

Figure 11. Two-dimensional histogram of PC1 and PC2 obtained by PCA of EMC sampling.

Figure 11. Two-dimensional histogram of PC1 and PC2 obtained by PCA of EMC sampling.

Table 3. Calculated confidence intervals of the posterior distribution for all peak parameters and those estimated using Eq. (19).

Figure 12. Posterior probability densities of peak parameters μk,hk,wk,rkk=1,2 for two peaks located at about EB=10 and 16 eV for the valence spectrum of SiO2.

Figure 12. Posterior probability densities of peak parameters μk,hk,wk,rkk=1,2 for two peaks located at about EB=10 and 16 eV for the valence spectrum of SiO2.

Figure A-1. Results of model selection and posterior distributions pwk|D,K and prk|D,K. (a)–(e) only differ in the initial random seeds used in generating the spectral data sets under the same conditions as those in ).

Figure A-1. Results of model selection and posterior distributions pwk|D,K and prk|D,K. (a)–(e) only differ in the initial random seeds used in generating the spectral data sets under the same conditions as those in Figure 1(a).

Figure A-2. Two-dimensional distribution of parameters θ. The diagonal components of the figure show the histogram of the posterior distribution of each parameter. The lower off-diagonal components are two-dimensional distributions, whereas the upper off-diagonal components show the coefficients of correlation between two parameters. The dotted lines indicate the true parameter values used to generate the spectral data.

Figure A-2. Two-dimensional distribution of parameters θ. The diagonal components of the figure show the histogram of the posterior distribution of each parameter. The lower off-diagonal components are two-dimensional distributions, whereas the upper off-diagonal components show the coefficients of correlation between two parameters. The dotted lines indicate the true parameter values used to generate the spectral data.

Figure E-1. Two examples of artificial spectra that have two peaks with different heights: (a) h1=1.0, h2=0.5, (b) h1=1.0, h2=0.1. The other parameters are the same. Open circles are the artificial spectrum, the orange line is the fitted spectrum, and the black lines are the peak components.

Figure E-1. Two examples of artificial spectra that have two peaks with different heights: (a) h1∗=1.0, h2∗=0.5, (b) h1∗=1.0, h2∗=0.1. The other parameters are the same. Open circles are the artificial spectrum, the orange line is the fitted spectrum, and the black lines are the peak components.

Figure E-2. Posterior distribution of each parameter when Bayesian estimation is performed on the spectral data in . The dashed lines indicate the true parameter values used to generate the spectral data.

Figure E-2. Posterior distribution of each parameter when Bayesian estimation is performed on the spectral data in Fig. E-1(a). The dashed lines indicate the true parameter values used to generate the spectral data.

Figure E-3. Posterior distribution of each parameter when Bayesian estimation is performed on the spectral data in . The dashed lines indicate the true parameter values used to generate the spectral data.

Figure E-3. Posterior distribution of each parameter when Bayesian estimation is performed on the spectral data in Fig. E-1(b). The dashed lines indicate the true parameter values used to generate the spectral data.

Figure E-4. Confidence intervals of peak parameters as a function of the height of the second peak h2 for peaks 1 (a) and 2 (b). (sim) indicates values simulated by the Bayesian EMC method, (approx) indicates values calculated with the approximated formula (19).

Figure E-4. Confidence intervals of peak parameters as a function of the height of the second peak h2 for peaks 1 (a) and 2 (b). (sim) indicates values simulated by the Bayesian EMC method, (approx) indicates values calculated with the approximated formula (19).