ABSTRACT
Crystalline phase structure is essential for understanding the performance and properties of a material. Therefore, this study identified and quantified the crystalline phase structure of a sample based on the diffraction pattern observed when the crystalline sample was irradiated with electromagnetic waves such as X-rays. Conventional analysis necessitates experienced and knowledgeable researchers to shorten the list from many candidate crystalline phase structures. However, the Conventional diffraction pattern analysis is highly analyst-dependent and not objective. Additionally, there is no established method for discussing the confidence intervals of the analysis results. Thus, this study aimed to establish a method for automatically inferring crystalline phase structures from diffraction patterns using Bayesian inference. Our method successfully identified true crystalline phase structures with a high probability from 50 candidate crystalline phase structures. Further, the mixing ratios of selected crystalline phase structures were estimated with a high degree of accuracy. This study provided reasonable results for well-crystallized samples that clearly identified the crystalline phase structures.
IMPACT STATEMENT
We developed a method to automatically estimate crystalline phase structures from diffraction patterns using Bayesian inference. Our method successfully identified true crystalline phase structures with a high probability from 50 candidate crystalline phase structures.
1. Introduction
Crystalline phase structure is essential for understanding the performance and properties of a material. Therefore, this study identified and quantified the crystalline phase structure of a sample based on the diffraction pattern observed when the crystalline sample was irradiated with electromagnetic waves such as X-rays. The measurement of the diffraction patterns using X-rays as probes is known as X-ray diffraction (XRD). The crystal structure of a material can be understood by analyzing the diffraction peaks in the XRD data.
A typical XRD data analysis method involves a simple comparison of the measured XRD data with a database. This method first detects the diffraction peaks in the measured XRD data by the smoothed derivative [Citation1–3]. Thereafter, the diffraction angles of the detected peaks are compared with those of the diffraction patterns registered in the database and the similarity to the diffraction patterns in the database is calculated. The diffraction patterns ranked by similarity are suggested by an analyst. Thus, in a typical analysis, the experience and knowledge of the researcher are crucial to shorten the list from several candidate crystal structures. However, the typical diffraction pattern analysis is highly analyst-dependent and not objective. Additionally, there is no established method for discussing the confidence intervals of the analysis results. Consequently, the interpretation of the analysis results is highly dependent on the analysts. Diffraction pattern analysis methods have been proposed to solve such analytical problems [Citation4,Citation5].
In recent years, methods for diffraction-pattern analysis using Bayesian estimation have been proposed, allowing confidence intervals to be discussed [Citation6,Citation7]. Mikhalychev proposed a method for naive Bayesian estimation of crystalline phase structure using a simple model with intensity information only, without considering the peak profile, the background, and the Poisson noise model [Citation7]. In addition, black-box optimization methods have been proposed for hyperparameters that are subjectively determined by an analyst [Citation8]. The proposed method is effective for solving several problems in diffraction pattern analyses. However, this has not been sufficiently discussed from the perspective of automatic estimation of the crystal structure contained in a measured sample from the diffraction pattern. Identifying the crystalline phase structures contained in a diffraction pattern is challenging because the number of candidate crystalline phase structures can be in the order of tens or hundreds, leading to combination explosions. Moreover, this problem requires considerable computational time because the crystal structure contains dozens of diffraction peaks. Despite these challenges, it is necessary to establish a method for identifying crystalline phase structures from diffraction patterns with confidence intervals (probability).
This study aimed to establish a method for the automatic estimation of crystalline phase structures from diffraction patterns. The proposed method decomposes the measured diffraction patterns and automatically selects crystalline phase structures using the diffraction patterns measured at each institute associated with the crystal structures or obtained via simulations as basis functions. The proposed method makes three main contributions to literature.
Crystalline phase structures can be selected precisely and automatically.
Posterior distributions can be estimated (confidence intervals can be discussed).
A global solution is provided (no initial value dependence).
The proposed method, which extracts material descriptors corresponding to the crystal structure from measured diffraction patterns, is expected to play an important role in promoting the development of data-driven materials. Note that in this paper the term ‘crystal structure’ refers specifically to the crystalline phase structure.
In this study, we provide reasonable results that allow clearer identification of the crystalline phase structures for more crystalline structures. Surprisingly, our method successfully detected crystalline phase structures containing only 2 [%]. However, a crystalline phase structure that contained only 1 [%] of the sample could not be detected, although we proposed a similar crystalline phase structure with low probability, suggesting a reasonable result.
2. Concept
shows an observation process of XRD data and a conceptual diagram of the proposed method. We suppose a multitude of candidate crystal phases and structures when preparing the materials. The crystal structures contained in the material are selected by material synthesis, manufacturing processes, etc. This study treats the control variable dealing with crystal structure selection as the indicator variable . Ideally, the crystalline materials produced should have diffraction line spectra corresponding to the crystal phases and structures they contain. In practice, we observe diffraction peaks whose shapes are dependent on the profile parameters that correspond to the measurement environment. We considered a situation wherein only the observed diffraction data and candidate crystal structures were provided.
The purpose of this study is to develop an automated Bayesian XRD analysis linked a database. In advanced materials, the crystalline phase structure contained in the XRD data is often unknown. This requires a method to automatically identify the crystalline phase structures in the data from a huge number of crystal phase structures in databases. We have therefore developed a methodology to precisely estimate the crystalline phase structure from XRD data by using the Bayesian estimation. In this study, we aimed to inversely estimate the structural indicator and profile parameter set from the observed diffraction data (XRD data) shown in . The proposed method is a Bayesian inverse estimation method used to identify crystal structures for XRD analysis.
3. Model
3.1. Problem setting
The purpose is to estimate the profile parameters and the crystalline phase structures in the measured sample, considering the measured XRD data and the candidate crystal structure . Here, and denote the diffraction angle [] and the diffraction intensity [counts], respectively.
The candidate crystal structure factor set is expressed as:
where is the number of candidate crystal structures and is the -th crystal structure factor. The elements of the crystal structure factor and are the diffraction angle (peak position) [] and relative intensity of the -th diffraction peak in for a crystal structure . Further, denotes the number of peaks in . In this study, the candidate crystal structure factor set is provided.
3.2. Profile function
XRD data can be represented by a profile function , which is a linear sum of the signal spectrum and the background :
where denote the measured data points, the function denotes the signal spectrum based on the candidate crystal structures , and the function denotes the background. We set as the profile parameter set. In addition, the sets and are the signal spectrum and background parameter sets, respectively.
The signal spectrum is expressed as a linear sum of the profile function (peaks) in a crystal structure among the several candidates [Citation10]:
where denotes the signal intensity of crystal structure factor . The profile function of candidate crystal structure is defined as follows:
where are the peak shift and Gauss-Lorentz ratio at the peak of crystal structure , respectively, is the peak position of the peak function, and the function is a pseudo-Voigt function [Citation11]. In addition, and are Gaussian and Lorentz functions, respectively. and are the Gaussian and Lorentzian widths of the peak, respectively, as a function of the diffraction angle . The width functions and are expressed as
where and are the Gaussian and Lorentzian width parameter sets, respectively. Function is a function expressing the peak asymmetry, and is the asymmetry parameter for the peak function. Further, the function is the sign function and the trigonometric function is .
The optimization parameter set for the signal spectrum is expressed as:
Background is defined as follows:
where the background parameter set is .
3.3. Generation model
We assume that the observed data are stochastically distributed owing to statistical noise in the measurement. Next, we consider the joint distribution , which can be expanded to . Using Bayes’ theorem to swap the orders of and , we can expand . Hence, the posterior distribution is expressed as:
where and are the posterior and prior distributions, respectively, in the Bayesian inference. Further, is the conditional probability of given the model parameter set , which is a probability distribution explained by error theory.
To derive , we consider the observation process of at the observation data points. Assuming that the observed data are independent of each other, the conditional probability of the observed data can be expressed as:
As XRD spectra are count data, the conditional probability of the intensity for the diffraction angle follows a Poisson distribution :
The cost function is defined by the negative log-likelihood function and is expressed as follows:
Further, is expressed using the cost function and the prior distribution as follows:
3.4. Identification of crystalline phase structures
In the analysis of XRD spectra, the crystal structure contained in the measured sample is often unknown. Therefore, it is important to accurately estimate the true crystal structure contained in the candidate crystal structures . We introduce an indicator vector , which controls the existence of the crystal structure factors in EquationEquation (5)(5) (5) :
where indicates that the crystal structure factor is present in the sample. Conversely, implies that it is absent.
We now consider the joint distribution . can be expanded to . According to Bayes’ theorem, the posterior distribution is expressed as:
The cost function that introduces the indicator vector is expressed as:
Using the joint distribution presented above, the indicator vector is estimated from the marginal posterior distribution as follows:
We estimate the profile and background parameters using the posterior distribution on parameter set .
4. Algorithm
4.1. Replica exchange monte carlo method — REMC method
We perform posterior visualization and the maximum a posteriori (MAP) estimation through sampling from the posterior distribution. A popular sampling method is the Monte Carlo (MC) method, which may be bounded by local solutions for cases when the initial value is affected or the cost function landscape is complex.
Therefore, the replica exchange Monte Carlo (REMC) method [Citation12,Citation13] was used to estimate the global solution. For sampling using the REMC method, a replica was prepared with the inverse temperature introduced as follows:
where the inverse temperature is . For each replica, the parameters were sampled using the Monte Carlo method.
5. Technique
5.1. Tricks for high speeds
This sub-section describes the techniques used to realize Bayesian inference of XRD spectra. In XRD spectral analysis, the number of candidate crystal structure factors and the number of peaks for each crystal structure factor are enormous. Therefore, to calculate the cost function for each sample, multiple loops of must be computed. is an immutable value because it is inherently determined by the crystal structure. Although reduction in the number of data by downsampling is feasible, it is expected that the peak structure will be broken or the separation accuracy will be significantly reduced owing to sharp XRD peaks.
Herein, we focused on the number of candidate crystal structures . This study screened the candidate crystallographic structure factors. To calculate the cost function , the crystal structures with need not be calculated. In other words, only the selected crystal structures need to be considered. In the proposed method, we compute where .
5.2. Rough pre-screening
In this study, we screened candidate crystallographic structures as described in sub-Section 5.1. This subsection describes the screening procedure. The similarity between the observed XRD data and the crystal structure factor was calculated and screening was performed by thresholding the similarity.
presents a supplementary diagram of the similarity calculation procedure, where parts (a) and (b) show the observed XRD data and crystal structure factor , respectively. We resampled the data points close to of from the observed data . The resampled data points are indicated by the red points in .
The resampled data points are denoted by the vector . The intensity vector of the crystal structure is denoted by . Our method computed the similarity between the vectors and for each crystal structure . In this study, we used cosine similarity as the vector similarity.
6. Scope and limitations
This section presents the three limitations of the proposed method:
The proposed method cannot refine the structural parameters owing to only the crystal structure selection and profile parameters being used as probability variables. Therefore, for precise crystal structure analysis, Rietveld analysis [Citation14–17] must be performed with reference to the posterior distribution of the selected crystal structure and profile parameters.
The proposed method automatically selects the crystal structure contained in the measurement sample from the candidate crystal structures. Therefore, crystal structures that are not included in the candidates or unknown crystal structures cannot be analyzed.
The proposed method may provide confusing results when similar crystal structures are included in the candidates. We need to combine other measurement techniques to refine the crystalline phase structure.
7. Configuration
7.1. Configuration of prior distribution
We set the prior distribution over the parameter set of the profile function as follows:
In addition, we set the prior distribution of the background (BG) parameter as follows:
where the probability distribution is the gamma distribution and and are the shape and scale parameters, respectively. The probability distribution is a normal distribution, and and are the mean and standard deviation, respectively. Whereas, the probability distribution is a uniform distribution, with and being the maximum and minimum values, respectively. Further, the values are and , where .
7.2. Configuration of the sampling algorithm
For the exchange MC simulation, we performed 1000 steps of calculations and rejected 1000 of them as burn-in. The inverse temperature was set as follows:
where the proportion was set to , and the number of temperatures was set to . The exchange of parameter sets between replicates was performed at each step.
7.3. Calculator specification
The calculator specifications were AMD Ryzen Thread ripper 3990X (64 core, 128 thread), 256GB DDR4–3200/PC4-25600SD, Ubuntu 18.04.5 LTS. We performed sampling using the REMC method with 32 threads.
7.4. Configuration of candidate crystal structures
We prepared 50 candidate crystal structures from the AtomWork [Citation18], which is an inorganic material database containing data on the crystal structures, X-ray diffraction, properties, and state diagrams of inorganic materials extracted from scientific and technical literature. We selected 50 candidates based on the condition that they contained titanium (Ti) or oxygen (O) in their composition because this study analyzed the XRD data of the titanium dioxide samples. lists the 50 prepared candidate crystal structures.
8. Results and discussion
8.1. Fitting results in actual measurement data
We conducted a calculation experiment on the measured XRD data. The results for artificial data, for which the true values are known, are presented in Appendix A. The measurement sample was a mixture of multiple types of : Anatase, Brookite, and Rutile. The mixture ratios were equal (1/1/1 wt. %). We prepared measurement samples such that the crystalline phases were homogeneous. Consequently, we measured the XRD data by using monochromatic X-rays of . Further, a non-reflecting plate cut from a specific orientation of a single crystal of silicon was used as the sample plate. The diffraction angles were in the range of 10–60[], with of .
presents the selection results for each temperature obtained using the REMC method. The x- and y-axes denote the candidate crystal structures and inverse temperature index , respectively. This figure visualizes the probability of indicator [%]. The proposed method estimated the crystal structure of a sample from 50 candidates. Candidate crystal structures were obtained from AtomWork as described in Section 7.4. A high index corresponds to a lower temperature. The color scale indicates the probability of calculated from the sampling frequency. The dark red color indicates the presence of a crystal structure in the measured sample. The result for the lowest temperature () shows that our method could select the true crystal structures, that is, Anatase, Brookite, and Rutile, with 100 [%] probability. A computational time of approximately 3 h was required to obtain this result. Therefore, our method can be used to estimate the crystal structures of a sample by analyzing the full diffraction profile using Bayesian inference. Thus, the contribution of our method is the simultaneous identification of profile parameters and crystal structures and the provision of their posterior distributions.
As shown in , the selection probability of Brookite decreases at medium to high temperatures compared to those of Anatase and Rutile. This suggests that Brookite was more difficult to identify than Anatase and Rutile. In crystallography, Brookite is a low-temperature phase and is known to exhibit a poorer crystal structure than Rutile, which is high-temperature-stable. This difficulty in its determination is believed to originate from the low crystallinity of Brookite.
An analysis using all 50 candidates would require a considerable amount of time. Therefore, we performed prescreening using the cosine similarity described in Section 5. shows the cosine similarity between the measured XRD data and crystal structure factors during prescreening. In this figure, the red line denotes the threshold value, which was set to 0.5. The y-axis denotes cosine similarity. We performed the analysis using crystal structure factors with a cosine similarity greater than 0.5. This prescreening narrowed the list from 50 to 12 candidates. This is expected to result in significant reduction in the computational costs.
We analyzed the measured XRD data using the 12 candidates that were narrowed down by prescreening. presents the selection results for each temperature obtained using the REMC method. The x- and y-axes denote the candidate crystal structures and the index of the inverse temperature . The proposed method could select the true crystal structures of Anatase, Brookite, and Rutile with 100 [%] probability. The crystal structures and results of sampling all 50 candidates were successfully identified (shown in ). The computation required approximately 1 h, and pre-screening reduced the computational cost by a factor of three. These results indicate that prescreening can effectively improve the efficiency of the calculations. However, prescreening may exclude true crystal structures from the candidates.
shows the fitting results via the profile function in the measurement XRD data. In , the black and red lines indicate the measured XRD data and the fitting profile functions, respectively. shows the peak components of the three crystal structures of : Anatase, Brookite, and Rutile. The red, green, and blue lines indicate the peaks of Anatase, Brookite, and Rutile, respectively. As shown in this figure, the estimated profile function facilitated a good fit of the XRD data. The mean Poisson cost was 5.026.
shows the posterior distribution of the peak area ratio in the triangular diagram when the measurement XRD data were analyzed using the proposed method. In , the blue dots indicate the sampling results obtained using the REMC method. The units on the axes are percentages [%]. As evident, the results of the area ratio estimation exhibited extremely high precision.
shows an expanded view of the posterior distribution of the peak used to determine its shape of the posterior distribution. In , the red, green, and blue histograms correspond to the posterior distributions of Anatase, Brookite, and Rutile, respectively. As evident, the posterior distribution of Rutile, which has the best crystallinity, exhibited a sharper shape than Anatase and Brookite. The shape of the posterior distribution was similar to that of a quadratic function, where the y-axis represents a logarithmic scale. This implies that the posterior distribution exhibits a Gaussian probability distribution shape. The MAP estimate of the ratio was [%]. Because the structural ratio of the preparation is [%], the proposed method is considered a reasonable estimation.
shows the posterior distribution of the profile parameters when analyzing the measurement XRD data using the proposed method. The red, green, and blue histograms represent the posterior distributions of Anatase, Brookite, and Rutile, respectively. ) show the peak height , peak shift , Gaussian – Lorentz ratio , and asymmetry parameter , respectively. ) show the Gaussian width and the Lorentz width , where is [°]. As indicated in part (a) of this figure, the height can be estimated with high precision using the proposed method. The figure shows that the peak shifts for all three crystal structures were positive (). This may be attributed to minute calibration deviations in the measurement device such as eccentricity and zero-point errors. As shown in parts (e) and (f) of this figure, the peak width of Rutile was narrow, indicating good crystallinity. Furthermore, we confirmed that Rutile with good crystallinity exhibited a sharp posterior distribution shape for most of the profile parameters. By contrast, Brookite with poor crystallinity, tended to exhibit a broad posterior distribution. This indicates that a structure with good crystallinity provides a highly precise estimation.
8.2. Analysis in measured data of non-uniform mixing ratios
Here, we show the analysis results for the measured data of non-uniform mixing ratios. We measured the samples with two different mixing ratios: (Case-01) and (Case-02) . In addition to these ratios, the conditions were identical to those described in Section 8.1. The proposed method was applied to the measured data in both cases. The data for Case-01 and Case-02 are shown in and of Appendix B, respectively.
First, we present the analysis results for Case-01. The data for Case-01 were included in Anatase and Rutile. Prescreening was used to reduce the number of candidates from 50 to 8. presents the selection results for each temperature using the REMC method. The x- and y-axes denote the candidate crystalline phase structures and the index of the inverse temperature . The proposed method could select the true crystalline phase structures, which were Anatase and Rutile with a probability of 100 [%]. As shown in , Anatase was not sampled using the REMC method at high temperatures. As evident, the determination of Anatase is more challenging than that of Rutile. This indicates that estimating structures with small component ratios is difficult. The fitting results are presented in in Appendix B.
Second, we present the analysis results for Case-02. The data for Case-02 included Anatase, Brookite, and Rutile. These data have a significantly low ratio of Anatase and Rutile. In particular, Rutile accounted for only 1%. Prescreening was used to reduce the number of candidates from 50 to 15. presents the selection results for each temperature obtained using the REMC method. The x- and y-axes denote the candidate crystalline phase structures and the index of the inverse temperature . The proposed method selected Brookite and Anatase as the MAP estimation. The two selected structures were included in the true structures. Surprisingly, we succeeded in detecting Anatase, which accounted for only 2%. As shown in , the estimation of Anatase, which has a low mixing ratio, was difficult. However, it can be confirmed that Rutile was not selected by the proposed method. We consider this a reasonable result because Rutile accounted for only 1% of the measured sample. The figure also shows that the of structure type was selected with a probability of approximately 50%. lists the cell lattice parameters of and Rutile crystal structures in [Citation19,Citation21]. The of structure type was obtained from literature [Citation19]. As shown in the table, and Rutile crystalline phase structures are similar. Therefore, it was suggested that structure type should be sampled instead of Rutile. Notably, the of structure type was not selected as the MAP solution. Our method may suggest a crystal structure similar to the true crystal structure by presenting a low probability of existence if the candidate does not contain a true crystalline phase structure. It is important to confirm the behaviour when the true crystalline phase structure is not included and when the crystal lattice is scaled, and this will be addressed in future work. The fitting results are shown in in Appendix B.
9. Conclusion
The knowledge of the probability that a sample contains a candidate crystal structure from full-range XRD data considering both the diffraction angles of the peaks and the profile functions, is essential. This study aimed at the Bayesian estimation of the structure contained in a sample from a large number of crystal structure candidates in the analysis of XRD data. Therefore, indicator vectors were introduced into the profile function and the XRD data were analyzed by sampling the posterior distribution using the REMC method. Consequently, we succeeded in identifying the true crystal structures of 50 candidates with high probability. The proposed method also estimated the mixing ratio of the selected crystal structures with high precision.
In this study, we provide reasonable results that allow clearer identification of the crystal structure for more crystalline structures. Surprisingly, our method successfully detected crystal structures that accounted for only 2 [%] of the sample. However, a crystal structure that contained only 1 [%] of the sample could not be detected, although we proposed a similar crystal structure with low probability, suggesting a reasonable result. Our method is a highly sensitive and probabilistic analysis method that can automatically identify crystal structures from full-range XRD data.
Disclosure statement
No potential conflict of interest was reported by the author(s).
Additional information
Funding
References
- Press WH, Teukolsky SA. Savitzky-golay smoothing filters. Comput Phys. 1990;4(6):669–20. doi:10.1063/1.4822961
- Taupin D. Automatic peak determination in x-ray powder patterns. J Appl Crystallogr. 1973;6(4):266–273. doi:10.1107/S0021889873008666
- Huang TC. Precision peak determination in x-ray powder diffraction. Aust J Phys. 1988;41(2):201–212. doi:10.1071/PH880201
- Surdu V-A, Győrgy R. X-ray diffraction data analysis by machine learning methods—a review. Appl Sci. 2023;13(17):9992. doi: 10.3390/app13179992
- Greasley J, Hosein P. Exploring supervised machine learning for multi-phase identification and quantification from powder X-ray diffraction spectra. J Mater Sci. 2023;58:1–15. doi:10.1007/s10853-023-08343-4
- Fancher CM, Han Z, Levin I, et al. Use of Bayesian inference in crystallographic structure refinement via full diffraction profile analysis. Sci Rep. 2016;6:8. doi: 10.1038/srep31625
- Mikhalychev A, Ulyanenkov A. Bayesian approach to powder phase identification. J Appl Crystallogr. 2017 Jun;50(3):776–786. doi: 10.1107/S1600576717004393
- Suzuki Y. Automated data analysis for powder X-ray diffraction using machine learning. Synchrotron Radiat News. 2022;35(4):9–15. doi:10.1080/08940886.2022.2112496
- Momma K, Izumi F. VESTA: a three-dimensional visualization system for electronic and structural analysis. J Appl Crystallogr. 2008 Jun;41(3):653–658. doi: 10.1107/S0021889808012016
- Toraya H. Array-type universal profile function for powder pattern fitting. J Appl Crystallogr. 1990 Dec;23(6):485–491. doi: 10.1107/S002188989000704X
- Wertheim GK, Butler MA, West KW, et al. Determination of the gaussian and lorentzian content of experimental line shapes. Rev Sci Instrum. 1974;45(11):1369–1371. doi: 10.1063/1.1686503
- Hukushima K, Nemoto K. Exchange Monte Carlo method and application to spin glass simulations. J Phys Soc Jpn. 1996;65(6):1604–1608. doi:10.1143/JPSJ.65.1604
- Nagata K, Sugita S, Okada M. Bayesian spectral deconvolution with the exchange Monte Carlo method. Neural Netw. 2012;28:82–89. doi:10.1016/j.neunet.2011.12.001
- Rietveld HM, HM Rietveld. Line profiles of neutron powder-diffraction peaks for structure refinement. Acta Crystallogr. 1967;22(1):151–152. doi:10.1107/S0365110X67000234
- Rietveld HM. A profile refinement method for nuclear and magnetic structures. J Appl Crystallogr. 1969 Jun;2(2):65–71. doi: 10.1107/S0021889869006558
- Rodríguez-Carvajal J. Recent advances in magnetic structure determination by neutron powder diffraction. Phys B Condens Matter. 1993;192(1–2):55–69. doi:10.1016/0921-4526(93)90108-I
- Akselrud L, Grin Y. Wincsd: software package for crystallographic calculations (version 4). J Appl Crystallogr. 2014;47(2):803–805. doi:10.1107/S1600576714001058
- Xu Y, Yamazaki M, Villars P. Inorganic materials database for exploring the nature of material. Jpn J Appl Phys. 2011 Nov;50(11S):11RH02. doi: 10.1143/JJAP.50.11RH02
- Akimoto J, Gotoh Y, Oosawa Y, et al. Topotactic oxidation of ramsdellite-type li0.5tio2, a new polymorph of titanium dioxide: Tio2(r). J Solid State Chem. 1994;113(1):27–36. doi: 10.1006/jssc.1994.1337
- Latroche M, Brohan L, Marchand R, et al. New hollandite oxides: Tio2(h) and k0.06tio2. J Solid State Chem. 1989;81(1):78–82. doi: 10.1016/0022-4596(89)90204-1
- Kim D-W, Enomoto N, Nakagawa Z-E, et al. Molecular dynamic simulation in titanium dioxide polymorphs: rutile, brookite, and anatase. J Am Ceram Soc. 1996;79(4):1095–1099. doi: 10.1111/j.1151-2916.1996.tb08553.x
Appendix A.
Fitting results in artificial data
We performed a calculation experiment with the artificial data. It was assumed that the XRD data were measured for a sample containing the three crystal structures of : Anatase, Brookite, and Rutile. We generated artificial data based on a set of true parameters and the model defined in Section 3. lists the true parameters of the profile function for the artificial data. We set the x-axis between 10° and 60° in increments of 0.02[], such that is . The background parameters were set to , , , . We performed the analysis using the proposed method without knowing the crystal structure of the sample. The purpose of this experiment was to select the true crystal structures (Anatase, Brookite, and Rutile of ) from the XRD data.
shows the selection results obtained using the proposed method on artificial XRD data. This figure shows the indicator probability [%]. The proposed method estimates the crystal structure of a sample from 50 candidates. Candidate crystal structures were obtained from AtomWork as described in Section 7.4. The y- and x-axes indicate the crystal structure and indicator probability, respectively. As evident, the proposed method could select true crystal structures, that is, Anatase, Brookite, and Rutile, from 50 candidates with 100 [%] probability.
shows the fitting results obtained using the profile function for the artificial data. In , the black and red lines represent the artificial data and fitting profile function, respectively. shows the peak components of the three crystal structures of : Anatase, Brookite, and Rutile. The red, green, and blue lines represent peaks of Anatase, Brookite, and Rutile, respectively. As evident, the estimated profile function provides a good fit. The mean Poisson cost was 4.476, which is close to the true cost of 4.469 given by the true parameters.
shows the posterior distributions of the profile parameters. The red, green, and blue histograms represent the posterior distributions of Anatase, Brookite, and Rutile, respectively. show the peak height , peak shift , Gaussian–Lorentz ratio , and asymmetry parameter , respectively. show the Gaussian widths and Lorentz width , where is []. In , the black dot-dashed line represents the true parameter of the profile function. As evident, the expectation of the posterior distribution is in good agreement with the true values. In addition, the shape of the histogram is a quadratic function, where the y-axis represents the log scale. This implies that the posterior distribution exhibits a Gaussian probability distribution shape. Therefore, this experiment confirms that the proposed method works well on artificial data.
Appendix B.
Data and Fitting in Section 8.2
and show the fitting result of Case-01 and Case-02 in Section 8.2, respectively. In Figure (a), the black and red lines represent the measurement XRD data and the fitting profile function, respectively. Figure (b) shows the peak components. The red, green, and blue lines represent peaks of Anatase, Brookite, and Rutile, respectively.