376
Views
0
CrossRef citations to date
0
Altmetric
Chronic Kidney Disease and Progression

Noninvasive diagnosis of interstitial fibrosis in chronic kidney disease: a systematic review and meta-analysis

, , , &
Article: 2367021 | Received 01 May 2024, Accepted 06 Jun 2024, Published online: 28 Jun 2024

Abstract

Rationale and objectives

Researchers have delved into noninvasive diagnostic methods of renal fibrosis (RF) in chronic kidney disease, including ultrasound (US), magnetic resonance imaging (MRI), and radiomics. However, the value of these diagnostic methods in the noninvasive diagnosis of RF remains contentious. Consequently, the present study aimed to systematically delineate the accuracy of the noninvasive diagnosis of RF.

Materials and methods

A systematic search covering PubMed, Embase, Cochrane Library, and Web of Science databases for all data available up to 28 July 2023 was conducted for eligible studies.

Results

We included 21 studies covering 4885 participants. Among them, nine studies utilized US as a noninvasive diagnostic method, eight studies used MRI, and four articles employed radiomics. The sensitivity and specificity of US for detecting RF were 0.81 (95% CI: 0.76–0.86) and 0.79 (95% CI: 0.72–0.84). The sensitivity and specificity of MRI were 0.77 (95% CI: 0.70–0.83) and 0.92 (95% CI: 0.85–0.96). The sensitivity and specificity of radiomics were 0.69 (95% CI: 0.59–0.77) and 0.78 (95% CI: 0.68–0.85).

Conclusions

The current early noninvasive diagnostic methods for RF include US, MRI, and radiomics. However, this study demonstrates that US has a higher sensitivity for the detection of RF compared to MRI. Compared to US, radiomics studies based on US did not show superior advantages. Therefore, challenges still exist in the current radiomics approaches for diagnosing RF, and further exploration of optimized artificial intelligence (AI) algorithms and technologies is needed.

1. Introduction

Chronic kidney disease (CKD) is an irreversible and gradually progressive clinical syndrome resulting from definitive alterations in function and/or structure of the kidney. Adult patients are diagnosed with CKD when their glomerular filtration rate (GFR) stands at less than 60 mL/min/1.73 m2 for three months or longer, or, alternatively, evidence of renal structural injury is detected despite a GFR of over 60 mL/min/1.73 m2 [Citation1]. CKD features excessive extracellular matrix deposition and chronic inflammation and is quite common worldwide. The prevalence of CKD in adults hovers around 13% in the United States and 12% in China [Citation2]. Renal interstitial fibrosis is the pathological basis of end-stage renal disease [Citation3].

According to von Stillfried and Triantopoulou’s research, diagnosing CKD-associated renal fibrosis (RF) is still challenging in clinical practice. Ultrasound (US) elastography, CT, and magnetic resonance imaging (MRI), have emerged as potential diagnostic methods for the noninvasive diagnosis of RF [Citation4,Citation5]. Imagomics, employing big data and machine learning to analyze medical image data, facilitates the development of personalized medicine and precision medicine. With the advances in technology and the deepening of application, imagomics holds the potential to furnish more dependable and precise information for disease diagnosis, treatment, and prognosis assessment. A systematic review and meta-analysis has introduced fresh perspectives for imaging diagnosis of RF. The association between interstitial fibrosis and tubular atrophy (IFTA) and the severity of CKD was closely intertwined [Citation6]. Moderate and severe IFTA alongside glomerulosclerosis escalated the risk of declined renal function by three- and fourfold, respectively, in comparison to mild IFTA [Citation7,Citation8]. Existing methods for monitoring RF are currently inefficient [Citation9]. Kidney biopsy is regarded as the gold standard for diagnosis of CKD and grading of fibrosis. The primary complications associated with native kidney biopsies predominantly encompass hemorrhagic events, which manifest as pain, hematuria, peri-nephric bleeding, resulting in a self-contained hematoma, or active bleeding requiring red blood cell transfusions or interventions to manage the hemorrhage. In severe cases, these complications may even lead to fatal outcomes [Citation10]. Renal biopsy may cause complications, such as pain, hematomas, macroscopic hematuria, and in severe cases, bleeding or even death [Citation11], while certain sampling bias reduces the accuracy of pathological diagnosis. Thus, noninvasive diagnostic methods for fibrosis are urgently needed to avoid these adverse events and facilitate dynamic diagnosis of patients with CKD. Therefore, exploring noninvasive diagnosis of RF has far-reaching clinical significance.

The noninvasive diagnosis of RF predominantly relies primarily on imaging methods [Citation12,Citation13]. In recent years, radiomics has been increasingly applied in the diagnosis and treatment of clinical diseases. Some studies have ventured into applying radiomics methods to assist in the noninvasive diagnosis of RF [Citation14–16]. However, radiomics-based diagnosis is challenging due to the over-configuration of equipment, segmentation specificity of the region of interest (ROI), differences in extracted features, and diversity of models. Consequently, the accuracy of radiomics remains elusive. Thus, the present study aims to ascertain whether radiomics is more efficient than other noninvasive diagnostic techniques.

2. Methods and materials

2.1. Study registration

This study adhered to the PRISMA extension for Diagnostic Test Accuracy (DTA) studies. Furthermore, the study protocol has been duly registered in the PROSPERO database for systematic reviews (ID: CRD42023465028).

2.2. Inclusion criteria and eligibility criteria

Inclusion criteria:

  1. Studies that used CT/MRI/US to assess the diagnostic performance in evaluating RF;

  2. Studies that confirmed the staging of RF with renal biopsy as the gold standard;

  3. Studies that provided data on the sensitivity and specificity of the diagnostic tests, as well as the number of patients at different stages of fibrosis, allowing the calculation of true positives (TPs), false positives (FPs), false negatives (FNs), and true negatives (TNs), thereby enabling the construction of at least one 2 × 2 test performance table for each stage of fibrosis based on threshold values.

Exclusion criteria:

  1. Studies involving subjects with a history of kidney transplantation;

  2. Unpublished conference abstracts;

  3. Insufficient original data to calculate TP, FP, FN, and TN;

  4. Animal experiments and basic research;

  5. Studies not reported in English.

2.3. Data sources and search strategy

We thoroughly retrieved PubMed, Cochrane, Embase, and Web of Science databases, up to 28 July 2023. The literature search has been updated at the time of submission. MESH + free terms were adopted for searching. The search strategies are depicted in Table S1.

2.4. Study selection and data extraction

The retrieved records were imported into EndNote, where duplicates were identified automatically and manually. The titles or abstracts of the remaining studies were read to preliminarily select qualified original research, followed by the full-text review. The full texts of the remaining studies were downloaded and reviewed to select original research that meets the criteria for this review.

Before data extraction, a standardized data extraction spreadsheet was developed. The data to be extracted included: (1) study features: author, country, publication date, study design, and patient recruitment period; (2) patient demographics: age, number of patients, gender ratio, and body mass index (BMI); (3) indicators of performance of imaging examinations: threshold values, sensitivity, specificity, and the area under the ROC curve (AUROC); (4) technical features: SWE or MRI mode, system used, US probe, array transducer, effective number of measurements, measurement depth, and ROI; (5) histological staging of fibrosis. TP, FP, FN, and TN were computed based on the sensitivity and specificity of the diagnostic tests reported in each study.

The literature screening and data extraction were independently conducted by two researchers, Wan Shanshan (a doctor with 5 years of experience in imaging diagnosis) and Wang Jiaping (a doctor with 15 years of experience in imaging diagnosis), and the results were cross-checked upon completion. Any dissents were addressed with the assistance of a third researcher, Xinyu He.

2.5. Risk of bias in study

Two independent researchers leveraged the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) tool to appraise the risk of bias in the included studies [Citation17,Citation18]. The QUADAS-2 consists of 14 questions in four domains (patient selection, index test, gold standard, flow, and timing) to assess two core issues: risk of bias and applicability. Each question is rated as ‘yes’, ‘no’, or ‘unclear’. A domain is considered to have a high or unclear risk of bias if at least one question within it is rated as ‘no’ or ‘unclear’. Any discrepancies between the assessments of the two researchers will be resolved by a third researcher.

2.6. Synthesis methods

Data analysis was executed by Stata 15.0 (StataCorp LLC, College Station, TX). A bivariate mixed-effects model was used for the meta-analysis. The meta-analysis for sensitivity and specificity was based on the diagnostic 2 × 2 tables. Nonetheless, most original studies did not provide these tables. In such cases, the diagnostic 2 × 2 tables were calculated using the following two approaches: (1) using sensitivity, specificity, and precision in combination with the number of cases; (2) extracting sensitivity and specificity based on the optimal Youden’s index and then combining them with the number of cases for calculation.

For the evaluation of the diagnostic performance of DECT for acute VFs, a bivariate mixed-effects model was employed for meta-analysis. The sensitivity, specificity, positive likelihood ratio (PLR), negative likelihood ratio (NLR), diagnostic odds ratio (DOR), and their 95% confidence interval (95% CI), and the area under the curve of comprehensive subject working characteristics (SROC) were estimated. In addition, we only pooled results from the independent validation sets during the meta-analysis of radiomics. Deek’s funnel plot was leveraged to discern publication bias, and p < .05 was indicative of a statistically significant difference.

3. Results

3.1. Study selection

A total of 4885 articles were retrieved from the database. EndNote was used to screen the literature. After 495 duplicates were expunged, ineligible literature was eliminated through a meticulous review of abstracts and titles. Ultimately, 26 studies were deemed potentially eligible for the systematic review. Then, five studies were further excluded due to inadequate data. Finally, 21 studies [Citation7,Citation19–38] were incorporated into the meta-analysis. The detailed literature screening progress is depicted in .

Figure 1. Literature screening process.

Figure 1. Literature screening process.

3.2. Study characteristics

A total of 21 studies with 4885 participants were incorporated into this meta-analysis. All studies adopted a single-center design. Sixteen studies [Citation19–23,Citation25,Citation27–30,Citation32–37] were based on the populations from China, and nine studies [Citation20,Citation22,Citation24,Citation25,Citation29,Citation32,Citation33,Citation37,Citation38] used noninvasive US to assess the severity of RF. The studies indicated that US could accurately distinguish between normal and moderate-to-severe fibrosis. However, for the diagnosis of early and mild fibrosis, using SWE as an example, there is not yet a consensus among centers on the diagnostic threshold of parameters. Eight studies [Citation7,Citation23,Citation26,Citation28,Citation31,Citation34–36] utilized MRI techniques to detect RF, and commonly used techniques, such as diffusion-weighted imaging/apparent diffusion coefficient (DWI/ADC), intravoxel incoherent motion (IVIM), diffusion tensor imaging (DTI), and gadolinium-enhanced imaging, were compared with pathological diagnoses to assess the severity of fibrosis. The studies suggested that MRI was more effective in assessing moderate to severe fibrosis. However, the diagnostic accuracy for mild fibrosis needs to be improved. There were slight differences in MRI imaging parameters and techniques used across different centers, preventing a lateral comparison of the advantages and disadvantages of each technique. Four studies used US radiomics. It is a recently emerging artificial intelligence (AI) method for evaluating subtle differences in images, aiming at identifying imaging-based criteria for diagnosing RF that are imperceptible to the naked eye ().

Table 1. Fundamental features of the included literature.

3.3. Risk of bias studies

In the studies included in our analysis, 17 studies utilized non-radiomic approaches for diagnosing kidney fibrosis. Consequently, we employed QUADAS-2 for quality assessment. Among these, one study was a case-control study, indicating a high risk of bias in case selection (). On the other hand, four studies employed radiomic methods for diagnosing kidney fibrosis, for which we utilized RQS for quality evaluation. Notably, none of these studies conducted repeat measurements on the same study subjects using different parameters, time points, or devices. Additionally, in terms of statistical methods, there was a lack of outcome evaluation using resampling techniques, and no external validation was conducted based on multicenter approaches. Furthermore, codes were not made publicly available. Consequently, the average score for the four studies was nine points.

Figure 2. The quality assessment of 21 included studies by QUADAS-2 tool.

Figure 2. The quality assessment of 21 included studies by QUADAS-2 tool.

3.4. Meta-analysis

3.4.1. Ultrasound and SWE

Nine studies used US to detect RF. The meta-analysis results showed a sensitivity of 0.81 (95% CI: 0.76–0.86), specificity of 0.79 (95% CI: 0.72–0.84), DOR of 16 (95% CI: 9–28), and an SROC curve of 0.87 (95% CI: 0.16–1.00). Deek’s funnel plot for US indicated no significant publication bias (p = .65). With a prior probability of 35%, the PLR was 3.8 (95% CI: 2.8–5.2) and the NLR was 0.24 (95% CI: 0.18–0.32), as shown in .

Figure 3. US statistical results. (a) Meta-analysis Forest plot for sensitivity and specificity of US in detecting RF. (b) SROC for meta-analysis based on US in detecting RF. (c) Deek’s funnel plot for meta-analysis based on US in detecting RF. (d) Meta-analysis line diagram based on US in detecting RF.

Figure 3. US statistical results. (a) Meta-analysis Forest plot for sensitivity and specificity of US in detecting RF. (b) SROC for meta-analysis based on US in detecting RF. (c) Deek’s funnel plot for meta-analysis based on US in detecting RF. (d) Meta-analysis line diagram based on US in detecting RF.

Seven studies used SWE to detect RF. The meta-analysis results showed a sensitivity of 0.77 (95% CI: 0.71–0.82), specificity of 0.76 (95% CI: 0.67–0.83), DOR of 11 (95% CI: 5–21), and an SROC curve of 0.83 (95% CI: 0.14–0.99). Deek’s funnel plot for SWE demonstrated no significant publication bias (p = .83). With a prior probability of 35%, the PLR was 3.2 (95% CI: 2.2–4.8) and the NLR was 0.30 (95% CI: 0.22–0.41), as illustrated in .

Figure 4. SWE statistical results. (a) Meta-analysis Forest plot for sensitivity and specificity of US in detecting RF. (b) SROC for meta-analysis based on SWE in detecting RF. (c) Deek’s funnel plot for meta-analysis based on SWE in detecting RF. (d) Meta-analysis line diagram based on SWE in detecting RF.

Figure 4. SWE statistical results. (a) Meta-analysis Forest plot for sensitivity and specificity of US in detecting RF. (b) SROC for meta-analysis based on SWE in detecting RF. (c) Deek’s funnel plot for meta-analysis based on SWE in detecting RF. (d) Meta-analysis line diagram based on SWE in detecting RF.

3.4.2. MRI

Eight studies used MRI to detect RF. The meta-analysis results showed a sensitivity of 0.77 (95% CI: 0.70–0.83), specificity of 0.92 (95% CI: 0.85–0.96), DOR of 40 (95% CI: 15–106), and an SROC curve of 0.90 (95% CI: 0.17–1.00) ( and ). Deek’s funnel plot for MRI revealed no significant publication bias (p = .28). With a prior probability of 35%, the PLR was 9.8 (95% CI: 4.9–19.7) and the NLR was 0.25 (95% CI: 0.18–0.35), as illustrated in .

Figure 5. MRI statistical results. (a) Meta-analysis Forest plot for sensitivity and specificity of MRI in detecting RF. (b) SROC for meta-analysis based on MRI in detecting RF. (c) Deek’s funnel plot for meta-analysis based on MRI in detecting RF. (d) Meta-analysis line diagram based on MRI in detecting RF.

Figure 5. MRI statistical results. (a) Meta-analysis Forest plot for sensitivity and specificity of MRI in detecting RF. (b) SROC for meta-analysis based on MRI in detecting RF. (c) Deek’s funnel plot for meta-analysis based on MRI in detecting RF. (d) Meta-analysis line diagram based on MRI in detecting RF.

Figure 6. Radiomics statistical results. (a) Meta-analysis forest plot for sensitivity and specificity of radiomics in detecting RF. (b) SROC for meta-analysis based on radiomics in detecting RF. (c) Deek’s funnel plot for meta-analysis based on radiomics in detecting RF. (d) Meta-analysis line diagram based on radiomics in detecting RF

Figure 6. Radiomics statistical results. (a) Meta-analysis forest plot for sensitivity and specificity of radiomics in detecting RF. (b) SROC for meta-analysis based on radiomics in detecting RF. (c) Deek’s funnel plot for meta-analysis based on radiomics in detecting RF. (d) Meta-analysis line diagram based on radiomics in detecting RF

3.4.3. Radiomics

Four studies [Citation19,Citation21,Citation27,Citation30] used US radiomics to detect RF. The meta-analysis results showed a sensitivity of 0.69 (95% CI: 0.59–0.77), specificity of 0.78 (95% CI: 0.68–0.85), DOR of eight (95% CI: 4–17), and an SROC curve of 0.79 (95% CI: 0.58–0.91) ( and ). Deek’s funnel plot for radiomics indicated no significant publication bias (p = .90). With a prior probability of 35%, the PLR was 3.1 (95% CI: 2.0–4.9) and the NLR was 0.40 (95% CI: 0.28–0.56), as illustrated in .

4. Discussion

4.1. Summary of the main findings

Our meta-analysis of 21 articles revealed that nine studies utilized US, eight used MRI, and four applied radiomics to assess RF. The meta-analysis results indicated that each group was able to accurately assess the level of fibrosis, suggesting that there is no statistically significant difference in accuracy between radiomics and the direct imaging assessment methods of US and MRI. Current additional research methods include PET-CT and CT for assessing fibrosis. However, these methods are constrained by sample size limitations and the absence of comprehensive outcome indicators. Certain researchers have restricted their exploration solely to animals.

4.2. Diagnosis of RF with US

Routine renal US examinations and laboratory tests can reflect the patient’s condition through aspects such as kidney size and shape, cortical thickness, echo of the renal medulla, renal vascular blood flow, as well as plasma albumin, urea, and creatinine levels. These methods exhibit a certain level of subjectivity and fall short in providing a precise evaluation of the extent of renal damage. Laboratory test results for renal function are also relatively delayed; the kidneys have a strong compensatory ability, and abnormalities in renal function indicators often indicate irreversible kidney damage [Citation39].

Previous meta-imaging studies have shown that SWE could be a potential tool for evaluating pathological changes in the kidneys. However, Cè et al. [Citation40] reported that the relationship between USE values and eGFR in CKD patients is still elusive. Our study did not unravel a significant difference in shear wave velocity between healthy individuals and CKD patients. Perfusion alterations may play a pivotal role in the early stage of renal injury, especially in certain subgroups of patients, such as those with diabetes, due to microvascular changes. The studies by Leong et al. and Cao et al. suggested that SWE is effective in diagnosing mild and severe RF but less effective for moderate RF [Citation24,Citation41]. In a study to discern the performance of SWE in detecting renal parenchymal stiffness in patients with CKD [Citation42], SWE is precise in diagnosing RF.

SWE has been shown to be an effective method for evaluating RF [Citation19,Citation29,Citation30]. SWE enables real-time, noninvasive, and quantitative assessment of the renal cortical elasticity modulus in patients with primary nephrotic syndrome (PNS), and can also provide certain reference values for the efficacy evaluation and disease development and outcome of PNS patients diagnosed by biopsy after treatment [Citation43]. SWE exhibits substantial promise in assessing RF in patients with CKD, which holds promise as a noninvasive and cost-effective imaging alternative to renal biopsy in the future. Nevertheless, the application of SWE in the kidneys remains contentious due to the influences of tissue anisotropy, viscoelasticity, and renal hemodynamics. The diversity in US systems, even those produced by the same manufacturer, may lead to variations in SWE values. It is recommended to establish more extensive datasets for each machine model to establish baseline or SWE levels indicative of different kidney stiffness. In comparison to SWE, 2D SWE with elastography allows for better selection of the ROI within the available range of SWE measurements, thereby providing more reproducible results. Given the attenuation of ARFI push pulses and tracking waves with increased skin-to-ROI depth, SWE is not suitable for renal imaging in overweight or obese patients, and variable transducer force may also influence the reproducibility and accuracy of SWE [Citation24]. Therefore, the introduction of more advanced SWE technology, the implementation of larger-scale or multi-center studies to establish methodological standards for renal elastometry, and the establishment of reference values for normal renal elasticity are crucial to increase the reliability of SWE in the diagnosis of kidney diseases [Citation6,Citation32].

4.3. Diagnosis of RF with MRI

MRI has the advantage of noninvasive evaluation with high soft tissue resolution. The included studies used techniques such as DWI-ADC, gadolinium-enhanced imaging, IVIM, and DTI to assess the level of RF. DWI is an imaging method that studies the diffusion movement of water molecules in living tissues. IVIM addresses the limitations of traditional DWI and more accurately describes different tissue characteristics, especially in tissues with complex water molecule dynamics [Citation44]. IVIM-DWI, based on a bi-exponential model, can separately evaluate water molecule diffusion of renal tissues, renal capillary perfusion, and tubular flow, and can more accurately reflect the microstructural changes in renal pathophysiology [Citation45]. IVIM can reflect the density of capillaries within the tissue [Citation46]. The decrease in the D value is mainly due to the increase in cell number from tissue swelling, inflammatory cell infiltration during the fibrosis process, and the deposition of collagen fibers [Citation47,Citation48]. The infiltration of inflammatory cells leads to an increased cellular density, causing an increase in intracellular water molecules and a decrease in extracellular water molecules, along with a decrease in extracellular water molecules due to the deposition of renal collagen fibers, causing restricted diffusion of water molecules inside and outside the cell, thereby leading to a decrease in the D value. During the fibrosis process in the kidney, both perfusion and diffusion decrease simultaneously, making IVIM-DWI a potential quantitative indicator for assessing RF. In previous studies, magnetic resonance elastography (MRE) technology was used to assess liver fibrosis [Citation49,Citation50]; recent research has found that MRE technology is also applicable to organs such as the kidneys [Citation51]. Studies have shown that MRE can detect and assess the degree of RF. When the kidney is subjected to external forces, causing particle displacement, imaging occurs in the magnetic field. During this process, multiple alterations in the kidney occur, such as kidney stiffness, changes in renal blood flow, expansion of the collecting system, and edema. MRE can reflect changes in these indicators [Citation52,Citation53].

4.4. Diagnosis of RF with radiomics

The directionality of US speckle patterns may be detected via wavelet transformation-based radiomics characteristics, which can be utilized to differentiate between individuals with and without CKD [Citation54]. AI-assisted medical imaging approaches provide useful information on features that are difficult for the human eye to detect for the diagnosis and treatment of CKD [Citation55]. AI-assisted medical image analysis as a clinical support tool, and radiomics and deep learning algorithms, can enhance the early detection and prognostic evaluation of CKD. Research has evaluated the feasibility and accuracy of radiomic features of phenotype apparent diffusion coefficient (ADC) maps, which assists in the clinical classification of participants [Citation56]. Yu et al. used CT to diagnose calcification in patients with CKD, based on a radiomic approach [Citation57]. Radiomics is an emerging technical means of precision medicine, and the process of radiomics research is relatively complex.

The present study unveiled that the diagnostic accuracy of traditional influential diagnostics was not significantly different from that of radiomics, suggesting that the study process can be simplified and the degree of RF can be reliably assessed based on large samples of original data. Our radiomics analysis is based on a US-based model. The meta-analysis of US showed a sensitivity of 0.81 (95% CI: 0.76–0.86), and the sensitivity was 0.69 (95% CI: 0.59–0.77) for radiomics based on US. The results of US radiomics studies did not show superior sensitivity compared to direct diagnosis using US. The current US radiomics in diagnosing RF is suboptimal, underscoring the imperative to cultivate a more refined algorithm or AI assessment model for evaluating fibrosis levels in CKD.

4.5. Analysis of the sources of heterogeneity in different diagnostic methods

Indeed, there is heterogeneity among US, MRI, and radiomics, with moderate heterogeneity (30–60%) observed in this study. The heterogeneity in US arises from variations in diagnostic experience and procedural skills among physicians, differences in equipment diagnostic performance, and diagnostic accuracy. The diagnostic accuracy of MRI is influenced by the field strength of the equipment and the scanning techniques employed by technicians, while the accuracy of image interpretation is linked to the knowledge and diagnostic experience of physicians. Moreover, the quality of radiomics research is affected by the modeling methods utilized and the precision of the images. In this study, subgroup classification was conducted based on the characteristics of various examinations to maintain heterogeneity within a reasonable range. Factors such as the frequency of the US transducer, the strength of the MRI magnet, and the pulse sequences utilized can all impact the appearance of fibrosis. Ideally, a subgroup analysis should be performed to assess the influence of these technical factors. However, we did not obtain sufficient granular data to report this influence. Consequently, the summarized estimates of diagnostic accuracy may obscure important differences related to specific acquisition techniques, and the results of this study reflect the average values of different methods for each modality.

4.6. Clinical applicability analysis

US is cheaper than MRI and is more frequently utilized in clinical practice for patients CKD patients. This preference is not solely due to cost considerations but also because US-guided biopsy serves as a crucial pathological method for diagnosing CKD. Under the visualization provided by the US probe, the biopsy needle can precisely position the renal site, thereby mitigating the risks of puncture injury, bleeding, and infection by avoiding renal blood vessels and other organs. Consequently, patients diagnosed pathologically with CKD typically undergo US examination. However, MRI is not a standard diagnostic procedure for CKD in clinical practice, resulting in fewer data acquisitions. MRI, leveraging its advantages in soft tissue imaging, can integrate functional imaging and enhanced scan perfusion data to evaluate renal function, blood flow, and fibrosis levels. Nevertheless, MRI is encumbered by drawbacks, such as high costs, challenges in equipment procurement and dissemination, slow imaging speed, prolonged examination durations, the necessity for multiple sequences, and extended information analysis times, which constrain its widespread adoption and application. In recent years, however, interventional surgical treatments guided by MRI have progressively been applied in clinical practice. In the future, MRI monitoring for visualized diagnosis and treatment of CKD may become feasible, potentially necessitating updated data for analysis and evaluation. The diagnostic accuracies reported in this study represent the average values of different methods for each modality. Nonetheless, in recent years, interventional surgical treatments under MRI have gradually entered clinical practice. In the future, MRI monitoring for the visualization, diagnosis, and treatment of CKD may become possible, at which point data will be updated for analysis and evaluation. The diagnostic accuracies indicated in this study represent the average values of different methods for each modality.

4.7. Strengths and limitations

AI-based medical technologies are rapidly evolving, and machine learning is a sub-area of AI, broadly referring to the process of fitting predictive models to data or identifying groupings of information contained in data [Citation58]. With the increasing use of electronic medical records, the development of patient-generated health data (PGHD) [Citation59], and the normalization of digital pathology, there is a growing demand for the processing and analysis of large datasets and high-dimensional data. The unprecedented progress in machine learning has made the collaborative integration of AI and digital pathology a feasible reality [Citation60]. Imaging models built using MRI and US, aside from inherent imaging factors such as imaging principles and resolution, follow a similar technical route from data annotation and feature extraction to modeling analysis. Thus, radiomics provides a means of comparing imaging data modeled from two different imaging modalities, but proving the effectiveness of certain models requires multi-center data modeling evaluations and external validations within radiomics studies. Our study is the first to comprehensively summarize the evidence for noninvasive diagnosis of RF, which offered important references for the field’s development from an evidence-based medical perspective. However, this study also faces limitations: (1) despite a comprehensive and systematic database search, the literature collected was very limited, and the imaging diagnostic devices and techniques at each center may not allow for a complete lateral comparison for evaluating RF in CKD. Conclusions drawn from a limited set of literature may restrict our interpretation of the findings. (2) In the included studies, the diagnosis of RF was confirmed by biopsy, albeit with slight variations in defining its severity. Our study discussed the diagnosis of RF through US, MRI, and radiomics; however, due to the restricted number of included studies, we were unable to delve deeper into its severity. (3) The imaging parameters for the various diagnostic methods (MRI, US, US radiomics) were not clearly noted in the articles, making it difficult to assess the consistency of the results. (4) This study included only four studies on radiomics, and conclusions drawn from this limited literature require cautious interpretation.

5. Conclusions

Currently, the main diagnostic methods for RF include US, MRI, and radiomics. However, our results unravel that US has a higher sensitivity for detecting RF compared to MRI. When comparing radiomics studies based on US, direct US imaging diagnosis showed better accuracy, suggesting that the current imagomics methods are not perfect and there is still a need to continue exploring more optimized AI algorithms and technologies. The comparison between radiomics studies based on US and direct US imaging diagnosis suggested that direct US imaging is more accurate, indicating that current radiomics methods are not only imperfect but also labor-intensive. Therefore, there is still a need to continue exploring more optimized AI algorithms and technologies. The accuracy differences between radiomics and direct imaging assessment methods like US and MRI are not statistically significant. Therefore, noninvasive diagnosis of RF continues to pose a substantial challenge.

In recent years, radiomics has garnered extensive interest from researchers in clinical practice. Its utility extends beyond tumor diagnosis and treatment prognosis, suggesting a promising trend for the future. In this context, very few researchers have focused on the noninvasive diagnosis of RF. The inclusion of only four radiomics studies in our research may bring some limitations in the analysis process and interpretation of results. This limitation impacts the reliability of comparisons between radiomics and other methods, such as US and MRI. The summarized estimates of diagnostic accuracy for various imaging methods may obscure important differences associated with specific acquisition techniques. Therefore, it is worth noting that our results reflect the averages of different methods for each modality. Our study has reviewed this noninvasive diagnostic approach for RF; however, further research is warranted to incorporate additional data for a more comprehensive assessment of the study’s validity and to broaden the scope of noninvasive imaging diagnostic methods for evaluating RF.

Author contributions

Conception and design: Shanshan Wan and Jiaping Wang; administrative support: Jiaping Wang; collection and assembly of data: Shanshan Wan and Chao Song; data analysis and interpretation: Shanshan Wan; manuscript writing: all authors; final approval of manuscript: all authors.

Supplemental material

Supplemental Material

Download PDF (94.7 KB)

Acknowledgements

Special acknowledgment to Dr. Chunjun Xia.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The data will not be disclosed to the public but can be shared with another research group upon a reasonable request.

Additional information

Funding

The research project was conducted under the supervision of Professor Jiaping Wang, MD, Professor Chao Song, and funded by the National Natural Science Foundation of China (No. 8186030172), the Kunming Health and Health Commission Health Science and Technology Project (2022-02-05-003) by Xinyu He, Yunnan Xingdian Talent Support Project-Famous Doctor Project (XDYC-MY-2022-0071) by Chao Song, but funding institutions have no role in the study design or implementation of the manuscript. And also funded by the Kunming Medical Joint Special Fund (No. 202401AY070001-339) and the Yunnan Provincial Department of Education Scientific Research Fund Project (No. 2024J0226).

References

  • Drawz P, Rahman M. Chronic kidney disease. Ann Intern Med. 2015;162(11):1–15. doi: 10.7326/AITC201506020.
  • Wu Q, Sun S, Wei L, et al. Twist1 regulates macrophage plasticity to promote renal fibrosis through galectin-3. Cell Mol Life Sci. 2022;79(3):137. doi: 10.1007/s00018-022-04137-0.
  • Luyckx VA, Cherney DZI, Bello AK. Preventing CKD in developed countries. Kidney Int Rep. 2019;5(3):263–277. doi: 10.1016/j.ekir.2019.12.003.
  • von Stillfried S, Apitzsch JC, Ehling J, et al. Contrast-enhanced CT imaging in patients with chronic kidney disease. Angiogenesis. 2016;19(4):525–535. doi: 10.1007/s10456-016-9524-7.
  • Triantopoulou C, Rizos S, Bourli A, et al. Localized unilateral perirenal fibrosis: CT and MRI appearances. Eur Radiol. 2002;12(11):2743–2746. doi: 10.1007/s00330-001-1184-2.
  • Weitzel WF, Kim K, Rubin JM, et al. Feasibility of applying ultrasound strain imaging to detect renal transplant chronic allograft nephropathy. Kidney Int. 2004;65(2):733–736. doi: 10.1111/j.1523-1755.2004.00435.x.
  • Nassar MK, Khedr D, Abu-Elfadl HG, et al. Diffusion tensor imaging in early prediction of renal fibrosis in patients with renal disease: functional and histopathological correlations. Int J Clin Pract. 2021;75(4):e13918. doi: 10.1111/ijcp.13918.
  • Andersen UB, Haddock B, Asmar A. Multiparametric magnetic resonance imaging: a robust tool to test pathogenesis and pathophysiology behind nephropathy in humans. Clin Physiol Funct Imaging. 2023;43(4):207–210. doi: 10.1111/cpf.12818.
  • Grams ME, Sang Y, Ballew SH, et al. A meta-analysis of the association of estimated GFR, albuminuria, age, race, and sex with acute kidney injury. Am J Kidney Dis. 2015;66(4):591–601. doi: 10.1053/j.ajkd.2015.02.337.
  • Poggio ED, McClelland RL, Blank KN, et al. Systematic review and meta-analysis of native kidney biopsy complications. Clin J Am Soc Nephrol. 2020;15(11):1595–1602. doi: 10.2215/CJN.04710420.
  • Corapi KM, Chen JL, Balk EM, et al. Bleeding complications of native kidney biopsy: a systematic review and meta-analysis. Am J Kidney Dis. 2012;60(1):62–73. doi: 10.1053/j.ajkd.2012.02.330.
  • Jaber BL. Clinical nephrology. Curr Opin Nephrol Hypertens. 2017;26(2):105. doi: 10.1097/MNH.0000000000000307.
  • Maurer MH, Härmä KH, Thoeny H. Diffusion-weighted genitourinary imaging. Radiol Clin North Am. 2017;55(2):393–411. doi: 10.1016/j.rcl.2016.10.014.
  • Saritas T, Kramann R. Kidney allograft fibrosis: diagnostic and therapeutic strategies. Transplantation. 2021;105(10):e114–e130. doi: 10.1097/TP.0000000000003678.
  • Li J, An C, Kang L, et al. Recent advances in magnetic resonance imaging assessment of renal fibrosis. Adv Chronic Kidney Dis. 2017;24(3):150–153. doi: 10.1053/j.ackd.2017.03.005.
  • Li S, Wang F, Sun D. The renal microcirculation in chronic kidney disease: novel diagnostic methods and therapeutic perspectives. Cell Biosci. 2021;11(1):90. doi: 10.1186/s13578-021-00606-4.
  • McInnes MDF, Moher D, Thombs BD, et al. Preferred reporting items for a systematic review and meta-analysis of diagnostic test accuracy studies: the PRISMA-DTA statement. JAMA. 2018;319(4):388–396. doi: 10.1001/jama.2017.19163.
  • Whiting PF, Rutjes AW, Westwood ME, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529–536. doi: 10.7326/0003-4819-155-8-201110180-00009.
  • Chen Z, Chen J, Chen H, et al. A nomogram based on shear wave elastography for assessment of renal fibrosis in patients with chronic kidney disease. J Nephrol. 2023;36(3):719–729. doi: 10.1007/s40620-022-01521-8.
  • Chen Z, Chen J, Chen H, et al. Evaluation of renal fibrosis in patients with chronic kidney disease by shear wave elastography: a comparative analysis with pathological findings. Abdom Radiol. 2022;47(2):738–745. doi: 10.1007/s00261-021-03351-x.
  • Chen Z, Ying MTC, Wang Y, et al. Ultrasound-based radiomics analysis in the assessment of renal fibrosis in patients with chronic kidney disease. Abdom Radiol. 2023;48(8):2649–2657. doi: 10.1007/s00261-023-03965-3.
  • Yang X, Hou FL, Zhao C, et al. The role of real-time shear wave elastography in the diagnosis of idiopathic nephrotic syndrome and evaluation of the curative effect. Abdom Radiol. 2020;45(8):2508–2517. doi: 10.1007/s00261-020-02460-3.
  • He L, Dan G, Yuanbo S, et al. The diagnostic efficacy of diffusion tensor imaging generated by gadolinium-based magnetic resonance imaging for patients with chronic kidney disease. Medicine. 2022;101(27):e29291. doi: 10.1097/MD.0000000000029291.
  • Leong SS, Wong JHD, Md Shah MN, et al. Shear wave elastography accurately detects chronic changes in renal histopathology. Nephrology. 2021;26(1):38–45. doi: 10.1111/nep.13805.
  • Cui G, Yang Z, Zhang W, et al. Evaluation of acoustic radiation force impulse imaging for the clinicopathological typing of renal fibrosis. Exp Ther Med. 2014;7(1):233–235. doi: 10.3892/etm.2013.1377.
  • Mohamed Osman NM, Abdel Kader M, Aziz Nasr TAEL, et al. The role of diffusion-weighted MRI and apparent diffusion coefficient in assessment of diabetic kidney disease: preliminary experience study. Int J Nephrol Renovasc Dis. 2021;14:1–10. doi: 10.2147/IJNRD.S254022.
  • Chen Z, Ying TC, Chen J, et al. Using elastography-based multilayer perceptron model to evaluate renal fibrosis in chronic kidney disease. Ren Fail. 2023;45(1):2202755. doi: 10.1080/0886022X.2023.2202755.
  • Hua C, Qiu L, Zhou L, et al. Value of multiparametric magnetic resonance imaging for evaluating chronic kidney disease and renal fibrosis. Eur Radiol. 2023;33(8):5211–5221. doi: 10.1007/s00330-023-09674-1.
  • Samir AE, Allegretti AS, Zhu Q, et al. Shear wave elastography in chronic kidney disease: a pilot experience in native kidneys. BMC Nephrol. 2015;16(1):119. doi: 10.1186/s12882-015-0120-7.
  • Zhu M, Ma L, Yang W, et al. Elastography ultrasound with machine learning improves the diagnostic performance of traditional ultrasound in predicting kidney fibrosis. J Formos Med Assoc. 2022;121(6):1062–1072. doi: 10.1016/j.jfma.2021.08.011.
  • Friedli I, Crowe LA, de Perrot T, et al. Comparison of readout-segmented and conventional single-shot for echo-planar diffusion-weighted imaging in the assessment of kidney interstitial fibrosis. J Magn Reson Imaging. 2017;46(6):1631–1640. doi: 10.1002/jmri.25687.
  • Radulescu D, Peride I, Petcu LC, et al. Supersonic shear wave ultrasonography for assessing tissue stiffness in native kidney. Ultrasound Med Biol. 2018;44(12):2556–2568. doi: 10.1016/j.ultrasmedbio.2018.07.001.
  • Wang L, Xia P, Lv K, et al. Assessment of renal tissue elasticity by acoustic radiation force impulse quantification with histopathological correlation: preliminary experience in chronic kidney disease. Eur Radiol. 2014;24(7):1694–1699. doi: 10.1007/s00330-014-3162-5.
  • Wei CG, Zeng Y, Zhang R, et al. Native T1 mapping for non-invasive quantitative evaluation of renal function and renal fibrosis in patients with chronic kidney disease. Quant Imaging Med Surg. 2023;13(8):5058–5071. doi: 10.21037/qims-22-1304.
  • Wu J, Shi Z, Zhang Y, et al. Native T1 mapping in assessing kidney fibrosis for patients with chronic glomerulonephritis. Front Med. 2021;8:772326. doi: 10.3389/fmed.2021.772326.
  • Liu Y, Zhang GM, Peng X, et al. Diffusion kurtosis imaging as an imaging biomarker for predicting prognosis in chronic kidney disease patients. Nephrol Dial Transplant. 2022;37(8):1451–1460. doi: 10.1093/ndt/gfab229.
  • Yang X, Yu N, Yu J, et al. Virtual touch tissue quantification for assessing renal pathology in idiopathic nephrotic syndrome. Ultrasound Med Biol. 2018;44(7):1318–1326. doi: 10.1016/j.ultrasmedbio.2018.02.012.
  • Bob F, Grosu I, Sporea I, et al. Ultrasound-based shear wave elastography in the assessment of patients with diabetic kidney disease. Ultrasound Med Biol. 2017;43(10):2159–2166. doi: 10.1016/j.ultrasmedbio.2017.04.019.
  • Barr RG. US-targeted microbubbles to assess liver fibrosis. Radiology. 2022;304(2):483–484. doi: 10.1148/radiol.220595.
  • Cè M, Felisaz PF, Alì M, et al. Ultrasound elastography in chronic kidney disease: a systematic review and meta-analysis. J Med Ultrason (2001). 2023;50(3):381–415. doi: 10.1007/s10396-023-01304-z.
  • Cao H, Ke B, Lin F, et al. Shear wave elastography for assessment of biopsy-proven renal fibrosis: a systematic review and meta-analysis. Ultrasound Med Biol. 2023;49(5):1037–1048. doi: 10.1016/j.ultrasmedbio.2023.01.003.
  • Mo XL, Meng HY, Wu YY, et al. Shear wave elastography in the evaluation of renal parenchymal stiffness in patients with chronic kidney disease: a meta-analysis. J Clin Med Res. 2022;14(2):95–105. doi: 10.14740/jocmr4621.
  • Sasaki Y, Hirooka Y, Kawashima H, et al. Measurements of renal shear wave velocities in chronic kidney disease patients. Acta Radiol. 2018;59(7):884–890. doi: 10.1177/0284185117734417.
  • Ozturk A, Olson MC, Samir AE, et al. Liver fibrosis assessment: MR and US elastography. Abdom Radiol. 2022;47(9):3037–3050. doi: 10.1007/s00261-021-03269-4.
  • Yu YM, Wang W, Wen J, et al. Detection of renal allograft fibrosis with MRI: arterial spin labeling outperforms reduced field-of-view IVIM. Eur Radiol. 2021;31(9):6696–6707. doi: 10.1007/s00330-021-07818-9.
  • Mao W, Zhou J, Zeng M, et al. Chronic kidney disease: pathological and functional evaluation with intravoxel incoherent motion diffusion-weighted imaging. J Magn Reson Imaging. 2018;47(5):1251–1259. doi: 10.1002/jmri.25861.
  • Wang Y, Zhang X, Wang B, et al. Evaluation of renal pathophysiological processes induced by an iodinated contrast agent in a diabetic rabbit model using intravoxel incoherent motion and blood oxygenation level-dependent magnetic resonance imaging. Korean J Radiol. 2019;20(5):830–843. doi: 10.3348/kjr.2018.0757.
  • Zhang JL, Lee VS. Renal perfusion imaging by MRI. J Magn Reson Imaging. 2020;52(2):369–379. doi: 10.1002/jmri.26911.
  • Selvaraj EA, Mózes FE, Jayaswal ANA, et al. Diagnostic accuracy of elastography and magnetic resonance imaging in patients with NAFLD: a systematic review and meta-analysis. J Hepatol. 2021;75(4):770–785. doi: 10.1016/j.jhep.2021.04.044.
  • Sangha K, Chang ST, Cheung R, et al. Cost-effectiveness of MRE versus VCTE in staging fibrosis for nonalcoholic fatty liver disease (NAFLD) patients with advanced fibrosis. Hepatology. 2023;77(5):1702–1711. doi: 10.1097/HEP.0000000000000262.
  • Ajmera V, Cepin S, Tesfai K, et al. A prospective study on the prevalence of NAFLD, advanced fibrosis, cirrhosis and hepatocellular carcinoma in people with type 2 diabetes. J Hepatol. 2023;78(3):471–478. doi: 10.1016/j.jhep.2022.11.010.
  • Korsmo MJ, Ebrahimi B, Eirin A, et al. Magnetic resonance elastography noninvasively detects in vivo renal medullary fibrosis secondary to swine renal artery stenosis. Invest Radiol. 2013;48(2):61–68. doi: 10.1097/RLI.0b013e31827a4990.
  • Zhang J, Yu Y, Liu X, et al. Evaluation of renal fibrosis by mapping histology and magnetic resonance imaging. Kidney Dis. 2021;7(2):131–142. doi: 10.1159/000513332.
  • Bandara MS, Gurunayaka B, Lakraj G, et al. Ultrasound based radiomics features of chronic kidney disease. Acad Radiol. 2022;29(2):229–235. doi: 10.1016/j.acra.2021.01.006.
  • Zhao D, Wang W, Tang T, et al. Current progress in artificial intelligence-assisted medical image analysis for chronic kidney disease: a literature review. Comput Struct Biotechnol J. 2023;21:3315–3326. doi: 10.1016/j.csbj.2023.05.029.
  • Li LP, Leidner AS, Wilt E, et al. Radiomics-based image phenotyping of kidney apparent diffusion coefficient maps: preliminary feasibility & efficacy. J Clin Med. 2022;11(7):1972. doi: 10.3390/jcm11071972.
  • Yu Q, Liu Y, Xie X, et al. Radiomics-based method for diagnosis of calciphylaxis in patients with chronic kidney disease using computed tomography. Quant Imaging Med Surg. 2021;11(11):4617–4626. doi: 10.21037/qims-20-1211.
  • Meng N, Fang T, Feng P, et al. Amide proton transfer-weighted imaging and multiple models diffusion-weighted imaging facilitates preoperative risk stratification of early-stage endometrial carcinoma. J Magn Reson Imaging. 2021;54(4):1200–1211. doi: 10.1002/jmri.27684.
  • Li J, Lin L, Gao X, et al. Amide proton transfer weighted and intravoxel incoherent motion imaging in evaluation of prognostic factors for rectal adenocarcinoma. Front Oncol. 2021;11:783544. doi: 10.3389/fonc.2021.783544.
  • Nishie A, Asayama Y, Ishigami K, et al. Amide proton transfer imaging to predict tumor response to neoadjuvant chemotherapy in locally advanced rectal cancer. J Gastroenterol Hepatol. 2019;34(1):140–146. doi: 10.1111/jgh.14315.