693
Views
8
CrossRef citations to date
0
Altmetric
Extraocular Structures

Tear Production Levels and Dry Eye Disease Severity in a Large Norwegian Cohort

ORCID Icon, , , , , , , & show all
Pages 1465-1470 | Received 30 Mar 2018, Accepted 16 Aug 2018, Published online: 05 Sep 2018

ABSTRACT

Purpose

To determine if the Schirmer I test (without anesthesia) cut-off value is a predictor of dry eye severity in a large Norwegian cohort of dry eye disease (DED) patients, which are grouped into six levels of tear production.

Methods

Patients (n = 1090) with DED of different etiologies received an extensive dry eye work-up: osmolarity (Osm), tear meniscus height (TMH), tear film break-up time (TFBUT), ocular protection index (OPI), ocular surface staining (OSS), Schirmer I test (ST), meibum expressibility (ME), and meibum quality (MQ). Classification of dry eye severity level (DESL) and diagnosis of meibomian gland dysfunction (MGD) were also included. The cohort was divided into six groups: below and above cut-off values of 5 (groups 1 and 2), 10 (groups 3 and 4), and 15 mm (groups 5 and 6) of ST. Mann-Whitney test and Chi-Square test were used for group comparison of parameters (≤ 0.05).

Results

The groups 1, 3, and 5 had values indicating more severe DED than the groups 2, 4, 6 with significant difference in DESL, Osm, TFBUT, OPI, OSS, and TMH. Regardless of the choice of cut-off values, there was no statistically significant difference in ME, MQ, and MGD between groups below and above selected cut-off value. When gender difference was considered in each group, significant difference was only observed for DESL (groups 2, 4, and 5), TFBUT (groups 2, 4, and 5), OPI (groups 2 and 6), and ME (group1).

Conclusions

Schirmer I is a robust discriminator for DESL, Osm, TFBUT, OPI, OSS, and TMH, but not for ME, MQ, and MGD. Patients with lower tear production levels presented with more severe DED at all three defined cut-off values. Interestingly, the differences in the mean values of DESL were minimal although statistically significant. Thus, the clinical value of different Schirmer levels appears to be limited.

Introduction

Dry eye disease (DED) is a multifactorial disorder, which can be caused by an alteration in the quality or quantity of tear film’s three layers.Citation1 DED can result from primary factors such as lacrimal gland atrophy and secondary factors such as pathological changes in the eyelids, conjunctiva, or cornea along with other influencing factors, including immunological processes, neurotransmitters, hormones, pharmaceuticals, contact lenses and environmental pollution.Citation2Citation4 In addition to these causes, several risk factors have been proposed as contributing to the development of the disease: gender (i.e., female), increasing age, therapies (e.g., postmenopausal estrogen and radiation), dietary deficiencies (e.g., vitamin A and omega-3 fatty acids), and systemic diseases (e.g., hepatitis C).Citation5Citation8

Several clinical tests are available for diagnostic evaluations, clinical trials, and follow-up examinations, including Schirmer I test (ST), measurement of tear osmolarity (Osm), tear film break-up time (TFBUT), ocular surface staining (OSS), tear meniscus height (TMH), meibum expressibility (ME), and meibum quality (MQ). Among them, ST is the most widely used method for assessment of aqueous tear production. It was first introduced by Schirmer in 1903 as installed piece of striped and marked (35 by 5 mm) blotting paper on lower eyelid margin with or without anesthesia to collect tears. ST suffers from several drawbacks such as poor reproducibility, sensitivity and specificity, long testing time (i.e., 5 min), potential for evaporative loss, high variability, discomfort, possibility of uneven absorption of tear by paper strip, and no well-defined cut-off value.Citation9Citation13 Nevertheless, ST without topical anesthesia is considered a valid ophthalmological test for patients with severe DED.Citation14 To assess whether the ST can be utilized as a discriminator for other objective tests, we investigated clinical parameters in a large Norwegian cohort of 1090 dry eye disease (DED) patients grouped into six levels of tear production. The six groups were below and above cut-off values of 5 mm (groups 1 and 2), 10 mm (groups 3 and 4), and 15 mm (groups 5 and 6) of the ST.

Materials and methods

Patients

One thousand and ninety subjects (average age: 52.86 ± 16.03 years, range: 8–95, 796 females and 294 males) diagnosed with DED of different etiologies were consecutively included in the study at the Norwegian Dry Eye Clinic between 2012 and 2016. All subjects received an extensive dry eye work-up.

All examinations were carried out at the same clinic by the same ophthalmologist during normal working hours between 9 AM and 3 PM. The use of the data for the study has been reviewed by The Regional Committee for Medical & Health Research Ethics, Section C, South East Norway (REC). REC found the research project “Evaluation of data from the Norwegian Dry Eye Clinic” to be outside the remit of the Act on Medical and Health Research (2008) and therefore could be implemented without its approval. Prior to data collection, informed consent was obtained from all participants. All procedures performed in this study were in compliance with the Declaration of Helsinki.

Clinical evaluation

All subjects underwent a comprehensive ophthalmic examination by the following stepwise tests: osmolarity (Osm), tear meniscus height (TMH), tear film break-up time (TFBUT), ocular protection index (OPI), ocular surface staining (OSS), Schirmer I test (ST), meibum expressibility (ME), meibum quality (MQ), diagnosis of meibomian gland dysfunction (MGD), and classification of dry eye severity level (DESL).

The tear quality evaluation was performed using TFBUT after instillation of 5 µl 2% fluorescein sodium. Ocular surface staining with fluorescein was recorded after Oxford Grading Scheme: 0–15; range of corneal staining: 0–5).Citation15 Schirmer test was performed without topical anesthesia in 5 min. The calculation of OPI was based on the ratio of the TFBUT divided by the blink interval.Citation16 TMH was measured using a slit lamp. ME was recorded based on the number of secreted glands of the lower eyelid viewed at the slit lamp when light pressure applied by cotton tips on central five MGs (0 = all five glands expressible; 1 = three to four glands expressible; 2 = one to two glands expressible; and 3 = no gland expressible). For MQ, the central eight glands of the lower eyelid were scored on a scale of 0 to 3 for each gland (0 = clear; 1 = cloudy; 2 = cloudy with debris (granular); and 3 = thick, like toothpaste). The sum of the central eight glands was then calculated (total score range, 0–24). DESL was assessed according to the Behrens et al.,Citation17 modified by 2007 International Dry Eye Workshop criteriaCitation18 (), and diagnosis of MGD was based on the suggestions by the international workshop on MGD.Citation19 Only 520 subjects were examined for tear film osmolarity measurement using TearLab Osmolarity System (TearLab Corp, San Diego, CA).

Table 1. Dry eye severity grading scheme.

Statistical analysis

Data from right eye were used for statistical analysis. The cohort was divided into six groups: below and above cut-off values (5, 10, and 15 mm) of ST for comparisons. Mann-Whitney test and Chi-Square test were used for group comparison of parameters using SPSS version 21 (SPSS Inc., Chicago, IL, USA). Chi-Square test was used to determine if females were different from males in each group. Data are presented as mean and standard error. P values less than or equal to 0.05 were considered significant.

Results

presents the subject demographics (average age: 52.86 ± 16.03 years, range: 8–95). The number of females (n = 796) was higher than males (n = 294). In all groups, except age group 0–19, females outnumbered males. The highest number of patients belonged to age group 40–59 (n = 430). In contrast, age groups 80–99 (n = 34) and 0–19 (n = 9) presented the lowest number of patients, respectively.

Table 2. Subject demographics: The age and gender distribution of 1090 subjects with test for gender difference in each group (Chi-square test, ≤ 0.05).

shows the distribution of Schirmer test wetting length. The overall average was 14.67 ± 0.308 mm. Among participants, 20%, 45%, and 62% had Schirmer test of < 5 mm, ≤ 10 mm, and ≤ 15 mm, respectively. The percentage of fully soaked strip (≥ 30 mm) was 12.7%.

Table 3. Distribution of Schirmer I test wetting length.

The analysis revealed that groups 1, 3, and 5 (i.e., individuals with values below cut-off levels of ≤ 5 mm, ≤ 10 mm, and ≤ 15 mm, respectively) had values indicating more severe DED with significant difference of ≤ 0.05 for DESL, Osm, TFBUT, OPI, OSS and TMH compared to the corresponding groups (2, 4, and 6) (). Regardless of the choice of cut-off values, there was no statistically significant difference in ME, MQ and MGD between the groups. The Chi-Square test indicated significant difference between six tear production level groups for TMH (< 0.001) in all three defined cut-off values but there was no significant difference for MGD ().

Table 4. Comparison of clinical parameter results between groups based on three defined cut-off values (5, 10, and 15) for pathological DED measured by Schirmer I test (Mann-Whitney test and Chi-Square test, ≤ 0.05).

When gender difference was considered in each group (), Chi-Square test indicated significant difference only for DESL (groups 2, 4, and 5), TFBUT (groups 2, 4, and 5), OPI (groups 2 and 6), and ME (group1).

Table 5. Comparison of gender difference in each group below and above Schirmer I test cut-off values (5, 10, and 15 mm) for clinical parameter results (Chi-Square test, ≤ 0.05).

Discussion

This study revealed that patients with lower tear production levels measured by ST presented with more severe DED than patients with higher tear production. All the three defined cut-off values (≤ 5, ≤ 10, and ≤ 15 mm) gave rise to significant difference for DESL, Osm, TFBUT, OPI, OSS, and TMH, but not for ME, MQ, and MGD, in the six tear production level groups. There was no clear pattern in the gender difference of each group and significant difference was only observed for DESL (groups 2, 4, and 5), TFBUT (groups 2, 4, and 5), OPI (groups 2 and 6), and ME (group1).

An association between ST and other commonly used clinical DED tests has previously been published. For example, three out of four statistical analyses performed by Nichols et al.Citation20 revealed a significant relationship between the ST (cut-off ≤ 5 mm) and TFBUT (cut-off ≤ 10 s) in DED patients (n = 75) compared to other objective tests. These two parameters were significantly correlated (= 0.40) and lower ST results were found to predict lower TFBUT in the stepwise, multivariate logistic regression models. Also, Wilcoxon rank sum analysis showed that subjects with an abnormal TFBUT had lower mean Schirmer scores.

Sullivan, et al.Citation21 determined correlation coefficients between ST (cut-off < 7 mm) and other tests: Osm (0.05), TFBUT (0.08), fluorescein corneal staining (0.14), lissamine green conjunctival staining (0.13) and Bron/Foulks meibomian gland grading (0.05) in 344 subjects with (262) and without (82) DED. In order to compare their results with an independent data set, the authors used the data collected from a previously published study in Germany with 200 subjects (184 DED and 16 health controls).Citation22 Accordingly, correlation coefficients were reported between ST and other clinical parameters, such as Osm (0.00), TFBUT (0.06), fluorescein corneal staining (0.03), lissamine green conjunctival staining (0.03), and Bron/Foulks meibomian gland grading (0.03). These results indicate that this test of aqueous deficiency can be utilized as a discriminator for some other objective tests.

Among all the parameters investigated in the current study, ST was unable to significantly discriminate only those related to meibomian gland function (ME, MQ and MGD). Patients with MGD have an unstable tear lipid layer but are generally considered to have normal tear production. This instability in the lipid layer may cause a decrease in tear volume, which in turn results in higher rates of evaporations and eventually damage to the ocular surface.Citation23 In contrast, Tung et al.Citation24 observed a different pattern in MGD wherein higher tear volume correlates with worse ocular surface disease. Thus, the authors hypothesized that changes in tear composition such as hyperosmolarity, lactoferrin concentration and inflammatory mediators may play a more important role in causing corneal epithelial damage in MGD than increased evaporation. This may also be the case in our study, as ST could not significantly determine meibomian gland parameters in DED patients. Therefore, measuring tear production may not be an efficient discriminator for ME, MQ, and MGD tests.

Historically, a gradual decrease in cut-off value for ST has been observed. For example, Schirmer,Citation25 De Roetth,Citation26 and Van BijsterveldCitation27 used 15, 10, and 5.5 mm as normal cut-off values in their studies, respectively. The lack of well-defined cut-off values for the available DED tests such as ST complicates research making it difficult to compare results from different studies. The choice of cut-off value by researchers has been based on the purpose of their studies, for example, increased sensitivity for screening and specificity for therapeutic purposes. Irrespective of whether the cut-off value is defined as 5, 10, or 15 mm, the present study showed that ST is a robust discriminator for some clinical DED tests in a cohort of 1090 DED individuals.

For ST without anesthesia, the application of 10 mm cut-off value was considered sufficiently high not to overlook DED for screening purposes, but a proper value should be used to increase specificity in comparing inter-institutional data and evaluating efficacy of drugs even though sensitivity decreases to some extent.Citation28 However, Lee and HyunCitation29 suggested that there is no available normal cut-off value for ST due to the high number of false positive or false negative values, leading to the ratio of 48.4% misdiagnosis in their study. They concluded that ST was not a reproducible test to differentiate DED patients (n = 15) from normal individuals (n = 110) and therefore analogized ST to the toss of a coin in the diagnosis of DED. In contrast, Van Bijsterveld’s (1969) cut-off value of 5.5 mm was suggested to be acceptable because of impossibility in ruling out both false positivity and false negativity in ST test due to its inaccuracy.Citation28,Citation30,Citation31 In the current study, a possible explanation for obtaining significant differences using all three defined cut-off values might be related to the sample size. We recruited 1090 subjects in contrast to other studies with a significantly lower number of individuals. Clearly, larger sample size increases statistical robustness.Citation32 Thus, by increasing the number of subjects, the conclusions may change and researchers should be mindful of this when designing their studies.

In response to the limitations of ST, such as invasive nature, unavoidable variability, poor reproducibility, long testing time (i.e., 5 min), low sensitivity and specificity, alternative strategies have been proposed by researchers. A study evaluating the diagnostic usefulness of ST by using the Japanese diagnostic criteria on DED concluded that the combination of ST and TFBUT increased the diagnostic predictability.Citation28 Other authors prefer to use less invasive tests, for example, phenol red thread testCitation33 or noninvasive methods, such as thermography, keratography, and Hartmann-Shack wavefront sensor.Citation34Citation36 Nevertheless, based on our results, ST should be chosen as a clinical standard for measuring tear production until a more accurate test appears.

Conclusion

Schirmer test is a robust discriminator for DESL, Osm, TFBUT, OPI, OSS, and TMH, but not for ME, MQ, and MGD. Our results demonstrate that patients with lower tear production levels presented with more severe DED, irrespective of different cut-off levels (5, 10, or 15 mm). Interestingly, the differences in the mean values of DESL were minimal although statistically significant. Thus, the clinical value of different Schirmer levels appears to be limited. No clear pattern of sex differences of results were observed in the groups.

Acknowledgments

This work was supported by Oslo University Hospital, Norway, and the Norwegian Dry Eye Clinic, Oslo, Norway.

Disclosure statement

We declare the authors have no competing interests as defined by Current Eye Research, or other interests that might be perceived to influence the interpretation of the article. The authors alone are responsible for the content and writing of the paper.

References