1,246
Views
3
CrossRef citations to date
0
Altmetric
Special Articles

Identifying cases of spinal cord injury or disease in a primary care electronic medical record database

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon show all

Abstract

Objective

To identify cases of spinal cord injury or disease (SCI/D) in an Ontario database of primary care electronic medical records (EMR).

Design

A reference standard of cases of chronic SCI/D was established via manual review of EMRs; this reference standard was used to evaluate potential case identification algorithms for use in the same database.

Setting

Electronic Medical Records Primary Care (EMRPC) Database, Ontario, Canada.

Participants

A sample of 48,000 adult patients was randomly selected from 213,887 eligible patients in the EMRPC database.

Interventions

N/A.

Main Outcome Measure(s)

Candidate algorithms were evaluated using sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and F-score.

Results

126 cases of chronic SCI/D were identified, forming the reference standard. Of these, 57 were cases of traumatic spinal cord injury (TSCI), and 67 were cases of non-traumatic spinal cord injury (NTSCI). The optimal case identification algorithm used free-text keyword searches and a physician billing code, and had 70.6% sensitivity (61.9–78.4), 98.5% specificity (97.3–99.3), 89.9% PPV (82.2–95.0), 94.7% NPV (92.8–96.3), and an F-score of 79.1.

Conclusions

Identifying cases of chronic SCI/D from a database of primary care EMRs using free-text entries is feasible, relying on a comprehensive case definition. Identifying a cohort of patients with SCI/D will allow for future study of the epidemiology and health service utilization of these patients.

Spinal cord injury or disease (SCI/D) is damage to the spinal cord resulting in neurological impairment; it can be caused either by external injury, known as traumatic spinal cord injury (TSCI), or by other causes such as congenital conditions or disease processes, known as non-traumatic spinal cord injury (NTSCI).Citation1 Historically, health services research in the field of “spinal cord injury” has focused only on TSCI, but recent efforts to classify the potential causes of NTSCI have made it feasible to identify and include non-traumatic cases.Citation2 This is important as the incidence of NTSCI is expected to increase with an aging population.Citation3

Although prevalence estimates are sparse and inconsistent, SCI/D is typically considered a rare disorder;Citation4 however, it has a disproportionately large impact on healthcare utilizationCitation5 due to the many associated impairments and co-morbiditiesCitation6 experienced by patients during the decades of life post-injury.Citation7 Most research in SCI/D has focused on the early experience of patients, during acute in-hospital care and initial specialized rehabilitation, while their experience of the years of life post-discharge is not well described.Citation8–10 In recent years, administrative datasets have been used to address this gap;Citation11–14 however, these data sources typically capture only incident cases over a particular time period and cannot be used to identify prevalent cases of chronic SCI/D. Lack of a clear case definition for SCI/D adds complexity; this is particularly so for NTSCI, which comprises a wide range of rare conditions.Citation2

Following the widespread adoption of electronic medical records (EMR), databases of EMRs aggregated from primary care practices have become available for research.Citation15,Citation16 Methods for identifying various conditions of interest within these databases have been described, employing evidence such as disease classification codes, specific medications, and key diagnostic test or lab values.Citation17–21 However, when implementing a similar case identification method in SCI/D, the types of evidence (such as disease codes) used in other conditions may be unreliable,Citation22,Citation23 so that the use of additional material from free-text fields in the EMR may be necessary.

An advantage of using primary care rather than hospital records is the possibility of identifying prevalent cases, creating a cohort that is more representative of the SCI/D population overall. Additionally, the capture of mild cases of NTSCI can be improved, since these cases may not result in hospitalization.Citation24 Once a cohort is reliably identified, it can be linked to other datasets and used to study the long-term experience of people living with SCI/D.

Our objectives were to (i) develop a method for identifying cases of chronic SCI/D in a primary care EMR database using a comprehensive case definition; (ii) conduct a detailed manual chart review in order to establish a reference standard cohort of cases of chronic SCI/D; and (iii) use the reference standard cohort to validate case finding algorithms that can be used to identify all cases of SCI/D in the EMR database.

Methods

EMR data were extracted from the Electronic Medical Record Primary Care (EMRPC) database (formerly known as EMRALD). EMRPC aggregates data from the EMR systems of participating family physicians across Ontario, Canada, a province with a population of approximately 14.5 million and a single-payer healthcare system. Data were collected semi-annually from primary care clinics that volunteer to participate. At the time of data extraction for this study (December 31, 2016), EMRPC comprised 376 family physicians (approximately 3% of the family physicians in Ontario) in 43 clinics, and 443,038 patients. EMRPC is among the largest EMR databases in the world (those representing more than 1% of their reference population)Citation25 and is unusual in that it contains large amounts of free-text data.Citation16 EMRPC contains all clinically relevant information aggregated from EMRs (see ).

Table 1 Components of EMRPC database.

EMRPC is maintained by ICES, an independent, non-profit research institute funded by an annual grant from the Ontario Ministry of Health (MOH) and the Ministry of Long-Term Care (MLTC).Citation26 As a prescribed entity under Ontario’s privacy legislation, ICES is authorized to collect and use health care data for the purposes of health system analysis, evaluation and decision support. Secure access to these data is governed by policies and procedures that are approved by the Information and Privacy Commissioner of Ontario. The use of the data in this project is authorized under section 45 of Ontario’s Personal Health Information Protection Act (PHIPA) and does not require review by a Research Ethics Board.

The study cohort included patients who had a valid date of birth and a valid health insurance number, were assigned to an active primary care physician, and were more than 14 years old as of the date of the data extraction. We excluded the pediatric population to facilitate chart review, with the age cut-off of 14 years chosen to align with other major studies of this population.Citation3,Citation4 Patients were excluded if they had less than one year of entries in the EMR, or if their physician had been using the EMR for less than 2 years before data extraction, in order to ensure accuracy and completeness of data. After application of the inclusion and exclusion criteria, 213,887 patients comprised the study cohort. A random sample of 48,000 patients was selected from the study cohort to generate a feasible number of charts for manual review. The study was conducted and reported according to recommended guidelines for reporting of case definition validation studies.Citation17

Keyword search strategy

A preliminary keyword search was used to identify potential cases of SCI/D for manual review from the 48,000 patient study sample. Care was taken to ensure that the keyword search strategy was inclusive of all potential cases. Previous studies employing the same database to identify other conditions of interest used a similar keyword search approach.Citation18,Citation27,Citation28

A list of 17 search terms related to SCI/D including associated impairments and syndromes () was developed in consultation with a primary care physician (with expertise in treating SCI/D patients) and used to conduct an initial keyword search in certain free-text components of patient records: the cumulative patient profile (CPP), progress notes (PN), and consultation letters (CONS). Within the CPP, the problem list and medical history sections were considered.

Table 2 Initial search terms.

It was hypothesized that these terms would risk missing some NTSCI cases because primary care physicians may identify them by the etiology rather than the impairment; this was confirmed when a preliminary review of 100 patients identified using a list of NTSCI-related etiologies revealed 5 possible cases. Therefore, in order to maximize capture of potential NTSCI cases, an exhaustive list of NTSCI-related keywords including etiologies was compiled using authoritative sources of NTSCI-related terms.Citation2,Citation29 The inclusion of an additional 126 search terms in the CPP problem list was considered warranted as problem lists maintained by primary care physicians have been shown to be well-maintained and accurate.Citation30–32

For the list of terms in the keyword search strategy, see Supplemental Table 1.

Case definition

A comprehensive case definition for identifying chronic SCI/D for chart review was developed (), based on prior work to characterize TSCI and the diagnostic criteria for NTSCI.Citation22,Citation24

Table 3 Case definition: criteria for chronic TSCI and NTSCI case identification.

This definition was designed to be maximally comprehensive, and includes spina bifida, which is sometimes treated as a separate entity.Citation4 All cases of multiple sclerosis were excluded.

Chart review

During the preliminary chart review process, the reliability of different chart components was noted. A mention of a keyword in the CPP problem list was more likely to identify a case accurately than a mention in progress notes (PN), which contained varied information and were therefore more likely to generate spurious keyword matches resulting from, for example, queries, differential diagnoses, and family histories. The use of diagnostic terms in consultation letters (CONS) tended to be clear and unambiguous, and consultation letters from certain specialties such as neurosurgery, neurology, and physiatry often contained additional evidence to validate the diagnosis, including reports of neurological examinations. Consultation letters from urology often clearly documented neurogenic bladder. The chart review process was designed to take these observations into account in the evidentiary value accorded to different EMR components.

EMRPC chart data was reviewed via a custom online abstraction platform which has previously been used in similar studies.Citation27,Citation33 The chart review process is shown in .

Figure 1 Chart review process.

Figure 1 Chart review process.

All charts were reviewed by a principal reviewer (JS), while a secondary reviewer (JC) validated a random sample of 10% of the charts, and reviewed all cases rated “possible” by the first reviewer, reclassifying them where possible. Cohen’s kappa for the 10% validation sample indicated strong agreement at 87.37%. Complex cases were referred to a senior clinical expert (CC) for adjudication.

Algorithm development

The reference standard cohort contained all definite cases of chronic SCI/D in the 48,000-patient study sample. In order to identify cases of chronic SCI/D in the larger study cohort, algorithms were developed, and their performance was evaluated using the reference standard. These algorithms were composed of text strings (keywords, names of medications) and physician billing codes that, if found when searching the EMRPC database, could accurately identify cases of SCI/D.

Given the different significance and evidentiary value of keywords depending on where they appeared in the chart, these were considered separately in the CPP and in the PN and CONS. In both cases, to determine the value of each keyword for case identification, occurrences in the reference standard charts were analyzed. Based on this analysis, keyword lists were constructed for use in algorithm development.

In order to identify medications commonly used in the SCI/D population, we consulted the available literature on pharmacological use in SCI/D.Citation34–38 However, this was of limited value due to the non-specific nature of most medications commonly used in this population, such as bowel agents, analgesics, antidepressants, and antibiotics. As a second step, a database of indication-medication pairsCitation39 was consulted to determine which drugs were most commonly prescribed for SCI/D. Of these, the reverse pairs (medication-indication) were then examined to determine which drugs were prescribed more commonly for SCI/D than for other indications. From this analysis, three medications were included as an algorithm component: oxybutynin, baclofen, and dantrolene (the brand names Ditropan, Lioresal, and Dantrium were also included).

ICD-9 disease classification codes related to SCI/D were identified in the literatureCitation22,Citation23 and were used as another algorithm component. The three ICD-9 codes (344, 806, and 952) that corresponded to physician billing codes in EMRPC were used.

Using the reference standard cohort as the criterion, confusion matrix values (true positives, true negatives, false positives, false negatives) were tabulated for each algorithm component, and the sensitivity, specificity, PPV, NPV, and F-scores were calculated using Microsoft SQL and Excel. Individual algorithm components were then combined in various permutations to create case identification algorithms, and the performance of these combination algorithms was similarly evaluated. For this study, sensitivity and PPV were considered to be primary outcomes (to select algorithms able to identify all cases of SCI/D, i.e. sensitivity, and only cases of SCI/D, i.e. PPV). F-score, the harmonic mean of sensitivity and PPV, was included as a single summary metric of performance. Performance metrics for all algorithms (including specificity and NPV) are reported in Supplemental Table 2.

Results

Chart review and case identification

Through the manual review of electronic medical records, this study identified 126 cases of chronic SCI/D. Using this cohort as a reference standard, case identification algorithms were developed and evaluated. The overall process is depicted, and key results summarized, in . The 126 validated SCI/D cases in the shaded box became the reference standard cohort (RSC).

Figure 2 Flowchart for SCI/D case identification.

Figure 2 Flowchart for SCI/D case identification.

The demographic characteristics of the RSC compared with the study cohort are presented in ; results are shown for the RSC as a whole, and for TSCI and NTSCI cases separately (two cases could not be classified as either TSCI or NTSCI).

Table 4 Demographic characteristics.

Algorithm development

Keyword analysis

The case identification algorithms developed made use of different types of information in the EMR database. The CPP component identified charts with a keyword match in the CPP (specifically in the problem list and medical history portions). Similarly, the P/C component identified charts with a keyword match in the progress notes (PN) or consultation letters (CONS).

In order to determine which keywords to use in these algorithm components, the occurrence of keywords in the charts of the RSC was analyzed. Because a given keyword (or keyword phrase) can occur at most once in the CPP, it was possible to calculate the sensitivity and PPV of each individual CPP keyword. This information, shown in , was used to construct keyword lists for use in case identification algorithms. The first list of CPP keyword terms, called CPP1, contains eight keywords all of which have a PPV of 100%, meaning that these keywords appeared only in the CPP of true positive cases. The next two lists added further terms in descending order of PPV, using 70% as a breakpoint; grouping the terms in this way facilitated the evaluation of different algorithm permutations with respect to the tradeoff between sensitivity and predictive value.

Figure 3 CPP keyword analysis.

Figure 3 CPP keyword analysis.

In the case of keywords occurring in the P/C components, it was possible for a given keyword to occur multiple times in the same record, making it infeasible to use the same analysis as with CPP keywords. Rather, the total numbers of keyword occurrences were tabulated, and the occurrences in cases were compared with those in non-cases to determine the value of each keyword for case identification. This algorithm component could specify either a certain number of unique terms from the list of keywords (e.g. three different terms) or a certain number of occurrences of any of the terms from the list (e.g. three occurrences of any term, including multiple occurrences of the same term).

The complete lists of inclusion and exclusion terms for CPP and P/C are provided in .

Table 5 Terms included in algorithm components.

Algorithm component evaluation

Algorithm components using the three CPP keyword lists illustrate the tradeoff between sensitivity and PPV. The best overall CPP algorithm component is CPP2 with an F-score of 67.4%. This component has an excellent PPV of 94.3% but the sensitivity of 52.4% is inadequate.

Algorithm components using keywords found in progress notes and consultation letters demonstrated a similar tradeoff between sensitivity and PPV. Overall, the best-performing of these achieved F-scores comparable to the best-performing CPP algorithm components, which is a significant result since this method (using keywords from progress notes and consultation letters in case identification algorithms) has not been documented in previous studies.

The performance of algorithm components using medication names and physician billing codes was also evaluated. The medication name algorithm component performed poorly, with a sensitivity of only 29.4% and PPV of 47.4%. Occurrences of physician billing codes were analyzed using the same method as in other studies: looking for single or multiple (“x2,” “x3,” etc.) occurrences, and also looking for multiple occurrences in a single year (“x2in1”). The only ICD-9 code that identified a significant number of cases was 806 (“fracture of vertebral column with spinal cord injury”), with 20 true positives and 0 false positives. Since there were no false positives, there was no improvement in performance from requiring multiple occurrences.

The other code that identified SCI/D cases was 344 (“other paralytic syndromes”), with three true positives and zero false positives; however, all three of the true positive cases also used the 806 code, so there was no incremental value in using both rather than using 806 alone. For the sake of parsimony ICD-9 code 344 was dismissed in further analysis. ICD-9 code 952 (“spinal cord injury without evidence of spinal bone injury”) did not identify any true positive cases and was dismissed in further analysis.

Algorithm performance evaluation

Using various permutations of the algorithm components, 125 candidate algorithms were created, and their performance evaluated. By combining algorithm components, a synergistic improvement in performance was obtained, increasing true positives and hence sensitivity without commensurately increasing false negatives and diminishing PPV. While the highest F-score of any single component is 68.1%, an optimal combination of components can yield an F-score as high as 79.1%.

The optimal algorithm selects cases that have any of three components: (i) a CPP keyword on the CPP2 list; or (ii) three or more occurrences of any of the keywords in the P/C list; or (iii) an occurrence of ICD-9 code 806. This algorithm has a sensitivity of 70.6% and a PPV of 89.9%.

presents the performance of selected algorithm components and algorithms. For a list of all algorithms considered, see Supplemental Table 2.

Table 6 Performance of selected algorithms.

Discussion

This study has demonstrated that it is possible to reliably identify cases of chronic SCI/D in a primary care EMR database using a detailed case definition, a comprehensive keyword search strategy, and a rigorous manual chart review process. With respect to the development of algorithms for use in the automated identification of cases of SCI/D in EMRPC (and potentially in other similar EMR databases), the precise methods demonstrated in previous studies (using disease codes and prescriptions) proved infeasible in SCI/D. However, it was possible to construct satisfactory case identification algorithms by making more extensive use of free-text keyword searching in the unstructured portions of the EMRPC database.

Unlike many other chronic conditions where diagnosis or billing codes (using a standard classification like ICD-9 or ICD-10) can be used, the relevant codes for SCI/D were found to be unreliable, with only 15.9% of the RSC showing an SCI/D-related physician billing code. This is perhaps because primary care visits by patients with SCI/D do not typically result in an SCI/D-related billing code. When working with a different sample, it would be prudent to re-examine the validity of disease classification codes.

Furthermore, prescriptions and lab test values were similarly unhelpful, although they have been shown to be reliable in identifying cases of other conditions such as MS and diabetes.Citation18,Citation20 With respect to the three medications identified as being both relatively widely used within and specific to SCI/D, only 29.4% of the RSC cases included a prescription for at least one of these; furthermore, 89 false positives were identified, for a PPV of 47.4% for the medication algorithm component.

In order to improve algorithm performance, additional unstructured free-text elements of the EMRPC database such as progress notes and consultation letters were used. Occurrences in these fields of highly specific terms such as “tetraplegia,” “neurogenic bowel,” and “spinal cord injury” proved to be a valuable addition to the contents of the CPP for the purpose of case identification. This approach leveraged the unique inclusion in EMRPC of these substantial free-text elements.

With respect to previous case identification studies in SCI/D,Citation22,Citation24,Citation40,Citation41 this is the first to use EMRs as a data source, and the first to use the information other than diagnostic codes (such as ICD-9 or ICD-10). The algorithms developed in this study performed similarly to those developed to identify other disease populations using the EMRPC database.Citation18,Citation19,Citation27,Citation28 The sensitivity of the optimal algorithm (70.6%) was lower than in some of these studies, which may be attributed to the difficulty of capturing every instance of a diagnosis (SCI/D) that can be designated in many different ways, particularly for cases of NTSCI. The PPV of the optimal algorithm (89.9%), however, is compared favorably. When compared with a broader sample of 40 case-finding studies covering 47 different conditions, using a variety of primary care databases,Citation17 the sensitivity was well within the range observed, with the PPV at the higher end of the range. Although there is no definitive threshold for the performance of classification algorithms, a summary of reviews of health algorithm studies proposed classifying PPVs over 70% as high, 50% to 70% as moderate, and below 50% as low.Citation42

The prevalence of chronic SCI/D in the study sample (calculated using the number of cases in the RSC) is: 262.5 per 100,000 overall; 118.8 per 100,000 for TSCI; and 139.6 per 100,000 for NTSCI. These results agree with the most recent estimate of SCI/D prevalence in Canada:Citation3 252.5 per 100,000 overall; 129.8 per 100,000 for TSCI; and 122.7 per 100,000 for NTSCI.

An important strength of this study is the use of an EMR database (EMRPC) that has been shown to be fairly representative of the overall Ontario SCI/D population.Citation15 Furthermore, EMRPC uniquely includes a large amount of unstructured free-text information which, in this study, proved important in case identification. Another strength is the inclusive and evidence-based case definition of SCI/D, which includes both TSCI and NTSCI. Finally, the rigorous and detailed chart review method helped ensure that cases were identified comprehensively and reliably.

There are several important limitations to this study. First, the emphasis on comprehensiveness in the case definition may have resulted in the inclusion of some patients (for example those with a temporary impairment lasting >48 h) that would not be considered cases of chronic SCI/D according to more restrictive definitions. Second, the use of only two reviewers imposed a limitation on the number of charts that could feasibly be manually reviewed (each chart review took on average approximately 15 min). Third, there is insufficient evidence in EMRPC to reliably determine important information such as level, severity, and completeness of SCI/D. Fourth, the date of onset of SCI/D is not clearly recorded in EMR, making identification of incident cases infeasible. Fifth, it is possible that greater precision may be achieved by considering progress notes and consultation letters separately, rather than combining them into a single component. Finally, it is important to note that the results of this study may not be translatable to other EMR databases that do not contain the same free-text entries.

The novel work using free text keywords documented in this study may be useful to other researchers looking to identify cases of SCI/D in free text or unstructured data sources including EMRs. For such researchers, lessons learned from this study include the importance of including the many potential etiologies of NTSC and the necessity of a precise, detailed case definition. Going forward, data from EMR databases can help address longstanding research gaps with respect to post-rehabilitation health system surveillance for this population.Citation8 The SCI/D cohort identified in EMRPC can be linked with other databases to develop an understanding of the health care utilization, outcomes, cost and care patterns across the entire healthcare system of the SCI/D population, creating a fuller picture of patient journeys throughout their life course, and helping to improve the quality of care.

Abbreviations

CONS: consultation letters; CPP: cumulative patient profile; EMR: electronic medical record; EMRPC: Electronic Medical Records Primary Care database; NPV: negative predictive value; NTSCI: non-traumatic spinal cord injury; P/C: progress notes and consultation letters; PN: progress notes; PPV: positive predictive value; RSC: reference standard cohort; SCI/D: spinal cord injury and disease; TSCI: traumatic spinal cord injury

Disclaimer statements

Contributors None.

Funding This study was supported by the ICES, which is funded by an annual grant from the Ontario Ministry of Health (MOH) and the Ministry of Long-Term Care (MLTC).

Conflicts of interest The authors have no conflicts of interest to report.

Supplemental material

Supplemental Material

Download Zip (60.5 KB)

Acknowledgements

Dr Craven wishes to acknowledge the support of the Toronto Rehab Foundation for the Chair in SCI Rehabilitation. The analyses, conclusions, opinions and statements expressed herein are solely those of the authors and do not reflect those of the funding or data sources; no endorsement is intended nor should be inferred.

References

  • Chhabra HS. ISCoS Textbook on Comprehensive Management of Spinal Cord Injuries. Chhabra HS, (ed.). New Delhi: Wolters Kluwer (India); 2015. 1209 p.
  • New PW, Marshall R. International spinal cord injury data sets for non-traumatic spinal cord injury. Spinal Cord 2014;52(2):123–32.
  • Noonan VK, Fingas M, Farry A, Baxter D, Singh A, Fehlings MG, et al. Incidence and prevalence of spinal cord injury in Canada: a national perspective. Neuroepidemiology 2012;38(4):219–26.
  • Bickenbach J, Boldt I, Brinkhof M, Chamberlain J, Cripps R, Fitzharris M, et al. A global picture of spinal cord injury. In: Bickenbach JE, (ed.) International Perspectives on Spinal Cord Injury. Geneva: WHO-ISCOS; 2013. p. 11–41.
  • Dryden DM, Saunders LD, Rowe BH, May LA, Yiannakoulias N, Svenson LW, et al. Utilization of health services following spinal cord injury: a 6-year follow-up study. Spinal Cord 2004;42(9):513–25.
  • Jensen MP, Truitt AR, Schomer KG, Yorkston KM, Baylor C, Molton IR. Frequency and age effects of secondary health conditions in individuals with spinal cord injury: a scoping review. Spinal Cord 2013;51(12):882–92.
  • DeVivo MJ, Maetz HM, Fine PR, Stover SL. Prevalence of spinal cord injury: a reestimation employing life Table techniques. Arch Neurol. 1980;37(11):707–8.
  • Dvorak MF, Cheng CL, Fallah N, Santos A, Atkins D, Humphreys S, et al. Spinal cord injury clinical registries: improving care across the SCI care continuum by identifying knowledge gaps. J Neurotrauma 2017;34(20):2924–33.
  • Gerber LH, Deshpande R, Prabhakar S, Cai C, Garfinkel S, Morse L, et al. Narrative review of clinical practice guidelines for rehabilitation of people with spinal cord injury. Am J Phys Med Rehabil. 2020;100(5):501–12.
  • Rowan CP, Chan BCF, Jaglal SB, Catharine Craven B. Describing the current state of post-rehabilitation health system surveillance in Ontario – an invited review. J Spinal Cord Med. 2019;42(sup1):21–33.
  • Jaglal SB, Munce SEP, Guilcher SJ, Couris CM, Fung K, Craven BC, et al. Health system factors associated with rehospitalizations after traumatic spinal cord injury: a population-based study. Spinal Cord 2009;47(8):604–9.
  • Guilcher SJT. Physician utilization among adults with traumatic spinal cord injury in Ontario. Spinal Cord 2009;47:470–476.
  • Munce SEP, Wodchis WP, Guilcher SJT, Couris CM, Verrier M, Fung K, et al. Direct costs of adult traumatic spinal cord injury in Ontario. Spinal Cord 2013;51(1):64–9.
  • Guilcher SJT, Munce SEP, Couris CM, Fung K, Craven BC, Verrier M, et al. Health care utilization in non-traumatic and traumatic spinal cord injury: a population-based study. Spinal Cord 2010;48(December 2008):45–50.
  • Tu K, Widdifield J, Young J, Oud W, Ivers NM, Butt DA, et al. Are family physicians comprehensively using electronic medical records such that the data can be used for secondary purposes? A Canadian perspective. BMC Med Inform Decis Mak. 2015;15(1):67.
  • Tu K, Mitiku TF, Ivers NM, Guo H, Lu H, Jaakkimainen L, et al. Evaluation of electronic medical record administrative data linked database (EMRALD). Am J Manag Care. 2014;20(1):15–21.
  • McBrien KA, Souri S, Symonds NE, Rouhi A, Lethebe BC, Williamson TS, et al. Identification of validated case definitions for medical conditions used in primary care electronic medical record databases: a systematic review. J Am Med Inform Assoc. 2018;25:1567–78.
  • Krysko KM, Ivers NM, Young J, O’Connor P, Tu K. Identifying individuals with multiple sclerosis in an electronic medical record. Mult Scler J. 2015;21(2):217–24.
  • Ivers N, Pylypenko B, Tu K. Identifying patients with ischemic heart disease in an electronic medical record. J Prim Care Community Health 2011;2(1):49–53.
  • Tu K, Manuel D, Lam K, Kavanagh D, Mitiku TF, Guo H. Diabetics can be identified in an electronic medical record using laboratory tests and prescriptions. J Clin Epidemiol. 2011;64(4):431–5.
  • Lee TM, Tu K, Wing LL, Gershon AS. Identifying individuals with physician-diagnosed chronic obstructive pulmonary disease in primary care electronic medical records: a retrospective chart abstraction study. npj Prim Care Respir Med. 2017;27(1):34. https://doi.org/http://doi.org/10.1038/s41533-017-0035-9.
  • Hagen E, Rekand T, Gilhus N, Gronning M. Diagnostic coding accuracy for traumatic spinal cord injuries. Spinal Cord 2008;47:367–71.
  • St. Germaine-Smith C, Metcalfe A, Pringsheim T, Roberts JI, Beck CA, Hemmelgarn BR, et al. Recommendations for optimal ICD codes to study neurologic conditions a systematic review. Neurology 2012;79:1049–55.
  • Ho C, Guilcher SJT, McKenzie N, Mouneimne M, Williams A, Voth J, et al. Validation of algorithm to identify persons with non-traumatic spinal cord dysfunction in Canada using administrative health data. Top Spinal Cord Inj Rehabil. 2017;23(4):333–42.
  • Gentil ML, Cuggia M, Fiquet L, Hagenbourger C, Le Berre T, Banâtre A, et al. Factors influencing the development of primary care data collection projects from electronic health records: a systematic review of the literature. BMC Med Inform Decis Mak. 2017;17(1):139. https://doi.org/http://doi.org/10.1186/s12911-017-0538-x.
  • Schull MJ, Azimaee M, Marra M, Cartagena R, Vermulen M, Ho M, et al. ICES: data, discovery. Better Health. Int J Popul Data Sci. 2020;4(2):1–9.
  • Butt DA, Tu K, Young J, Green D, Wang M, Ivers N, et al. A validation study of administrative data algorithms to identify patients with Parkinsonism with prevalence and incidence trends. Neuroepidemiology 2014;43(1):28–37.
  • Breiner A, Young J, Green D, Katzberg HD, Barnett C, Bril V, et al. Canadian administrative health data can identify patients with myasthenia gravis. Neuroepidemiology 2015;44:108–13.
  • New PW, Delafosse V. What to call spinal cord damage not due to trauma? Implications for literature searching. J Spinal Cord Med. 2012;35(2):89–95.
  • Wright A, Feblowitz J, Maloney FL, Henkin S, Bates DW. Use of an electronic problem list by primary care providers and specialists. J Gen Intern Med. 2012;27(8):968–73.
  • Luna D, Franco M, Plaza C, Otero C, Wassermann S, Gambarte ML, et al. Accuracy of an electronic problem list from primary care providers and specialists. Stud Health Technol Inform. 2013;192(1–2):417–21.
  • Zhou X, Zheng K, Ackerman MS, Hanauer DA. Cooperative documentation: the patient problem list as a nexus in electronic health records. In: Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work (CSCW ‘12). 2012. p. 911–20.
  • Weisman A, Tu K, Young J, Kumar M, Austin PC, Jaakkimainen L, et al. Validation of a type 1 diabetes algorithm using electronic medical records and administrative healthcare data to study the population incidence and prevalence of type 1 diabetes in Ontario, Canada. BMJ Open Diabetes Res Care 2020;8(1):1–11.
  • Patel T, Milligan J, Lee J. Medication-related problems in individuals with spinal cord injury in a primary care-based clinic. J Spinal Cord Med. 2017;40(1):54–61.
  • Hwang M, Zebracki K, Vogel LC. Medication profile and polypharmacy in adults with pediatric-onset spinal cord injury. Spinal Cord 2015;53(9):673–8.
  • Høgholen H, Storhaug A, Kvernrød K, Kostovski E, Viktil KK, Mathiesen L. Use of medicines, adherence and attitudes to medicines among persons with chronic spinal cord injury. Spinal Cord 2017;56:35–40.
  • Guilcher SJT, Hogan ME, Calzavara A, Hitzig SL, Patel T, Packer T, et al. Prescription drug claims following a traumatic spinal cord injury for older adults: a retrospective population-based study in Ontario, Canada. Spinal Cord 2018;56(11):1059–68.
  • Rouleau P, Guertin PA. Traumatic and nontraumatic spinal-cord-injured patients in Quebec, Canada. Part 3: pharmacological characteristics. Spinal Cord 2011;49(2):186–95.
  • Wei W-Q, Cronin RM, Xu H, Lasko TA, Bastarache L, Denny JC. Development and evaluation of an ensemble resource linking medications to their indications. J Am Med Inform Assoc. 2013;20(5):954–61.
  • Noonan V, Thorogood NP, Fingas M, Batke J, Belanger LM, Kwon BK, et al. The validity of administrative data to classify patients with spinal column and cord injuries. J Neurotrauma 2012;30:173–80.
  • Welk B, Loh E, Shariff SZ, Liu K, Siddiqi F. An administrative data algorithm to identify traumatic spinal cord injured patients: a validation study. Spinal Cord 2014;52(1):34–8.
  • Carnahan RM. Mini-Sentinel’s systematic reviews of validated methods for identifying health outcomes using administrative data: summary of findings and suggestions for future research. Pharmacoepidemiol Drug Saf. 2012;21(S1):90–9.