Search in:

Clinical Epidemiology Volume 9, 2017 - Issue

Submit an article Journal homepage

Open access

542

Views

CrossRef citations to date

Altmetric

Listen

Perspectives

Clinical epidemiology in the era of big data: new opportunities, familiar challenges

Vera EhrensteinDepartment of Clinical Epidemiology, Aarhus University Hospital, Aarhus N, DenmarkCorrespondence[email protected]

Henrik NielsenDepartment of Clinical Epidemiology, Aarhus University Hospital, Aarhus N, Denmark

Alma B PedersenDepartment of Clinical Epidemiology, Aarhus University Hospital, Aarhus N, Denmark

Søren P JohnsenDepartment of Clinical Epidemiology, Aarhus University Hospital, Aarhus N, Denmark

Lars PedersenDepartment of Clinical Epidemiology, Aarhus University Hospital, Aarhus N, Denmark

Pages 245-250 | Published online: 27 Apr 2017

Cite this article
CrossMark

In this article

Introduction
Examples of big data collaborations in epidemiology
Big data in epidemiology: benefits and challenges
Conclusion
Acknowledgements
References

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF

Abstract

Routinely recorded health data have evolved from mere by-products of health care delivery or billing into a powerful research tool for studying and improving patient care through clinical epidemiologic research. Big data in the context of epidemiologic research means large interlinkable data sets within a single country or networks of multinational databases. Several Nordic, European, and other multinational collaborations are now well established. Advantages of big data for clinical epidemiology include improved precision of estimates, which is especially important for reassuring (“null”) findings; ability to conduct meaningful analyses in subgroup of patients; and rapid detection of safety signals. Big data will also provide new possibilities for research by enabling access to linked information from biobanks, electronic medical records, patient-reported outcome measures, automatic and semiautomatic electronic monitoring devices, and social media. The sheer amount of data, however, does not eliminate and may even amplify systematic error. Therefore, methodologies addressing systematic error, clinical knowledge, and underlying hypotheses are more important than ever to ensure that the signal is discernable behind the noise.

Keywords:

electronic health records
healthcare administrative claims
medical record linkage
multicenter studies
validation studies

Introduction

Big data has firmly established itself in the health research,^{Citation1,Citation2} illustrated by publications in high-ranking general-interest biomedical journals, including The New England Journal of Medicine,^Citation3 JAMA,^Citation4 Journal of Internal Medicine,^Citation5 Science,^Citation6^–^Citation9 and Nature.^Citation10^–^Citation13 A basic definition of big data includes “the 3 Vs”: variety (linkage of many data sets from heterogeneous independent sources in a single data set); volume (large number of observations and variables per observation from different sources); and/or velocity (real-time or frequent data updates, often fully or partially automated).^Citation14 Other definitions encompass additional three Vs: value (clinically relevant information); variability (eg, seasonal or secular disease trends); and veracity (data quality).^Citation2 Routinely recorded health data are large automated data sets stemming from day-to-day activities of health care, such as hospital admissions or claims.^Citation15^–^Citation18 These data have evolved from mere byproducts of health care delivery or billing into a powerful tool for improving patient care through preventive, etiologic, and prognostic epidemiologic research.^Citation4 A recent article summarizes 46 most influential studies conducted with big data in health care,^Citation1 while a review from 2015 provides multiple examples of the “variety” V in big data for health.^Citation2

The notion of applying lessons from the clinical past to the clinical future is “as old as medicine.”^Citation19 In a simplified form, evidence-based medical care means that a clinician can use research results in making treatment decisions in his or her clinical practice, often through explicit literature-based treatment guidelines. For a clinician, this means answers to questions such as: “How likely is my patient with atrial fibrillation on oral anticoagulants to develop a major bleeding? Does the risk vary by type of anticoagulant or patient characteristics?” or “To what extent does comorbidity affect mortality of patients with hip fracture?” To be answered, a clinical question must be first translated into a precise research question and then back-translated and interpreted for clinical decision making. Therefore, it is essential for clinicians and epidemiologists to understand each other’s language. For an epidemiologist, an answer to a research question should be a precise and valid estimate of an underlying population parameter such as mean, risk, incidence rate, or odds ratio. Big data – via the “volume” V – often addresses the precision component, but does little to address validity (the “veracity” V in the big-data vocabulary). Plausible hypotheses, expert knowledge, and accurate measurement tools must be available to ensure validity of research findings, since a highly precise biased result, especially perceived as credible based on precision alone, is more dangerous translated into clinical practice than an imprecise biased result.^{Citation20,Citation21} This paper, using primarily case studies from the Nordic countries, provides a brief overview and examples of use of big data in clinical epidemiology and outlines associated advantages and challenges.

Examples of big data collaborations in epidemiology

Some say that the digitalization of medical records revolutionized the usability of big data in medical research.^Citation4 Whether or not this claim is accepted, it is important to be aware that the current development follows a long evolution of using register data for medical research. This evolution started with the establishment of the first National Leprosy Register, in Norway, in 1856 (),^{Citation22,Citation23} and of the Danish Cancer Registry, in 1943.^Citation24 Other Nordic registries followed, most of them established between the 1960s and the early 2000s.^{Citation25,Citation26} Researchers in the Nordic countries have been using the volume component of the big data before the term was invented: for decades, epidemiologists have been conducting epidemiologic studies based on linkage of routinely collected data from multiple administrative, health, and demographic registries, and their potential has been recognized at least since the 1990s,^Citation27 if not earlier.^Citation28

Figure 1 Building that used to house the Norwegian Leprosy Registry, currently home of the Department of Global Public Health and Primary Care, University of Bergen, Norway.

Note: Courtesy: Dr Astrid Lunde.

Estimates of association with narrow confidence intervals often stem from big data analyses of common health outcomes in population-based registry data spanning several decades. When the intervention or the outcome of interest is rare, even data from an entire country may be in sufficient, requiring that data from different countries are combined. Several formal or ad hoc collaborative networks in observational epidemiology have arisen, often from the need to study benefits and risks of relatively uncommon pharmacological^{Citation16,Citation29}^–^Citation31 or surgical^{Citation32,Citation33} interventions, or vaccines.^{Citation3,Citation30} Examples of pan-Nordic collaborations using combined data from Denmark, Finland, Iceland, Norway, and Sweden^{Citation31,Citation34,Citation35} include studies on prenatal exposure to antidepressants and adverse effects in the offspring^{Citation31,Citation34,Citation35} or the Nordic Arthroplasty Register Association (NARA) database of about 1 million primary hip and knee replacement procedures performed since 1995 in Denmark, Finland, Norway, and Sweden.^Citation36 NARA enabled studies of rare risk factors and outcomes, for which single-country data are too sparse.^{Citation32,Citation33} One clinically relevant question is whether a type of fixation used in total hip replacement (THR) is associated with risk of subsequent revision in patients younger than 55 years of age, since these patients may be different from older patients in mobility, post-THR life expectancy, and compliance with treatment. Only 5% of THR procedures are performed in patients younger than 55 years and previous studies, including those based on national hip registries, had insufficient sample size to address the fixation issue in younger patients. Pedersen et al^Citation37 used NARA to assemble a study population of ~30,000 patients younger than 55 years undergoing THR, with each fixation technique represented by more than 3,000 observations. The study yielded a clinically relevant message that uncemented implants are associated with a lower long-term risk of aseptic loosening but a higher short-term risk of revisions. Thus, the purpose of uncemented implants has been achieved in long term, but technical issues causing dislocation, periprostethic fracture, and infection have been previously overlooked in patients younger than 55 years.

Use of routinely collected data for epidemiologic research has also been possible outside the Nordic countries, including general practice-based data in the UK, or claims-based databases and database networks in the USA. In contrast to the typical European health care databases, which are established to fulfill administrative (health services), clinical quality, or surveillance needs, the US claims databases (eg, Medicare, Medicaid, and commercial insurance records) are by-products of medical accounting. Several European database networks, including those encompassing the Nordic data, have been successfully established and have found ways to overcome challenges of differences in the underlying health care systems, languages, data-sharing laws, record-generating mechanisms, and classifications.^{Citation5,Citation16,Citation30,Citation38,Citation39} Medical data in the Nordic countries are coded using a common basic set of standard classifications (International Classification of Diseases, Nordic Medico-Statistical Committee classification for procedures and causes of injury,^{Citation40,Citation41} or Anatomical Therapeutic Chemical codes for medications), which makes it easier to establish common algorithms. In the USA, Medicare and Medicaid provide financial incentives for “meaningful use” of electronic health records.^Citation3 The most prominent big data collaborative models in the USA have been the Mini-Sentinel project and the Observational Medical Outcomes Partnership (OMOP).^Citation3 The difference between routine records accumulated in systems like Mini-Sentinel or OMOP and those in Europe is the structure of the health care system, linkage possibilities, and the availability of lifelong complete follow-up. Thus, certain aspects of big data in Nordic countries are more diverse than those in many other databases (the “volume” V and the “variety” V of the big data), thanks to individual-level linkage to both medical and nonmedical data, including education, income, and residence, and because of lifelong follow-up. In 2013, the Mini-Sentinel project covered 360 million person-years of observation representing 150 million lives.^Citation3 In 2014, the Danish Civil Registration System, with its linkable network of national registries, covered 400 million person-years of observation from 9.5 million lives.^Citation25 Asian countries are building a linkable registry infrastructure with individual-level linkage mimicking those of the Nordic countries.^Citation42

The “variety” V of the big data is developing rapidly, whereby previously unused on underused types of data are incorporated into medical research, including electronic medical records, imaging, biobanks, and patient-reported data (including social media and wearables).^{Citation2,Citation43} Individual linkage may not be always necessary: in a classical ecologic study, hostility of language on Twitter was associated with country-specific mortality from heart diseases.^Citation44 Pharmacovigilance with social media is already a reality.^Citation45 Mobile phones can be used to test and subsequently deliver behavioral interventions such as smoking cessation aid^Citation46 or adherence support.^Citation47 The type of bias associated with certain types of data may change over time. For example, in the early days of epidemiologic research, random landline phone surveys tended to select the relatively more affluent, the employed, and the young. Today, these groups are more likely to be accessed via social networks and mobile telephony,^Citation2 while use of landline phones may select for older or disadvantaged population segments.

Assembling database networks carries with it technical, logistical, ethical, and legal challenges.^Citation48 The last two are often the hardest to overcome because of issues of data access, patient privacy, and potential conflicts of interest. Even in large studies, one has to remain vigilant about patient privacy and the possibility of inadvertently identifying individuals based on a set of rare characteristics. Gini et al^Citation16 provide a practical guide of the different models of data networking, defined on the degree of centralization and harmonization of the different analytic processes. It seems to be practical to designate a single network partner, with adequate resources, to be the coordinating analytic hub. The process starts with raw data from each participating database and ends with the statistical output combining results of individual patients from all databases. Between the starting and the end points, there exist different models for the extent of process automation, autonomy, and control enjoyed by each data partner. A global protocol, with flexibility for local adaptations, is usually followed. Depending on the aims of the study, the analysis may entail as little sharing as contributing country-specific odds ratios for a meta-analysis or as much sharing as harmonization and pooling of individual-level data sets.^Citation16 Harmonization involves transformations, whereby each partner creates standard input data sets according to exact specification – a common data model (CDM) – which dictates the data set types and structure, variable names and attributes, and definitions of derived variables. A single statistical analytic program is then run on the CDM-conforming files either by each network partner locally (“one analyst, many outputs”) or centrally by the hub on the combined data set (“one analyst, one output”). By contrast, the “many analysts, many outputs” approach is discouraged because it is prone to error and duplicates work. Whether one or many analysts, quality control of programming by another analyst is always necessary.

Health outcomes measured by health care professionals might differ from the outcomes subjectively experienced by patients, and the latter also affects the outcome of treatment. To fill this gap, patient-reported outcome measures (PROMs) are being used increasingly.^Citation49 An example of incorporation of PROMs in a single-country setting, while capitalizing on unique data linkage capabilities common to the Nordic settings, includes the generic infrastructure for collecting PROM data, AmbuFlex, developed in Denmark by Hjollund et al.^Citation50 The researchers have successfully implemented a flexible paper-based and electronic data collection on PROMs in more than 20 projects since 2004. Group-level aggregated PROM data, linked with data from routine registries and clinical databases, can be used to monitor national and regional hospital performance in oncology and cardiology care, psychiatry, neurology, and orthopedics. Patient-level PROM data collected on clinic level, in combination with electronic health records, can be used to facilitate screening, clinical decisions, patient–doctor communication, and efficient use of resources in cardiology, rheumatology, and oncology. Response rates exceeded 75% in all and 90% in most cases. A clinical decision support function of PROMs can save clinicians’ time by using an algorithm-based initial identification of patients in need of immediate attention, while presenting data on other patients in a decision-supporting format for clinical judgment.^Citation50 AmbuFlex is a unique example of implementation in routine care, a generic system integrated with electronic medical records, and is used for longitudinal collection of detailed PROM data on an individual level to personalize the care for the individual patient. This allows the collection of PROM data on large cohorts of chronically ill patients over many years, similar to the systems currently in place for administrative data.

Big data in epidemiology: benefits and challenges

Precision of results is not the only benefit of big data. Observations from large number of individuals allow a rapid detection of potential risk signals associated with newly marketed therapies, for which risks of rare adverse events are rarely known from Phase III preapproval trials (the velocity “V” of the big data).^Citation51 A thought experiment showed that having records of 100 million patients for safety monitoring would have allowed the detection of adverse cardiovascular effects of rofecoxib (Merck, Kenilworth, NJ, USA) in 3 months instead of 5 years.^{Citation5,Citation52} On the other hand, large data sets help convincingly rule out harmful associations, in the so-called “null studies.” One example is the abovementioned Nordic collaboration on safety of antidepressant use in pregnancy. Less than 2% of pregnant women use selective serotonin reuptake inhibitors (SSRIs) in pregnancy, while birth defects affect about 3% of live births. Therefore it took a pan-Nordic study to assemble a study population of >1.5 million pregnancies with ~73,000 malformation cases, including ~33,000 SSRI-exposed pregnancies with >1,300 cases exposed to SSRIs.^Citation34 The study convincingly showed a null association between maternal use of SSRIs and major birth defects, providing reassurance to pregnant women with depression and their physicians. Finally, in analyses based on large data sets, estimates are likely to be “highly statistically significant,” ie, associated with P-values <0.05. This “universal statistical significance” could finally lay to rest reliance on P-values for interpretation of study results, allowing researchers to focus on clinical significance instead.^Citation53^–^Citation55

The perks of big data should not go to our collective heads. Big data does not address the usual epidemiologic challenges related to validity, and may even amplify them.^{Citation15,Citation56} Accurate measurement of study variables remains imperative in big-data settings. An advantage of multinational databases is that estimates originating from different databases to address the same research question amount to reproducibility checks of results under varying assumptions about the record-generating mechanisms and the effects of the underlying health care and social structures. At the same time, in multinational database studies, validity concerns are increased proportional to the number of the databases, with the need of several valid operational definitions for the same clinical characteristic or event, to avoid propagating a systematic error on a large scale.^{Citation53,Citation56} Validation of algorithms in large secondary databases remains imperative for valid inference.^{Citation15,Citation56,Citation57} The NARA collaboration has contributed to improvement of data validity in all four participating countries through regular meetings, where differences in registration practice have been discussed. Also, through different research projects, a number of differences regarding data quality between registries have been pointed out and discussed, and subsequently changes in national registries have been made to achieve uniform data definition, collection, and interpretation.

Large amounts of missing data may cause selection bias and undermine gains in precision afforded by big data, since in multiple regression models, standard statistical software removes observations with missing values. Reverse causation, immortal time bias,^Citation58 and healthy user/healthy adherer bias^Citation59 are likewise not remedied by large amounts of data and need to be addressed in big-data and small-data studies alike. On a pragmatic level, delay of data delivery and changes in coding practice present additional challenges.

Conclusion

Epidemiologic research, including database research, is an “exercise in measurement,”^Citation60 in an effort to maximize signal-to-noise ratio. The results of big data-based medical research represent a dividend to the public on its investment in the form of contribution to routine databases with data and with tax money. The advantages of big data are precision of results, including precise “null” findings, ability to address clinical questions in patient subgroups, and rapid detection of risk signals. In the Nordic countries, big data is collected and maintained by public institutions and operate in the setting of income-independent access to health care and lifelong follow-up. In other settings, such as US claims databases, demographic or economic disadvantages are better represented, while follow-up is not lifelong and health care access may be interrupted. Combining evidence from different settings and countries creates multiple-informant settings, providing built-in cross-validation and addressing a wide array of clinical questions in a single study. A formal requirement to the big data is that size, complexity, and velocity of the data are too intense for processing and interpretation with exiting tools. In the Nordic settings, the volume has been available for some decades, and the variety is increasing rapidly to include data on imaging, behavior, geo-location, ecology, genetics, and patient-reported outcomes. Velocity has not yet reached the real-time update stage, but it is improving, and its value is obvious. Veracity (familiar to epidemiologists as validity) needs to be assured before data can be interpreted. The large amount of data, thus, does not eliminate and may amplify sources of systematic error. To that end, technical expertise, clinical knowledge, and underlying hypotheses are more important than ever to ensure that the signal is not drowned out by noise.

Acknowledgments

We thank Professor Olaf M Dekkers for helpful comments on the early drafts of this manuscript and Dr Astrid Lunde for providing the photo for . This paper was funded by the Program for Clinical Research Infrastructure established by the Lundbeck Foundation and the Novo Nordisk Foundation and administered by the Danish Regions.

Disclosure

The authors report no conflicts of interest in this work.

References

de la Torre DíezICosgayaHMGarcia-ZapirainBLópez-CoronadoMBig Data in health: a literature review from the year 2005J Med Syst201640920927520614
PubMed Web of Science ®Google Scholar
Andreu-PerezJPoonCCMerrifieldRDWongSTYangGZBig data for healthIEEE J Biomed Health Inform20151941193120826173222
PubMed Web of Science ®Google Scholar
PsatyBMBreckenridgeAMMini-Sentinel and regulatory science–big data rendered fit and functionalN Engl J Med2014370232165216724897081
PubMed Web of Science ®Google Scholar
MurdochTBDetskyASThe inevitable application of big data to health careJAMA2013309131351135223549579
PubMed Web of Science ®Google Scholar
TrifiròGColomaPMRijnbeekPRCombining multiple healthcare databases for postmarketing drug and vaccine safety surveillance: why and how?J Intern Med2014275655156124635221
PubMed Web of Science ®Google Scholar
BroniatowskiDAPaulMJDredzeMTwitter: big data opportunitiesScience20143456193148
Web of Science ®Google Scholar
FungICTseZTFuKWConverting Big Data into public healthScience20153476222620
Web of Science ®Google Scholar
KhouryMJIoannidisJPMedicine. Big data meets public healthScience201434662131054105525430753
PubMed Web of Science ®Google Scholar
LazerDKennedyRKingGVespignaniABig data. The parable of Google Flu: traps in big data analysisScience201434361761203120524626916
PubMed Web of Science ®Google Scholar
ReardonSUS big-data health network launches aspirin studyNature201451275121825100465
PubMed Web of Science ®Google Scholar
SavageNBioinformatics: big data versus the big CNature20145097502S66S6724870826
PubMed Web of Science ®Google Scholar
SejdićEMedicine: adapt current tools for handling big dataNature20145077492306
Web of Science ®Google Scholar
WilsonSData protection: big data held to privacy laws, tooNature20155197544414
Web of Science ®Google Scholar
BaroEDegoulSBeuscartRChazardEToward a literature-driven definition of big data in healthcareBiomed Res Int2015201563902126137488
PubMed Web of Science ®Google Scholar
GangeSJGolubETFrom smallpox to big data: the next 100 years of epidemiologic methodsAm J Epidemiol2016183542342626443419
PubMed Web of Science ®Google Scholar
GiniRSchuemieMBrownJData extraction and management in networks of observational health care databases for scientific research: a comparison of EU-ADR, OMOP, Mini-Sentinel and MATRICE strategiesEGEMS201641118927014709
PubMedGoogle Scholar
HernánMASavitzDAFrom “big epidemiology” to “colossal epidemiology”: when all eggs are in one basketEpidemiology201324334434523549177
PubMed Web of Science ®Google Scholar
TohSPlattRIs size the next big thing in epidemiology?Epidemiology201324334935123549179
PubMed Web of Science ®Google Scholar
LastJMWhat is “clinical epidemiology”?J Public Health Policy1988921591633417857
PubMedGoogle Scholar
HernánMARobinsJMUsing big data to emulate a target trial when a randomized trial is not availableAm J Epidemiol2016183875876426994063
PubMed Web of Science ®Google Scholar
IoannidisJPWhy most published research findings are falsePLoS Med200528e12416060722
PubMed Web of Science ®Google Scholar
IrgensLMThe origin of registry-based medical research and careActa Neurol Scand Suppl20121954623278649
PubMedGoogle Scholar
IrgensLMBjerkedalTEpidemiology of leprosy in Norway: the history of The National Leprosy Registry of Norway from 1856 until todayInt J Epidemiol19732181894590337
PubMed Web of Science ®Google Scholar
GjerstorffMLThe Danish Cancer RegistryScand J Public Health2011397 Suppl424521775350
PubMed Web of Science ®Google Scholar
SchmidtMPedersenLSørensenHTThe Danish Civil Registration System as a tool in epidemiologyEur J Epidemiol201429854154924965263
PubMed Web of Science ®Google Scholar
FuruKWettermarkBAndersenMMartikainenJEAlmarsdottirABSørensenHTThe Nordic countries as a cohort for pharmacoepidemiological researchBasic Clin Pharmacol Toxicol20101062869419961477
PubMed Web of Science ®Google Scholar
SørensenHTRegional administrative health registries as a resource in clinical epidemiology. A study of options, strengths, limitations and data quality provided with examples of useInt J Risk Saf Med199710112223511270
PubMedGoogle Scholar
BaksaasIFugelliPHalvorsenIKLundePKMNæssKPrescription of hypotensives in general practiceEur J Clin Pharmacol1978145309317729624
PubMed Web of Science ®Google Scholar
FitzHenryFResnicFSRobbinsSLCreating a common data model for comparative effectiveness with the observational medical outcomes partnershipAppl Clin Inform20156353654726448797
PubMed Web of Science ®Google Scholar
AvillachPColomaPMGiniRHarmonization process for the identification of medical events in eight European healthcare databases: the experience from the EU-ADR projectJ Am Med Inform Assoc201320118419222955495
PubMed Web of Science ®Google Scholar
KielerHArtamaMEngelandASelective serotonin reuptake inhibitors during pregnancy and risk of persistent pulmonary hypertension in the newborn: population based cohort study from the five Nordic countriesBMJ2012344d801222240235
PubMed Web of Science ®Google Scholar
HavelinLIFenstadAMSalomonssonRThe Nordic Arthroplasty Register Association: a unique collaboration between 3 national hip arthroplasty registries with 280,201 THRsActa Orthop200980439340119513887
PubMed Web of Science ®Google Scholar
RobertssonOBizjajevaSFenstadAMKnee arthroplasty in Denmark, Norway and Sweden. A pilot study from the Nordic Arthroplasty Register AssociationActa Orthop2010811828920180723
PubMed Web of Science ®Google Scholar
FuruKKielerHHaglundBSelective serotonin reuptake inhibitors and venlafaxine in early pregnancy and risk of birth defects: population based cohort study and sibling designBMJ2015350h179825888213
PubMed Web of Science ®Google Scholar
StephanssonOKielerHHaglundBSelective serotonin reuptake inhibitors during pregnancy and risk of stillbirth and infant mortalityJAMA20133091485423280224
PubMed Web of Science ®Google Scholar
HavelinLIRobertssonOFenstadAMOvergaardSGarellickGFurnesOA Scandinavian experience of register collaboration: the Nordic Arthroplasty Register Association (NARA)J Bone Joint Surg Am201193Suppl 3131922262418
PubMedGoogle Scholar
PedersenABMehnertFHavelinLIAssociation between fixation technique and revision risk in total hip arthroplasty patients younger than 55 years of age. Results from the Nordic Arthroplasty Register AssociationOsteoarthritis Cartilage201422565966724631923
PubMed Web of Science ®Google Scholar
ColomaPMSchuemieMJTrifiròGCombining electronic healthcare databases in Europe to allow for large-scale drug safety monitoring: the EU-ADR ProjectPharmacoepidemiol Drug Saf201120111121182150
PubMed Web of Science ®Google Scholar
PatadiaVKColomaPSchuemieMJUsing real-world healthcare data for pharmacovigilance signal detection – the experience of the EU-ADR projectExpert Rev Clin Pharmacol2015819510225487079
PubMed Web of Science ®Google Scholar
NOMESCONordic Medico-Committee Statistical (NOMESCO) Classification of Surgical Procedures Available from: http://nowbase.org/Publikationer/~/media/Projekt%20sites/Nowbase/Publikationer/NCSP/NCSP%201_14.ashxAccessed May 17, 2015
Google Scholar
Nordic Medico-Statistical Committee’s (NOMESCO) Classification of External Causes of Injuries (NCECI)Nordic Medico-Statistical CommitteeCopenhagen1990 Available from: http://www.nordclass.se/ncsp_e.htmAccessed July 27, 2011
Google Scholar
HsingAWIoannidisJPNationwide population science: lessons from the Taiwan National Health Insurance Research DatabaseJAMA Intern Med201517591527152926192815
PubMed Web of Science ®Google Scholar
LoBPLIpHYangGZTransforming Health Care: body sensor networks, wearables, and the Internet of thingsPulse EMBS20167148
Web of Science ®Google Scholar
EichstaedtJCSchwartzHAKernMLPsychological language on Twitter predicts county-level heart disease mortalityPsychol Sci201526215916925605707
PubMed Web of Science ®Google Scholar
SarkerAGinnRNikfarjamAUtilizing social media data for pharmacovigilance: a reviewJ Biomed Inform20155420221225720841
PubMed Web of Science ®Google Scholar
Vodopivec-JamsekVde JonghTGurol-UrganciIAtunRCarJMobile phone messaging for preventive health careCochrane Database Syst Rev201212CD00745723235643
PubMedGoogle Scholar
SarfoFSTreiberFJenkinsCPhone-based intervention under nurse guidance after stroke (PINGS): study protocol for a randomized controlled trialTrials201617143627596244
PubMed Web of Science ®Google Scholar
LudvigssonJFHåbergSEKnudsenGPEthical aspects of registry-based research in the Nordic countriesClin Epidemiol2015749150826648756
PubMed Web of Science ®Google Scholar
NelsonECEftimovskaELindCHagerAWassonJHLindbladSPatient reported outcome measures in practiceBMJ2015350g781825670183
PubMed Web of Science ®Google Scholar
HjollundNHLarsenLPBieringKJohnsenSPRiiskjærESchougaardLMUse of patient-reported outcome (PRO) measures at group and patient levels: experiences from the generic integrated PRO system, WestChronicInteract J Med Res201431e524518281
PubMedGoogle Scholar
SørensenHTLashTLRothmanKJBeyond randomized controlled trials: a critical comparison of trials with nonrandomized studiesHepatology20064451075108217058242
PubMed Web of Science ®Google Scholar
McClellanMDrug safety reform at the FDA – pendulum swing or systematic improvement?N Engl J Med2007356171700170217435081
PubMed Web of Science ®Google Scholar
ChioleroABig data in epidemiology: too big to fail?Epidemiology201324693893924077000
PubMed Web of Science ®Google Scholar
RothmanKJSignificance questingAnn Intern Med198610534454473740684
PubMed Web of Science ®Google Scholar
LangJMRothmanKJCannCIThat confounded P-valueEpidemiology199891789430261
PubMed Web of Science ®Google Scholar
TohSPlattRBig data in epidemiology: too big to fail?Epidemiology2013246939
Web of Science ®Google Scholar
EhrensteinVPetersenISmeethLHelping everyone do better: a call for validation studies of routinely recorded health dataClin Epidemiol20168495127110139
PubMed Web of Science ®Google Scholar
SuissaSImmortal time bias in pharmacoepidemiologyAm J Epidemiol2008167449249918056625
PubMed Web of Science ®Google Scholar
ShrankWHPatrickARBrookhartMAHealthy user and related biases in observational studies of preventive interventions: a primer for physiciansJ Gen Intern Med201126554655021203857
PubMed Web of Science ®Google Scholar
RothmanKJGreenlandSCausation and causal inference in epidemiologyAm J Public Health200595Suppl 1S144S15016030331
PubMed Web of Science ®Google Scholar

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Clinical epidemiology in the era of big data: new opportunities, familiar challenges

Abstract

Introduction

Examples of big data collaborations in epidemiology

Big data in epidemiology: benefits and challenges

Conclusion

Acknowledgments

Disclosure

References

Information for

Open access

Opportunities

Help and information

Clinical epidemiology in the era of big data: new opportunities, familiar challenges

Abstract

Introduction

Examples of big data collaborations in epidemiology

Big data in epidemiology: benefits and challenges

Conclusion

Acknowledgments

Disclosure

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date