4,441
Views
1
CrossRef citations to date
0
Altmetric
Research Article

The puzzle of sharing scientific data

, &

Figures & data

Figure 1. The research design: a mixed-methods approach to address RQ1and RQ2.

Figure 1. The research design: a mixed-methods approach to address RQ1and RQ2.

Table 1. Enablers and deterrents for data sharing in science according to prior literature and implications from collective action and epistemic cultures perspectives

Table 2. Factors related to data sharing

Table 3. Researcher willingness to share data for various reasons and experience of it

Figure 2. Data sharing by discipline.

The fields are ordered by their researchers’ average experience sharing data. The fields included are Maths (MATHS), Medicine and Allied Health (MED), Engineering (ENG), Materials Science (MAT SCI), Computer Science (COMP SCI), Social Sciences, Humanities & Economics (SOC HUM ECON), Chemistry (CHEM), Physics & Astronomy (PHY), Life Sciences (LIFE SCI) and Earth & Environmental Science (ENV). The responses of ‘N/A’ and ‘Neither agree nor disagree’ were left out for clarity. Width relates to sample size. + denote significant difference to at least one other field after posthoc Dunn test without adjustment (p-value<0.05). * denote fields that continue to have significant differences to at least one other field after posthoc dunn test with Holm adjustment (p-value <0.05).
Figure 2. Data sharing by discipline.

Figure 3. Data sharing outcomes across all fields.

Plots show percent responses across researchers from all fields. Asterisks denote p-value after Kruskal–Wallis test (* <0.05, ** < 0.01, ***<0.001) comparing different fields in their responses. The type of data shared is less than 100% due to researchers saying that it is not very important in their field.
Figure 3. Data sharing outcomes across all fields.

Figure 4. Comparison of data sharing among disciplines, highlighting life sciences and physics/astronomy.

For reference, we also show the mean response across all fields. For questions with 5-point Likert scale coded from 0 to 4, we divided the response by 4 for the plot. All the statistical analysis was done with the original (non-transformed) data through Kruskal–Wallis test. Asterisks denote p-value after Kruskal–Wallis test comparing the two fields of Life Science and Astrophysics in their responses. (* <0.05, ** < 0.01, ***<0.001).
Figure 4. Comparison of data sharing among disciplines, highlighting life sciences and physics/astronomy.

Table 4. Data collection sources for both Molecular Biology (MB) and High-Energy Physics (HEP)

Table 5. Theoretical progression of our analysis

Table 6. Similarities and differences observed between Reana (HEP) and open targets (MB)

Figure 5. Mechanisms that enable HEP and MB researchers to share data. The (+) show significant motivator/deterrent of data sharing from our study.

Figure 5. Mechanisms that enable HEP and MB researchers to share data. The (+) show significant motivator/deterrent of data sharing from our study.