2,269
Views
5
CrossRef citations to date
0
Altmetric
Original Article

Analyzing Behavioral Big Data: Methodological, practical, ethical, and moral issues

References

  • Adar, E. 2015. The two cultures and big data research. A Journal of Law and Policy for the Information Society, 10 (3):765–781.
  • Agarwal, D. K. and B.-C. Chen. 2016. Statistical Methods for Recommender Systems. New York, NY: Cambridge University Press.
  • Allen, G. N., D. L. Burk, and G. B. Davis. 2006. Academic data collection in electronic environments: Defining acceptable use of internet resources. MIS Quarterly 30 (3):599–610.
  • Armistead, T. 2014. Resurrecting the third variable: Critique of pearl’s causal analysis of simpson’s paradox. The American Statistician 68 (2):1–7.
  • Bapna, R., J. Ramaprasad, G. Shmueli, and A. Umyarov. 2016. One-way mirrors in online dating: A randomized field experiment. Management Science. http://dx.doi.org/10.1287/mnsc.2015.2301
  • Bapna, R. and A. Umyarov. 2015. Do your online friends make you pay? a randomized field experiment on peer influence in online social networks. Management Science 61 (8):1902–1920.
  • Bell, R. M., Y. Koren, and C. Volinsky. 2010. All together now: A perspective on the netflix prize. Chance 23 (1):24–29.
  • Belo, R., P. Ferreira, and R. Telang. 2013. Broadband in school: Impact on student performance. Management Science 60 (2):265–282.
  • Cascavilla, G., M. Conti, D. G. Schwartz, and I. Yahav. 2015. Revealing censored information through comments and commenters in online social networks. In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015, ASONAM ’15, pages 675–680, New York, NY, USA. ACM.
  • Chetty, R., J. N. Friedman, and J. E. Rockoff. 2014. Measuring the impacts of teachers ii: Teacher value-added and student outcomes in adulthood. American Economic Review 104 (9):2633–2679.
  • Dhar, V., T. Geva, G. Oestreicher-Singer, and A. Sundararajan. 2014. Prediction in economic networks. Information Systems Research 25 (2):264–284.
  • Eggers, D. 2013. The Circle. New York, NY: Random House LLC.
  • Fawcett, T. 2016. Mining the quantified self: Personal knowledge discovery as a challenge for data science. Big Data 3 (4):249–266.
  • Fienberg, S. 2006. Privacy and confidentiality in an e-commerce world: Data mining, data warehousing, matching and disclosure limitation. Statistical Science 21 (2):143–154.
  • Fienberg, S. 2011. The challenge of privacy protection for statistical network data. In Proceedings of the 58th World Statistics Congress 2011.
  • Fienberg, S. 2015. The promise and perils of big data for statistical inference. Israel Statistical Association Annual Meeting.
  • Gao, G., B. N. Greenwood, J. McCullough, and R. Agarwal. 2014. Vocal minority and silent majority: How do online ratings reflect population perceptions of quality?. MIS Quarterly 39 (3):565–589.
  • Glymour, C., D. Madigan, D. Pregibon, and P. Smyth. 1997. Statistical themes and lessons for data mining. Data Mining and Knowledge Discovery, 1:11–28.
  • Hardoon, D. R. and G. Shmueli. 2013. Getting Started With Business Analytics: Insightful Decision Making. Boca Raton, FL: Taylor & Francis.
  • Heckman, J. 1979. Sample selection bias as a specification error. Econometrica 47 (1):153–161.
  • Hill, S., F. Provost, and C. Volinsky. 2006. Network-based marketing: Identifying likely adopters via consumer networks. Statistical Science 21 (2):256–276.
  • Hinz, O., M. Spann, and I.-H. Hahn. 2015. Can’t buy me love...or can i? social capital attainment through conspicuous consumption in virtual environments. Information Systems Research 26 (4):849–870.
  • Hoerl, R., R. Snee, and R. De Veaux. 2014. Applying statistical thinking to “big data” problems. WIREs Computer Statistics, 6:222–232.
  • Junque de Fortuny, E., D. Martens, and F. Provost. 2014. Predictive modeling with big data: Is bigger really better? Big Data 1 (4):215–226.
  • Keiding, N. and T. A. Louis. 2016. Perils and potentials of self-selected entry to epidemiological studies and surveys. Journal of the Royal Statistical Society, Series A 179 (2):1–28.
  • Kenett, R. 2015. Statistics: A life cycle view. Quality Engineering, 27:111–121.
  • Kenett, R. S. and G. Shmueli. 2014. On information quality. Journal of the Royal Statistical Society, Series A 177 (1):3–38.
  • Kenett, R. S. and G. Shmueli. 2016. Information Quality: The Potential of Data and Analytics to Generate Knowledge. John Wiley and Sons, Inc., Chichester, UK.
  • Kramer, A., J. Guillory, and J. T. Hancock. 2014. Experimental evidence of massive-scale emotional contagion through social networks. Proceedings of the National Academies of Sciences 111 (24):8788–8790.
  • Lambert, D. and D. Pregibon. 2007. More bang for their bucks: Assessing new features for online advertisers. In Proceedings of the 1st International Workshop on Data Mining and Audience Intelligence for Advertising, ADKDD ’07, pages 7–15, New York, NY, USA. ACM.
  • Lin, M., H. Lucas Jr, and G. Shmueli. 2013. Too big to fail: Large samples and the p-value problem. Information Systems Research 24 (4):906–917.
  • Marcus, J. 2014. Amazonia. Five years at the epicentre of the dot-com juggernaut. NY: The New Press.
  • Mason, W. and S. Suri. 2012. Conducting behavioral research on amazon’s mechanical turk. Behavior Research 44 (1):647–651.
  • Moussa, M. 2015. Monitoring employee behavior through the use of technology and issues of employee privacy in america. SAGE Open. http://dx.doi.org/10.1177/2158244015580168
  • Muchnik, L., S. Aral, and S. Taylor. 2014. Social influence bias: A randomized experiment. Science 341 (6146):647–651.
  • Rosenbaum, P. and D. Rubin. 1983. The central role of the propensity score in observational studies for causal effects. Biometrika 70 (1):41–55.
  • Schwartz, D., I. Yahav, and G. Silverman. 2016. News censorship in online social networks: A study of circumvention in the commentsphere. Journal of the Association for Information Science and Technology, forthcoming. doi: 10.1002/asi.23698
  • Shmueli, G. 2010. To explain or to predict? Statistical Science 25 (3):289–310.
  • Shmueli, G., P. C. Bruce, and N. R. Patel. 2016. Data Mining for Business Analytics: Concepts, Techniques, and Applications with XLMiner. 3rd edit., Hoboken, NJ: Wiley & Sons.
  • Shmueli, G. and O. Koppius. 2011. Predictive analytics in information systems research. MIS Quarterly 35 (3):553–572.
  • Shmueli, G. and I. Yahav. 2014. The forest or the trees? tackling simpson’s paradox with classification and regression trees. Indian School of Business Research Paper Series. Available at SSRN: http://ssrn.com/abstract=2392953
  • Steinberg, D. M. 2016. Industrial statistics: The challenges and the research. Quality Engineering 28 (1):45–59.
  • Watts, D. J. 2013. Computational social science: Exciting progress and future directions. The Bridge on Frontiers of Engineering 43 (4):5–10.
  • Xiao, B. and I. Benbasat. 2015. Designing warning messages for detecting biased online product recommendations: An empirical investigation. MIS Quarterly 26 (4):793–811.
  • Yahav, I., G. Shmueli, and D. Mani. 2016. A tree-based approach for addressing self-selection in impact studies with big data. MIS Quarterly, forthcoming. doi: 10.2139/ssrn.2276770

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.