459
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Deep Learning-Based Imputation Method to Enhance Crowdsourced Data on Online Business Directory Platforms for Improved Services

, &

References

  • Acia, I.; Mohamed, W.; and Abderrahim, G. Modeling the structure of social networks by using the Pythagorean Spiral. Journal of Theoretical and Applied Information Technology, 97, 6 (2019), 1856–1869
  • Anderson, C. The Long Tail: Why the Future of Business Is Selling Less of More. Hachette UK: Hachette Books, 2006.
  • Azur, M.J.; Stuart, E.A.; Frangakis, C.; and Leaf, P.J. Multiple imputation by chained equations: What is it and how does it work? International Journal of Methods in Psychiatric Research, 20, 1 (2011), 40–49.
  • Baesens, B.; Bapna, R.; Marsden, J.R.; Vanthienen, J.; and Zhao, J.L. Transformational issues of big data and analytics in networked business. MIS Quarterly, 40, 4 (2016), 807–818.
  • Bandura A. and Walters R.H. Social Learning Theory. Englewood Cliffs, NJ: Prentice-Hall, 1977.
  • Batini, C.; Cappiello, C.; Francalanci, C.; and Maurino, A. Methodologies for data quality assessment and improvement. ACM Computing Surveys, (CSUR), 41, 3 (2009), 1–52.
  • Beaulieu-Jones, B.K.; Lavage, D.R.; Snyder, J.W.; Moore, J.H.; Pendergrass, S.A.; and Bauer, C.R. Characterizing and managing missing structured data in electronic health records: Data analysis. JMIR Medical Informatics, 6, 1 (2018), e11.
  • Beaulieu-Jones, B.K.; and Moore, J.H. Missing data imputation in the electronic health record using deeply learned autoencoders. In Pacific Symposium on Biocomputing, Vol. 0 (2017), pp. 207–218.
  • Beckert, J. Institutional isomorphism revisited: Convergence and divergence in institutional change. Sociological Theory, 28, 2 (2010), 150–166.
  • Bhattacharyya, S.; Banerjee, S.; Bose, I.; and Kankanhalli, A. Temporal effects of repeated recognition and lack of recognition on online community contributions. Journal of Management Information Systems, 37, 2 (2020), 536–562.
  • Brás, L.P.; and Menezes, J.C. Improving cluster-based missing value estimation of DNA microarray data. Biomolecular Engineering, 24, 2 (2007), 273–282.
  • Cao, Z.; Wang, L.; and De Melo, G. Link prediction via subgraph embedding-based convex matrix completion. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (2018), pp. 2803–2810.
  • Dacin, M.T. Isomorphism in context: The power and prescription of institutional norms. Academy of Management Journal, 40, 1 (1997), 46–81.
  • Deephouse, D.L. Does isomorphism legitimate? Academy of Management Journal, 39, 4 (1996), 1024–1039.
  • Deng, Y.; Chang, C.; Ido, M.S.; and Long, Q. Multiple imputation for general missing data patterns in the presence of high-dimensional data. Scientific Reports, 6, 1 (2016), 1–10.
  • DiMaggio, P.J.; and Powell, W.W. The iron cage revisited: Institutional isomorphism and collective rationality in organizational fields. American Sociological Review, (1983), 147–160.
  • Ebrahimi, S.; Ghasemaghaei, M.; and Benbasat, I. The impact of trust and recommendation quality on adopting interactive and non-interactive recommendation agents: A meta-analysis. Journal of Management Information Systems, 39, 3 (2022), 733–764.
  • Fang, L. The effects of online review platforms on restaurant revenue, survival rate, consumer learning and welfare. (2019).
  • Fang, X.; Gao, Y.; and Hu, P.J.-H. A Prescriptive analytics method for cost reduction in clinical decision making. MIS Quarterly, 45, 1 (2021), 83–115.
  • Fang, X.; and Hu, P.J. Top persuader prediction for social networks. MIS Quarterly, 42, 1 (2018), 63–82.
  • Fang, X.; Hu, P.J.; Chau, M.; Hu, H.; Yang, Z.; and Liu, Sheng, O. A data-driven approach to measure web site navigability. Journal of Management Information Systems, 29, 2 (2012), 173–212.
  • Ghose, A.; Ipeirotis, P.G.; and Li, B. Examining the impact of ranking on consumer behavior and search engine revenue. Management Science, 60, 7 (2014), 1632–1654.
  • Ghoshal, A.; Mookerjee, V.S.; and Sarkar, S. Recommendations and cross-selling: Pricing strategies when personalizing firms cross-sell. Journal of Management Information Systems, 38, 2 (2021), 430–456.
  • Glynn, M.A.; and Abzug, R. Institutionalizing identity: Symbolic isomorphism and organizational names. Academy of Management Journal, 45, 1 (2002), 267–280.
  • Goodfellow, I., Bengio, Y., Courville, A., and Bengio, Y. Deep Learning. Cambridge, MA: MIT Press, 2016.
  • Gupta, N.; and Singh, S. Collective factorization for relational data: An evaluation on the Yelp datasets. Technical Report, (2015), Yelp Dataset Challenge, Round 4.
  • Hastie, T.; Mazumder, R.; Lee, J.D.; and Zadeh, R. Matrix completion and low-rank SVD via fast alternating least squares. Journal of Machine Learning Research, 16, 1 (2015), 3367–3402.
  • Hastie, T.; Tibshirani, R.; Sherlock, G.; Eisen, M.; Brown, P., and Botstein, D. Imputing missing data for gene expression arrays. California: Stanford University Statistics Department Technical Report, (1999).
  • Haunschild, P.R. Interorganizational imitation: The impact of interlocks on corporate acquisition activity. Administrative Science Quarterly, (1993), 564–592.
  • Haunschild, P.R., and Miner, A.S. Modes of interorganizational imitation: The effects of outcome salience and uncertainty. Administrative Science Quarterly, 42, 3 (1997), 472–500.
  • He, J.; Fang, X.; Liu, H.; and Li, X.D. Mobile app recommendation: An involvement-enhanced approach. MIS Quarterly, 43, 3 (2019), 827–849.
  • He, X.; Liao, L.; Zhang, H.; Nie, L.; Hu, X.; and Chua, T.-S. Neural collaborative filtering. In Proceedings of the 26th International Conference on World Wide Web, (2017), pp. 173–182.
  • Hinton, G.E.; and Salakhutdinov, R.R. Reducing the dimensionality of data with neural networks. Science, 313, 5786 (2006), 504–507.
  • Hu, B.; and Ester, M. Spatial topic modeling in online social media for location recommendation. In Proceedings of the 7th ACM Conference on Recommender Systems, (2013), pp. 25–32.
  • Hu, P.J.-H.; Hu, H.; and Fang, X. Examining the mediating roles of cognitive load and performance outcomes in user satisfaction with a website: A field quasi-experiment. MIS Quarterly, 41, 3 (2017), 975–987.
  • Jannach, D.; and Jugovac, M. Measuring the business value of recommender systems. ACM Transactions on Management Information Systems, 10, 4 (2019), 1–23.
  • Jin, Y.; Lee, H.C.B.; Ba, S; and Stallaert, J. Winning by learning? Effect of knowledge sharing in crowdsourcing contests. Information Systems Research, 32, 3 (2021), 836–859.
  • Kim, H.; Golub, G.H.; and Park, H. Missing value estimation for DNA microarray gene expression data: Local least squares imputation. Bioinformatics, 21, 2 (2005), 187–198.
  • Kingma, D.P.; and Ba, J. Adam: A method for stochastic optimization. ArXiv Preprint ArXiv: 1412.6980, (2014).
  • Kittur, A.; Nickerson, J.V.; Bernstein, M.; Gerber, E.; Shaw, A.; Zimmerman, J.; Lease, M.; and Horton, J. The future of crowd work. In Proceedings of the 2013 Conference on Computer Supported Cooperative Work, (2013), pp. 1301–1318.
  • Kondra, A.Z.; and Hinings, C.R. Organizational diversity and change in institutional theory. Organization Studies, 19, 5 (1998), 743–767.
  • Lall, R.; and Robinson, T. The MIDAS Touch: Accurate and scalable missing-data imputation with deep learning. Political Analysis, 30, 2 (2022), 179–196.
  • Levitt, B.; and March, J.G. Organizational learning. Annual Review of Sociology, 14, 1 (1988), 319–338.
  • Li, G.; Wang, J.; Zheng, Y.; and Franklin, M.J. Crowdsourced data management: A survey. IEEE Transactions on Knowledge and Data Engineering, 28, 9 (2016), 2296–2319.
  • Little, R., and Donald, B.R. Statistical Analysis with Missing Data. Hoboken, NJ: John Wiley & Sons, vol. 793, 2019.
  • Liu, Y.; Dillon, T.; Yu, W.; Rahayu, W.; and Mostafa F. Missing value imputation for industrial IoT sensor data with large gaps. IEEE Internet of Things Journal, 7, 8 (2020), 6855–6867.
  • Lu, J.; Wu, D.; Mao, M.; Wang, W.; and Zhang, G. Recommender system application developments: A survey. Decision Support Systems, 74 (2015), 12–32.
  • Lukyanenko, R.; and Parsons, J. Is traditional conceptual modeling becoming obsolete? Lecture Notes in Computer Science, 8217, LNCS (2013), 61–73.
  • Lukyanenko, R.; Parsons, J., and Wiersma, Y.F. The impact of conceptual modeling on dataset completeness: A field experiment. In Proceedings of the 35th International Conference on Information Systems, ICIS 2014 , 2014, pp. 1–19.
  • Lukyanenko, R.; Parsons, J.; and Wiersma, Y.F. The IQ of the crowd: Understanding and improving information quality in structured user-generated content. Information Systems Research, 25, 4 (2014), 669–689.
  • Lukyanenko, R.; Parsons, J.; Wiersma, Y.F.; and Maddah, M. Expecting the unexpected: Effects of data collection design choices on the quality of crowdsourced user-generated content. MIS Quarterly, 43, 2 (2019), 623–648.
  • March, J.G. Exploration and exploitation in organizational. Organization Science, 2, 1 (1991), 71–87.
  • Marimont, R.B.; and Shapiro, M.B. Nearest neighbour searches and the curse of dimensionality. IMA Journal of Applied Mathematics, 24, 1 (1979), 59–70.
  • Mazumder, R.; Hastie, T.; and Tibshirani, R. Spectral regularization algorithms for learning large incomplete matrices. Journal of Machine Learning Research, 11 (2010), 2287–2322.
  • Meyer, J.W.; and Rowan, B. Institutionalized organizations: Formal structure as myth and ceremony. American Journal of Sociology, 83, 2 (1977), 340–363.
  • Mizruchi, M.S.; and Fein, L.C. The social construction of organizational knowledge: A study of the uses of coercive, mimetic, and normative isomorphism. Administrative Science Quarterly, 44, 4 (1999), 653–683.
  • Nikolaeva, R. Interorganizational imitation heuristics arising from cognitive frames. Journal of Business Research, 67, 8 (2014), 1758–1765.
  • Nikulin, V. Hybrid recommender system for prediction of the Yelp users’ preferences. In Proceedings of the Industrial Conference on Data Mining, (2014), pp. 85–99.
  • Oba, S.; Sato, M.A.; Takemasa, I.; Monden, M.; Matsubara, K.I.; and Ishii, S. A Bayesian missing value estimation method for gene expression profile data. Bioinformatics, 19, 16 (2003), 2088–2096.
  • Qian, X.; Feng, H.; Zhao, G.; and Mei, T. Personalized recommendation combining user interest and social circle. IEEE Transactions on Knowledge and Data Engineering, 26, 7 (2014), 1763–1777.
  • Pfaffel, O. Clustimpute: An R Package for K-Means Clustering with Build-In Missing Data Imputation.
  • Raghunathan, T.; Lepkowski, J.; Van Hoewyk, J.; and Solenberger, P. A multivariate technique for multiply imputing missing values using a sequence of regression models. Survey Methodology, 27, 1 (2001), 85–96.
  • Reagans, R.; and McEvily, B. Network structure and knowledge transfer: The effects of cohesion and range. Administrative Science Quarterly, 48, 2 (2003), 240–267.
  • Rejeb, S.; Duveau, C.; and Rebafka, T. Self-organizing maps for exploration of partially observed data and imputation of missing values. arXiv:2202.07963. (2022).
  • Ren, X.; Yu, C.M.; Yu, W.; Yang, S.; Yang, X.; McCann, J.A.; and Philip, S.Y. LoPub: High-dimensional crowdsourced data publication with local differential privacy. IEEE Transactions on Information Forensics and Security, 13, 9 (2018), 2151–2166.
  • Rubin, D.B. Multiple Imputation for Nonresponse in Surveys. New York, NY: Wiley Series in Probability and Statistics, Wiley, 2004.
  • Scot, W., and Meyer, W. The organization of societal sector. In Organizational Environments: Ritual and Rationality. Beverly Hills: CA: Sage, 1983, pp. 129–153.
  • Scott, W.R. The adolescence of institutional theory. Administrative Science Quarterly, (1987), 493–511.
  • Silva-Ramírez, E.L.; Pino-Mejías, R.; and López-Coello, M. Single imputation with multilayer perceptron and multiple imputation combining multilayer perceptron and K-nearest neighbours for monotone patterns. Applied Soft Computing, 29 (2015), 65–74.
  • Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; and Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15, 1 (2014), 1929–1958.
  • Stekhoven, D.J. MissForest: Nonparametric missing value imputation using random forest. Astrophysics Source Code Library, (2015), ascl–1505.
  • Tang, Q.; Gu, B.; and Whinston, A.B. Content contribution for revenue sharing and reputation in social media: A dynamic structural model. Journal of Management Information Systems, 29, 2 (2012), 41–76.
  • Teo, H.H.; Wei, K.K.; and Benbasat, I. Predicting intention to adopt interorganizational linkages: An institutional perspective. MIS Quarterly, 27, 1 (2003), 19–49.
  • Troyanskaya, O.; Cantor, M.; Sherlock, G.; Brown, P.; Hastie, T.; Tibshirani, R.; Botstein, D.; and Altman, R.B. Missing value estimation methods for DNA microarrays. Bioinformatics, 17, 6 (2001), 520–525.
  • Tsai, C.-F.; Li, M.-L.; and Lin, W.-C. A class center based approach for missing value imputation. Knowledge-Based Systems, 151 (2018), 124–135.
  • Van Buuren, S., and Groothuis-Oudshoorn, K. Mice: Multivariate imputation by chained equations in R. Journal of Statistical Software, (2011), 1–67.
  • Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; and Polosukhin, I. Attention is all you need. Advances in Neural Information Processing Systems, 30 (2017), 5998–6008.
  • Vincent, P.; Larochelle, H.; Lajoie, I.; Bengio, Y.; Manzagol, P.A.; and Bottou, L. Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. Journal of Machine Learning Research, 11, 12 (2010).
  • Wang, R.Y.; and Strong, D.M. Beyond accuracy: What data quality means to data consumers. Journal of Management Information Systems, 12, 4 (1996), 5–33.
  • Wells, B.J.; Chagin, K.M.; Nowacki, A.S.; and Kattan, M.W. Strategies for handling missing data in electronic health record derived data. EGEMS, 1, 3 (2013), 1035. 1–7.
  • Xu, D.; Hu, P.J.-H.; Huang, T.S.; Fang, X.; and Hsu, C.C. A deep learning–based, unsupervised method to impute missing values in electronic health records for improved patient management. Journal of Biomedical Informatics, 111, 103576 (2020).
  • Yang, C.; Bai, L.; Zhang, C.; Yuan, Q.; and Han, J. Bridging collaborative filtering and semi-supervised learning: A neural approach for POI recommendation. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, (2017), pp. 1245–1254.
  • Yang, M.; and Hyland, M. Who do firms imitate? a multilevel approach to examining sources of imitation in the choice of mergers and acquisitions. Journal of Management, 32, 3 (2006), 381–399.
  • Yin, J.; Luo, J.; and Brown, S.A. Learning from crowdsourced multi-labeling: A variational Bayesian approach. Information Systems Research, 32, 3 (2021), 752–773.
  • Zhang, S.; Zhang, J.; Zhu, X.; Qin, Y.; and Zhang, C. Missing value imputation based on data clustering. In Transactions on Computational Science I. Berlin, Heidelberg: Springer-Verlag, (2008), pp. 128–138.
  • Zhu, H.; Chen, E.; Xiong, H.; Yu, K.; Cao, H.; and Tian, J. Mining mobile user preferences for personalized context-aware recommendation. ACM Transactions on Intelligent Systems and Technology, 5, 4 (2014), 1–27.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.