144
Views
2
CrossRef citations to date
0
Altmetric
Computers & Computing

Feature Extraction to Filter Out Low-Quality Answers from Social Question Answering Sites

ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon

References

  • G. Blanco, R. Prez-Lpez, F. Fdez-Riverola, and A. M. G. Loureno, “Understanding the social evolution of the java community in stack overflow: A 10-year study of developer interactions,” Future Gener. Comput. Syst., Vol. 105, pp. 446–454, 2020. doi:10.1016/j.future.2019.12.021
  • J. Herrera, D. Parra, and B. Poblete, “Social QA in non-CQA platforms,” Future Gener. Comput. Syst., Vol. 105, pp. 631–649, 2020. doi:10.1016/j.future.2019.12.023
  • J. Yin, W. X. Zhao, and X. M. Li, “Type-aware question answering over knowledge base with attention-based tree-structured neural networks,” J. Comput. Sci. Technol., Vol. 32, pp. 805–813, 2017. doi:10.1007/s11390-017-1761-8
  • J. Surowiecki, M. P. Silverman, “The wisdom of crowds,” Am. J. Phys., Vol. 75, no. 2, pp. 190–192, Feb. 2007. doi:10.1119/1.2423042
  • A. Y. Chua, and S. Banerjee, “So fast so good: An analysis of answer quality and answer speed in community question-answering sites,” J. Am. Soc. Inf. Sci. Technol., Vol. 64, pp. 2058–2068, 2013. doi:10.1002/asi.22902
  • A. Y. Chua, and S. Banerjee, “Measuring the effectiveness of answers in yahoo! answers,” Online Inf. Rev., Vol. 39, pp. 104–118, 2015. doi:10.1108/OIR-10-2014-0232
  • V. Kitzie, and C. Shah, “Faster, better, or both? looking at both sides of online question-answering coin,” Proceedings of the American Society for Information Science and Technology, Vol. 48, pp. 1–4, 2011. doi:10.1002/meet.2011.14504801180
  • L. Mamykina, B. Manoim, M. Mittal, G. Hripcsak, and B. Hartmann, “Design lessons from the fastest Q&A site in the west,” in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 2011, pp. 2857–2866. ACM
  • T.-L. Wong, “Answering reachability queries on incrementally updated graphs by hierarchical labeling schema,” J. Comput. Sci. Technol., Vol. 31, pp. 381–399, 2016. doi:10.1007/s11390-016-1633-7
  • I. Srba, and M. Bielikova, “Why is stack overflow failing? preserving sustainability in community question answering,” IEEE Softw., Vol. 33, pp. 80–89, 2016b. doi:10.1109/MS.2016.34
  • Y. Zhang, D. Lo, X. Xia, and J.-L. Sun, “Multi-factor duplicate question detection in stack overflow,” J. Comput. Sci. Technol., Vol. 30, pp. 981–997, 2015. doi:10.1007/s11390-015-1576-4
  • B. Yang, and S. Manandhar, “Tag-based expert recommendation in community question answering,” in Advances in Social Networks Analysis and Mining (ASONAM), 2014 IEEE/ACM International Conference on, 2014, pp. 960–963. IEEE.
  • P. K. Roy, J. P. Singh, A. M. Baabdullah, H. Kizgin, and N. P. Rana, “Identifying reputation collectors in community question answering (CQA) sites: exploring the dark side of social media,” Int. J. Inf. Manage., Vol. 42, pp. 25–35, 2018b. doi:10.1016/j.ijinfomgt.2018.05.003
  • X. Wang, C. Huang, L. Yao, B. Benatallah, and M. Dong, “A survey on expert recommendation in community question answering,” J. Comput. Sci. Technol., Vol. 33, pp. 625–653, 2018. doi:10.1007/s11390-018-1845-0
  • R. Ren, H. Duan, W. Liu, and J. Liu, “Aunet: An unsupervised method for answer reliability evaluation in community QA systems,” in Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data, 2018, pp. 281–292. Springer.
  • M. A. Suryanto, E. P. Lim, A. Sun, and R. H. Chiang, “Quality-aware collaborative question answering: methods and evaluation,” in Proceedings of the Second ACM International Conference on web Search and Data Mining, 2009, pp. 142–151. ACM.
  • X. He, L. Wang, W. Zhang, and P. Zhang, “Research on the quality prediction of online Chinese question answering community answers based on comments,” in Proceedings of the 2nd International Conference on Big Data Technologies, 2019, pp. 114–120.
  • B. Qu, G. Cong, C. Li, A. Sun, and H. Chen, “An evaluation of classification models for question topic categorization,” J. Am. Soc. Inf. Sci. Technol., Vol. 63, pp. 889–903, 2012. doi:10.1002/asi.22611
  • M. J. Blooma, A. Y. Chua, and D. H.-L. Goh, “A predictive framework for retrieving the best answer,” in Proceedings of the 2008 ACM Symposium on Applied Computing, 2008, pp. 1107–1111. ACM
  • C. Chen, K. Wu, V. Srinivasan, and R. K. Bharadwaj, “The best answers? think twice: identifying commercial campaigns in the CQA forums,” J. Comput. Sci. Technol., Vol. 30, pp. 810–828, 2015. doi:10.1007/s11390-015-1562-x
  • H. Toba, Z.-Y. Ming, M. Adriani, and T.-S. Chua, “Discovering high quality answers in community question answering archives using a hierarchy of classifiers,” Inf. Sci. (Ny), Vol. 261, pp. 101–115, 2014. doi:10.1016/j.ins.2013.10.030
  • Y. Yao, H. Tong, T. Xie, L. Akoglu, F. Xu, and J. Lu, “Detecting high quality posts in community question answering sites,” Inf. Sci. (Ny), Vol. 302, pp. 70–82, 2015. doi:10.1016/j.ins.2014.12.038
  • Xin-Qi Bao, and Yun-Fang Wu, “A Tensor Neural Network with Layerwise Pretraining: Towards Effective Answer Retrieval,” Journal of Computer Science and Technology, Vol. 31, no. 6, pp. 1151–1160, 2016. http://doi.org/10.1007/s11390-016-1689-4.
  • E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne, “Finding high quality content in social media,” in Proceedings of the 2008 International Conference on web Search and Data Mining, 2008, pp. 183–194. ACM
  • M. J. Blooma, A. Y.-K. Chua, and D. H.-L. Goh, “Selection of the best answer in CQA services,” in Information Technology: New Generations (ITNG), 2010 Seventh International Conference, 2010, pp. 534–539. IEEE
  • L. Xu, J. Xiang, Y. Wang, and F. Ni, “Data-driven approach for quality evaluation on knowledge sharing platform,” in 2019 International Conference on Machine Learning and Cybernetics (ICMLC); 2019, pp. 1–6. IEEE.
  • B. M. John, A. Y.-K. Chua, and D. H.-L. Goh, “What makes a high-quality user-generated answer?,” IEEE Internet Comput., Vol. 15, pp. 66–71, 2011. doi:10.1109/MIC.2011.23
  • C. T. Lee, E. M. Rodrigues, G. Kazai, N. Milic-Frayling, and A. Ignjatovic, “Model for voter scoring and best answer selection in community Q&A services. In Web Intelligence and intelligent agent technologies, 2009. WI-IAT’09,” in IEEE/WIC/ACM International Joint Conferences on, 2009, pp. 116–123. IEEE volume 1.
  • C. Shah, and J. Pomerantz, “Evaluating and predicting answer quality in community QA,” in Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010, pp. 411–418. ACM.
  • T. P. Sahu, N. K. Nagwani, and S. Verma, “Selecting best answer: An empirical analysis on community question answering sites,” IEEE. Access., Vol. 4, pp. 4797–4808, 2016. doi:10.1109/ACCESS.2016.2600622
  • M. J. Blooma, D. Hoe-LianGoh, and A. Yeow-Kuan Chua, “Predictors of high-quality answers,” Online Inf. Rev., Vol. 36, pp. 383–400, 2012. doi:10.1108/14684521211241413
  • J. Lou, Y. Fang, K. H. Lim, and J. Z. Peng, “Contributing high quantity and quality knowledge to online CQA communities,” J. Am. Soc. Inf. Sci. Technol., Vol. 64, pp. 356–371, 2013. doi:10.1002/asi.22750
  • P. K. Roy, Z. Ahmad, J. P. Singh, M. A. A. Alryalat, N. P. Rana, and Y. K. Dwivedi, “Finding and ranking high quality answers in community question answering sites,” Global Journal of Flexible Systems Management, Vol. 19, pp. 53–68, 2018a. doi:10.1007/s40171-017-0172-6
  • D. Hoogeveen, A. Bennett, Y. Li, K. M. Verspoor, and T. Baldwin, “Detecting mis-flagged duplicate questions in community question answering archives,” in Tvwelfth International AAAI Conference on Web and Social Media, 2018, pp. 112–120.
  • O. Kucuktunc, B. B. Cambazoglu, I. Weber, and H. Ferhatosmanoglu, “A large-scale sentiment analysis for yahoo! answers,” in Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, 2012, pp. 633–642.
  • M. Li, Y. Li, Q. Peng, and J. Wang, “A hybrid MCDM model combining DANP with TODIM to evaluate the information quality of community question answering in a two-dimensional linguistic environment,” Expert Syst., Vol. 38, no. 2, pp. e12619, 2021.
  • D. Elalfy, W. Gad, and R. Ismail, “A hybrid model to predict best answers in question answering communities,” Egyptian Informatics Journal, Vol. 19, no. 1, pp. 21–31, 2018. doi:10.1016/j.eij.2017.06.002
  • Y. Zhang, M. Zhang, N. Luo, Y. Wang, and T. Niu, “Understanding the formation mechanism of high-quality knowledge in social question and answer communities: A knowledge co-creation perspective,” Int. J. Inf. Manage., Vol. 48, pp. 72–84, 2019. doi:10.1016/j.ijinfomgt.2019.01.022
  • D. Palomera, and A. Figueroa, “Leveraging linguistic traits and semi-supervised learning to single out informational content across how-to community question-answering archives,” Inf. Sci. (Ny), Vol. 381, pp. 20–32, 2017. doi:10.1016/j.ins.2016.11.006
  • J. E. Van Engelen, and H. H. Hoos, “A survey on semi-supervised learning,” Mach. Learn., Vol. 109, no. 2, pp. 373–440, 2019.
  • H. Fu, and S. Oh, “Quality assessment of answers with user-identified criteria and data-driven features in social Q&A,” Inf. Process. Manag., Vol. 56, no. 1, pp. 14–28, 2019. doi:10.1016/j.ipm.2018.08.007
  • A. Tang, P. Ren, and Z. Sun, “Multi-feature based question–answerer model matching for predicting response time in CQA,” Knowl. Based. Syst., Vol. 182, pp. 104794, 2019. doi:10.1016/j.knosys.2019.06.002
  • O. Chergui, A. Begdouri, and D. Groux-Leclet, “Integrating a Bayesian semantic similarity approach into CBR for knowledge reuse in community question answering,” Knowl. Based. Syst., Vol. 185, pp. 104919, 2019. doi:10.1016/j.knosys.2019.104919
  • Q. Tian, P. Zhang, and B. Li, “Towards predicting the best answers in community-based question-answering services,” in ICWSM, 2013, pp. 725–728.
  • I. Rish, “An empirical study of the naive Bayes classifier,” in IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, 2001, pp. 41–46. IBM volume 3.
  • J. H. Friedman, “Greedy function approximation: a gradient boosting machine,” Ann. Stat., Vol. 29, no. 5, pp. 1189–1232, 2001.
  • L. Breiman, “Random forests,” Mach. Learn., Vol. 45, pp. 5–32, 2001. doi:10.1023/A:1010933404324
  • N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, “Smote: synthetic minority over-sampling technique,” J. Artif. Intell. Res., Vol. 16, pp. 321–357, 2002. doi:10.1613/jair.953
  • H. He, Y. Bai, E. A. Garcia, and S. Li, “ADASYN: Adaptive synthetic sampling approach for imbalanced learning,” in 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), 2008, pp. 1322–1328. IEEE.
  • J. Davis, and M. Goadrich, “The relationship between precision-recall and roc curves,” in Proceedings of the 23rd International Conference on Machine Learning ICML ‘06, 2006, pp. 233–240. ACM
  • L. Ponzanelli, A. Mocci, A. Bacchelli, M. Lanza, and D. Fullerton, “Improving low quality stack overflow post detection,” in Software Maintenance and Evolution (ICSME), 2014 IEEE International Conference on, 2014, pp. 541–544. IEEE.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.