What is Quora 112

Multilayer Convolutional Neural Network to Filter Low Quality Content from Quora

tip

Call up other articles in this issue by swiping

06/21/2020 | Issue 1/2020

Magazine:
Neural Processing Letters> Issue 1/2020
Author:
Pradeep Kumar Roy

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Abstract

Question answering (QA) websites now play a crucial role in meeting Internet users ’information needs. Quora is a growing QA platform where users get quick answers to their questions from their peers. Nonetheless, it is noted that a significant number of questions remained unanswered for a long time. Questions that have long been unable to receive any answer, opinion-based, need a debate to get the answers, or a valid answer does not exist, fall under Insincere question group. It is therefore important to weed out Insincere questions in order to maintain the integrity of the site. Quora have a huge number of such questions that can not be filtered manually. To overcome this problem, this paper proposes a multi-layer convolutional neural network model that helps to minimize insincere questions from the website. Two embeddings were created from Quora dataset: (i) using Skipgram, and (ii) using Continuous Bag of Word model. The created embeddings and a pre-trained glove embedding vector were used for system development. The proposed model needs only the question text to predict the question is Insincere question or not and hence free from manual feature engineering. The experimental results indicated that the proposed multilayer CNN model outperforming over the earlier works by achieving the F1 score of 0.98 for the best case.

Would you like to get access to this content? Then find out more about our products now:

Springer Professional "Business + Technology"

With Springer Professional "Business + Technology" you get access to:

  • above 69,000 books
  • above 500 magazines

from the following fields:

  • Automobile + engines
  • Construction + real estate
  • Business IT + informatics
  • Electrical engineering + electronics
  • Energy + environment
  • Finance + Banking
  • Management + leadership
  • Marketing + sales
  • Mechanical engineering + materials
  • Insurance + risk

Try now for 30 days free of charge.

Springer Professional "Technology"

With Springer Professional "Technology" you get access to:

  • above 50,000 books
  • above 380 magazines

from the following fields:

  • Automobile + engines
  • Construction + real estate
  • Business IT + informatics
  • Electrical engineering + electronics
  • Energy + environment
  • Mechanical engineering + materials



Try now for 30 days free of charge.

Springer Professional "Economy"

With Springer Professional "Economy" you get access to:

  • above 58,000 books
  • above 300 magazines

from the following fields:

  • Construction + real estate
  • Business IT + informatics
  • Finance + Banking
  • Management + leadership
  • Marketing + sales
  • Insurance + risk



Try now for 30 days free of charge.

literature
Go back to reference Anderson A, Huttenlocher D, Kleinberg J, Leskovec J (2012) Discovering value from community activity on focused question answering sites: a case study of stack overflow. In: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp 850–858 Anderson A, Huttenlocher D, Kleinberg J, Leskovec J (2012) Discovering value from community activity on focused question answering sites: a case study of stack overflow. In: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, ACM, pp 850–858
Go back to reference Tian Y, Kochhar PS, Lim EP, Zhu F, Lo D (2013) Predicting best answerers for new questions: an approach leveraging topic modeling and collaborative voting. In: Workshops at the international conference on social informatics, Springer, Berlin, pp 55–68 Tian Y, Kochhar PS, Lim EP, Zhu F, Lo D (2013) Predicting best answerers for new questions: an approach leveraging topic modeling and collaborative voting. In: Workshops at the international conference on social informatics, Springer, Berlin, pp 55–68
Go back to reference Wang G, Gill K, Mohanlal M, Zheng H, Zhao BY (2013) Wisdom in the social crowd: an analysis of quora. In: 22nd international world wide web conference, WWW '13, Rio de Janeiro, Brazil, May 13–17, 2013, pp 1341–1352 Wang G, Gill K, Mohanlal M, Zheng H, Zhao BY (2013) Wisdom in the social crowd: an analysis of quora. In: 22nd international world wide web conference, WWW ’13, Rio de Janeiro, Brazil, May 13–17, 2013, pp 1341–1352
Go back to reference Ahasanuzzaman M, Asaduzzaman M, Roy CK, Schneider KA (2016) Mining duplicate questions of stack overflow. In: IEEE / ACM 13th working conference on mining software repositories (MSR), 2016, IEEE, pp 402-412 Ahasanuzzaman M, Asaduzzaman M, Roy CK, Schneider KA (2016) Mining duplicate questions of stack overflow. In: IEEE / ACM 13th working conference on mining software repositories (MSR), 2016, IEEE, pp 402-412
Go back to reference Hoogeveen D, Bennett A, Li Y, Verspoor KM, Baldwin T (2018) Detecting misflagged duplicate questions in community question-answering archives. In: ICWSM, pp 112-120 Hoogeveen D, Bennett A, Li Y, Verspoor KM, Baldwin T (2018) Detecting misflagged duplicate questions in community question-answering archives. In: ICWSM, pp 112-120
Go back to reference Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P (2011) Natural language processing (almost) from scratch. J Mach Learn Res 12 (Aug): 2493-2537 MATH Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P (2011) Natural language processing (almost) from scratch. J Mach Learn Res 12 (Aug): 2493-2537 MATH
Go back to reference Roy PK, Ahmad Z, Singh JP, Alryalat MAA, Rana NP, Dwivedi YK (2018) Finding and ranking high-quality answers in community question answering sites. Glob J Flex Syst Manag 19 (1): 53-68 Roy PK, Ahmad Z, Singh JP, Alryalat MAA, Rana NP, Dwivedi YK (2018) Finding and ranking high-quality answers in community question answering sites. Glob J Flex Syst Manag 19 (1): 53-68
Wang XJ, Tu X, Feng D, Zhang L (2009) Ranking community answers by modeling question – answer relationships via analogical reasoning. In: Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval, ACM, pp 179–186 Wang XJ, Tu X, Feng D, Zhang L (2009) Ranking community answers by modeling question – answer relationships via analogical reasoning . In: Proceedings of the 32nd international ACM SIGIR conference on research and development in information retrieval, ACM, pp 179-186
Go back to reference Blooma MJ, Chua AYK, Goh DHL (2010) Selection of the best answer in CQA services. In: Seventh international conference on information technology: new generations (ITNG), 2010, IEEE, pp 534-539 Blooma MJ, Chua AYK, Goh DHL (2010) Selection of the best answer in CQA services. In: Seventh international conference on information technology: new generations (ITNG), 2010, IEEE, pp 534-539
Go back to reference Abishek K, Hariharan BR, Valliyammai C (2019) An enhanced deep learning model for duplicate question pairs recognition. In: Soft computing in data analytics, Springer, Berlin, pp 769-777 Abishek K, Hariharan BR, Valliyammai C (2019) An enhanced deep learning model for duplicate question pairs recognition. In: Soft computing in data analytics, Springer, Berlin, pp 769-777
Go back to reference Saedi C, Rodrigues J, Silva J, Branco A, Maraev V (2017) Learning profiles in duplicate question detection. In: 2017 IEEE international conference on information reuse and integration (IRI), pp 544-550. https: // doi. org / 10. 1109 / IRI. 2017. 39 Saedi C, Rodrigues J, Silva J, Branco A, Maraev V (2017) Learning profiles in duplicate question detection. In: 2017 IEEE international conference on information reuse and integration (IRI), pp 544-550. https: // doi. org / 10. 1109 / IRI. 2017. 39
Go back to reference Dror G, Maarek Y, Szpektor I (2013) Will my question be answered? predicting “question answerability” in community question-answering sites. In: Blockeel H, Kersting K, Nijssen S, Železný F (eds) Machine learning and knowledge discovery in databases. Springer, Heidelberg, pp 499-514 Dror G, Maarek Y, Szpektor I (2013) Will my question be answered? predicting “question answerability” in community question-answering sites. In: Blockeel H, Kersting K, Nijssen S, Železný F (eds) Machine learning and knowledge discovery in databases. Springer, Heidelberg, pp 499-514
Go back to reference Yang L, Bao S, Lin Q, Wu X, Han D, Su Z, Yu Y (2011) Analyzing and predicting not-answered questions in community-based question answering services. In: AAAI, pp 1273-1278 Yang L, Bao S, Lin Q, Wu X, Han D, Su Z, Yu Y (2011) Analyzing and predicting not-answered questions in community-based question answering services. In: AAAI, pp 1273-1278
Go back to reference Wang G, Gill K, Mohanlal M, Zheng H, Zhao BY (2013) Wisdom in the social crowd: an analysis of quora. In: Proceedings of the 22nd international conference on world wide web, ACM, pp 1341–1352 Wang G, Gill K, Mohanlal M, Zheng H, Zhao BY (2013) Wisdom in the social crowd: an analysis of quora. In: Proceedings of the 22nd international conference on world wide web, ACM, pp 1341-1352
Go back to reference Singh JP, Irani S, Rana NP, Dwivedi YK, Saumya S, Roy PK (2017) Predicting the “helpfulness” of online consumer reviews. J Bus Res 70: 346-355 Singh JP, Irani S, Rana NP, Dwivedi YK, Saumya S, Roy PK (2017) Predicting the “helpfulness” of online consumer reviews. J Bus Res 70: 346-355
Go back to reference Wen S, Liu W, Yang Y, Zhou P, Guo Z, Yan Z, Chen Y, Huang T (2020) Multilabel image classification via feature / label co-projection. IEEE Trans Syst Man Cybern Syst. https: // doi. org / 10. 1109 / TSMC. 2020. 2967071 Wen S, Liu W, Yang Y, Zhou P, Guo Z, Yan Z, Chen Y, Huang T (2020) Multilabel image classification via feature / label co-projection. IEEE Trans Syst Man Cybern Syst. https: // doi. org / 10. 1109 / TSMC. 2020. 2967071
Go back to reference Wen S, Dong M, Yang Y, Zhou P, Huang T, Chen Y (2019a) End-to-end detection-segmentation system for face labeling. IEEE Trans Emerg Top Comput Intell 1–11 Wen S, Dong M, Yang Y, Zhou P, Huang T, Chen Y (2019a) End-to-end detection-segmentation system for face labeling. IEEE Trans Emerg Top Comput Intell 1–11
Go back to reference Wen S, Wei H, Yan Z, Guo Z, Yang Y, Huang T, Chen Y (2019b) Memristor-based design of sparse compact convolutional neural network. IEEE Transactions on Network Science and Engineering pp 1–11 Wen S, Wei H, Yan Z, Guo Z, Yang Y, Huang T, Chen Y (2019b) Memristor-based design of sparse compact convolutional neural network. IEEE Transactions on Network Science and Engineering pp 1-11
Go back to reference Yan Z, Piramuthu R, Jagadeesh V, Di W, Decoste D (2019) Hierarchical deep convolutional neural network for image classification. US Patent 10,387,773 Yan Z, Piramuthu R, Jagadeesh V, Di W, Decoste D (2019) Hierarchical deep convolutional neural network for image classification. U.S. Patent 10,387,773
Go back to reference Ponzanelli L, Mocci A, Bacchelli A, Lanza M, Fullerton D (2014) Improving low quality stack overflow post detection. In: IEEE international conference on software maintenance and evolution (ICSME), 2014, IEEE, pp 541-544 Ponzanelli L, Mocci A, Bacchelli A, Lanza M, Fullerton D (2014) Improving low quality stack overflow post detection. In: IEEE international conference on software maintenance and evolution (ICSME), 2014, IEEE, pp 541-544
Go back to reference Mizobuchi Y, Takayama K (2017) Two improvements to detect duplicates in stack overflow. In: IEEE 24th international conference on software analysis, evolution and reengineering (SANER), 2017, IEEE, pp 563-564 Mizobuchi Y, Takayama K (2017) Two improvements to detect duplicates in stack overflow. In: IEEE 24th international conference on software analysis, evolution and reengineering (SANER), 2017, IEEE, pp 563-564
Go back to reference Zhang WE, Sheng QZ, Lau JH, Abebe E (2017a) Detecting duplicate posts in programming qa communities via latent semantics and association rules. In: Proceedings of the 26th international conference on world wide web, international world wide web conferences steering committee, pp 1221–1229 Zhang WE, Sheng QZ, Lau JH, Abebe E (2017a) Detecting duplicate posts in programming qa communities via latent semantics and association rules. In: Proceedings of the 26th international conference on world wide web, international world wide web conferences steering committee, pp 1221-1229
Go back to reference Zhang WE, Sheng QZ, Shu Y, Nguyen VK (2017b) Feature analysis for duplicate detection in programming qa communities. In: International conference on advanced data mining and applications, Springer, Berlin, pp 623–638 Zhang WE, Sheng QZ, Shu Y, Nguyen VK (2017b) Feature analysis for duplicate detection in programming qa communities. In: International conference on advanced data mining and applications, Springer, Berlin, pp 623–638
About this article
title
Multilayer Convolutional Neural Network to Filter Low Quality Content from Quora
Publication date
21.06.2020
DOI
https://doi.org/10.1007/s11063-020-10284-x