How could the Quora bookmark be improved?

Quora Question Answer Dataset

tip

Access other chapters in this book by swiping

2017 | OriginalPaper | Book chapter

Abstract

We report on a progressing work for compiling Quora Question Answer dataset. Quora dataset is composed of questions which are posed in Quora Question Answering site. It is the only dataset which provides sentence-level and word-level answers at the same time. Moreover, the questions in the dataset are authentic which is much more realistic for Question Answering systems. We test the performance of a state-of-the-art Question Answering system on the dataset and compare it with human performance to establish an upper bound.

Would you like to get access to this content? Then find out more about our products now:

Springer Professional "Business + Technology"

With Springer Professional "Business + Technology" you get access to:

  • above 69,000 books
  • above 500 magazines

from the following fields:

  • Automobile + engines
  • Construction + real estate
  • Business IT + informatics
  • Electrical engineering + electronics
  • Energy + environment
  • Finance + Banking
  • Management + leadership
  • Marketing + sales
  • Mechanical engineering + materials
  • Insurance + risk

Try it now for 30 days free of charge.

Springer Professional "Technology"

With Springer Professional "Technology" you get access to:

  • above 50,000 books
  • above 380 magazines

from the following fields:

  • Automobile + engines
  • Construction + real estate
  • Business IT + informatics
  • Electrical engineering + electronics
  • Energy + environment
  • Mechanical engineering + materials



Try it now for 30 days free of charge.

Springer Professional "Economy"

With Springer Professional "Economy" you get access to:

  • above 58,000 books
  • above 300 magazines

from the following fields:

  • Construction + real estate
  • Business IT + informatics
  • Finance + Banking
  • Management + leadership
  • Marketing + sales
  • Insurance + risk



Try now for 30 days free of charge.

Footnotes
123
literature
Go back to reference Bollacker, K., Tufts, P., Pierce, T., Robert, C .: A platform for scalable, collaborative, structured information integration. In: Proceedings of the Sixth International Workshop on Information Integration on the Web (2007) Bollacker, K., Tufts, P., Pierce, T., Robert, C .: A platform for scalable, collaborative, structured information integration. In: Proceedings of the Sixth International Workshop on Information Integration on the Web (2007)
Go back to reference Bordes, A., Usunier, N., Chopra, S., Weston, J .: Large-scale simple question answering with memory networks. arxiv: 1506. 02075 (2015) Bordes, A., Usunier, N., Chopra, S., Weston, J .: Large-scale simple question answering with memory networks. arxiv: 1506. 02075 (2015)
Go back to reference Hermann, K.M., Kocisky, T., Grefenstette, E., Espeholt, L., Kay, W., Suleyman, M., Blunsom, P .: Teaching machines to read and comprehend. In: Advances in Neural Information Processing Systems (2015) Hermann, KM, Kocisky, T., Grefenstette, E., Espeholt, L., Kay, W., Suleyman, M., Blunsom, P .: Teaching machines to read and comprehend. In: Advances in Neural Information Processing Systems (2015)
Go back to reference Hill, F., Bordes, A., Chopra, S., Weston, J .: The goldilocks principle: reading children’s books with explicit memory representations. arxiv: 1511. 02301 (2015) Hill, F., Bordes, A., Chopra, S., Weston, J .: The goldilocks principle: reading children’s books with explicit memory representations. arxiv: 1511.02301 (2015)
Go back to reference Kadlec, R., Schmid, M., Bajgar, O., Kleindienst, J .: Text understanding with the attention sum reader network. In: Proceedings of the Association for Computational Linguistics (2016) Kadlec, R., Schmid, M., Bajgar, O., Kleindienst, J .: Text understanding with the attention sum reader network. In: Proceedings of the Association for Computational Linguistics (2016)
Go back to reference Rajpurkar, P., Zhang, J., Lopyrev, K., Liang, P .: Squad: 100,000+ questions for machine comprehension of text. arxiv: 1606.05250 (2016) Rajpurkar, P., Zhang, J., Lopyrev, K., Liang, P .: Squad: 100,000+ questions for machine comprehension of text. arxiv: 1606.05250 (2016)
Rao, J., He, H., Lin, J .: Noise-contrastive estimation for answer selection with deep neural networks. In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management (2016) Rao, J., He, H., Lin, J .: Noise-contrastive estimation for answer selection with deep neural networks. In: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management (2016)
Go back to reference Richardson, M., Burges, J.C., C., Erin, R .: MCTest: a challenge dataset for the open-domain machine comprehension of text. In: Empirical Methods in Natural Language Processing (EMNLP) (2013) Richardson, M., Burges, J.C., C., Erin, R .: MCTest: a challenge dataset for the open-domain machine comprehension of text. In: Empirical Methods in Natural Language Processing (EMNLP) (2013)
Go back to reference Santos, C.D., Tan, M., Xiang, B., Zhou, B .: Attentive pooling networks. arXiv: 1602.03609v1 (2016) Santos, C.D., Tan, M., Xiang, B., Zhou, B .: Attentive pooling networks. arXiv: 1602.03609v1 (2016)
Go back to reference Yang, Y., Yih, S.W.T., Meek, C .: WikiQA: a challenge dataset for open-domain question answering. In: Empirical Methods in Natural Language Processing (EMNLP) (2015) Yang, Y., Yih, S.W.T., Meek, C .: WikiQA: a challenge dataset for open-domain question answering. In: Empirical Methods in Natural Language Processing (EMNLP) (2015)
About this chapter
title
Quora Question Answer Dataset
book
Text, Speech, and Dialogue

Print ISBN: 978-3-319-64205-5

Electronic ISBN: 978-3-319-64206-2

Copyright Year: 2017

https://doi.org/10.1007/978-3-319-64206-2

DOI
https://doi.org/10.1007/978-3-319-64206-2_8
Author:
Ahmad Aghaebrahimian

premium partner