International journal of

ADVANCED AND APPLIED SCIENCES

EISSN: 2313-3724, Print ISSN:2313-626X

Frequency: 12

line decor
  
line decor

 Volume 6, Issue 8 (August 2019), Pages: 23-31

----------------------------------------------

 Original Research Paper

 Title: Automatic classification of product reviews into interrogative and noninterrogative: Generating real time answer

 Author(s): Sheikh Muhammad Saqib 1, *, Fazal Masud Kundi 1, Shakeel Ahmad 2, Tariq Naeem 1

 Affiliation(s):

 1Institute of Computing and Information Technology, Gomal University, Dera Ismail Khan, Pakistan
 2Faculty of Computing and Information Technology in Rabigh (FCITR), King Abdul Aziz University (KAU), Jeddah, Saudi Arabia

  Full Text - PDF          XML

 * Corresponding Author. 

  Corresponding author's ORCID profile: https://orcid.org/0000-0002-4647-1698

 Digital Object Identifier: 

 https://doi.org/10.21833/ijaas.2019.08.004

 Abstract:

Posted reviews on the relevant webpages about a product not only motivate the company to enhance quality but also it helps users to decide in favor of (or against) purchasing the product. These reviews are classified by different researchers through subjectivity based, entity based, or aspect based to find the polarity using the supervised or unsupervised technique. However, classification based on interrogatives and non-interrogatives is not handled yet. Datasets of interrogatives are analyzed as identifying Answer Seeking questions from Arabic tweets, question conveying and not conveying Information, Rhetorical Questions while here classifying the sentences into interrogatives and non-interrogatives is the preliminary step, which is a core contribution of proposed work. If detected questions are answered and moreover real time, it could not only motivate a user positively to buy the product but also users feel full duplex communication. In this work, we formulated this problem proposing linguistic and heuristic rules that automatically senses the interrogative and answer promptly based on the aforementioned aspect. If there is no aspect in an asked question, then LSI (Latent Semantic Indexing) generate answer using classified non-interrogatives. LSI is an efficient information retrieval algorithm, which finds the closest document to a given query. Experimental results using two publically available datasets show a precision of 95% and 96% which has 10% increased performance than alternatives machine learning methods Meta Filtered Classifier and Naive Bayes. 

 © 2019 The Authors. Published by IASE.

 This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

 Keywords: Sentiment classification, Preprocessing, Text mining, Sentiment orientation

 Article History: Received 12 March 2019, Received in revised form 3 June 2019, Accepted 4 June 2019

 Acknowledgement:

No Acknowledgement.

 Compliance with ethical standards

 Conflict of interest:  The authors declare that they have no conflict of interest.

 Citation:

 Saqib SM, Kundi FM, and Ahmad S et al. (2019). Automatic classification of product reviews into interrogative and noninterrogative: Generating real time answer. International Journal of Advanced and Applied Sciences, 6(8): 23-31

 Permanent Link to this page

 Figures

 Fig. 1 Fig. 2

 Tables

 Table 1 Table 2 Table 3 Table 4 Table 5 Table 6 Table 7 Table 8 Table 9 Table 10

----------------------------------------------

 References (48) 

  1. Ahmad S, Saqib SM, Almagrabi AO, and Alotaibi FM (2017). LSI based search technique: Using extracted keywords and key-sentences. VAWKUM Transactions on Computer Sciences, 14(2): 1-8. https://doi.org/10.21015/vtcs.v14i2.471   [Google Scholar]
  2. Alghunaim A (2015). A vector space approach for aspect-based sentiment analysis. Ph.D. Dissertation, Massachusetts Institute of Technology, Cambridge, USA. https://doi.org/10.3115/v1/W15-1516   [Google Scholar]
  3. Archak N, Ghose A, and Ipeirotis PG (2007). Show me the money: Deriving the pricing power of product features by mining consumer reviews. In the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, San Jose, USA: 56-65. https://doi.org/10.1145/1281192.1281202   [Google Scholar]
  4. Batra S and Rao D (2010). Entity based sentiment analysis on twitter. Science, 9(4): 1-12.   [Google Scholar]
  5. Blair-Goldensohn S, Hannan K, McDonald R, Neylon T, Reis G, and Reynar J (2008). Building a sentiment summarizer for local service reviews. In the NLP Challenges in the Information Explosion Era, Beijing, China: 1-10.   [Google Scholar]
  6. Blei D, Carin L, and Dunson D (2010). Probabilistic topic models: A focus on graphical model design and applications to document and image analysis. IEEE Signal Processing Magazine, 27(6): 55-65. https://doi.org/10.1109/MSP.2010.938079   [Google Scholar] PMid:25104898 PMCid:PMC4122269
  7. Blei DM, Ng AY, and Jordan MI (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3: 993-1022.   [Google Scholar]
  8. Brun C, Popa DN, and Roux C (2014). XRCE: Hybrid classification for aspect-based sentiment analysis. In the 8th International Workshop on Semantic Evaluation, Dublin, Ireland: 838-842. https://doi.org/10.3115/v1/S14-2149   [Google Scholar]
  9. Chen Y and Xie J (2008). Online consumer review: Word-of-mouth as a new element of marketing communication mix. Management Science, 54(3): 477-491. https://doi.org/10.1287/mnsc.1070.0810   [Google Scholar]
  10. Deerwester S, Dumais ST, Furnas GW, Landauer TK, and Harshman R (1990). Indexing by latent semantic analysis. Journal of the American Society for Information Science, 41(6): 391–407. https://doi.org/10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9   [Google Scholar]
  11. Ding X and Liu B (2010). Resolving object and attribute coreference in opinion mining. In the 23rd International Conference on Computational Linguistics, Association for Computational Linguistics, Beijing, China: 268-276.   [Google Scholar]
  12. Engonopoulos N, Lazaridou A, Paliouras G, and Chandrinos K (2011). ELS: A word-level method for entity-level sentiment analysis. In the International Conference on Web Intelligence, Mining and Semantics, ACM, Sogndal, Norway: 1-9.   [Google Scholar]
  13. Greene S and Resnik P (2009). More than words: Syntactic packaging and implicit sentiment. In the Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Association for Computational Linguistics, Boulder, Colorado, USA: 503-511. https://doi.org/10.3115/1620754.1620827   [Google Scholar]
  14. Griffiths TL and Steyvers M (2003). Prediction and semantic association. In: Sahani M and Linden JF (Eds.), Advances in neural information processing systems: 11-18. MIT Press, Cambridge, USA.   [Google Scholar]
  15. Grossman DA and Frieder O (2012). Information retrieval: Algorithms and heuristics. Springer Science and Business Media, Berlin, Germany.   [Google Scholar]
  16. Hasanain M, Bagdouri M, Elsayed T, and Oard DW (2016). What questions do journalists ask on twitter?. In the 10th International AAAI Conference on Web and Social Media, Association for the Advancement of Artificial Intelligence (AAAI), Cologne, Germany: 127-134.   [Google Scholar]
  17. Hasanain M, Elsayed T, and Magdy W (2014). Identification of answer-seeking questions in Arabic microblogs. In the 23rd ACM International Conference on Conference on Information and Knowledge Management, ACM, Shanghai, China: 1839-1842. https://doi.org/10.1145/2661829.2661959   [Google Scholar]
  18. Jeyapriya A and Selvi CK (2015). Extracting aspects and mining opinions in product reviews using supervised learning algorithm. In the 2nd International Conference on Electronics and Communication Systems, IEEE, Coimbatore, India: 548-552. https://doi.org/10.1109/ECS.2015.7124967   [Google Scholar]
  19. Kharche SE and Mante RV (2017). An efficient way of answering the questions asked on social sites by understanding user intent. In the 2017 International Conference on Recent Trends in Electrical, Electronics and Computing Technologies, IEEE, Warangal, India: 159-163. https://doi.org/10.1109/ICRTEECT.2017.22   [Google Scholar]
  20. Kirange DK, Deshmukh RR, and Kirange MDK (2014). Aspect based sentiment analysis SemEval-2014 task 4. Asian Journal of Computer Science {and} Information Technology, 4(8): 72-75. https://doi.org/10.15520/ajcsit.v4i8.9   [Google Scholar]
  21. Kwong H and Yorke-Smith N (2012). Detection of imperative and declarative question–answer pairs in email conversations. AI Communications, 25(4): 271-283.   [Google Scholar]
  22. Lee L (1999). Measures of distributional similarity. In the 37th Annual Meeting of the Association for Computational Linguistics on Computational Linguistics, Association for Computational Linguistics, College Park, Maryland, USA: 25-32. https://doi.org/10.3115/1034678.1034693   [Google Scholar] PMCid:PMC1232645
  23. Lerman K and McDonald R (2009). Contrastive summarization: An experiment with consumer reviews. In the Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Boulder, USA: 113-116. https://doi.org/10.3115/1620853.1620886   [Google Scholar]
  24. Li B, Si X, Lyu MR, King I, and Chang EY (2011a). Question identification on twitter. In the 20th ACM International Conference on Information and Knowledge Management, ACM, Glasgow, Scotland, UK: 2477-2480. https://doi.org/10.1145/2063576.2063996   [Google Scholar] PMCid:PMC3116284
  25. Li FH, Huang M, Yang Y, and Zhu X (2011b). Learning to identify review spam. In the Twenty-Second International Joint Conference on Artificial Intelligence, Barcelona, Spain: 2488-2493.   [Google Scholar]
  26. Li P and Zhao H (2013). NLP-oriented study on the imperative sentence with interrogative mood. In the 2013 International Conference on Asian Language Processing, IEEE, Urumqi, China: 54-56. https://doi.org/10.1109/IALP.2013.18   [Google Scholar]
  27. Liu B (2012). Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies, 5(1): 1-167. https://doi.org/10.2200/S00416ED1V01Y201204HLT016   [Google Scholar]
  28. Liu B (2015). Sentiment analysis: Mining opinions, sentiments, and emotions. Cambridge University Press, Cambridge, UK. https://doi.org/10.1017/CBO9781139084789   [Google Scholar]
  29. Liu Q, Gao Z, Liu B, and Zhang Y (2013). A logic programming approach to aspect extraction in opinion mining. In the 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies, IEEE, Atlanta, USA, 1: 276-283. https://doi.org/10.1109/WI-IAT.2013.40   [Google Scholar]
  30. Liu Z and Jansen BJ (2015). Subjective versus objective questions: Perception of question subjectivity in social Q&A. In the International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction, Springer, Washington, USA: 131-140. https://doi.org/10.1007/978-3-319-16268-3_14   [Google Scholar]
  31. Liu Z and Jansen BJ (2016). Understanding and predicting question subjectivity in social question and answering. IEEE Transactions on Computational Social Systems, 3(1): 32-41. https://doi.org/10.1109/TCSS.2016.2564400   [Google Scholar]
  32. Long C, Zhang J, and Zhut X (2010). A review selection approach for accurate feature rating estimation. In the 23rd International Conference on Computational Linguistics: Posters, Association for Computational Linguistics, Beijing, China: 766-774.   [Google Scholar]
  33. Ott M, Choi Y, Cardie C, and Hancock JT (2011). Finding deceptive opinion spam by any stretch of the imagination. In the 49th Annual Meeting of the Association for Computational Linguistics: Human language Technologies, Association for Computational Linguistics, Portland, Oregon, USA, 1: 309-319.   [Google Scholar]
  34. Pablos AG, Cuadros M, and Rigau G (2014). V3: Unsupervised generation of domain aspect terms for aspect based sentiment analysis. In the 8th International Workshop on Semantic Evaluation, Dublin, Ireland: 833-837.   [Google Scholar]
  35. Pantel P, Crestan E, Borkovsky A, Popescu AM, and Vyas V (2009). Web-scale distributional similarity and entity set expansion. In the 2009 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Singapore, 2: 938-947. https://doi.org/10.3115/1699571.1699635   [Google Scholar]
  36. Pontiki M, Galanis D, Papageorgiou H, Androutsopoulos I, Manandhar S, Mohammad AS, and Hoste V (2016). Semeval-2016 task 5: Aspect based sentiment analysis. In the 10th International Workshop on Semantic Evaluation, Association for Computational Linguistics, San Diego, California, USA: 19-30.   [Google Scholar]
  37. Ranganath S, Hu X, Tang J, Wang S, and Liu H (2016). Identifying rhetorical questions in social media. In the 10th International AAAI Conference on Web and Social Media, AAAI, Cologne, Germany: 667-670.   [Google Scholar]
  38. Rubini R and Chezian RM (2014). An Analysis on search engines: Techniques and tools. International Journal of Advanced Research in Computer and Communication Engineering, 3(9): 7860–7864.   [Google Scholar]
  39. Saqib SM, Ahmad S, Syed AH, Naeem T, Alotaibi FM (2019). Grouping of aspects into relevant category based on WordNet definitions. International Journal of Computer Science and Network Security, 19(2): 113–119.   [Google Scholar]
  40. Saqib SM, Kundi FM, and Ahmad S (2018a). Unsupervised learning method for sorting positive and negative reviews using LSI (latent semantic indexing) with automatic generated queries. International Journal of Computer Science and Network Security, 18(1): 56-62.   [Google Scholar]
  41. Saqib SM, Kundi FM, Syed AH, and Ahmad S (2018b). Semi supervised method for detection of ambiguous word and creation of sense: Using WordNet. International Journal of Advanced Computer Science and Applications, 9(11): 353-359. https://doi.org/10.14569/IJACSA.2018.091149   [Google Scholar]
  42. Saqib SM, Mahmood K, and Naeem T (2016a). Comparison of LSI algorithms without and with pre-processing: using text document based search. ACCENTS Transactions on Information Security, 1(4): 44-51.   [Google Scholar]
  43. Saqib SM, Naeem T, and Mahmood K (2016b). Extraction of key/ title/ aspect words from document using WordNet. ACCENTS Transactions on Information Security, 1(3): 36-43.   [Google Scholar]
  44. Teli S and Biradar S (2014). Effective spam detection method for email. In the International Conference on Advances in Engineering and Technology, Singapore: 68-72.   [Google Scholar]
  45. Vijayarani S and Muthulakshmi M (2013). Comparative study on classification meta algorithms. International Journal of Innovative Research in Computer and Communication Engineering, 1(8): 1768-1774.   [Google Scholar]
  46. Wang SH, Ding Y, Zhao W, Huang YH, Perkins R, Zou W, and Chen JJ (2016). Text mining for identifying topics in the literatures about adolescent substance use and depression. BMC Public Health, 16(1): 279-286. https://doi.org/10.1186/s12889-016-2932-1   [Google Scholar] PMid:26993983 PMCid:PMC4799597
  47. Zhang L and Liu B (2011). Identifying noun product features that imply opinions. In the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Association for Computational Linguistics, Portland, Oregon, USA, 2: 575-580.   [Google Scholar]
  48. Zhao Z and Mei Q (2013). Questions about questions: An empirical analysis of information needs on twitter. In the 22nd International Conference on World Wide Web, ACM, Rio de Janeiro, Brazil: 1545-1556. https://doi.org/10.1145/2488388.2488523   [Google Scholar] PMCid:PMC3971240