Estudo sobre Métricas para Definir Reputação do Autor de Comentários em Sites de Vendas de Produtos
DOI:
https://doi.org/10.5753/isys.2019.595Keywords:
Reputação do autor, Mineração de opinião, Redes neurais artificiaisAbstract
Conhecer a reputação do autor de textos opinativos na Web é de suma importância para o desenvolvimento de sistemas baseados em dados abertos. Este artigo apresenta um estudo sobre medidas usadas no processo de avaliação da reputação do autor em sites de vendas de produtos. Realizou-se dois experimentos com as redes neurais Multilayer Perceptron (MLP) e Radial Basis Function (RBF), sendo que a rede MLP obteve melhor desempenho. Em um terceiro experimento, comparou-se a abordagem TOP(X) original, usada para inferir os melhores comentários, com um novo modelo que utiliza rede MLP na dimensão da reputação do autor. Considerando os comentários excelentes e bons, a nova abordagem apresentou resultados significativamente superiores. Adicionalmente, foi realizado um quarto experimento com outros algoritmos de aprendizagem de máquina (AM) para observar o comportamento dos dados.
Downloads
Referências
[2] Adler, B. T., De Alfaro, L., and Pye, I. (2010). “Detecting wikipedia vandalism using wikitrust”. Notebooks papers of CLEF.
[3] Aggarwal, A. and Kumaraguru, P. (2014). “Followers or phantoms? an anatomy of purchased twitter followers”. CoRR.
[4] Bruce, R. F. and Wiebe, J. M. (1999). “Recognizing subjectivity: A case study of manual tagging”. Natural Language Engineering, v. 5, p. 187–205.
[5] Cappelletti, R. and Sastry, N. (2012). “IARank: Ranking users on twitter in near real-time, based on their information amplification potential”. In SocialInformatics, pages 70–77.
[6] Carvalho, P., Sarmento, L., Silva, M. J., and de Oliveira, E. (2009). “Clues for detecting irony in user-generated contents: Oh...!! it’s ”so easy”;-)”. In Proc. of the Int. Workshop on Topic-sentiment Analysis for Mass Opinion, pages 53–56.
[7] De Sousa, R. F., Rabelo, R. A. L., and Moura, R. S. (2015). “A fuzzy system-based approach to estimate the importance of online customer reviews”. In International Conference on Fuzzy Systems (FUZZ-IEEE), pages 1–8.
[8] George-Nektarios, T. (2013). "Weka Classifiers Summary", [online] Available: www.academia.eduI5167325IWeka_Classifiers_Summary.
[9] Gonçalves, P., Dalip, D., Reis, J., Messias, J., Ribeiro, F., Melo, P., Araújo, L., Gonçalves, M., and Benevenuto, F. (2015). “Bazinga! caracterizando e detectando sarcasmo e ironia no Twitter”. In Proc. of the Brazilian Workshop on Social Network Analysis and Mining (BraSNAM).
[10] Halfaker, A., Kittur, A., Kraut, R. and Riedl, J. (2009). “A jury of your peers: Quality, experience and ownership in Wikipedia”. In Proc. of the Int. Symposium on Wikis and Open Collaboration, pages 15:1–15:10. ACM.
[11] Hamilton, R., Vohs, K. D. and McGill, A. L. (2014). “We’ll be honest, this won’t be the best article you’ll ever read: The use of dispreferred markers in word-of-mouth communication”. Journal of Consumer Research, 41(1):197 – 212.
[12] Hartmann, N., Avanço, L., Balage, P., Duran, M., Nunes, M., Pardo, T. and Aluísio, S. (2014). “A large corpus of product reviews in portuguese: Tackling out-of-vocabulary words”. In Proc. of the Int. Conference on Language Resources and Evaluation (LREC’14).
[13] Hoisl B., Aigner W., Miksch S. (2007) “Social Rewarding in Wiki Systems – Motivating the Community”. In: Int. Conf. on Online Communities and Social Computing. Springer.
[14] Jackson, P. and Moulinier, I. (2007). “Natural language processing for online applications: Text retrieval, extraction and categorization”. John Benjamins, Amsterdam.
[15] Jones, C., Hesterly, W. S. and Borgatti, S. P. “A general theory of network governance: Exchange conditions and social mechanisms”. The Academy of Management Review, Academy of Management, v. 22, n. 4, p. 911–945, 1997.
[16] Kohavi, R. (1995). “The Power of Decision Tables”. In: 8th European Conference on Machine Learning,174-189, 1995.
[17] Ku, L.-W., Liang, Y.-T. and Chen, H.-H. (2006) “Opinion extraction, summarization and tracking in news and blog Corpora”. In: Proceedings of AAAI-CAAW-06, the Spring Symposia on Computational Approaches to Analyzing Weblogs.
[18] Kwak, H., Lee, C., Park, H. and Moon, S. (2010). “What is Twitter, a social network or a news media?” In Proc. of the Int. Conf. on World Wide Web. ACM.
[19] Li, B., Li, RH, King, I., Lyu MR and Yu, JX (2014). “A topic-biased user reputation model in rating systems”. Knowledge and Information Systems.
[20] Li, M., Huang, L., Tan, C. and Wei, K. (2013). “Helpfulness of online product reviews as seen by consumers: Source and content features”. Int. J. Electronic Commerce, 17(4):101–136.
[21] Liu, B. (2011). “Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data”. Springer-Verlag New York, Inc., Secaucus, NJ, USA.
[22] Mackay, D. (1998). “Introduction to Gaussian Processes”. Dept. of Physics, Cambridge University, UK.
[23] Pang, B., Lee, L. and Vaithyanathan, S. (2002) “Thumbs up?: Sentiment classification using machine learning techniques”. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing.
[24] Powers, D. M. W. (2011). “Evaluation: From precision, recall and f-measure to roc., informedness, markedness & correlation”. Journal of Machine Learning Tech- nologies, 2(1):37–63.
[25] Shevade, S., Keerthi, S., Bhattacharyya, C., Murthy, K. (1999). “Improvements to the SMO Algorithm for SVM Regression”. In: IEEE Transactions on Neural Networks.
[26] SPSS (2007). SPSS for Windows, Version 16.0. Chicago USA: [s.n.].
[27] Wang, A. H. (2010). “Don’t follow me: Spam detection in Twitter”. In Proc. of the Int. Conference on Security and Cryptography (SECRYPT), pages 1–10.
[28] Weitzel, L., de Oliveira, J. P. and Quaresma, P. (2014). “Measuring the reputation in user-generated-content systems based on health information”. Procedia Computer Science, 29:364 – 378.
[29] Wiebe, J., Wilson, T. and Cardie, C. (2005) “Annotating expressions of opinions and emotions in language”. In: Language Resources and Evaluation, 39 2-3.
[30] Weng, J., Lim, E.-P., Jiang, J. and He, Q. (2010). “Twitterrank: Finding topic-sensitive influential Twitterers”. In Proc. of the Int. Conference on Web Search and Data Mining, pages 261–270. ACM.
[31] Wöhner, T., Köhler, S., and Peters, R. (2011). “Automatic reputation assessment in Wikipedia”. In Proc. of the Int. Conference on Information Systems.
[32] Zhao, H., Ye, S., Bhattacharyya, P., Rowe, J., Gribble, K. and Wu, S. F. (2010). “Socialwiki: Bring order to wiki systems with social context”. In: Int. Conf. on Social Informatics.