Advanced search
Start date

Machine learning for WebSensors: algorithms and applications

Grant number: 14/08996-0
Support type:Regular Research Grants
Duration: August 01, 2014 - July 31, 2016
Field of knowledge:Physical Sciences and Mathematics - Computer Science
Principal Investigator:Solange Oliveira Rezende
Grantee:Solange Oliveira Rezende
Home Institution: Instituto de Ciências Matemáticas e de Computação (ICMC). Universidade de São Paulo (USP). São Carlos , SP, Brazil
Assoc. researchers: Bruno Magalhães Nogueira ; Gustavo Enrique de Almeida Prado Alves Batista ; Ricardo Marcondes Marcacini ; Veronica Oliveira de Carvalho


The popularization of textual content published in web platforms has motivated the development of methods for automatic knowledge extraction from texts. Particularly, a new range of applications and studies have been proposed to use the web as a powerful "social sensor". This allows to identify and monitor events published in news portals and social networks as epidemics detection, sentiment analysis, and political and economic indicators. Currently, the construction of a websensor is a complex task, since it depends of domain specialists to define the parameters of the sensors, i.e., search queries, filters and monitoring textual content from web. Moreover, for some problems there is no comprehension about the phenomenons to monitor, which limits the application of websensors. In this research project we investigate the use of machine learning methods to support the building of websensors. The basic idea is to use a sample of textual document from a problem and apply semi/non supervised learning methods to extract patterns from texts and support the generation of websensors. Thus, we hope to reduce the dependency of specialist domains to define parameters for websensor. Besides, each websensor represents a phenomenon related to a problem and it can be monitored during the time to be used as support to decision making. (AU)

Articles published in Agência FAPESP Newsletter about the research grant
Algorithms facilitate automated classification of web texts 

Scientific publications (6)
(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)
MARCACINI, RICARDO MARCONDES; ROSSI, RAFAEL GERALDELI; MATSUNO, IVONE PENQUE; REZENDE, SOLANGE OLIVEIRA. Cross-domain aspect extraction for sentiment analysis: A transductive learning approach. DECISION SUPPORT SYSTEMS, v. 114, p. 70-80, OCT 2018. Web of Science Citations: 3.
ROSSI, RAFAEL GERALDELI; LOPES, ALNEU DE ANDRADE; REZENDE, SOLANGE OLIVEIRA. Using bipartite heterogeneous networks to speed up inductive semi-supervised learning and improve automatic text categorization. KNOWLEDGE-BASED SYSTEMS, v. 132, p. 94-118, SEP 15 2017. Web of Science Citations: 1.
SOUZA, VINICIUS M. A.; ROSSI, RAFAEL G.; BATISTA, GUSTAVO E. A. P. A.; REZENDE, SOLANGE O. Unsupervised active learning techniques for labeling training sets: An experimental evaluation on sequential data. Intelligent Data Analysis, v. 21, n. 5, p. 1061+, 2017. Web of Science Citations: 0.
MANZATO, MARCELO G.; DOMINGUES, MARCOS A.; FORTES, ARTHUR C.; SUNDERMANN, CAMILA V.; D'ADDIO, RAFAEL M.; CONRADO, MERLEY S.; REZENDE, SOLANGE O.; PIMENTEL, MARIA G. C. Mining unstructured content for recommender systems: an ensemble approach. INFORMATION RETRIEVAL JOURNAL, v. 19, n. 4, p. 378-415, AUG 2016. Web of Science Citations: 3.
ROSSI, RAFAEL GERALDELI; LOPES, ALNEU DE ANDRADE; REZENDE, SOLANGE OLIVEIRA. Optimization and label propagation in bipartite heterogeneous networks to improve transductive classification of texts. INFORMATION PROCESSING & MANAGEMENT, v. 52, n. 2, p. 217-257, MAR 2016. Web of Science Citations: 19.
CORREA, GERALDO N.; MARCACINI, RICARDO M.; HRUSCHKA, EDUARDO R.; REZENDE, SOLANGE O. Interactive textual feature selection for consensus clustering. PATTERN RECOGNITION LETTERS, v. 52, p. 25-31, JAN 15 2015. Web of Science Citations: 4.

Please report errors in scientific publications list by writing to: