Advanced search
Start date
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

Mining unstructured content for recommender systems: an ensemble approach

Full text
Manzato, Marcelo G. [1] ; Domingues, Marcos A. [2] ; Fortes, Arthur C. [1] ; Sundermann, Camila V. [1] ; D'Addio, Rafael M. [1] ; Conrado, Merley S. [1] ; Rezende, Solange O. [1] ; Pimentel, Maria G. C. [1]
Total Authors: 8
[1] Univ Sao Paulo, Inst Math & Comp Sci, Sao Carlos, SP - Brazil
[2] Univ Estadual Maringa, Dept Informat, Maringa, Parana - Brazil
Total Affiliations: 2
Document type: Journal article
Source: INFORMATION RETRIEVAL JOURNAL; v. 19, n. 4, p. 378-415, AUG 2016.
Web of Science Citations: 3

Recommendation of textual documents requires indexing mechanisms to extract structured metadata for attribute-aware recommender systems. Applying a variety of text mining algorithms has the advantage of capturing different aspects of unstructured content, resulting in richer descriptions. However, it is difficult to integrate them into a unique model so that these descriptions can efficiently improve recommendation accuracy. This article proposes a generic model based on ensemble learning that combines simple text mining methods in a post-processing approach. After executing each text mining technique, each set of metadata of a particular type is applied to the recommender module, which generates attribute-specific rankings. Then, the resulting recommendations are ensembled to generate a final personalized ranking to the user. We evaluated our ensemble technique with two attribute-aware collaborative recommenders (k-Nearest Neighbors and BPR-Mapping) and we demonstrate its generality by means of comparisons among different types of ensembles. We used two datasets from different domains, the first is from the Brazilian Embrapa Agency of Technology Information website, whose documents are written in Portuguese language, and the second is the HetRec MovieLens 2k, published by the GroupLens Research Group, whose movies' storylines are written in English. The experiments show that, particularly to the k-NN recommender, better accuracy can be obtained when multiple metadata types are combined. The proposed approach is extensible and flexible to new indexing and recommendation techniques. (AU)

FAPESP's process: 14/08996-0 - Machine learning for WebSensors: algorithms and applications
Grantee:Solange Oliveira Rezende
Support type: Regular Research Grants
FAPESP's process: 13/10756-5 - Content-based filtering supported by collaborative indexing methods
Grantee:Rafael Martins Daddio
Support type: Scholarships in Brazil - Master
FAPESP's process: 12/13830-9 - Automatic Acquisition of Contextual Information for Context-Aware Recommender Systems
Grantee:Marcos Aurelio Domingues
Support type: Scholarships in Brazil - Post-Doctorate
FAPESP's process: 13/22547-1 - Exploring collaborative annotations in hibrid Recommender systems
Grantee:Marcelo Garcia Manzato
Support type: Regular Research Grants
FAPESP's process: 13/16039-3 - Exploration of text mining techniques for automatic acquisition of contextual information for Context-Aware recommendation systems
Grantee:Camila Vaccari Sundermann
Support type: Scholarships in Brazil - Master