| Grant number: | 21/14591-7 |
| Support Opportunities: | Scholarships in Brazil - Doctorate |
| Start date: | July 01, 2022 |
| End date: | October 25, 2026 |
| Field of knowledge: | Physical Sciences and Mathematics - Computer Science |
| Principal Investigator: | Tiago Agostinho de Almeida |
| Grantee: | Pedro Reis Pires |
| Host Institution: | Centro de Ciências em Gestão e Tecnologia (CCGT). Universidade Federal de São Carlos (UFSCAR). Campus de Sorocaba. Sorocaba , SP, Brazil |
| Associated scholarship(s): | 24/15919-4 - Time-aware exploration and state representation for incremental recommender systems, BE.EP.DR |
Abstract With the constant popularization of technology, recommendation systems have become increasingly important within digital media. Its main purpose is to recommend a subset of items relevant to a specific user, helping them to discover new interests. Since the beginning of the field, a static and non-incremental approach has been common, in which algorithms are trained with a fixed database, captured in the past. However, the practical recommendation scenario operates sequentially: the system generates recommendations to the user, who immediately provides feedback. Recent studies in the area are developing models that take advantage of this continuous feature of the problem. Based on recent reinforcement learning strategies, these methods learn in an incremental manner, thus generating a recommending agent that automatically adapts to the users' interests over time. As this area of research is quite recent, it is still maturing and has many open challenges. For example, models capable of generating top-N recommendations, i.e., a list of ordered items, are practically scarce. Additionally, although reinforcement learning has, by definition, sequential temporal knowledge of interactions, little is known about the explicit use of time during training, as is done in time-aware recommenders. This research project seeks to study these open questions and propose new recommendation models based on reinforcement learning for top-N recommendation tasks. The main strategy will be the balance between exploration and diversity of the list of recommended items, in addition to the consumption of temporal attributes to increase the agents' knowledge. (AU) | |
| News published in Agência FAPESP Newsletter about the scholarship: | |
| More itemsLess items | |
| TITULO | |
| Articles published in other media outlets ( ): | |
| More itemsLess items | |
| VEICULO: TITULO (DATA) | |
| VEICULO: TITULO (DATA) | |