Abstract
The project aims to develop different Reinforcement Learning techniques (Proximal Policy Optimization, Recurrent Reinforcement Learning and Deep-Q-Learning) in a comparative way for the application to the problem of automatic investment on the Brazilian stock market - specifically for daily operations (swing trade ). Therefore, it is proposed: the development of a database with historical…