Abstract
In this project, we are interested in identifying the most important topics present in a given dataset; finding sub-themes within the data (structures) that are of interest according to a given query; determining whether or not two pieces of information (visual and textual) refer to the same subject; determining involved actors (who is present in the mentioned information pi…