Abstract
This research plan addresses temporal and spatial signals such as speech, audio and images to be processed for learning features based on differentiable losses or objective functions. In the process, it is important to convert the signals from their original domains to other ones, resulting in spectral and time-frequency representations in order to disclose features that are more signific…