Busca avançada
Ano de início
Entree
(Referência obtida automaticamente do Web of Science, por meio da informação sobre o financiamento pela FAPESP e o número do processo correspondente, incluída na publicação pelos autores.)

ight-Field Imaging Reconstruction Using Deep Learning Enabling Intelligent Autonomous Transportation Syste

Texto completo
Autor(es):
Silva, Juan Casavilca [1] ; Saadi, Muhammad [2] ; Wuttisittikulkij, Lunchakorn [3] ; Militani, Davi Ribeiro [4] ; Rosa, Renata Lopes [4] ; Rodriguez, Demostenes Zegarra [4] ; Al Otaibi, Sattam [5]
Número total de Autores: 7
Afiliação do(s) autor(es):
[1] Pontificia Univ Catolica Peru, Dept Sci, Lima 15088 - Peru
[2] Univ Cent Punjab, Dept Elect Engn, Fac Engn, Lahore 54000 - Pakistan
[3] Chulalongkorn Univ, Dept Elect Engn, Wireless Commun Ecosyst Res Unit, Bangkok 10330 - Thailand
[4] Univ Fed Lavras, Dept Comp Sci, BR-37200000 Lavras - Brazil
[5] Taif Univ, Dept Elect Engn, Coll Engn, At Taif 21944 - Saudi Arabia
Número total de Afiliações: 5
Tipo de documento: Artigo Científico
Fonte: IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS; v. 23, n. 2 MAY 2021.
Citações Web of Science: 1
Resumo

Light-field (LF) cameras, also known as plenoptic cameras, permit the recording of the 4D LF distribution of target scenes. However, many times, surface errors of a microlens array (MLA) are responsible for degradation in the images captured by a plenoptic camera. Additionally, the limited pixel count of the sensor can cause missing parallax information. The aforementioned issues are crucial for creating accurate maps for Intelligent Autonomous Transport System (IATS), because they cause loss of LF information, and need to be addressed. To tackle this problem, a learning-based framework by directly simulating the LF distribution is proposed. A high-dimensional convolution layer with densely sampled LFs in 4D space and considering a soft activation function based on ReLU segmentation correction is used to generate a superresolution (SR) LF image, improving the convergence rate in the deep learning network. Experimental results show that our proposed LF image reconstruction framework outperforms the existing state-of-the-art approaches; specifically, it is effective for learning the LF distribution and generating high-quality LF images. Different image quality assessment methods are used to evaluate the performance of the proposed framework, such as PSNR, SSIM, IWSSIM, FSIM, GFM, MDFM, and HDR-VDP. Additionally, the computational efficiency was evaluated in terms of number of parameters and FLOPs, and experimental results demonstrated that our proposed framework reached the highest performance in most of the datasets used. (AU)

Processo FAPESP: 18/26455-8 - Processamento Audiovisual de Voz por Aprendizagem de Máquina
Beneficiário:Miguel Arjona Ramírez
Modalidade de apoio: Auxílio à Pesquisa - Regular