Follow
Lorenzo Baraldi
Title
Cited by
Cited by
Year
Predicting human eye fixations via an LSTM-based saliency attentive model
M Cornia, L Baraldi, G Serra, R Cucchiara
IEEE Transactions on Image Processing, 2018
4662018
Meshed-Memory Transformer for Image Captioning
M Cornia, M Stefanini, L Baraldi, R Cucchiara
IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
3682020
A deep multi-level network for saliency prediction
M Cornia, L Baraldi, G Serra, R Cucchiara
23rd International Conference on Pattern Recognition. ICPR, 2016
3242016
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
L Baraldi, C Grana, R Cucchiara
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017
1802017
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions
M Cornia, L Baraldi, R Cucchiara
IEEE/CVF International Conference on Computer Vision and Pattern Recognition, 2019
1032019
Gesture recognition in ego-centric videos using dense trajectories and hand segmentation
L Baraldi, F Paci, G Serra, L Benini, R Cucchiara
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2014
922014
A deep siamese network for scene detection in broadcast videos
L Baraldi, C Grana, R Cucchiara
Proceedings of the 23rd ACM international conference on Multimedia, 1199-1202, 2015
812015
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation
M Tomei, M Cornia, L Baraldi, R Cucchiara
IEEE/CVF International Conference on Computer Vision and Pattern Recognition, 2019
802019
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention
M Cornia, L Baraldi, G Serra, R Cucchiara
ACM Transactions on Multimedia Computing Communications and Applications, 2017
762017
Hand segmentation for gesture recognition in ego-vision
G Serra, M Camurri, L Baraldi, M Benedetti, R Cucchiara
Proceedings of the 3rd ACM international workshop on Interactive multimedia …, 2013
702013
Shot and scene detection via hierarchical clustering for re-using broadcast video
L Baraldi, C Grana, R Cucchiara
International conference on computer analysis of images and patterns, 801-811, 2015
622015
Gesture Recognition using Wearable Vision Sensors to Enhance Visitors' Museum Experiences
L Baraldi, F Paci, G Serra, L Benini, R Cucchiara
IEEE Sensors 15 (5), 2705 - 2714, 2015
512015
Recognizing and presenting the storytelling video structure with deep multimodal networks
L Baraldi, C Grana, R Cucchiara
IEEE Transactions on Multimedia 19 (5), 955-968, 2016
452016
YACCLAB-yet another connected components labeling benchmark
C Grana, F Bolelli, L Baraldi, R Vezzani
2016 23rd International Conference on Pattern Recognition (ICPR), 3109-3114, 2016
372016
Modeling multimodal cues in a deep learning-based framework for emotion recognition in the wild
S Pini, OB Ahmed, M Cornia, L Baraldi, R Cucchiara, B Huet
Proceedings of the 19th ACM International Conference on Multimodal …, 2017
362017
Toward reliable experiments on the performance of connected components labeling algorithms
F Bolelli, M Cancilla, L Baraldi, C Grana
Journal of Real-Time Image Processing 17 (2), 229-244, 2020
342020
Perceive, transform, and act: Multi-modal attention networks for vision-and-language navigation
FLLBM Cornia, MCR Cucchiara
33*2019
Optimized connected components labeling with pixel prediction
C Grana, L Baraldi, F Bolelli
International Conference on Advanced Concepts for Intelligent Vision Systems …, 2016
302016
Spaghetti labeling: Directed acyclic graphs for block-based connected components labeling
F Bolelli, S Allegretti, L Baraldi, C Grana
IEEE Transactions on Image Processing 29, 1999-2012, 2019
272019
LAMV: Learning to align and match videos with kernelized temporal layers
L Baraldi, M Douze, R Cucchiara, H Jégou
Proceedings of the IEEE conference on computer vision and pattern …, 2018
272018
The system can't perform the operation now. Try again later.
Articles 1–20