Artificial Intelligence - Publications
- Fini, Enrico; Brutti, Alessio; IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); Supervised Online Diarization with Sample Mean Loss for Multi-Domain Data; in «ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)»; 2020; pp. 7134-7138
- Cerutti, Gianmarco; Prasad, Rahul; Brutti, Alessio; Farella, Elisabetta; Compact recurrent neural networks for acoustic event detection on low-energy low-complexity platforms; in «IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING»; Vol. 14>; 2020; pp. 654-664
- Cerutti, Gianmarco; Prasad, Rahul; Brutti, Alessio; Farella, Elisabetta; Interspeech; Neural Network Distillation on IoT Platforms for Sound Event Detection; in «Interspeech 2019»; 2019; pp. 3609-3613
- Qian, Xinyuan; Brutti, Alessio; Lanz, Oswald; Omologo, Maurizio; Cavallaro, Andrea; Multi-speaker tracking from an audio-visual sensing device; in «IEEE TRANSACTIONS ON MULTIMEDIA»; Vol. 21>; 2019; pp. 2576-2588
- Rajan, Vandana; Brutti, Alessio; Cavallaro, Andrea; ConflictNET: End-to-End Learning for Speech-based Conflict Intensity Estimation; in «IEEE SIGNAL PROCESSING LETTERS»; Vol. 26>; 2019; pp. 1668-1672
- Lanz, Oswald; Brutti, Alessio; Xompero, Alessio; Qian, Xinyuan; Omologo, Maurizio; Cavallaro, Andrea; IEEE International Conference on Acoustics, Speech, and Signal Processing; Accurate Target Annotation in 3D from Multimodal Streams; in «Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)»; 2019; pp.
- Pertilä, Pasi; Brutti, Alessio; Svaizer, Piergiorgio; Omologo, Maurizio; Multichannel Source Activity Detection, Localization, and Tracking; in «Audio Source Separation and Speech Enhancement»; 2018; pp. 47-64
- Cavallaro, Andrea; Brutti, Alessio; Chapter 5 - Audio-visual learning for body-worn cameras; in «Multimodal Behavior Analysis in the Wild: advances and challenges»; Academic Press; 2018; pp. 103-119
- Qian, Xinyuan; Xompero, Alessio; Brutti, Alessio; Lanz, Oswald; Omologo, Maurizio; Cavallaro, Andrea; International Conference on Acoustics, Speech and Signal Processing (ICASSP); 3D mouth tracking from a compact microphone array co-located with a camera; in «Proc. 2018 International Conference on Acoustics, Speech and Signal Processing (ICASSP)»; 2018; pp. 3071-3075
- Alessio, Brutti; Andrea Cavallaro; 2nd International Workshop on “Computer Vision for Audio-Visual Media” (CVAVM) – ICCV 2017; Unsupervised cross-modal deep-model adaptation for audio-visual re-identification with wearable cameras; in «2nd International Workshop on “Computer Vision for Audio-Visual Media” (CVAVM) – ICCV 2017»; 2017; pp. 438-445