The Multimedia team is pursuing its research on the representation, compression and transmission of multimedia data, with a particular focus on video and on emerging and immersive content formats.
The main research themes are:
- Linear video coding
- Rich-media scene representation
- Multimedia content adaptation
- Multimedia distribution networks
- Frugal and Efficient AI
- Geometric deep learning
- Multimodal learning
Team members
- Enzo Tartaglione, Associate Professor, team leader
- Jhony Giraldo, Associate Professor
- Stéphane Lathuilière, Associate Professor
- Jean Le Feuvre, Directeur d’Etudes/Professor
- Jean-Claude Moissinac, Associate Professor
Keywords
- Image and video compression
- Transport and orchestration of multimedia content
- Deep neural network compression
- Graph Neural Networks
- AI-based generative models
- Frugal and Efficient AI
- Geometric deep learning
- Multimodal learning
News
[Nov 24] Article accepted for publication:
Higher-Order GNNs Meet Efficiency: Sparse Sobolev Graph Neural Networks. Jhony H. Giraldo, Aref Einizade, Andjela Todorovic, Jhon A. Castro-Correa, Mohsen Badiey, Thierry Bouwmans, Fragkiskos D. Malliaros. IEEE Transactions on Signal and Information Processing over Networks.
[Oct 24] We have 4x articles accepted at the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2025):
-
WiGNet: Windowed Vision Graph Neural Network. Gabriele Spadaro, Marco Grangetto, Attilio Fiandrotti, Enzo Tartaglione, Jhony H. Giraldo
-
ELMGS: Enhancing memory and computation scaLability through coMpression for 3D Gaussian Splatting. Muhammad Salman Ali, Sung-Ho Bae, Enzo Tartaglione
- CATALOG: A Camera Trap Language-guided Contrastive Learning Model. Julian Santamaria, Claudia Isaza, Jhony H. Giraldo
-
Efficient Progressive Image Compression with Variance-aware Masking. Alberto Presta, Enzo Tartaglione, Attilio Fiandrotti, Marco Grangetto, Pamela Cosman
[Sep 24] We have 3x articles accepted at the 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024):
- An eye for an ear: zero-shot audio description leveraging an image captioner with audio-visual token distribution matching. Hugo Malard, Michel Olvera, Stéphane Lathuilière, Slim Essid
- Continuous Product Graph Neural Networks. Aref Einizade, Fragkiskos D. Malliaros, Jhony H. Giraldo
- Activation Map Compression through Tensor Decomposition for Deep Learning. Le-Trung Nguyen, Aël Quélennec, Enzo Tartaglione, Samuel Tardieu, Van-Tam Nguyen
——–