Godkända
Maskininlärningsbaserad Multimodal Datakomprimering
Jacob Forsell () och Yuyang Jin ()
Start
2024-01-25
Presentation
2024-06-10 09:15
Plats:
E:2349
Avslutat:
2024-06-27
Examensrapport:
Sammanfattning
The field of learned image compression has been experiencing rapid development and increased research engagement. In this thesis, we aim to contribute to the field by extending a state-of-the-art learned image compression architecture, called LIC-TCM, by incorporating a depth map as a second complementary modality to further enhance image compression. Additionally, we explore the inverse approach, where we primarily compress a depth map (which can be represented as an image) using LIC-TCM and incorporate the corresponding image frame as a secondary complementary modality. Based on these explorations, we propose three unique multimodal compression architectures. Our experimental results demonstrate overall improvements in compression performance and indicate a positive direction for future research.
Handledare: Saeed Bastani (Ericsson) och Alexander Ekman (Lund University, Particle and nuclear physics) och Amir Aminifar (EIT)
Examinator: Michael Lentmaier (EIT)