I’m an AI Scientist at Mistral in Paris. My research focuses on multimodal foundation models for vision and language. Previously, I was a postdoctoral researcher at FAIR in Meta AI, where I contributed to the development of DINOv3, a state-of-the-art self-supervised vision foundation model, and DINO-world, a latent video world model. During my PhD at KTH, Stockholm, I worked on the explainability of deep learning models, applied to computer vision and bioinformatics.
PhD in Deep Learning, 2018 - 2023
KTH - Royal Institute of Technology
MSc in Machine Learning, 2016 - 2018
KTH - Royal Institute of Technology
BSc in Computer Engineering, 2013 - 2016
UNIBO - University of Bologna