Federico Baldassarre
Federico Baldassarre
News
Experience
Publications
CV
Light
Dark
Automatic
Postdoc
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
CVPR 2025
- Locked-image tuning for vision-language alignment using a DINOv2 backbone and a few tricks on top.
Cijo Jose
,
Théo Moutakanni
,
Dahyun Kang
,
Federico Baldassarre
,
Timothée Darcet
,
Hu Xu
,
Daniel Li
,
Marc Szafraniec
,
Michaël Ramamonjisoa
,
Maxime Oquab
,
Oriane Siméoni
,
Huy v. Vo
,
Patrick Labatut
,
Piotr Bojanowski
PDF
Cluster and Predict Latents Patches for Improved Masked Image Modeling
TMLR 2025
- Stable training of dense image representations using a clustering loss on ViT patch tokens.
Timothée Darcet
,
Federico Baldassarre
,
Maxime Oquab
,
Julien Mairal
,
Piotr Bojanowski
PDF
Code
Cite
×