Huy V. Vo
Paris, France

I am currently a Research Scientist at FAIR, Meta. I obtained my PhD in Computer Science from Ecole Normale Superieure on November 2022. My thesis was prepared in the WILLOW team at INRIA and the Valeo.ai team under the supervision of Prof. Jean Ponce and Dr. Patrick Pérez. Prior to my PhD, I was a student in the Mathématique-Vision-Apprentissage (MVA) master at Ecole Normale Supérieure de Paris Saclay and the Ingénieur Polytechnicien Program of Ecole Polytechnique.

My research interests include self-supervised learning, automatic data curation and any learning problems that require less supervision including object discovery, weakly supervised object detection/segmentation and active learning.

Contacts:

News

  • 08/2025 We release DINOv3 , a state-of-the-art computer vision model trained with self-supervised learning (SSL) that produces powerful, high-resolution image features. For the first time, a single frozen vision backbone outperforms specialized solutions on multiple long-standing dense prediction tasks.
  • 05/2024 We release our paper on Automatic data curation for self-supervised learning , a generic, principled approach to build large, diverse and balanced training datasets. VentureBeat wrote an article and Andrew Ng's letter gave a detailed account about it.
  • 04/2023 We release DINOv2 , a family of foundation models producing universal features suitable for image-level visual tasks (image classification, instance retrieval, video understanding) as well as pixel-level visual tasks (depth estimation, semantic segmentation).
  • 11/2022 I am joining FAIR Labs, Meta as an AI Research Scientist.
  • 11/2022 I successully defended my thesis on "Annotation-efficient learning for object discovery and detection". The thesis was reviewed by Prof. Andrew Zisserman and Prof. Tinne Tuytelaars, examinated by Dr. Cordelia Schmid, Dr. Yannis Avrithis, Dr. Elena Sizikova (invited member) and Dr. Oriane Siméoni (invited member), and supervised by Prof. Jean Ponce and Dr. Patrick Pérez.
  • 07/2022 Our work on active and weakly-supervised object detection, BiB, is accepted to ECCV 2022.
  • 10/2021 Our work on unsupervised object discovery/detection, LOST, is accepted to BMVC 2021.
  • 09/2021 Our work on large-scale unsupervised object discovery, LOD, is accepted to NeuRIPS 2021.
  • 03/2021 Our paper on weakly supervised lesion segmentation is accepted at MIDL 2021.
  • 07/2020 Our work on unsupervised object discovery, rOSD, is accepted at ECCV 2020.
  • 10/2019 I am visiting Center for Data Science, New York University for 3 months.
  • 03/2019 Our work on unsupervised object discovery, OSD, is accepted at CVPR 2019.
  • 10/2018 I start my PhD in INRIA's Willow team and Valeo.ai under the supervision of Prof. Jean Ponce and Dr. Patrick Pérez.
  • 04/2018 I start a 6-month research internship in the WILLOW team of INRIA and Center for Data Science (New York university), working on object discovery under the supervision of Prof. Jean Ponce and Prof. Yann LeCun.
  • 04/2017 I start a 5-month internship at Technicolor, working on Image inpainting under the supervision of Dr. Patrick Pérez.

Invited Talks

  • 10/2024 Automatic data curation, Vanderbilt Machine Learning Seminars.
  • 06/2023 Annotation-efficient learning for object discovery and detection, VinAI seminar.
  • 12/2021 Unsupervised Object Discovery, IMAGINE team, École des ponts ParisTech, Paris.

Publications

Arxiv 2025 DINOv3
[project page]
Oriane Siméoni, Huy V. Vo, Maximilian Seitzer, Federico Baldassarre, Maxime Oquab, Cijo Jose, Vasil Khalidov, Marc Szafraniec, Seungeun Yi, Michaël Ramamonjisoa, Francisco Massa, Daniel Haziza, Luca Wehrstedt, Jianyuan Wang, Timothée Darcet, Théo Moutakanni, Leonel Sentana, Claire Roberts, Andrea Vedaldi, Jamie Tolan, John Brandt, Camille Couprie, Julien Mairal, Hervé Jégou, Patrick Labatut, Piotr Bojanowski
CVPR 2024 Dinov2 meets text: A unified framework for image-and pixel-level vision-language alignment
Cijo Jose, Théo Moutakanni, Dahyun Kang, Federico Baldassarre, Timothée Darcet, Hu Xu, Daniel Li, Marc Szafraniec, Michaël Ramamonjisoa, Maxime Oquab, Oriane Siméoni, Huy V. Vo, Patrick Labatut, Piotr Bojanowski
TMLR 2024 Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach
[project page]
Huy V. Vo, Vasil Khalidov, Timothée Darcet, Théo Moutakanni, Nikita Smetanin, Marc Szafraniec, Hugo Touvron, Camille Couprie, Maxime Oquab, Armand Joulin, Hervé Jégou, Patrick Labatut, Piotr Bojanowski
TMLR 2024 DINOv2: Learning robust visual features without supervision
[project page]
Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy V. Vo, Marc Szafraniec, Vasil Khalidov, Patrick Labatut, Armand Joulin, Piotr Bojanowski et al.
Remote Sensing 2024 Sub-meter resolution canopy height maps using self-supervised learning and a vision transformer trained on Aerial and GEDI Lidar
[paper]
Jamie Tolan, Hung-I Yang, Ben Nosarzewski, Guillaume Couairon, Huy V. Vo, John Brandt, Justine Spore, Sayantan Majumdar, Daniel Haziza, Janaki Vamaraju, Theo Moutakani, Piotr Bojanowski, Tracy Johns, Brian White, Tobias Tiecke, Camille Couprie
Thesis Annotation-efficient learning for object discovery and detection
[thesis]
Huy V. Vo
ECCV 2022 Active Learning Strategies for Weakly-Supervised Object Detection
[project page]
Huy V. Vo, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Jean Ponce
NeurIPS 2021 Large-Scale Unsupervised Object Discovery
[project page]
Huy V. Vo, Elena Sizikova, Cordelia Schmid, Patrick Pérez, Jean Ponce
BMVC 2021 Localizing Objects with Self-Supervised Transformers and no Labels
[project page]
Oriane Siméoni, Gilles Puy, Huy V. Vo, Simon Roburin, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Renaud Marlet and Jean Ponce
MIDL 2021 Improving Weakly Supervised Lesion Segmentation using Multi-Task Learning
[paper][code]
Tianshu Chu, Xinmeng Li, Huy V. Vo, Ronald M Summers, Elena Sizikova
ECCV 2020 Toward unsupervised, multi-object discovery in large-scale image collections
[project page]
Huy V. Vo, Patrick Pérez, Jean Ponce
CVPR 2019 Unsupervised image matching and object discovery as optimization
[project page]
Huy V. Vo, Francis Bach, Minsu Cho, Kai Han, Yann LeCun, Patrick Pérez, Jean Ponce
ACMMM 2018 Structural inpainting
[project page]
Huy V. Vo, Ngoc QK Duong, Patrick Pérez