Catalogue en ligne IGN

Détail de l'auteur

Auteur Thibault Groueix

Documents disponibles écrits par cet auteur (1)

Ajouter le résultat dans votre panier Affiner la recherche Interroger des sources externes

Learning 3D generation and matching / Thibault Groueix (2020)

Public

Titre : Learning 3D generation and matching
Type de document : Thèse/HDR
Auteurs : Thibault Groueix, Auteur ; Mathieu Aubry, Directeur de thèse
Editeur : Paris : Ecole Nationale des Ponts et Chaussées ENPC
Année de publication : 2020
Importance : 169 p.
Format : 21 x 30 cm
Note générale : bibliographie
A doctoral thesis in the domain of automated signal and image processing submitted to École Doctorale Paris-Est
Mathématiques et Sciences et Technologies de l’Information et de la Communication
Langues : Anglais (eng)
Descripteur : [Vedettes matières IGN] Traitement d'image optique
[Termes IGN] appariement de formes
[Termes IGN] appariement dense
[Termes IGN] apprentissage profond
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] déformation de surface
[Termes IGN] isométrie
[Termes IGN] maillage
[Termes IGN] modélisation 3D
[Termes IGN] reconstruction 3D
[Termes IGN] reconstruction d'image
[Termes IGN] segmentation d'image
[Termes IGN] semis de points
[Termes IGN] voxel

Index. décimale : THESE Thèses et HDR
Résumé : (auteur) The goal of this thesis is to develop deep learning approaches to model and analyse 3D shapes. Progress in this field could democratize artistic creation of 3D assets which currently requires time and expert skills with technical software. We focus on the design of deep learning solutions for two particular tasks, key to many 3D modeling applications: single-view reconstruction and shape matching. A single-view reconstruction (SVR) method takes as input a single image and predicts the physical world which produced that image. SVR dates back to the early days of computer vision. In particular, in the 1960s, Lawrence G. Roberts proposed to align simple 3D primitives to the input image under the assumption that the physical world is made of cuboids. Another approach proposed by Berthold Horn in the 1970s is to decompose the input image in intrinsic images and use those to predict the depth of every input pixel. Since several configurations of shapes, texture and illumination can explain the same image, both approaches need to form assumptions on the distribution of images and 3D shapes to resolve the ambiguity. In this thesis, we learn these assumptions from large-scale datasets instead of manually designing them. Learning allows us to perform complete object reconstruction, including parts which are not visible in the input image. Shape matching aims at finding correspondences between 3D objects. Solving this task requires both a local and global understanding of 3D shapes which is hard to achieve explicitly. Instead we train neural networks on large-scale datasets to solve this task and capture this knowledge implicitly through their internal parameters.Shape matching supports many 3D modeling applications such as attribute transfer, automatic rigging for animation, or mesh editing.The first technical contribution of this thesis is a new parametric representation of 3D surfaces modeled by neural networks.The choice of data representation is a critical aspect of any 3D reconstruction algorithm. Until recently, most of the approaches in deep 3D model generation were predicting volumetric voxel grids or point clouds, which are discrete representations. Instead, we present an alternative approach that predicts a parametric surface deformation ie a mapping from a template to a target geometry. To demonstrate the benefits of such a representation, we train a deep encoder-decoder for single-view reconstruction using our new representation. Our approach, dubbed AtlasNet, is the first deep single-view reconstruction approach able to reconstruct meshes from images without relying on an independent post-processing, and can do it at arbitrary resolution without memory issues. A more detailed analysis of AtlasNet reveals it also generalizes better to categories it has not been trained on than other deep 3D generation approaches.Our second main contribution is a novel shape matching approach purely based on reconstruction via deformations. We show that the quality of the shape reconstructions is critical to obtain good correspondences, and therefore introduce a test-time optimization scheme to refine the learned deformations. For humans and other deformable shape categories deviating by a near-isometry, our approach can leverage a shape template and isometric regularization of the surface deformations. As category exhibiting non-isometric variations, such as chairs, do not have a clear template, we learn how to deform any shape into any other and leverage cycle-consistency constraints to learn meaningful correspondences. Our reconstruction-for-matching strategy operates directly on point clouds, is robust to many types of perturbations, and outperforms the state of the art by 15% on dense matching of real human scans.
Note de contenu : 1- Introduction
2 Related Work
3 AtlasNet: A Papier-Mache Approach to Learning 3D Surface Generation
4 3D-CODED : 3D Correspondences by Deep Deformation
5 Unsupervised cycle-consistent deformation for shape matching
6 Conclusion
Numéro de notice : 28310
Affiliation des auteurs : non IGN
Thématique : IMAGERIE
Nature : Thèse française
Note de thèse : Thèse de Doctorat : Automated signal and image processing : Paris-Est : 2020
Organisme de stage : LIGM
DOI : sans
En ligne : https://tel.archives-ouvertes.fr/tel-03127055v2/document
Format de la ressource électronique : URL
Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=98201