Catalogue en ligne IGN

Descripteur

Termes IGN > 1-Candidats > estimation de pose

estimation de pose

Documents disponibles dans cette catégorie (35)

Ajouter le résultat dans votre panier Visionner les documents numériques Affiner la recherche Interroger des sources externes

Etendre la recherche sur niveau(x) vers le bas

Apprentissage profond et IA pour l’amélioration de la robustesse des techniques de localisation par vision artificielle / Achref Elouni (2021)

Public

Titre : Apprentissage profond et IA pour l’amélioration de la robustesse des techniques de localisation par vision artificielle
Type de document : Thèse/HDR
Auteurs : Achref Elouni, Auteur ; Michel Dhome, Directeur de thèse
Editeur : Clermont-Ferrand : Université Clermont Auvergne
Année de publication : 2021
Importance : 121 p.
Format : 21 x 30 cm
Note générale : Bibliographie
Thèse pour obtenir le grade de Docteur d’Université, Spécialité Informatique
Langues : Français (fre)
Descripteur : [Vedettes matières IGN] Traitement d'image
[Termes IGN] apprentissage profond
[Termes IGN] estimation de pose
[Termes IGN] état de l'art
[Termes IGN] localisation basée vision
[Termes IGN] méthode robuste
[Termes IGN] points homologues
[Termes IGN] réalité augmentée
[Termes IGN] reconstruction 3D
[Termes IGN] réseau neuronal convolutif
[Termes IGN] segmentation sémantique

Index. décimale : THESE Thèses et HDR
Résumé : (Auteur) Le travail réalisé dans le cadre de ce doctorat se place dans le contexte d’un projet collaboratif ayant pour objectif la mise au point d’un casque de réalité augmenté. Afin de faire fonctionner un tel dispositif il s’avère nécessaire de calculer la position d’une caméra embarquée dans l’environnent d’intervention de l’utilisateur. Récemment, deux technologies dénommées SLAM (pour « Simultaneous Localization And Mapping ») et SfM (pour « Structure From Motion ») ont fait preuve de performances indéniables pour la reconstruction 3D d’un environnement à partir d’une collection d’images. Nous nous sommes intéressés à elles afin de résoudre le problème délicat de l’initialisation de notre dispositif ou de sa ré-initialisation en cas d’échec du suivi temps réel de la position. En effet, malgré les travaux de recherche réalisés ces dernières années, plusieurs limitations empêchent les système de localisation d’estimer une pose parfaite dans toutes les conditions. Ces conditions incluent les changements légers du contexte comme les variations de la luminosité, du point d’observation ou des modifications géométriques telles que l’ajout d’objets. Pour faire face à ces limitations et afin de proposer une solution facile à déployer,nous avons étudié la possibilité d’intégrer dans le processus de localisation des informations invariantes qui pourraient augmenter la probabilité d’avoir une pose précise. Deux types d’information invariante (sémantique et géométrique) ont été exploitées dans cette thèse pour aider le système de localisation à trouver sa position. Les solutions proposées ont été validées sur plusieurs jeux de données internes et externes (Dubrovnik, Rome, Oxford, Musée) grâce auxquels nous avons pu comparer nos résultats avec les travaux décrits dans l’état de l’art. Deux types d’images requêtes ont été étudiées dans cette thèse : celle composée d’une seule image et celle issue d’un dispositif stéréo. L’avantage d’utiliser une paire stéréo est de pouvoir trianguler des points homologues afin d’extraire leur hauteur et d’exploiter cette dernière dans le processus de localisation. L’autre approche envisagée consiste à utiliser comme invariant le label des pixels obtenu par un algorithme de segmentation sémantique basé sur un réseau de neurones convolutionnel. Dans les deux cas, les résultats obtenus montrent une amélioration sensible sur la précision des poses estimées.
Note de contenu : 1. Introduction générale
1.1 Contexte
2. État de l’art
2.1 Le problème de la reconstruction 3D
2.2 Localisation visuelle basée sur l’image
2.3 Conclusion
3. Descripteur géométrique-visuel pour une localisation améliorée ba?sée sur l’image utilisant un a priori sur la verticale
3.1 Introduction
3.2 Méthode proposée
3.3 Étude expérimentale
3.4 Conclusion
4. La segmentation sémantique pour améliorer la localisation visuelle
4.1 Introduction
4.2 Méthode proposée
4.3 Étude expérimentale
4.4 Conclusion
5. La segmentation sémantique pour améliorer la recherche d’image par le contenu
5.1 Introduction
5.2 Méthode proposée
5.3 Étude expérimentale
5.4 Conclusion
6. Conclusion et Perspectives

Numéro de notice : 26763
Affiliation des auteurs : non IGN
Thématique : IMAGERIE/INFORMATIQUE
Nature : Thèse française
Note de thèse : Thèse de Doctorat : Informatique : Clermont Ferrand : 2021
Organisme de stage : Institut Pascal
nature-HAL : Thèse
DOI : sans
Date de publication en ligne : 03/02/2022
En ligne : https://tel.hal.science/tel-03554182/
Format de la ressource électronique : URL
Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=99827

How to spatialize geographical iconographic heritage / Emile Blettery (2021)

Public

contenu dans SUMAC'21: Proceedings of the 3rd Workshop on Structuring and Understanding of Multimedia heritAge Contents / Valérie Gouet-Brunet (2021)

Titre : How to spatialize geographical iconographic heritage
Type de document : Article/Communication
Auteurs : Emile Blettery , Auteur ; Nelson Fernandes, Auteur ; Valérie Gouet-Brunet , Auteur
Editeur : New York [Etats-Unis] : Association for computing machinery ACM
Année de publication : 2021
Projets : Alegoria / Gouet-Brunet, Valérie
Conférence : SUMAC 2021, 3rd workshop on Structuring and Understanding of Multimedia heritAge Contents 20/10/2021 24/10/2021 Chengdu Chine
Importance : 10 p.
Note générale : bibliographie
Langues : Anglais (eng)
Descripteur : [Vedettes matières IGN] Traitement d'image optique
[Termes IGN] base de données d'images
[Termes IGN] estimation de pose
[Termes IGN] géolocalisation
[Termes IGN] géoréférencement indirect
[Termes IGN] image ancienne
[Termes IGN] image numérisée
[Termes IGN] patrimoine culturel
[Termes IGN] photographie aérienne oblique
[Termes IGN] photographie terrestre
[Termes IGN] recherche d'image basée sur le contenu

Résumé : (auteur) This article is dedicated to the spatialization of image contents, with a focus on geographical iconographic heritage, i.e. digitized or born-digital image collections, acquired at variable temporal periods and showing the territory and its human-made and natural visual landmarks. We present a panorama of the current solutions (manual, semi-automatic and fully automatic alternatives) that exist to spatialize a visual content, with respect to the data available and the level of spatialization targeted. In particular, we highlight the characteristics of the approaches dedicated to geographical iconographic heritage, and in some cases, we present tests and practical feedbacks that we had the opportunity to conduct for old photographic contents in oblique aerial and terrestrial imagery.
Numéro de notice : C2021-035
Affiliation des auteurs : UGE-LASTIG (2020- )
Thématique : IMAGERIE/INFORMATIQUE
Nature : Communication
nature-HAL : ComAvecCL&ActesPubliésIntl
DOI : 10.1145/3475720.3484444
En ligne : https://doi.org/10.1145/3475720.3484444
Format de la ressource électronique : URL article
Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=99054

Documents numériques

peut être téléchargé

How to spatialize geographical iconographic heritage - pdf éditeur
Adobe Acrobat PDF

Learning to map street-side objects using multiple views / Ahmed Samy Nassar (2021)

Public

Titre : Learning to map street-side objects using multiple views
Type de document : Thèse/HDR
Auteurs : Ahmed Samy Nassar, Auteur ; Sébastien Lefèvre, Directeur de thèse ; Jan Dirk Wegner, Directeur de thèse
Editeur : Vannes : Université de Bretagne Sud
Année de publication : 2021
Importance : 139 p.
Format : 21 x 30 cm
Note générale : bibliographie
Thèse de Doctorat de l'Université de Bretagne Sud, spécialité Informatique
Langues : Anglais (eng)
Descripteur : [Vedettes matières IGN] Traitement d'image optique
[Termes IGN] apprentissage profond
[Termes IGN] arbre urbain
[Termes IGN] cartographie par internet
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] détection d'objet
[Termes IGN] données multisources
[Termes IGN] estimation de pose
[Termes IGN] géolocalisation
[Termes IGN] graphe
[Termes IGN] image Streetview
[Termes IGN] inventaire
[Termes IGN] mobilier urbain
[Termes IGN] vision par ordinateur

Index. décimale : THESE Thèses et HDR
Résumé : (auteur) Creating inventories of street-side objects and their monitoring in cities is a labor-intensive and costly process. Field workers are known to conduct this process on-site to record properties about the object. These properties can be the location, species, height, and health of a tree as an example. To monitor cities, gathering such information on a large scale becomes challenging. With the abundance of imagery, adequate coverage of a city is achieved from different views provided by online mapping services (e.g., Google Maps and Street View, Mapillary). The availability of such imagery allows efficient creation and updating of inventories of street-side objects status by using computer vision methods such as object detection and multiple object tracking. This thesis aims at detecting and geo-localizing street-side objects, especially trees and street signs, from multiple views using novel deep learning methods.
Note de contenu : 1- Introduction
2- Background
3- Multi-view instance matching with learned geometric soft-constraints
4- Simultaneous multi-view instance detection with learned geometric softconstraints
5- GeoGraphV2: Graph-based aerial & street view multi-view object detection with geometric cues end-to-end
6- Conclusion
Numéro de notice : 28674
Affiliation des auteurs : non IGN
Thématique : IMAGERIE/INFORMATIQUE
Nature : Thèse française
Note de thèse : Thèse de Doctorat : Informatique : Université de Bretagne Sud : 2021
Organisme de stage : IRISA
DOI : sans
En ligne : https://hal.science/tel-03523658
Format de la ressource électronique : URL
Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=99920

Robust and fast global image orientation / Xin Wang (2021)

Public

Titre : Robust and fast global image orientation
Type de document : Thèse/HDR
Auteurs : Xin Wang, Auteur ; Christian Heipke, Directeur de thèse
Editeur : Munich : Bayerische Akademie der Wissenschaften
Année de publication : 2021
Collection : DGK - C, ISSN 0065-5325 num. 871
Importance : 141 p.
Note générale : bibliographie
Diese Arbeit ist gleichzeitig veröffentlicht in: Wissenschaftliche Arbeiten der Fachrichtung Geodäsie und Geoinformatik der Leibniz Universität Hannover ISSN 0174-1454, Nr. 373, Hannover 2021
Langues : Anglais (eng)
Descripteur : [Vedettes matières IGN] Photogrammétrie numérique
[Termes IGN] appariement d'images
[Termes IGN] appariement dense
[Termes IGN] chaîne de traitement
[Termes IGN] estimation de pose
[Termes IGN] méthode robuste
[Termes IGN] orientation d'image
[Termes IGN] orientation relative
[Termes IGN] rotation
[Termes IGN] structure-from-motion
[Termes IGN] translation
[Termes IGN] valeur aberrante

Résumé : (auteur) The estimation of image orientation (also called pose) has always played a crucial role in the field of photogrammetry since it is a fundamental prerequisite for the subsequent works of multi-view dense matching, generating DEM and DSM, etc. In the community of computer vision, the task is also well known as Structure-from-Motion (SfM), which reveals that image pose, while positions of object points are determined interdependently. Despite a lot of efforts over the last decades, it has recently gained the photogrammetrists’ interests again due to the fast-growing number of different resources of images. New challenges are posed for accurately and efficiently orienting various image datasets (e.g., unordered datasets with a large number of images, or images compromised of critical stereo pairs). In this thesis, the relevant ambition is to develop a new fast and robust method for the estimation of image orientation which is capable of coping with different types of datasets. To achieve this goal, the two most time-consuming steps of image orientation are in particular taken care of: (a) image matching and (b) the estimation process. To accelerate the image matching process, a new method employing a random k-d forest is proposed to quickly obtain pairs of overlapping images from an unordered image set. After that, image matching and the estimation of relative orientation parameters are performed only for pairs found to be very likely overlapping. On the other hand, to estimate the image poses in a time efficient manner, a global image orientation strategy is advocated. Its basic idea is to first simultaneously solve all available images’ poses, before a final bundle adjustment is carried out once for refinement. The conventional two-step global approach is pursued in this work, separating the determination of rotation matrices and translation parameters; the former is solved by an existing popular method of Chatterjee and Govindu [2013], and the latter are estimated globally using a newly developed method: translation estimation integrating both the relative translations and tie points. Tie points within triplets are adopted to firstly calculate global unified scale factors for each available pairwise relative translation. Then, analogous to rotation estimation, translations are determined by performing an averaging operation on the scaled relative translations. In order to improve the robustness of the solution, efforts in this thesis are also focused on coping with outliers in the relative orientations (ROs), which global image orientation approaches are particularly sensitive to. A general method based on triplet compatibility with respect to loop closure errors of relative rotations and translations is presented for detecting blunders in relative orientations. Although this procedure eliminated many gross errors in the input ROs, it typically cannot sort out blunders which are caused by repetitive structures and critical configurations, such as inappropriate baselines (very short baseline or baselines parallel to the viewing direction). Therefore, another new method is proposed to eliminate wrong ROs which have resulted from repetitive structures and very short baselines. Two corresponding criteria that indicate the quality of ROs are introduced. Repetitive structure is detected based on counts of conjugate points of the various image pairs, while very short baselines are found by inspecting the intersection angles of corresponding image rays. By analyzing these two criteria, incorrect ROs are detected and eliminated. As correct ROs of image pairs with a wider baseline nearly parallel to both viewing directions can be valuable, a method to identify and keep these ROs is also a part of this research. The validation and evaluation of the proposed method are thoroughly conducted on various benchmarks including ordered and unordered sets of images, images with repetitive structures and inappropriate baselines, etc. In particular, robustness is investigated by demonstrating the efficacy of the corresponding RO outlier detection methods. The performance and time efficiency of determining image orientation are evaluated and compared with several state-of-the-art global image orientation approaches. In summary, based on the experimental results, the developed methods demonstrateto be able to accomplish the image orientation taskfast and robustlyon different kinds of datasets.
Numéro de notice : 17672
Affiliation des auteurs : non IGN
Thématique : IMAGERIE
Nature : Thèse étrangère
Note de thèse : PhD dissertation : Fachrichtung Geodäsie und Geoinformatik : Hanovre : 2021
En ligne : https://dgk.badw.de/fileadmin/user_upload/Files/DGK/docs/c-871.pdf
Format de la ressource électronique : URL
Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=97997

3D hand mesh reconstruction from a monocular RGB image / Hao Peng in The Visual Computer, vol 36 n° 10 - 12 (October 2020)

Public

[article]
inThe Visual Computer > vol 36 n° 10 - 12 (October 2020) . - pp pages2227 - 2239
Titre : 3D hand mesh reconstruction from a monocular RGB image
Type de document : Article/Communication
Auteurs : Hao Peng, Auteur ; Chuhua Xian, Auteur ; Yunbo Zhang, Auteur
Année de publication : 2020
Article en page(s) : pp pages2227 - 2239
Note générale : bibliographie
Langues : Anglais (eng)
Descripteur : [Vedettes matières IGN] Traitement d'image optique
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] estimation de pose
[Termes IGN] image de synthèse
[Termes IGN] image RVB
[Termes IGN] maillage
[Termes IGN] modélisation 3D
[Termes IGN] réalité augmentée
[Termes IGN] réalité virtuelle
[Termes IGN] reconstruction 3D
[Termes IGN] reconstruction d'objet
[Termes IGN] vision monoculaire

Résumé : (auteur) Most of the existing methods for 3D hand analysis based on RGB images mainly focus on estimating hand keypoints or poses, which cannot capture geometric details of the 3D hand shape. In this work, we propose a novel method to reconstruct a 3D hand mesh from a single monocular RGB image. Different from current parameter-based or pose-based methods, our proposed method directly estimates the 3D hand mesh based on graph convolution neural network (GCN). Our network consists of two modules: the hand localization and mask generation module, and the 3D hand mesh reconstruction module. The first module, which is a VGG16-based network, is applied to localize the hand region in the input image and generate the binary mask of the hand. The second module takes the high-order features from the first and uses a GCN-based network to estimate the coordinates of each vertex of the hand mesh and reconstruct the 3D hand shape. To achieve better accuracy, a novel loss based on the differential properties of the discrete mesh is proposed. We also use professional software to create a large synthetic dataset that contains both ground truth 3D hand meshes and poses for training. To handle the real-world data, we use the CycleGAN network to transform the data domain of real-world images to that of our synthesis dataset. We demonstrate that our method can produce accurate 3D hand mesh and achieve an efficient performance for real-time applications.
Numéro de notice : A2020-596
Affiliation des auteurs : non IGN
Thématique : IMAGERIE
Nature : Article
DOI : 10.1007/s00371-020-01908-3
Date de publication en ligne : 14/07/2020
En ligne : https://doi.org/10.1007/s00371-020-01908-3
Format de la ressource électronique : URL article
Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=95936

[article]

Guided feature matching for multi-epoch historical image blocks pose estimation / Lulin Zhang in ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol V-2-2020 (August 2020)

Permalink
Towards structureless bundle adjustment with two- and three-view structure approximation / Ewelina Rupnik in ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol V-2-2020 (August 2020)

Permalink
Refractive two-view reconstruction for underwater 3D vision / François Chadebecq in International journal of computer vision, vol 128 n° 5 (May 2020)

Permalink
Automatic scale estimation of structure from motion based 3D models using laser scalers in underwater scenarios / Klemen Istenič in ISPRS Journal of photogrammetry and remote sensing, vol 159 (January 2020)

Permalink
Collaborative visual-inertial state and scene estimation / Marco Karrer (2020)

Permalink
Intelligent Imaging and Analysis / DaeEun Kim (2020)

Permalink
Probabilistic pose estimation and 3D reconstruction of vehicles from stereo images / Maximilian Alexander Coenen (2020)

Permalink
Robust pose estimation and calibration of catadioptric cameras with spherical mirrors / Sagi Filin in Photogrammetric Engineering & Remote Sensing, PERS, vol 86 n° 1 (January 2020)

Permalink
Simulation and analysis of photogrammetric UAV image blocks: influence of camera calibration error / Yilin Zhou in ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol IV-2/W5 (May 2019)

Permalink
BIM-PoseNet: Indoor camera localisation using a 3D indoor model and deep learning from synthetic images / Debaditya Acharya in ISPRS Journal of photogrammetry and remote sensing, vol 150 (April 2019)

Permalink

IGN

Centre de documentation
scientifique

Accueil

Sélection de la langue

Adresse

Se connecter

Actualité

L'actu ! Horaires du CDOS

Informations pratiques

Descripteur

estimation de pose

Documents disponibles dans cette catégorie (35)

Documents numériques

IGN / ENSG

L'IGN a pour vocation

Accès directs

2014-2022 IGN

IGN

Centre de documentationscientifique

Accueil

Sélection de la langue

Adresse

Se connecter

Actualité

L'actu ! Horaires du CDOS

Informations pratiques

Descripteur

estimation de pose

Documents disponibles dans cette catégorie (35)

Documents numériques

IGN / ENSG

L'IGN a pour vocation

Accès directs

2014-2022 IGN

Centre de documentation
scientifique