Descripteur
Termes IGN > mathématiques > statistique mathématique > analyse de données > classification > classification par réseau neuronal > classification par réseau neuronal convolutif
classification par réseau neuronal convolutifVoir aussi |
Documents disponibles dans cette catégorie (157)
Ajouter le résultat dans votre panier
Visionner les documents numériques
Affiner la recherche Interroger des sources externes
Etendre la recherche sur niveau(x) vers le bas
Titre : Deep learning based 3D reconstruction: supervision and representation Type de document : Thèse/HDR Auteurs : François Darmon, Auteur ; Pascal Monasse, Directeur de thèse ; Mathieu Aubry, Directeur de thèse Editeur : Champs-sur-Marne : Ecole des Ponts ParisTech Année de publication : 2022 Importance : 115 p. Format : 21 x 30 cm Note générale : Bibliographie
Thèse de doctorat de l'Ecole des Ponts ParisTech, spécialité informatiqueLangues : Anglais (eng) Descripteur : [Vedettes matières IGN] Traitement d'image optique
[Termes IGN] appariement d'images
[Termes IGN] carte de profondeur
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] extraction
[Termes IGN] géométrie épipolaire
[Termes IGN] maillage
[Termes IGN] modèle stéréoscopique
[Termes IGN] point d'intérêt
[Termes IGN] Ransac (algorithme)
[Termes IGN] reconstruction 3D
[Termes IGN] reconstruction d'objet
[Termes IGN] semis de points
[Termes IGN] SIFT (algorithme)
[Termes IGN] structure-from-motion
[Termes IGN] voxelIndex. décimale : THESE Thèses et HDR Résumé : (auteur) 3D reconstruction is a long standing problem in computer vision. Yet, state-of-the-art methods still struggle when the images used have large illumination changes, many occlusions or limited textures. Deep Learning holds promises of improving 3D reconstruction in such setups, but classical methods still produce the best results. In this thesis we analyse the specificity of deep learning applied to multiview 3D reconstruction and introduce new deep learning based methods.The first contribution of this thesis is an analysis of the possible supervision for training Deep Learning models for sparse image matching. We introduce a two-step algorithm that first computes low resolution matches using deep learning and then matches classical local features inside the matches regions. We analyze several levels of supervision and show that our new epipolar supervision leads to the best results.The second contribution is also a study of supervision for Deep Learning but applied to another scenario: calibrated 3D reconstruction in the wild. We show that existing unsupervised methods do not work on such data and we introduce a new training technique that solves this issue. We then exhaustively compare unsupervised approach and supervised approaches with different network architectures and training data.Finally, our third contribution is about data representation. Neural implicit representation were recently used for image rendering. We adapt this representation to the multiview reconstruction problem and we introduce a new method that, similar to classical 3D reconstruction techniques, optimizes photo-consistency between projections of multiple images. Our approach outperforms state-of-the-art by a large margin. Note de contenu : 1- Introduction
2- Background
3- Deep learning for guiding keypoint matching
4- Deep Learning based Multi-View Stereo in the wild
5- Multi-view reconstruction with implicit surfaces and patch warping
6- ConclusionNuméro de notice : 24085 Affiliation des auteurs : non IGN Thématique : IMAGERIE/INFORMATIQUE Nature : Thèse française Note de thèse : Thèse de Doctorat : Informatique : Ponts ParisTech : 2022 Organisme de stage : Laboratoire d'Informatique Gaspard-Monge LIGM DOI : sans En ligne : https://www.theses.fr/2022ENPC0024 Format de la ressource électronique : URL Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=102473
Titre : Deep-learning based multiple land-cover map translation Type de document : Article/Communication Auteurs : Luc Baudoux , Auteur ; Jordi Inglada, Auteur ; Clément Mallet , Auteur Editeur : New York : Institute of Electrical and Electronics Engineers IEEE Année de publication : 2022 Projets : 1-Pas de projet / Conférence : IGARSS 2022, IEEE International Geoscience And Remote Sensing Symposium 17/07/2022 22/07/2022 Kuala Lumpur Malaysie Proceedings IEEE Importance : pp 1260 - 1263 Note générale : bibliographie Langues : Anglais (eng) Descripteur : [Vedettes matières IGN] Bases de données localisées
[Termes IGN] apprentissage profond
[Termes IGN] base de données d'occupation du sol
[Termes IGN] cadre conceptuel
[Termes IGN] carte d'occupation du sol
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] segmentation sémantiqueRésumé : (auteur) This paper presents a framework for simultaneously translating multiple land-cover maps into a given one in a supervised way. Conversely to existing approaches working on 1–1 translation, we propose a multi-translation setup that increases the generalizability and translation performance, especially on land-cover maps covering restricted spatial extents. The proposed method mainly assumes that the map of interest spatially overlaps at least with one of the other maps. High performance translation is achieved with a Convolutional Neural Network (CNN) based encoder-decoder frame-work trained with three goals: (i) high-quality translation; (ii) self-reconstruction ability; (iii) mapping of all datasets into a common representation space. Country-scale experimental results show the method effectiveness in translating six highly heterogeneous land-cover maps, achieving significantly better results than the traditional semantic-based method and better results than CNN trained for a 1–1 translation task (+ 9.7% in Overall Accuracy (OA) and +12% in macro F1-score (mF1)). Numéro de notice : C2022-039 Affiliation des auteurs : UGE-LASTIG+Ext (2020- ) Autre URL associée : https://hal.science/hal-03983066v1/document Thématique : GEOMATIQUE Nature : Communication nature-HAL : ComAvecCL&ActesPubliésIntl DOI : 10.1109/IGARSS46834.2022.9883056 En ligne : https://doi.org/10.1109/IGARSS46834.2022.9883056 Format de la ressource électronique : URL article Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=101765
Titre : Deep learning based vehicle detection in aerial imagery Type de document : Monographie Auteurs : Lars Wilko Sommer, Éditeur scientifique Editeur : Karlsruhe [Allemagne] : KIT Scientific Publishing Année de publication : 2022 Importance : 276 p. Format : 15 x 21 cm ISBN/ISSN/EAN : 978-3-7315-1113-7 Note générale : bibliographie Langues : Anglais (eng) Descripteur : [Vedettes matières IGN] Traitement d'image optique
[Termes IGN] ancre
[Termes IGN] apprentissage profond
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] détection d'objet
[Termes IGN] extraction de traits caractéristiques
[Termes IGN] filtre
[Termes IGN] image aérienne
[Termes IGN] véhiculeRésumé : (éditeur) This book proposes a novel deep learning based detection method, focusing on vehicle detection in aerial imagery recorded in top view. The base detection framework is extended by two novel components to improve the detection accuracy by enhancing the contextual and semantical content of the employed feature representation. To reduce the inference time, a lightweight CNN architecture is proposed as base architecture and a novel module that restricts the search area is introduced. Note de contenu : 1- Introduction
2- Related work
3- Concept
4- Experimental setup
5- Base framework
6- Integration of contextual knowledge
7- Runtime optimization
8- Evaluation
9- Conclusions and outlookNuméro de notice : 28685 Affiliation des auteurs : non IGN Thématique : IMAGERIE Nature : Recueil / ouvrage collectif DOI : 10.5445/KSP/1000135415 En ligne : https://doi.org/10.5445/KSP/1000135415 Format de la ressource électronique : URL Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=100015 Detection of windthrown tree stems on UAV-orthomosaics using U-Net convolutional networks / Stefan Reder in Remote sensing, vol 14 n° 1 (January-1 2022)
[article]
Titre : Detection of windthrown tree stems on UAV-orthomosaics using U-Net convolutional networks Type de document : Article/Communication Auteurs : Stefan Reder, Auteur ; J.P. Mund, Auteur ; Nicole Albert, Auteur ; et al., Auteur Année de publication : 2022 Article en page(s) : n° 75 Note générale : bibliographie Langues : Anglais (eng) Descripteur : [Vedettes matières IGN] Traitement d'image optique
[Termes IGN] apprentissage profond
[Termes IGN] branche (arbre)
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] détection d'objet
[Termes IGN] dommage forestier causé par facteurs naturels
[Termes IGN] données d'entrainement (apprentissage automatique)
[Termes IGN] image captée par drone
[Termes IGN] orthophotoplan numérique
[Termes IGN] segmentation sémantique
[Termes IGN] tempête
[Termes IGN] troncRésumé : (auteur) The increasing number of severe storm events is threatening European forests. Besides the primary damages directly caused by storms, there are secondary damages such as bark beetle outbreaks and tertiary damages due to negative effects on the market. These subsequent damages can be minimized if a detailed overview of the affected area and the amount of damaged wood can be obtained quickly and included in the planning of clearance measures. The present work utilizes UAV-orthophotos and an adaptation of the U-Net architecture for the semantic segmentation and localization of windthrown stems. The network was pre-trained with generic datasets, randomly combining stems and background samples in a copy–paste augmentation, and afterwards trained with a specific dataset of a particular windthrow. The models pre-trained with generic datasets containing 10, 50 and 100 augmentations per annotated windthrown stems achieved F1-scores of 73.9% (S1Mod10), 74.3% (S1Mod50) and 75.6% (S1Mod100), outperforming the baseline model (F1-score 72.6%), which was not pre-trained. These results emphasize the applicability of the method to correctly identify windthrown trees and suggest the collection of training samples from other tree species and windthrow areas to improve the ability to generalize. Further enhancements of the network architecture are considered to improve the classification performance and to minimize the calculative costs. Numéro de notice : A2022-082 Affiliation des auteurs : non IGN Thématique : FORET/IMAGERIE Nature : Article DOI : 10.3390/rs14010075 En ligne : https://doi.org/10.3390/rs14010075 Format de la ressource électronique : URL article Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=99476
in Remote sensing > vol 14 n° 1 (January-1 2022) . - n° 75[article]Effective triplet mining improves training of multi-scale pooled CNN for image retrieval / Federico Vaccaro in Machine Vision and Applications, vol 33 n° 1 (January 2022)
[article]
Titre : Effective triplet mining improves training of multi-scale pooled CNN for image retrieval Type de document : Article/Communication Auteurs : Federico Vaccaro, Auteur ; Marco Bertini, Auteur ; Tiberio Uricchio, Auteur ; et al., Auteur Année de publication : 2022 Article en page(s) : n° 16 Note générale : bibliographie Langues : Anglais (eng) Descripteur : [Vedettes matières IGN] Traitement d'image optique
[Termes IGN] agrégation de données
[Termes IGN] analyse visuelle
[Termes IGN] architecture de réseau
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] exploration de données
[Termes IGN] extraction de traits caractéristiques
[Termes IGN] recherche d'image basée sur le contenu
[Termes IGN] réseau neuronal siamois
[Termes IGN] tripletRésumé : (auteur) In this paper, we address the problem of content-based image retrieval (CBIR) by learning images representations based on the activations of a Convolutional Neural Network. We propose an end-to-end trainable network architecture that exploits a novel multi-scale local pooling based on the trainable aggregation layer NetVLAD (Arandjelovic et al in Proceedings of the IEEE conference on computer vision and pattern recognition CVPR, NetVLAD, 2016) and bags of local features obtained by splitting the activations, allowing to reduce the dimensionality of the descriptor and to increase the performance of retrieval. Training is performed using an improved triplet mining procedure that selects samples based on their difficulty to obtain an effective image representation, reducing the risk of overfitting and loss of generalization. Extensive experiments show that our approach, that can be effectively used with different CNN architectures, obtains state-of-the-art results on standard and challenging CBIR datasets. Numéro de notice : A2022-237 Affiliation des auteurs : non IGN Thématique : IMAGERIE Nature : Article DOI : 10.1007/s00138-021-01260-z Date de publication en ligne : 06/01/2022 En ligne : https://doi.org/10.1007/s00138-021-01260-z Format de la ressource électronique : URL article Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=100153
in Machine Vision and Applications > vol 33 n° 1 (January 2022) . - n° 16[article]Éléments pour l'analyse et le traitement d'images : application à l'estimation de la qualité du bois / Rémy Decelle (2022)PermalinkGénération d’un jeu de données d’entraînement et mise en oeuvre d’une architecture de détection par deep learning des numéros de parcelles sur les plans du cadastre Napoléonien / Tiecoumba Ibrahim Tamela (2022)PermalinkA GIS-based landslide susceptibility mapping and variable importance analysis using artificial intelligent training-based methods / Pengxiang Zhao in Remote sensing, vol 14 n° 1 (January-1 2022)PermalinkGlobal canopy height regression and uncertainty estimation from GEDI LIDAR waveforms with deep ensembles / Nico Lang in Remote sensing of environment, vol 268 (January 2022)PermalinkHistograms of oriented mosaic gradients for snapshot spectral image description / Lulu Chen in ISPRS Journal of photogrammetry and remote sensing, vol 183 (January 2022)PermalinkMLMT-CNN for object detection and segmentation in multi-layer and multi-spectral images / Majedaldein Almahasneh in Machine Vision and Applications, vol 33 n° 1 (January 2022)PermalinkMulti-criteria geographic analysis for automated cartographic generalization / Guillaume Touya in Cartographic journal (the), vol 59 n° 1 (February 2022)PermalinkA novel unmixing-based hypersharpening method via convolutional neural network / Xiaochen Lu in IEEE Transactions on geoscience and remote sensing, vol 60 n° 1 (January 2022)PermalinkPedestrian trajectory prediction with convolutional neural networks / Simone Zamboni in Pattern recognition, vol 121 (January 2022)PermalinkPermalink