Catalogue en ligne IGN

Descripteur

Termes IGN > mathématiques > statistique mathématique > analyse de données > classification > classification par réseau neuronal > classification par réseau neuronal convolutif

classification par réseau neuronal convolutif

Voir aussi

réseau neuronal convolutif

Documents disponibles dans cette catégorie (157)

Ajouter le résultat dans votre panier Visionner les documents numériques Affiner la recherche Interroger des sources externes

Etendre la recherche sur niveau(x) vers le bas

Deep learning based 3D reconstruction: supervision and representation / François Darmon (2022)

Public

Titre : Deep learning based 3D reconstruction: supervision and representation
Type de document : Thèse/HDR
Auteurs : François Darmon, Auteur ; Pascal Monasse, Directeur de thèse ; Mathieu Aubry, Directeur de thèse
Editeur : Champs-sur-Marne : Ecole des Ponts ParisTech
Année de publication : 2022
Importance : 115 p.
Format : 21 x 30 cm
Note générale : Bibliographie
Thèse de doctorat de l'Ecole des Ponts ParisTech, spécialité informatique
Langues : Anglais (eng)
Descripteur : [Vedettes matières IGN] Traitement d'image optique
[Termes IGN] appariement d'images
[Termes IGN] carte de profondeur
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] extraction
[Termes IGN] géométrie épipolaire
[Termes IGN] maillage
[Termes IGN] modèle stéréoscopique
[Termes IGN] point d'intérêt
[Termes IGN] Ransac (algorithme)
[Termes IGN] reconstruction 3D
[Termes IGN] reconstruction d'objet
[Termes IGN] semis de points
[Termes IGN] SIFT (algorithme)
[Termes IGN] structure-from-motion
[Termes IGN] voxel

Index. décimale : THESE Thèses et HDR
Résumé : (auteur) 3D reconstruction is a long standing problem in computer vision. Yet, state-of-the-art methods still struggle when the images used have large illumination changes, many occlusions or limited textures. Deep Learning holds promises of improving 3D reconstruction in such setups, but classical methods still produce the best results. In this thesis we analyse the specificity of deep learning applied to multiview 3D reconstruction and introduce new deep learning based methods.The first contribution of this thesis is an analysis of the possible supervision for training Deep Learning models for sparse image matching. We introduce a two-step algorithm that first computes low resolution matches using deep learning and then matches classical local features inside the matches regions. We analyze several levels of supervision and show that our new epipolar supervision leads to the best results.The second contribution is also a study of supervision for Deep Learning but applied to another scenario: calibrated 3D reconstruction in the wild. We show that existing unsupervised methods do not work on such data and we introduce a new training technique that solves this issue. We then exhaustively compare unsupervised approach and supervised approaches with different network architectures and training data.Finally, our third contribution is about data representation. Neural implicit representation were recently used for image rendering. We adapt this representation to the multiview reconstruction problem and we introduce a new method that, similar to classical 3D reconstruction techniques, optimizes photo-consistency between projections of multiple images. Our approach outperforms state-of-the-art by a large margin.
Note de contenu : 1- Introduction
2- Background
3- Deep learning for guiding keypoint matching
4- Deep Learning based Multi-View Stereo in the wild
5- Multi-view reconstruction with implicit surfaces and patch warping
6- Conclusion
Numéro de notice : 24085
Affiliation des auteurs : non IGN
Thématique : IMAGERIE/INFORMATIQUE
Nature : Thèse française
Note de thèse : Thèse de Doctorat : Informatique : Ponts ParisTech : 2022
Organisme de stage : Laboratoire d'Informatique Gaspard-Monge LIGM
DOI : sans
En ligne : https://www.theses.fr/2022ENPC0024
Format de la ressource électronique : URL
Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=102473

Deep-learning based multiple land-cover map translation / Luc Baudoux (2022)

Public

Titre : Deep-learning based multiple land-cover map translation
Type de document : Article/Communication
Auteurs : Luc Baudoux , Auteur ; Jordi Inglada, Auteur ; Clément Mallet , Auteur
Editeur : New York : Institute of Electrical and Electronics Engineers IEEE
Année de publication : 2022
Projets : 1-Pas de projet /
Conférence : IGARSS 2022, IEEE International Geoscience And Remote Sensing Symposium 17/07/2022 22/07/2022 Kuala Lumpur Malaysie Proceedings IEEE
Importance : pp 1260 - 1263
Note générale : bibliographie
Langues : Anglais (eng)
Descripteur : [Vedettes matières IGN] Bases de données localisées
[Termes IGN] apprentissage profond
[Termes IGN] base de données d'occupation du sol
[Termes IGN] cadre conceptuel
[Termes IGN] carte d'occupation du sol
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] segmentation sémantique

Résumé : (auteur) This paper presents a framework for simultaneously translating multiple land-cover maps into a given one in a supervised way. Conversely to existing approaches working on 1–1 translation, we propose a multi-translation setup that increases the generalizability and translation performance, especially on land-cover maps covering restricted spatial extents. The proposed method mainly assumes that the map of interest spatially overlaps at least with one of the other maps. High performance translation is achieved with a Convolutional Neural Network (CNN) based encoder-decoder frame-work trained with three goals: (i) high-quality translation; (ii) self-reconstruction ability; (iii) mapping of all datasets into a common representation space. Country-scale experimental results show the method effectiveness in translating six highly heterogeneous land-cover maps, achieving significantly better results than the traditional semantic-based method and better results than CNN trained for a 1–1 translation task (+ 9.7% in Overall Accuracy (OA) and +12% in macro F1-score (mF1)).
Numéro de notice : C2022-039
Affiliation des auteurs : UGE-LASTIG+Ext (2020- )
Autre URL associée : https://hal.science/hal-03983066v1/document
Thématique : GEOMATIQUE
Nature : Communication
nature-HAL : ComAvecCL&ActesPubliésIntl
DOI : 10.1109/IGARSS46834.2022.9883056
En ligne : https://doi.org/10.1109/IGARSS46834.2022.9883056
Format de la ressource électronique : URL article
Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=101765

Deep learning based vehicle detection in aerial imagery / Lars Wilko Sommer (2022)

Public

Titre : Deep learning based vehicle detection in aerial imagery
Type de document : Monographie
Auteurs : Lars Wilko Sommer, Éditeur scientifique
Editeur : Karlsruhe [Allemagne] : KIT Scientific Publishing
Année de publication : 2022
Importance : 276 p.
Format : 15 x 21 cm
ISBN/ISSN/EAN : 978-3-7315-1113-7
Note générale : bibliographie
Langues : Anglais (eng)
Descripteur : [Vedettes matières IGN] Traitement d'image optique
[Termes IGN] ancre
[Termes IGN] apprentissage profond
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] détection d'objet
[Termes IGN] extraction de traits caractéristiques
[Termes IGN] filtre
[Termes IGN] image aérienne
[Termes IGN] véhicule

Résumé : (éditeur) This book proposes a novel deep learning based detection method, focusing on vehicle detection in aerial imagery recorded in top view. The base detection framework is extended by two novel components to improve the detection accuracy by enhancing the contextual and semantical content of the employed feature representation. To reduce the inference time, a lightweight CNN architecture is proposed as base architecture and a novel module that restricts the search area is introduced.
Note de contenu : 1- Introduction
2- Related work
3- Concept
4- Experimental setup
5- Base framework
6- Integration of contextual knowledge
7- Runtime optimization
8- Evaluation
9- Conclusions and outlook

Numéro de notice : 28685
Affiliation des auteurs : non IGN
Thématique : IMAGERIE
Nature : Recueil / ouvrage collectif
DOI : 10.5445/KSP/1000135415
En ligne : https://doi.org/10.5445/KSP/1000135415
Format de la ressource électronique : URL
Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=100015

Detection of windthrown tree stems on UAV-orthomosaics using U-Net convolutional networks / Stefan Reder in Remote sensing, vol 14 n° 1 (January-1 2022)

Public

[article]
inRemote sensing > vol 14 n° 1 (January-1 2022) . - n° 75
Titre : Detection of windthrown tree stems on UAV-orthomosaics using U-Net convolutional networks
Type de document : Article/Communication
Auteurs : Stefan Reder, Auteur ; J.P. Mund, Auteur ; Nicole Albert, Auteur ; et al., Auteur
Année de publication : 2022
Article en page(s) : n° 75
Note générale : bibliographie
Langues : Anglais (eng)
Descripteur : [Vedettes matières IGN] Traitement d'image optique
[Termes IGN] apprentissage profond
[Termes IGN] branche (arbre)
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] détection d'objet
[Termes IGN] dommage forestier causé par facteurs naturels
[Termes IGN] données d'entrainement (apprentissage automatique)
[Termes IGN] image captée par drone
[Termes IGN] orthophotoplan numérique
[Termes IGN] segmentation sémantique
[Termes IGN] tempête
[Termes IGN] tronc

Résumé : (auteur) The increasing number of severe storm events is threatening European forests. Besides the primary damages directly caused by storms, there are secondary damages such as bark beetle outbreaks and tertiary damages due to negative effects on the market. These subsequent damages can be minimized if a detailed overview of the affected area and the amount of damaged wood can be obtained quickly and included in the planning of clearance measures. The present work utilizes UAV-orthophotos and an adaptation of the U-Net architecture for the semantic segmentation and localization of windthrown stems. The network was pre-trained with generic datasets, randomly combining stems and background samples in a copy–paste augmentation, and afterwards trained with a specific dataset of a particular windthrow. The models pre-trained with generic datasets containing 10, 50 and 100 augmentations per annotated windthrown stems achieved F1-scores of 73.9% (S1Mod10), 74.3% (S1Mod50) and 75.6% (S1Mod100), outperforming the baseline model (F1-score 72.6%), which was not pre-trained. These results emphasize the applicability of the method to correctly identify windthrown trees and suggest the collection of training samples from other tree species and windthrow areas to improve the ability to generalize. Further enhancements of the network architecture are considered to improve the classification performance and to minimize the calculative costs.
Numéro de notice : A2022-082
Affiliation des auteurs : non IGN
Thématique : FORET/IMAGERIE
Nature : Article
DOI : 10.3390/rs14010075
En ligne : https://doi.org/10.3390/rs14010075
Format de la ressource électronique : URL article
Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=99476

[article]

Effective triplet mining improves training of multi-scale pooled CNN for image retrieval / Federico Vaccaro in Machine Vision and Applications, vol 33 n° 1 (January 2022)

Public

[article]
inMachine Vision and Applications > vol 33 n° 1 (January 2022) . - n° 16
Titre : Effective triplet mining improves training of multi-scale pooled CNN for image retrieval
Type de document : Article/Communication
Auteurs : Federico Vaccaro, Auteur ; Marco Bertini, Auteur ; Tiberio Uricchio, Auteur ; et al., Auteur
Année de publication : 2022
Article en page(s) : n° 16
Note générale : bibliographie
Langues : Anglais (eng)
Descripteur : [Vedettes matières IGN] Traitement d'image optique
[Termes IGN] agrégation de données
[Termes IGN] analyse visuelle
[Termes IGN] architecture de réseau
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] exploration de données
[Termes IGN] extraction de traits caractéristiques
[Termes IGN] recherche d'image basée sur le contenu
[Termes IGN] réseau neuronal siamois
[Termes IGN] triplet

Résumé : (auteur) In this paper, we address the problem of content-based image retrieval (CBIR) by learning images representations based on the activations of a Convolutional Neural Network. We propose an end-to-end trainable network architecture that exploits a novel multi-scale local pooling based on the trainable aggregation layer NetVLAD (Arandjelovic et al in Proceedings of the IEEE conference on computer vision and pattern recognition CVPR, NetVLAD, 2016) and bags of local features obtained by splitting the activations, allowing to reduce the dimensionality of the descriptor and to increase the performance of retrieval. Training is performed using an improved triplet mining procedure that selects samples based on their difficulty to obtain an effective image representation, reducing the risk of overfitting and loss of generalization. Extensive experiments show that our approach, that can be effectively used with different CNN architectures, obtains state-of-the-art results on standard and challenging CBIR datasets.
Numéro de notice : A2022-237
Affiliation des auteurs : non IGN
Thématique : IMAGERIE
Nature : Article
DOI : 10.1007/s00138-021-01260-z
Date de publication en ligne : 06/01/2022
En ligne : https://doi.org/10.1007/s00138-021-01260-z
Format de la ressource électronique : URL article
Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=100153

[article]

Éléments pour l'analyse et le traitement d'images : application à l'estimation de la qualité du bois / Rémy Decelle (2022)

Permalink
Génération d’un jeu de données d’entraînement et mise en oeuvre d’une architecture de détection par deep learning des numéros de parcelles sur les plans du cadastre Napoléonien / Tiecoumba Ibrahim Tamela (2022)

Permalink
A GIS-based landslide susceptibility mapping and variable importance analysis using artificial intelligent training-based methods / Pengxiang Zhao in Remote sensing, vol 14 n° 1 (January-1 2022)

Permalink
Global canopy height regression and uncertainty estimation from GEDI LIDAR waveforms with deep ensembles / Nico Lang in Remote sensing of environment, vol 268 (January 2022)

Permalink
Histograms of oriented mosaic gradients for snapshot spectral image description / Lulu Chen in ISPRS Journal of photogrammetry and remote sensing, vol 183 (January 2022)

Permalink
MLMT-CNN for object detection and segmentation in multi-layer and multi-spectral images / Majedaldein Almahasneh in Machine Vision and Applications, vol 33 n° 1 (January 2022)

Permalink
Multi-criteria geographic analysis for automated cartographic generalization / Guillaume Touya in Cartographic journal (the), vol 59 n° 1 (February 2022)

Permalink
A novel unmixing-based hypersharpening method via convolutional neural network / Xiaochen Lu in IEEE Transactions on geoscience and remote sensing, vol 60 n° 1 (January 2022)

Permalink
Pedestrian trajectory prediction with convolutional neural networks / Simone Zamboni in Pattern recognition, vol 121 (January 2022)

Permalink
Reshaping perception for autonomous driving with semantic keypoints / Lorenzo Bertoni (2022)

Permalink