Descripteur
Termes IGN > sciences humaines et sociales > linguistique > linguistique informatique > traitement du langage naturel
traitement du langage naturelSynonyme(s)traitement automatique du langage naturelVoir aussi |
Documents disponibles dans cette catégorie (64)



Etendre la recherche sur niveau(x) vers le bas
Deep learning method for Chinese multisource point of interest matching / Pengpeng Li in Computers, Environment and Urban Systems, vol 96 (September 2022)
![]()
[article]
Titre : Deep learning method for Chinese multisource point of interest matching Type de document : Article/Communication Auteurs : Pengpeng Li, Auteur ; Jiping Liu, Auteur ; An Luo, Auteur ; et al., Auteur Année de publication : 2022 Article en page(s) : n° 101821 Note générale : bibliographie Langues : Anglais (eng) Descripteur : [Vedettes matières IGN] Géomatique
[Termes IGN] appariement sémantique
[Termes IGN] apprentissage profond
[Termes IGN] classification par Perceptron multicouche
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] extraction de traits caractéristiques
[Termes IGN] inférence sémantique
[Termes IGN] information sémantique
[Termes IGN] point d'intérêt
[Termes IGN] représentation vectorielle
[Termes IGN] traitement du langage naturelRésumé : (auteur) Multisource point of interest (POI) matching refers to the pairing of POIs that refer to the same geographic entity in different data sources. This also constitutes the core issue in geospatial data fusion and update. The existing methods cannot effectively capture the complex semantic information from a text, and the manually defined rules largely affect matching results. This study developed a multisource POI matching method based on deep learning that transforms the POI pair matching problem into a binary classification problem. First, we used three different Chinese word segmentation methods to segment the POI text attributes and used the segmentation results to train the Word2Vec model to generate the corresponding word vector representation. Then, we used the text convolutional neural network (Text-CNN) and multilayer perceptron (MLP) to extract the POI attributes' features and generate the corresponding feature vector representation. Finally, we used the enhanced sequential inference model (ESIM) to perform local inference and inference combination on each attribute to realize the classification of POI pairs. We used the POI dataset containing Baidu Map, Tencent Map, and Gaode Map from Chengdu to train, verify, and test the model. The experimental results show that the matching precision, recall rate, and F1 score of the proposed method exceed 98% on the test set, and it is significantly better than the existing matching methods. Numéro de notice : A2022-513 Affiliation des auteurs : non IGN Thématique : GEOMATIQUE Nature : Article DOI : 10.1016/j.compenvurbsys.2022.101821 Date de publication en ligne : 18/06/2022 En ligne : https://doi.org/10.1016/j.compenvurbsys.2022.101821 Format de la ressource électronique : URL article Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=101053
in Computers, Environment and Urban Systems > vol 96 (September 2022) . - n° 101821[article]GIS-KG: building a large-scale hierarchical knowledge graph for geographic information science / Jiaxin Du in International journal of geographical information science IJGIS, vol 36 n° 5 (May 2022)
![]()
[article]
Titre : GIS-KG: building a large-scale hierarchical knowledge graph for geographic information science Type de document : Article/Communication Auteurs : Jiaxin Du, Auteur ; Shaohua Wang, Auteur ; Xinyue Ye, Auteur ; et al., Auteur Année de publication : 2022 Article en page(s) : pp 873 - 897 Note générale : bibliographie Langues : Anglais (eng) Descripteur : [Vedettes matières IGN] Géomatique
[Termes IGN] apprentissage profond
[Termes IGN] approche hiérarchique
[Termes IGN] exploration de données
[Termes IGN] ingénierie des connaissances
[Termes IGN] ontologie
[Termes IGN] recherche d'information géographique
[Termes IGN] réseau sémantique
[Termes IGN] traitement du langage naturelRésumé : (auteur) An organized knowledge base can facilitate the exploration of existing knowledge and the detection of emerging topics in a domain. Knowledge about and around Geographic Information Science and its associated system technologies (GIS) is complex, extensive and emerging rapidly. Taking the challenge, we built a GIS knowledge graph (GIS-KG) by (1) merging existing GIS bodies of knowledge to create a hierarchical ontology and then (2) applying deep-learning methods to map GIS publications to the ontology. We conducted several experiments on information retrieval to evaluate the novelty and effectiveness of the GIS-KG. Results showed the robust support of GIS-KG for knowledge search of existing GIS topics and potential to explore emerging research themes. Numéro de notice : A2022-341 Affiliation des auteurs : non IGN Thématique : GEOMATIQUE Nature : Article nature-HAL : ArtAvecCL-RevueIntern DOI : 10.1080/13658816.2021.2005795 Date de publication en ligne : 26/11/2021 En ligne : https://doi.org/10.1080/13658816.2021.2005795 Format de la ressource électronique : URL article Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=100515
in International journal of geographical information science IJGIS > vol 36 n° 5 (May 2022) . - pp 873 - 897[article]Automated construction of a French Entity Linking dataset to geolocate social network posts in the context of natural disasters / Gaëtan Caillaut (2022)
![]()
Titre : Automated construction of a French Entity Linking dataset to geolocate social network posts in the context of natural disasters Type de document : Article/Communication Auteurs : Gaëtan Caillaut, Auteur ; Cécile Gracianne, Auteur ; Nathalie Abadie , Auteur ; Guillaume Touya
, Auteur ; Samuel Auclair, Auteur
Editeur : Tarbes [France] : ISCRAM proceedings Année de publication : 2022 Conférence : ISCRAM 2022, 19th International Conference on Information Systems for Crisis Response and Management 22/05/2022 25/05/2022 Tarbes France OA Proceedings Projets : RéSoCio / Auclair, Samuel Importance : 11 p. Note générale : Bibliographie Langues : Anglais (eng) Descripteur : [Vedettes matières IGN] Géomatique web
[Termes IGN] catastrophe naturelle
[Termes IGN] données issues des réseaux sociaux
[Termes IGN] données localisées des bénévoles
[Termes IGN] extraction automatique
[Termes IGN] géolocalisation
[Termes IGN] gestion de crise
[Termes IGN] traitement du langage naturel
[Termes IGN] TwitterRésumé : (Auteur) During natural disasters, automatic information extraction from Twitter posts is a valuable way to get a better overview of the field situation. This information has to be geolocated to support effective actions, but for the vast majority of tweets, spatial information has to be extracted from texts content. Despite the remarkable advances of the Natural Language Processing field, this task is still challenging for current state-of-the-art models because they are not necessarily trained on Twitter data and because high quality annotated data are still lacking for low resources languages. This research in progress address this gap describing an analytic pipeline able to automatically extract geolocatable entities from texts and to annotate them by aligning them with the entities present in Wikipedia/Wikidata resources. We present a new dataset for Entity Linking on French texts as preliminary results, and discuss research perspectives for enhancements over current state-of-the-art modeling for this task. Numéro de notice : C2022-005 Affiliation des auteurs : UGE-LASTIG+Ext (2020- ) Thématique : GEOMATIQUE/SOCIETE NUMERIQUE Nature : Communication nature-HAL : ComAvecCL&ActesPubliésIntl DOI : sans Date de publication en ligne : 05/04/2022 En ligne : https://hal.archives-ouvertes.fr/hal-03631387/document Format de la ressource électronique : URL Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=100410 A benchmark of named entity recognition approaches in historical documents : application to 19th century French directories / Nathalie Abadie (2022)
![]()
Titre : A benchmark of named entity recognition approaches in historical documents : application to 19th century French directories Type de document : Article/Communication Auteurs : Nathalie Abadie , Auteur ; Edwin Carlinet, Auteur ; Joseph Chazalon, Auteur ; Bertrand Duménieu
, Auteur
Editeur : Berlin, Heidelberg, Vienne, New York, ... : Springer Année de publication : 2022 Collection : Lecture notes in Computer Science, ISSN 0302-9743 num. 13237 Projets : SODUCO / Perret, Julien Conférence : DAS 2022, 5th IAPR International Workshop on Document Analysis Systems 22/05/2022 25/05/2022 La Rochelle France Proceedings Springer Importance : pp 445 - 460 Note générale : bibliographie Langues : Anglais (eng) Descripteur : [Vedettes matières IGN] Géomatique
[Termes IGN] classification par réseau neuronal convolutif
[Termes IGN] dix-neuvième siècle
[Termes IGN] données d'entrainement (apprentissage automatique)
[Termes IGN] exploration de texte
[Termes IGN] objet géohistorique
[Termes IGN] reconnaissance de noms
[Termes IGN] traitement du langage naturelRésumé : (auteur) Named entity recognition (NER) is a necessary step in many pipelines targeting historical documents. Indeed, such natural language processing techniques identify which class each text token belongs to, e.g. “person name”, “location”, “number”. Introducing a new public dataset built from 19th century French directories, we first assess how noisy modern, off-the-shelf OCR are. Then, we compare modern CNN- and Transformer-based NER techniques which can be reasonably used in the context of historical document analysis. We measure their requirements in terms of training data, the effects of OCR noise on their performance, and show how Transformer-based NER can benefit from unsupervised pre-training and supervised fine-tuning on noisy data. Results can be reproduced using resources available at https://github.com/soduco/paper-ner-bench-das22 and https://zenodo.org/record/6394464. Numéro de notice : C2022-030 Affiliation des auteurs : UGE-LASTIG+Ext (2020- ) Autre URL associée : vers HAL Thématique : GEOMATIQUE/INFORMATIQUE Nature : Communication nature-HAL : ComAvecCL&ActesPubliésIntl DOI : 10.1007/978-3-031-06555-2_30 En ligne : http://dx.doi.org/10.1007/978-3-031-06555-2_30 Format de la ressource électronique : URL article Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=101088 Generating geographical location descriptions with spatial templates: a salient toponym driven approach / Mark M. Hall in International journal of geographical information science IJGIS, vol 36 n° 1 (January 2022)
![]()
[article]
Titre : Generating geographical location descriptions with spatial templates: a salient toponym driven approach Type de document : Article/Communication Auteurs : Mark M. Hall, Auteur ; Christopher B. Jones, Auteur Année de publication : 2022 Article en page(s) : pp 55 - 85 Note générale : bibliographie Langues : Anglais (eng) Descripteur : [Vedettes matières IGN] Toponymie
[Termes IGN] image Flickr
[Termes IGN] OpenStreetMap
[Termes IGN] relation spatiale
[Termes IGN] répertoire toponymique
[Termes IGN] saillance
[Termes IGN] toponyme
[Termes IGN] traitement du langage naturelRésumé : (auteur) Natural language descriptions of geographical locations are used frequently in daily life and there is a motivation to create systems that generate such descriptions automatically, for purposes such as documentation of where events have taken place, where a person is located, where photos were taken and where plants and animals are located. Typically location descriptions combine references to named geographical features with vague spatial relational terms, such as near, north of and at that relate locations to the features. Here we describe a system for generating location descriptions, that combines spatial templates, that model the applicability of different spatial relations relative to a reference location, with toponyms in the vicinity of the described location that are selected according to aspects of salience. The toponyms are retrieved from a gazetteer service based on OpenStreetMap for which we create a hierarchical feature classification scheme to facilitate selection of toponyms according to distinctiveness of their feature types and other aspects of salience. The advantages of the approach are demonstrated in a user study, relative to an existing state of the art system and to other baseline approaches that include manually created captions and the automated methods of two widely used photo captioning systems. Numéro de notice : A2022-043 Affiliation des auteurs : non IGN Thématique : TOPONYMIE Nature : Article DOI : 10.1080/13658816.2021.1913498 Date de publication en ligne : 28/04/2021 En ligne : https://doi.org/10.1080/13658816.2021.1913498 Format de la ressource électronique : URL article Permalink : https://documentation.ensg.eu/index.php?lvl=notice_display&id=99402
in International journal of geographical information science IJGIS > vol 36 n° 1 (January 2022) . - pp 55 - 85[article]Le carrefour dont vous êtes le héros : description de carrefours pour les personnes déficientes visuelles / Jérémy Kalsron (2021)
![]()
PermalinkExtracting event-related information from a corpus regarding soil industrial pollution / Chuanming Dong (2021)
PermalinkPlace names in Spanish republican life stories: spatial patterns in locations and perceptions / Laurence Jolivet (2021)
PermalinkSocial media as passive geo-participation in transportation planning – how effective are topic modeling & sentiment analysis in comparison with citizen surveys? / Oliver Lock in Geo-spatial Information Science, vol 23 n° 4 (December 2020)
PermalinkA deep learning architecture for semantic address matching / Yue Lin in International journal of geographical information science IJGIS, vol 34 n° 3 (March 2020)
PermalinkA framework for extracting urban functional regions based on multiprototype word embeddings using points-of-interest data / Sheng Hu in Computers, Environment and Urban Systems, vol 80 (March 2020)
PermalinkComparing supervised learning algorithms for Spatial Nominal Entity recognition / Amine Medad (2020)
PermalinkExtraction de connaissances pour la description de l'environnement maritime côtier à partir de textes d'aide à la navigation / Léa Lamotte in Revue des Nouvelles Technologies de l'Information, E.36 (2020)
PermalinkPermalinkMapping urban fingerprints of odonyms automatically extracted from French novels / Ludovic Moncla in International journal of geographical information science IJGIS, vol 33 n° 12 (December 2019)
Permalink