Modeling and Computing Overlapping Aggregation of Large Data Sequences in Geographic Information Systems - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue International Journal of Information System Modeling and Design Année : 2019

Modeling and Computing Overlapping Aggregation of Large Data Sequences in Geographic Information Systems

Résumé

Recently, in the field of information systems, the acquisition of geo-referenced data has made a huge leap forward in terms of technology. There is a real issue in terms of the data processing optimization, and different research works have been proposed to analyze large geo-referenced datasets based on multi-core approaches. In this paper, different methods based on GPGPU (General-Purpose logic on Graphics Processing Unit) are modelled and compared to parallelize overlapping aggregations of raster sequences. Our methods are tested on a sequence of rasters representing the evolution of temperature over time for the same region. Each raster corresponds to a different data acquisition time period, and each raster geo-referenced cell is associated with a temperature value. This paper proposes optimized methods to calculate the average temperature for the region for all the possible raster subsequences of a determined length, i.e., to calculate overlapping aggregated data summaries. In these aggregations, the same subsets of values are aggregated several times. For example, this type of aggregation can be useful in different environmental data analyses, e.g., to pre-calculate all the average temperatures in a database. The present paper highlights a significant increase in performance and shows that the use of GPGPU parallel processing enabled us to run the aggregations up to more than 50 times faster than the sequential method including data transfer cost and more than 200 times faster without data transfer cost.
Fichier non déposé

Dates et versions

hal-02087943 , version 1 (02-04-2019)

Identifiants

Citer

Driss En-Nejjary, François Pinet, Myoung-Ah Kang. Modeling and Computing Overlapping Aggregation of Large Data Sequences in Geographic Information Systems. International Journal of Information System Modeling and Design, 2019, 10 (1), pp.20-41. ⟨10.4018/IJISMD.2019010102⟩. ⟨hal-02087943⟩
88 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More