BENCHMARKING BIG SPATIAL DATA PROCESSING FRAMEWORKS

Abstract

Today, the processing of large amounts of spatial data in distributed systems plays a crucial role in many areas of our life. Large data are often unstructured, and special algorithms are required for its processing. One of the methods for analyzing large data is a spatial analysis. The source of large data in this case is often the geographical information system. In this article, a benchmark is considered to evaluate the frameworks that work with such data. Also, the evaluation results of three frameworks according to developed benchmark are presented: GeoSpark, STARK, SpecialSpark. In the course of this paper, we considered a benchmark of two types: macrobenchmark and microbenchmark. In the paper, testing of topological predicates on various topological data is also considered. The comparison was made using the DE-9IM model. This model is used to determine the types of topological relationships, such as intersection, equality, etc. The main problem of comparing the data frameworks was that not all of them support the operations of the selected model, which influenced the formation of scenarios for the microbenchmark and macrobenchmark, since it was impossible to compare all the DE-9IM items.

Authors and Affiliations

Anastasia Garaeva, Airat Kabirov, Olga Tikhonova

Keywords

Related Articles

THE SOFTWARE OF THE ADVANCED MIL-STD-1553B MULTIPLEX DATA BUS TESTER AND INTERFACE MODULE: FEATURES AND IMPLEMENTATION DETAILS

The UEM-MK is a new module of universal device and parametric tester of multiplex data bus, which meets all requirements for testing equipment for use in validation of devices against requirements of GOST R 52070-2003 (t...

THE FORMALIZED MATHEMATICAL CONTENT COGNITIVE MANAGEMENT

Problem of the formalized mathematical content management for any given subject domain is considered. The content represented by domain ontology as the unified variety of elementary knowledge classes and relations for su...

VECTORIZATION OF SMALL-SIZED SPECIAL-TYPE MATRICES MULTIPLICATION USING INSTRUCTIONS AVX-512

Modern software packages for supercomputer calculations require a large amount of computing resources. At the same time there are new hardware architectures that open up new opportunities for program code optimizing. The...

CONCEPT OF THE IMPROVED ARCHITECTURE OF VIRTUAL COMPUTER LABORATORY FOR EFFECTIVE TRAINING OF SPECIALISTS SKILLED IN DISTRIBUTED INFORMATION SYSTEMS AND DESIGN TOOLS

The article discusses the advanced architecture of the virtual computer laboratory, which is used in the innovative practice of training specialists in distributed information systems, as well as software developers skil...

PARTICULAR QUALITIES OF THE DEVELOPMENT AND APPLICATION OF FDM-TECHNOLOGY FOR CREATING AND PROTOTYPING 3D-OBJECTS

The article gives comparative analysis of additive technologies used in prototyping 3D-objects, and features of the FDM-technology are characterized. The parameters of the 3D printer using FDM-technology are described, a...

Download PDF file
  • EP ID EP508782
  • DOI 10.25559/SITITO.14.201801.126-137
  • Views 106
  • Downloads 0

How To Cite

Anastasia Garaeva, Airat Kabirov, Olga Tikhonova (2018). BENCHMARKING BIG SPATIAL DATA PROCESSING FRAMEWORKS. Современные информационные технологии и ИТ-образование, 14(1), 126-137. https://www.europub.co.uk/articles/-A-508782