BENCHMARKING BIG SPATIAL DATA PROCESSING FRAMEWORKS
Journal Title: Современные информационные технологии и ИТ-образование - Year 2018, Vol 14, Issue 1
Abstract
Today, the processing of large amounts of spatial data in distributed systems plays a crucial role in many areas of our life. Large data are often unstructured, and special algorithms are required for its processing. One of the methods for analyzing large data is a spatial analysis. The source of large data in this case is often the geographical information system. In this article, a benchmark is considered to evaluate the frameworks that work with such data. Also, the evaluation results of three frameworks according to developed benchmark are presented: GeoSpark, STARK, SpecialSpark. In the course of this paper, we considered a benchmark of two types: macrobenchmark and microbenchmark. In the paper, testing of topological predicates on various topological data is also considered. The comparison was made using the DE-9IM model. This model is used to determine the types of topological relationships, such as intersection, equality, etc. The main problem of comparing the data frameworks was that not all of them support the operations of the selected model, which influenced the formation of scenarios for the microbenchmark and macrobenchmark, since it was impossible to compare all the DE-9IM items.
Authors and Affiliations
Anastasia Garaeva, Airat Kabirov, Olga Tikhonova
THE SOFTWARE OF THE ADVANCED MIL-STD-1553B MULTIPLEX DATA BUS TESTER AND INTERFACE MODULE: FEATURES AND IMPLEMENTATION DETAILS
The UEM-MK is a new module of universal device and parametric tester of multiplex data bus, which meets all requirements for testing equipment for use in validation of devices against requirements of GOST R 52070-2003 (t...
THE FORMALIZED MATHEMATICAL CONTENT COGNITIVE MANAGEMENT
Problem of the formalized mathematical content management for any given subject domain is considered. The content represented by domain ontology as the unified variety of elementary knowledge classes and relations for su...
VECTORIZATION OF SMALL-SIZED SPECIAL-TYPE MATRICES MULTIPLICATION USING INSTRUCTIONS AVX-512
Modern software packages for supercomputer calculations require a large amount of computing resources. At the same time there are new hardware architectures that open up new opportunities for program code optimizing. The...
CONCEPT OF THE IMPROVED ARCHITECTURE OF VIRTUAL COMPUTER LABORATORY FOR EFFECTIVE TRAINING OF SPECIALISTS SKILLED IN DISTRIBUTED INFORMATION SYSTEMS AND DESIGN TOOLS
The article discusses the advanced architecture of the virtual computer laboratory, which is used in the innovative practice of training specialists in distributed information systems, as well as software developers skil...
PARTICULAR QUALITIES OF THE DEVELOPMENT AND APPLICATION OF FDM-TECHNOLOGY FOR CREATING AND PROTOTYPING 3D-OBJECTS
The article gives comparative analysis of additive technologies used in prototyping 3D-objects, and features of the FDM-technology are characterized. The parameters of the 3D printer using FDM-technology are described, a...