Optimization of ETL Process in Data Warehouse Through a Combination of Parallelization and Shared Cache Memory

Journal Title: Engineering, Technology & Applied Science Research - Year 2016, Vol 6, Issue 6

Abstract

Extraction, Transformation and Loading (ETL) is introduced as one of the notable subjects in optimization, management, improvement and acceleration of processes and operations in data bases and data warehouses. The creation of ETL processes is potentially one of the greatest tasks of data warehouses and so its production is a time-consuming and complicated procedure. Without optimization of these processes, the implementation of projects in data warehouses area is costly, complicated and time-consuming. The present paper used the combination of parallelization methods and shared cache memory in systems distributed on the basis of data warehouse. According to the conducted assessment, the proposed method exhibited 7.1% speed improvement to kattle optimization instrument and 7.9% to talend instrument in terms of implementation time of the ETL process. Therefore, parallelization could notably improve the ETL process. It eventually caused the management and integration processes of big data to be implemented in a simple way and with acceptable speed.

Authors and Affiliations

M. Faridi Masouleh, M. A. Afshar Kazemi, M. Alborzi, A. Toloie Eshlaghy

Keywords

Related Articles

Comparing the Thixotropic and Lightly Solidified Hardening Behavior of a Dredged Marine Clay

When a soil is disturbed upon remolding, it may lose part or all of its strength. As time passes, the structural arrangement of the soil particles would be restored to a stable form and the soil would regain hardness und...

Evaluating the Effects of Dam Construction on the Morphological Changes of Downstream Meandering Rivers (Case Study: Karkheh River)

The establishment of stability in rivers is dependent on a variety of factors, and yet the established stability can be interrupted at any moment or time. One factor that can strongly disrupt the stability of rivers is t...

VALUE ANALYSIS: Going into a Further Dimension

Value Analysis (VA), as it was originally conceived, was defined and applied as a cost cutting tool, in order to make products more competitive. That short scope was early identified as limiting further developments and...

Evaluation of window size in classification of epileptic short-term EEG signals using a Brain Computer Interface software

The complexity of epilepsy created a fertile ground for further research in automated methods, attempting to help the epileptologists’ task. Over the past years, great breakthroughs have emerged in computer-aided analysi...

Modeling and Analysis of a Multilevel Parallel Hybrid Active Power Filter

This paper introduces a new control approach for the Multilevel Parallel Hybrid Active Power Filter (M-PHAPF) which can compensate harmonics and variable reactive power demand of loads by controlling the DC link voltage...

Download PDF file
  • EP ID EP110817
  • DOI -
  • Views 246
  • Downloads 0

How To Cite

M. Faridi Masouleh, M. A. Afshar Kazemi, M. Alborzi, A. Toloie Eshlaghy (2016). Optimization of ETL Process in Data Warehouse Through a Combination of Parallelization and Shared Cache Memory. Engineering, Technology & Applied Science Research, 6(6), -. https://www.europub.co.uk/articles/-A-110817