Framework for systematization of data science methods

Authors

DOI:

https://doi.org/10.15276/aait.01.2021.7

Keywords:

Data science, framework, data preprocessing, data modeling, data visualization, case study

Abstract

The rapid development of data science has led to the accumulation of many models, methods, and techniques that had been
successfully applied. As the analysis of publications has shown, the systematization of data science methods and techniques is an
urgent task. However, in most cases, the results are relevant to applications in a particular problem domain. The paper develops the

framework for the systematization of data science methods, neither domain-oriented nor task-oriented. The metamodel-method-
technique hierarchy organizes the relationships between existing methods and techniques and reduces the complexity of their under-
standing. The first level of the hierarchy consists of metamodels of data preprocessing, data modeling, and data visualization. The

second level comprises methods corresponded to metamodels. The third level collects the main techniques grouped according to
methods. The authors describe the guiding principles of the framework use. It provides a possibility to define the typical process of
problem-solving with data science methods. A case study is used to verify the framework’s appropriateness. Four cases of applying
data science methods to solve practical problems described in publications are examined. It is shown that the described solutions are
entirely agreed with the proposed framework. The recommended directions for applying the framework are defined. The constraint of
the framework applying is structured or semi-structured data that should be analyzed. Finally, the ways of further research are given.

Downloads

Download data is not yet available.

Author Biographies

Vira V. Liubchenko, Odessa National Polytechnic University, Shevchenko Ave., 1, Odessa, Ukraine, 65044

Dr. Sci. (Eng.) (2014), PhD (Eng) (1997), Professor, Department of System Software

Nataliia O. Komleva, Odessa National Polytechnic University, Shevchenko Ave., 1, Odessa, Ukraine, 65044

PhD (Eng) (2006), Associate Prof., Department of System Software

Svitlana L. Zinovatna, Odessa National Polytechnic University, Shevchenko Ave., 1, Odessa, Ukraine, 65044

PhD (Eng) (2008), Associate Prof., Department of System Software

Katherine O. Pysarenko, Odessa National Polytechnic University, Shevchenko Ave., 1, Odessa, Ukraine, 65044

PhD (Eng) (2018), Associate Prof., Department of System Software

Downloads

Published

2021-03-15

How to Cite

[1]
Liubchenko V.V.., Komleva N.O., Zinovatna S.L.., Pysarenko K.O. “Framework for systematization of data science methods”. Applied Aspects of Information Technology. 2021; Vol. 4, No. 1: 80-90. DOI:https://doi.org/10.15276/aait.01.2021.7.

Most read articles by the same author(s)