Evaluating New Approaches of Big Data Analytics Frameworks
N. Spangenberg, M. Roth, and B. Franczyk. Business Information Systems, page 28--37. Cham, Springer International Publishing, (2015)
Abstract
The big data topic will be one of the leading growth markets in information technology in the next years. One problem in this area is the efficient computation of huge data volumes, especially for complex algorithms in data mining and machine learning tasks. This paper discuss new processing frameworks for big and smart data in distributed environments and presents a benchmark between two frameworks - Apache Flink and Apache Spark - based on a mixed workload with algorithms from different analytic areas with different real-world datasets.
%0 Conference Paper
%1 10.1007/978-3-319-19027-3_3
%A Spangenberg, Norman
%A Roth, Martin
%A Franczyk, Bogdan
%B Business Information Systems
%C Cham
%D 2015
%E Abramowicz, Witold
%I Springer International Publishing
%K imported
%P 28--37
%T Evaluating New Approaches of Big Data Analytics Frameworks
%X The big data topic will be one of the leading growth markets in information technology in the next years. One problem in this area is the efficient computation of huge data volumes, especially for complex algorithms in data mining and machine learning tasks. This paper discuss new processing frameworks for big and smart data in distributed environments and presents a benchmark between two frameworks - Apache Flink and Apache Spark - based on a mixed workload with algorithms from different analytic areas with different real-world datasets.
%@ 978-3-319-19027-3
@inproceedings{10.1007/978-3-319-19027-3_3,
abstract = {The big data topic will be one of the leading growth markets in information technology in the next years. One problem in this area is the efficient computation of huge data volumes, especially for complex algorithms in data mining and machine learning tasks. This paper discuss new processing frameworks for big and smart data in distributed environments and presents a benchmark between two frameworks - Apache Flink and Apache Spark - based on a mixed workload with algorithms from different analytic areas with different real-world datasets.},
added-at = {2024-10-02T10:38:17.000+0200},
address = {Cham},
author = {Spangenberg, Norman and Roth, Martin and Franczyk, Bogdan},
biburl = {https://puma.scadsai.uni-leipzig.de/bibtex/2f2265dcc76a251f4f913b15cb4890343/scadsfct},
booktitle = {Business Information Systems},
editor = {Abramowicz, Witold},
interhash = {83392423644bf98e72b0f6f2fd149a36},
intrahash = {f2265dcc76a251f4f913b15cb4890343},
isbn = {978-3-319-19027-3},
keywords = {imported},
pages = {28--37},
publisher = {Springer International Publishing},
timestamp = {2024-10-02T10:38:17.000+0200},
title = {Evaluating New Approaches of Big Data Analytics Frameworks},
year = 2015
}