IVML  
  about | r&d | publications | courses | people | links
   

S. Sioutas, Ph. Mylonas, A. Panaretos, P. Gerolymatos, D. Vogiatzis, E. Karavaras, T. Spitieris, A. Kanavos
Survey of machine learning algorithms on Spark over DHT-based Structures
2nd International Workshop on Algorithmic Aspects of Clouc Computing (ALGOCLOUD 2016), Aarhus, Denmark, August 2016
ABSTRACT
Over the past few years there have been proposed many solutions on data storage, data management and data retrieval systems. These solutions can process massive amount of data stored in relational or distributed database management systems. In addition, decision making analytics and predictive computational statistics are some of the most common and well studied fields in computer science. In this paper, we demonstrate the implementation of machine learning algorithms over an open-source distributed database management system that can run in parallel on a cluster. In order to accomplish that we propose a system architecture scheme such as Apache Spark over Apache Cassandra. This paper presents a survey of the most common machine learning algorithms and the results of the experiments performed over a point of sales data set.
22 August , 2016
S. Sioutas, Ph. Mylonas, A. Panaretos, P. Gerolymatos, D. Vogiatzis, E. Karavaras, T. Spitieris, A. Kanavos, "Survey of machine learning algorithms on Spark over DHT-based Structures", 2nd International Workshop on Algorithmic Aspects of Clouc Computing (ALGOCLOUD 2016), Aarhus, Denmark, August 2016
[ save PDF] [ BibTex] [ Print] [ Back]

© 00 The Image, Video and Multimedia Systems Laboratory - v1.12