DataLab as a Service: Distributed Computing Framework for Multi-Interactive Analysis Environments

During the current data era, data analysis across multiple disciplines has become a critical task for researchers to obtain meaningful insights and solve complex problems that are immeasurable using traditional technologies. Big Data has led to the development of state-of-the-art technologies that h...

Full description

Saved in:
Bibliographic Details
Main Authors: Aida Palacio Hoz, Andres Heredia Canales, Ezequiel Cimadevilla Alvarez, Marta Obregon Ruiz, Alvaro Lopez Garcia
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10858125/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1825207045482086400
author Aida Palacio Hoz
Andres Heredia Canales
Ezequiel Cimadevilla Alvarez
Marta Obregon Ruiz
Alvaro Lopez Garcia
author_facet Aida Palacio Hoz
Andres Heredia Canales
Ezequiel Cimadevilla Alvarez
Marta Obregon Ruiz
Alvaro Lopez Garcia
author_sort Aida Palacio Hoz
collection DOAJ
description During the current data era, data analysis across multiple disciplines has become a critical task for researchers to obtain meaningful insights and solve complex problems that are immeasurable using traditional technologies. Big Data has led to the development of state-of-the-art technologies that have revolutionized the process of experimentation. These innovations span from automating the setup of the infrastructure required for data analysis to providing user-friendly interfaces that simplify coding and result visualization. However, managing and scaling these resources for large-scale data processing remains a challenge. In this work, we introduce a novel framework called Datalab as a Service which integrates cutting-edge and open-source technologies to offer an online platform designed for both resource providers and researchers. The platform enables users to easily and automatically deploy interactive environments tailored for data analysis, thereby streamlining the process of managing computational resources. Through DLaaS, users gain access to cloud-based infrastructures and distributed computing resources, which are essential for performing compute-intensive tasks on massive datasets. The framework ensures scalability, resource management and optimization, and high availability, all within an accessible and user-friendly platform. Furthermore, this paper presents several use cases where researchers have successfully utilized DLaaS resources, demonstrating its practical applications in real-world scenarios.
format Article
id doaj-art-bcb2371c38ef48b4a0e6a985a792705a
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-bcb2371c38ef48b4a0e6a985a792705a2025-02-07T00:01:20ZengIEEEIEEE Access2169-35362025-01-0113225662257710.1109/ACCESS.2025.353663710858125DataLab as a Service: Distributed Computing Framework for Multi-Interactive Analysis EnvironmentsAida Palacio Hoz0https://orcid.org/0000-0003-2289-0254Andres Heredia Canales1Ezequiel Cimadevilla Alvarez2https://orcid.org/0000-0002-8437-2068Marta Obregon Ruiz3Alvaro Lopez Garcia4https://orcid.org/0000-0002-0013-4602Instituto de Física de Cantabria (IFCA, CSIC-UC), Santander, SpainInstituto de Física de Cantabria (IFCA, CSIC-UC), Santander, SpainInstituto de Física de Cantabria (IFCA, CSIC-UC), Santander, SpainInstituto de Física de Cantabria (IFCA, CSIC-UC), Santander, SpainInstituto de Física de Cantabria (IFCA, CSIC-UC), Santander, SpainDuring the current data era, data analysis across multiple disciplines has become a critical task for researchers to obtain meaningful insights and solve complex problems that are immeasurable using traditional technologies. Big Data has led to the development of state-of-the-art technologies that have revolutionized the process of experimentation. These innovations span from automating the setup of the infrastructure required for data analysis to providing user-friendly interfaces that simplify coding and result visualization. However, managing and scaling these resources for large-scale data processing remains a challenge. In this work, we introduce a novel framework called Datalab as a Service which integrates cutting-edge and open-source technologies to offer an online platform designed for both resource providers and researchers. The platform enables users to easily and automatically deploy interactive environments tailored for data analysis, thereby streamlining the process of managing computational resources. Through DLaaS, users gain access to cloud-based infrastructures and distributed computing resources, which are essential for performing compute-intensive tasks on massive datasets. The framework ensures scalability, resource management and optimization, and high availability, all within an accessible and user-friendly platform. Furthermore, this paper presents several use cases where researchers have successfully utilized DLaaS resources, demonstrating its practical applications in real-world scenarios.https://ieeexplore.ieee.org/document/10858125/Distributed computingKubernetesinteractive environmentscloud computingJupyter notebooks
spellingShingle Aida Palacio Hoz
Andres Heredia Canales
Ezequiel Cimadevilla Alvarez
Marta Obregon Ruiz
Alvaro Lopez Garcia
DataLab as a Service: Distributed Computing Framework for Multi-Interactive Analysis Environments
IEEE Access
Distributed computing
Kubernetes
interactive environments
cloud computing
Jupyter notebooks
title DataLab as a Service: Distributed Computing Framework for Multi-Interactive Analysis Environments
title_full DataLab as a Service: Distributed Computing Framework for Multi-Interactive Analysis Environments
title_fullStr DataLab as a Service: Distributed Computing Framework for Multi-Interactive Analysis Environments
title_full_unstemmed DataLab as a Service: Distributed Computing Framework for Multi-Interactive Analysis Environments
title_short DataLab as a Service: Distributed Computing Framework for Multi-Interactive Analysis Environments
title_sort datalab as a service distributed computing framework for multi interactive analysis environments
topic Distributed computing
Kubernetes
interactive environments
cloud computing
Jupyter notebooks
url https://ieeexplore.ieee.org/document/10858125/
work_keys_str_mv AT aidapalaciohoz datalabasaservicedistributedcomputingframeworkformultiinteractiveanalysisenvironments
AT andresherediacanales datalabasaservicedistributedcomputingframeworkformultiinteractiveanalysisenvironments
AT ezequielcimadevillaalvarez datalabasaservicedistributedcomputingframeworkformultiinteractiveanalysisenvironments
AT martaobregonruiz datalabasaservicedistributedcomputingframeworkformultiinteractiveanalysisenvironments
AT alvarolopezgarcia datalabasaservicedistributedcomputingframeworkformultiinteractiveanalysisenvironments