Author: Cook, N.S.
Paper Title Page
MOPHA160 Enabling Data Analytics as a Service for Large Scale Facilities 614
  • K. Woods, R.J. Clegg, N.S. Cook, R. Millward
    Tessella, Abingdon, United Kingdom
  • F. Barnsely, C. Jones
    STFC/RAL, Chilton, Didcot, Oxon, United Kingdom
  Funding: UK Research and Innovation - Science & Technology Facilities Council (UK SBS IT18160)
The Ada Lovelace Centre (ALC) at STFC is an integrated, cross-disciplinary data intensive science centre, for better exploitation of research carried out at large scale UK Facilities including the Diamond Light Source, the ISIS Neutron and Muon Facility, the Central Laser Facility and the Culham Centre for Fusion Energy. ALC will provide on-demand, data analysis, interpretation and analytics services to worldwide users of these research facilities. Using open-source components, ALC and Tessella have together created a software infrastructure to support the delivery of that vision. The infrastructure comprises a Virtual Machine Manager, for managing pools of VMs across distributed compute clusters; components for automated provisioning of data analytics environments across heterogeneous clouds; a Data Movement System, to efficiently transfer large datasets; a Kubernetes cluster to manage on demand submission of Spark jobs. In this paper, we discuss the challenges of creating an infrastructure to meet the differing analytics needs of multiple facilities and report the architecture and design of the infrastructure that enables Data Analytics as a Service.
poster icon Poster MOPHA160 [1.665 MB]  
DOI • reference for this paper ※  
About • paper received ※ 30 September 2019       paper accepted ※ 10 October 2019       issue date ※ 30 August 2020  
Export • reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml)