Image
Type: Virtual Research Environment (VRE)

ScienceMesh

Image
About

CERN runs CERNBox, a synchronisation and file-sharing platform integrated with the laboratory’s computing infrastructure and specialised applications, such as JupyterLab, HTC (HTCondor), and viewers/editors for ROOT and markdown formats.
As part of ScienceMesh, CERNBox enables seamless data sharing and collaboration across diverse platforms without requiring a common authentication infrastructure.
CERN has significantly contributed to ScienceMesh’s technology and standards, including
Open Cloud Mesh (OCM) and Cloud Storage Synchronisation and Sharing (CS3) APIs, and led the CS3Mesh4EOSC project.
By integrating ScienceMesh with EOSC Data Commons services, CERNBox will allow users to discover and directly import datasets via the Matchmaker, facilitated by the Data Player.
Processing can be done on integrated HPC resources, JupyterLab, and HTC (HTCondor). Users can easily share preliminary analyses, collaborate via Jupyter notebooks, and finalise results, which are then deposited and discoverable through CERNBox or project-specific repositories.

Use Case Status Before Joining EOSC Data Commons

Processing happens through integration with other systems and current examples include JupyterLab, HTC (HTCondor batch), and various viewers and editors for particular fi letypes, e.g. markdown and ROOT.

Objectives in the Project

  • Integration of HPC resources into ScienceMesh via CERNBox support for CephFS; making data directly available to processing resources (JupyterLab and HPC).
  • ScienceMesh integration into EOSC Data Commons services:
    • Integrated with discovery mechanisms.
    • Ingestion and processing of search results.
    • Publishing data to digital repositories.
  • Integration with the EOSC EU Node. The EOSC EU Node and the upcoming EOSC Federation are providing the same storage federation technology, based on Open Cloud Mesh (OCM). OCM v1.2 support is implemented in CERNBox, and its Jupyter integration is extended to fulfil the Data Commons workflows.

 

Integration with EOSC Data Commons Services and Components

Expected Results
  • Integration of ScienceMesh into the EOSC Matchmaker through a “Run on ScienceMesh” button, which will enable the processing analysis and datasets on ScienceMesh.
  • Integration of ScienceMesh into the EOSC Data Player will happen by the development of a ScienceMeshVRE class in the Dispatcher, which will dispatch packages for processing to a ScienceMesh node through OCM shares embedding a Package for Processing Datasets. ScienceMesh will have some interaction with the Package for Processing Datasets through format specification and feedback.

Discover EOSC Data Commons Use Cases

Loading...