EOSC Data Player combines deployment tools, orchestrators, Virtual Research Environments (VREs), cloud and container frameworks, and data access services across the full computing continuum (Cloud, HTC, HPC).
Its dispatcher matches each Package for Processing Datasets with the most suitable platform to execute the corresponding analysis.
By integrating with existing engines and research community platforms, EOSC Data Player interprets the instructions in the Package for Processing Datasets (provided by the EOSC Matchmaker) and executes analyses automatically. The system hides technical complexity from users through an extensible plugin architecture that connects various compute engines and data access platforms.
The Dispatcher is the main entry point. It parses incoming packages, forwards them to the right compute engine, and coordinates data access through the Data Access layer. Depending on user needs, it can interact with existing engines or automatically deploy new ones.
EOSC Data Commons will provide plugins for widely used compute engines and data access tools. The Data Access layer offers libraries that resolve dataset references and make them accessible to the engines. It includes a lightweight file system (FUSE/PyFilesystem) that exposes data as an RO-Crate, enabling easy packaging and use. Data access is plugin-based and extensible, with current and planned support for local files, web URLs, S3 object stores, and other data management solutions as required by different use cases.