Image
Type: VRE + Data Repository

Modama: a User-Friendly Data Lake for Everyday Scientists

About

Modama is a “data lake” used to manage working data at the Ernst Ruska Centre for microscopy and spectroscopy with electrons (ER-C) at Forschungszentrum Jülich.

The ER-C Data Management System can handle large-scale data with sufficient performance and allows access from microscopes, PCs, processing workstations and mobile devices for institute members and collaborators.
The data is unpublished and largely controlled by the users themselves as part of their research activities, replacing e.g. personal storage devices or cloud storage. For that reason the data is only accessible with proper authentication and authorization.

Use Case Status Before Joining EOSC Data Commons

Modama is used to manage all kinds of working data for user facilities such as ER-C, for both members and visitors. It facilitates data exchange between projects and users, offering a wide range of interfaces for access. Together with its simple structure and without any restrictions on data formats or schemas, this enables applications to the widest range of workflows and projects, including new user-created ones. On the other hand, it provides very little guidance on data structures and the data is highly non-uniform.
Modama follows a modular approach where components are coupled with standards-based interfaces. This allows integration of new tools and interfaces without risking or interfering with existing functionality, as well as administering various parts collaboratively by a team within a large facility.
Modama is centred around a file system, as opposed to object storage, since an overwhelming majority of software tools and workflows in electron microscopy are file-based. The data in Modama is mutable by default, but selected parts can be made immutable to preserve e.g. valuable raw data.

Objectives in the Project

  • Make more tools available to modama users
  • Improved search
  • Improve interoperability with other repositories, such as Zenodo
  • Improve support for data formats that are specific to electron microscopy (EM) in the wider ecosystem
  • Enable federation with other repositories and VREs participating in EOSC Data Commons to facilitate collaboration and data exchange.
  • modama users should have access to a larger choice of tools while reducing the administrative effort to deploy them with an improved search and browsing interface that can likely be based on the discovery mechanisms and interfaces developed in EOSC Data Commons.

Integration with EOSC Data Commons Services and Components

Expected Results
  • The search engine and interface of EOSC Data Commons can likely also be used in modama.
  • OpenCloud, in use at CERN, may replace Nextcloud in modama to improve performance and compatibility with local file data.
  • Since data in modama is private and primarily targets local use, EOSC Data Commons services are likely to be deployed locally at ER-C as part of modama, which can possibly be integrated with an identity provider common to EOSC Data Commons.
  • Federation with other repositories can be explored as well to facilitate data exchange, with a mechanism for AAI to keep unpublished data private.

Discover EOSC Data Commons Use Cases

Loading...