Bitte benutzen Sie diese Kennung, um auf die Ressource zu verweisen:
http://dx.doi.org/10.25673/86229
Titel: | Dissecting self-describing data formats to enable advanced querying of file metadata |
Autor(en): | Duwe, Kira Kuhn, Michael |
Erscheinungsdatum: | 2021 |
Art: | Konferenzobjekt |
Sprache: | Englisch |
URN: | urn:nbn:de:gbv:ma9:1-1981185920-881815 |
Schlagwörter: | Information systems Hierarchical storage management Computer systems organization Client-server architectures Distributed storage |
Zusammenfassung: | In times of continuously growing data sizes, performing insightful analysis is increasingly difficult. I/O libraries such as NetCDF and ADIOS2 offer options to manage additional metadata to make the data retrieval more efficient. However, queries on this metadata are difficult as it is currently stored inside the corresponding self-describing data formats. By replacing the file system underneath with the storage framework JULEA, we can use dedicated backends for keyvalue and object stores, as well as databases. Splitting the BP file content into file metadata and file data enables novel and highly efficient data management techniques without creating redundancy.We have kept our approach transparent to the application layer by implementing a custom ADIOS2 engine. Moreover, our data analysis interface allows speeding up metadata queries by a factor of up to 60,000 in comparison to the ADIOS2 API and data formats. |
URI: | https://opendata.uni-halle.de//handle/1981185920/88181 http://dx.doi.org/10.25673/86229 |
Open-Access: | Open-Access-Publikation |
Nutzungslizenz: | (CC BY 4.0) Creative Commons Namensnennung 4.0 International |
Sponsor/Geldgeber: | Transformationsvertrag |
Verlag: | Association for Computing Machinery |
Verlagsort: | New York |
Originalveröffentlichung: | 10.1145/3456727.3463778 |
Enthalten in den Sammlungen: | Fakultät für Informatik (OA) |
Dateien zu dieser Ressource:
Datei | Beschreibung | Größe | Format | |
---|---|---|---|---|
Duwe et al._Dissecting self-describing_2021.pdf | Zweitveröffentlichung | 1.01 MB | Adobe PDF | Öffnen/Anzeigen |