Repository | Book | Chapter

196905

(2018) Semantic applications, Dordrecht, Springer.

Variety management for big data

Wolfgang Mayer, Georg Grossmann, Matt Selway, Jan Stanek, Markus Stumptner

pp. 47-62

Of the core challenges originally associated with Big Data, namely Volume, Velocity, and Variety, the Variety aspect is the one that is least addressed by the standard analytics architectures. In this chapter, we analyze types and sources of variety and describe data- and metadata management principles for organizing data lakes. We discuss how semantic metadata can help describe and manage variety in structure, provenance, visibility and permitted use. Moreover, ontologies and metadata catalogs can aid discovery, navigation, exploration, and interpretation of heterogeneous data lakes, and can simplify interpretation, lift data quality, and simplify integration of multiple data sets. We present an application of these principles in a data architecture for the Law Enforcement domain in Australia.

Publication details

DOI: 10.1007/978-3-662-55433-3_4

Full citation:

Mayer, W. , Grossmann, G. , Selway, M. , Stanek, J. , Stumptner, M. (2018)., Variety management for big data, in T. Hoppe, B. Humm & A. Reibold (eds.), Semantic applications, Dordrecht, Springer, pp. 47-62.

This document is unfortunately not available for download at the moment.

Cannot connect to DataBase