Guidelines for Designing Modern Data Architectures

Introduction

When designing a new data architecture, numerous important questions arise that must be addressed. Should the new architecture be based on a data lake, a more traditional data warehouse, a data hub, a data fabric, a data lakehouse, or a data mesh? Or should it be a combination of these? Additionally, should the architecture run in the cloud? Is it important to migrate to an analytical SQL database server or deploy data warehouse automation? How can data streaming be integrated? What are the new requirements regarding anonymization and other data privacy aspects? In general, what elements should be included in a comprehensive data architecture document? And where do you start the design process? So many questions must be answered when designing a new data architecture.

Organizations are seeking to leverage data more effectively, necessitating the development of new data architectures. To enhance business operations, decision-making processes, and competitive advantage, data must be deployed more widely, efficiently, and effectively.

Technically, this situation necessitates the adoption of new data usage methods, including data science, real-time dashboarding, embedded BI, edge analytics, and customer-driven BI. Unfortunately, current IT systems such as data warehouses and transactional systems, copeare ill-equipped to handle these new, more intense, and resource-intense forms of data usage. Current data delivery systems are already overstretched. Additionally, because they have become static and inflexible, implementing new reports, changing existing applications, and executing new forms of analytics have become time consuming exercises. In other words, the current data architecture struggles to meet the growing demands for enhanced data utilization.

Unfortunately, designing new data architectures poses a significant challenge for organizations. To address this issue, this two-day seminar aimes at providing architects with answers to their pressing questions about designing a modern data architecture. Guidelines, tips, and design rules are discussed. Concepts and technologies, such as data lakes, data hubs, data fabrics, big data, cloud, data virtualization, Hadoop, NoSQL, data catalog, data warehouse automation, and anonymization of data are explained. The seminar is based on practical experiences while designing and implementing modern data architectures. Also, the relationship between a modern data architecture and more organizational aspects are addressed as well, including data quality, data governance, data strategy, and migration to the new architecture.

Table of Contents

1. Introduction - What is a Data Architecture?

Why a new data architecture?
What are the key elements of a data architecture?
What are the differences between a data architecture and a solutions architecture?
Benefits, drawbacks, and shortcomings of well-known reference architectures, such as the classic data warehouse architecture, the data lake, data hub, and data mesh
The impact of new technology on data architectures – the holistic approach to designing data architectures
10 steps to design a data architecture

2. Initial Phases of the Project

Determine the real business motivations for a new data architecture: ICT cost reduction, competitive improvement, new business model, new laws and regulations, improving reaction speed to business demands, or a more efficient exploitation of available data?
Relationship with business strategy and data strategy
Determine new requirements and constraints
Analyze the existing environment
Determine maturity level of the IT organization

3. Overview of Technologies and Products that Influence Data Architectures

Data storage: analytical SQL, NoSQL, translytical SQL
Data integration: ETL, data virtualization, data replication, data warehouse automation, enterprise service bus, API gateway
Data streaming and messaging technology
Data documentation: data glossary, data catalog, metadata management
Reporting tools: self-service BI, dashboards, embedded BI
Data science tools: programming languages, such as R and Python, machine learning automation tools, data science workbenches
Are all software products suitable for the cloud?
Data security: anonymization, authorization

4. Design Principles for Data Architectures

First the technology or first the data architecture?
The importance of data processing specifications, such as integration, filtering, correcting, aggregation, masking, transformation of data
Why migration to the cloud: unburdening, high performance, scalability, available software?
Data minimization: a new principle for designing data architectures that focus on minimizing data copying resulting in more data-on-demand data architectures
The impact of cloud on data architectures
Design principles for dealing with data history and data cleansing
Modernization of a classic data warehouse architecture
Generating a data warehouse architecture with data warehouse automation tools
New requirements for transactional systems, such as storing historic data and continuous logging
The influence of GDPR: deleting customer data

5. Innovative New Data Architectures, including Data Mesh, Data Fabric, and Data Lakehouse

The logical data warehouse architecture as an agile alternative
Design rules, do’s and don’ts for a logical data warehouse architecture
The changing role of the data lake: From a raw to a business data lake
Processing and sharing operational data with a data hub
Differences between data lake, data hub, and data warehouse
A data lakehouse to support the BI use case and the data science use case with one storage solution
Developing a data mesh to avoid a centralized, monolithic database; what do domain-oriented and data product mean?
The data fabric for frictionless acces to data; dealing with transactional and analytical services
A data streaming architecture; when every microsecond counts
Operationalization of data science models
Merging data architectures to one unified data delivery platform

6. Designing a New Data Architecture

Using track diagrams to design a new data architecture
Data processing specifications are key to the architecture, they are the intellectual property of an organization
Focus on the data processing specifications first, before data storage components, such as data lakes, hubs and warehouses are introduced
Breaking up the development of a data architecture in small steps – think big, act small
A metadata architecture is as important as a data architecture
Tips for selecting new products and technologies
Prepare the organization for the new data architecture

7. Closing Remarks

Learning Objectives

What are the pros and cons of new data architectures, such as data hub, data mesh, data fabric, and data lakehouse?
What are the necessary steps to come up with the perfect data architecture? From requirement analysis via proof of concepts to a data architecture.
What is the importance of a holistic approach to analyzing technology, organization, and architecture in conjunction?
What are real life examples of new data architectures?
How can the new technology use optimally within a new data architecture?
How do you develop a data architecture?
Which components make up a data architecture?
What are the use cases, pros and cons of new technologies and how do they influence data architectures?
What are the right criteria for a data architecture?

Target Audience

Business intelligence specialists; data analysts; data warehouse designers; business analysts; data scientists; technology planners; technical architects; enterprise architects; IT consultants; IT strategists; systems analysts; database developers; database administrators; solutions architects; data architects; IT managers.

Copyright (c) 2025 R20/Consultancy B.V.. All rights reserved.