Data Lakes and Data Mesh Architectures: Enabling Scalable and Decentralized Data Governance

Authors

  • David Jebasingh S. Data Analyst, LatentView, Chennai, India. Author

DOI:

https://doi.org/10.63282/3050-9246.IJETCSIT-V5I4P103

Keywords:

Data mesh, Data lake, Data architecture, Data governance, Decentralization, Scalability

Abstract

Data mesh and data lake are distinct approaches to data management within an organization. A data lake is a centralized repository where vast amounts of structured and unstructured data are stored in their raw format. Data mesh, however, is a decentralized architecture where data ownership is distributed across domain teams. The data mesh approach treats data as a product, managed by individual domain teams rather than a centralized data team, fostering data accountability and collaboration. Data lakes offer scalability but can lead to bottlenecks in data management and governance, while data mesh focuses on autonomy and governance across different teams. Data mesh dismantles traditional data silos by promoting collaboration and data sharing among domain teams, which facilitates cross-functional insights and cohesive decision-making. In contrast to data lakes, data mesh architecture facilitates self-service data usage and requires stricter data standards, including alignment on formatting, metadata fields, discoverability, and governance. Data governance in a data lake may face challenges such as excessive security and data accessibility. Data mesh improves scalability, data quality, and governance by decentralizing data management. It enables organizations to scale efficiently as data volumes grow by reducing the burden on a central team. A data lake can serve as the foundational storage layer in a data mesh ecosystem, providing scalable storage while enabling decentralized data ownership and governance

Downloads

Download data is not yet available.

References

[1] Atlan. Data mesh vs. data lake: Understanding the differences. https://atlan.com/data-mesh-vs-data-lake/

[2] AWS. What is a data mesh? https://aws.amazon.com/what-is/data-mesh/

[3] ChaosSearch. Data lake vs. data mesh: Key differences and use cases. https://www.chaossearch.io/blog/data-lake-vs-datamesh

[4] Collibra. Data mesh: Don't drown in your data lake. https://www.collibra.com/resources/data-mesh-dont-drown-in-your-datalake

[5] Dataversity. (2023). Data architecture trends in 2023. https://www.dataversity.net/data-architecture-trends-in-2023/

[6] FirstEigen. Data mesh vs. data lake: Key differences and benefits. https://firsteigen.com/blog/data-mesh-vs-data-lake/

[7] GeeksforGeeks. What is a data lake? https://www.geeksforgeeks.org/what-is-data-lake/

[8] Don Kaye (2024). 2024 predictions for data management, data discovery and cybersecurity. GroundLabs. https://www.groundlabs.com/blog/2024-predictions-data-management-cybersecurity/

[9] IBM. AI data management: The future of intelligent data processing. https://www.ibm.com/think/topics/ai-data-management

[10] Medium. What is a data mesh? https://medium.com/@cadarsh88/what-is-a-data-mesh-5cbad56c8621

[11] Monte Carlo Data. Data mesh vs. data lake: What’s the difference? https://www.montecarlodata.com/blog-data-mesh-vs-datalake-whats-the-difference/

[12] Starburst. Data mesh vs. data lake: Choosing the right approach. https://www.starburst.io/blog/data-mesh-vs-data-lake/

[13] XenonStack. Data warehouse vs. data lake vs. data mesh: A comparative study. https://www.xenonstack.com/blog/datawarehouse-vs-data-lake-vs-data-mesh

Published

2024-10-15

Issue

Section

Articles

How to Cite

1.
S. DJ. Data Lakes and Data Mesh Architectures: Enabling Scalable and Decentralized Data Governance. IJETCSIT [Internet]. 2024 Oct. 15 [cited 2025 Oct. 3];5(4):16-22. Available from: https://www.ijetcsit.org/index.php/ijetcsit/article/view/89

Similar Articles

51-60 of 240

You may also start an advanced similarity search for this article.