Skip to main content

Data Catalog overview

KakaoCloud Data Catalog is a metadata management service that centrally manages diverse data assets across an organization. It collects metadata from each data source, stores it in a central repository, and systematically inventories data assets.

It also integrates with Hadoop Eco, enabling more efficient metadata-based data discovery and management across analytics workflows.

Use cases and purpose

When an organization manages large amounts of data, it can be difficult to locate data due to its distribution, leading to issues such as data duplication, inconsistency, and decreased trustworthiness, which hinders data sharing and collaboration between users. Additionally, insufficient protection of sensitive data may result in data leaks and severe security impacts.
Data Catalog addresses these challenges by enabling centralized data management and providing features such as metadata utilization, data search and sharing, security and access control, and data quality improvement, supporting efficient and secure data usage within organizations.

Features

Centralized data search without individual movement or searching

  • Increases data management efficiency by enabling data queries in Data Catalog without the need to move or individually search through large-scale data.

Integrated management of diverse and large-scale metadata

  • Consolidates and manages various types of metadata available in KakaoCloud within the console.

Fast data search and query

  • Enables data search and query in Data Catalog without the need to individually access storage or databases.

Getting started

Detailed guides on using Data Catalog are available in How-to Guides.
If you are new to KakaoCloud, start with Start.