Key concepts

Data source

Data Query allows you to import data sources from KakaoCloud’s Data Catalog and MySQL for analysis and utilization.

All catalog lists created by users in Data Catalog are automatically loaded without creating separate connections.
For MySQL data, you can import data from the desired database through the data source creation procedure.

Query result storage location

All executed query result data is automatically saved in the Object Storage bucket set by the user.

Users can analyze and utilize query result data stored in the Object Storage bucket.
Query result storage paths can be set individually per user within a project to utilize query result data.
Query execution result data is stored as .metadata and .csv files.

Database

You can import and select all databases for Data Catalog and MySQL data sources.

All database information connected to the selected catalog is automatically loaded.
You can load the list of all databases connected to the MySQL instance group.

Table

You can load the list of all tables in the selected database from Data Catalog and MySQL.

You can check the list of all tables in the selected database.
You can check detailed column information for each table.
Table data preview helps with query writing.
You can generate the table's DDL to understand the structure in advance and assist with query writing.

Query editor

Users can load data sources, enter queries, and execute them on the query editor screen.

The query editor allows running queries and analyzing results for all connected data sources.
You can preview query results, and all query result data can be checked in the configured Object Storage bucket.

Execution flow chart

The execution flow chart visually shows the process of analyzing data with the executed query. Users can easily use it for execution plans and result analysis.

Real-time status monitoring: You can check the progress and performance metrics of each stage of query execution in real time.
Parallel processing visualization: Intuitively shows data distribution and parallel task structure to provide efficient data processing.
Detailed execution information: Provides detailed information such as data throughput and query execution time at each stage at a glance.

Query execution lifecycle

Query execution has the lifecycle shown below, allowing you to check the status of the query process.

Query lifecycle

Query execution status

Status	Description
Waiting	The task is in the queue, waiting for query execution
Running	The query is currently running
Success	The query completed successfully - When query result reuse is used, shown as `Success - Query result reused`
Canceled	The user stopped the query
Failed	The query execution did not complete due to an error

Data types

Supported data types

When writing queries, refer to the supported data types for each Data Catalog and MySQL source type.
Data types used in queries are automatically mapped according to the data source type.

caution

Unsupported types are not listed in the table.

Data Catalog types

Hive	Data Query
VARCHAR	BOOLEAN, TINYINT, SMALLINT, INTEGER, BIGINT, REAL, DOUBLE, TIMESTAMP, DATE, CHAR, VARCHAR
VARBINARY	VARCHAR
TINYINT	VARCHAR, SMALLINT, INTEGER, BIGINT, DOUBLE, DECIMAL
TIMESTAMP	VARCHAR, DATE
SMALLINT	VARCHAR, INTEGER, BIGINT, DOUBLE, DECIMAL
REAL	DOUBLE, DECIMAL
INTEGER	VARCHAR, BIGINT, DOUBLE, DECIMAL
DOUBLE	FLOAT, DECIMAL
DECIMAL	DOUBLE, REAL, VARCHAR, TINYINT, SMALLINT, INTEGER, BIGINT, DECIMAL
DATE	VARCHAR
CHAR	VARCHAR, CHAR
BOOLEAN	VARCHAR
BIGINT	VARCHAR, DOUBLE, DECIMAL

MySQL types

MySQL	Data Query
BIT	BOOLEAN
BOOLEAN	TINYINT
TINYINT	TINYINT
TINYINT UNSIGNED	SMALLINT
SMALLINT	VARCHAR, SMALLINT
SMALLINT UNSIGNED	INTEGER
INTEGER	INTEGER
INTEGER UNSIGNED	BIGINT
BIGINT	BIGINT
BIGINT UNSIGNED	DECIMAL(20, 0)
DOUBLE PRECISION	DECIMAL(20, 0)
FLOAT	REAL
REAL	REAL
DECIMAL(p, s)	DECIMAL(p, s)
CHAR(n)	CHAR(n)
VARCHAR(n)	VARCHAR(n)
TINYTEXT	VARCHAR(255)
TEXT	VARCHAR(65535)
MEDIUMTEXT	VARCHAR(16777215)
LONGTEXT	VARCHAR
ENUM(n)	VARCHAR(n)
BINARY, VARBINARY, TINYBLOB, BLOB, MEDIUMBLOB, LONGBLOB	VARBINARY
JSON	JSON
DATE	DATE
TIME(n)	TIME(n)
DATETIME(n)	TIMESTAMP(n)
TIMESTAMP(n)	TIMESTAMP(n) WITH TIME ZONE

Event collection

You can check automatically collected events on Data Query user activity using the Cloud Trail service.

Check Data Query events (Data Query events are collected per project.)
For more details on Cloud Trail, refer to KakaoCloud’s Cloud Trail documentation.

Data source​

Query result storage location​

Database​

Table​

Query editor​

Execution flow chart​

Query execution lifecycle​

Query execution status​

Data types​

Supported data types​

Data Catalog types​

MySQL types​

Event collection​