Key concepts
Data source
Data Query allows you to import data sources from KakaoCloud’s Data Catalog and MySQL for analysis and utilization.
- All catalog lists created by users in Data Catalog are automatically loaded without creating separate connections.
- For MySQL data, you can import data from the desired database through the data source creation procedure.
Query result storage location
All executed query result data is automatically saved in the Object Storage bucket set by the user.
- Users can analyze and utilize query result data stored in the Object Storage bucket.
- Query result storage paths can be set individually per user within a project to utilize query result data.
- Query execution result data is stored as
.metadata
and.csv
files.
Database
You can import and select all databases for Data Catalog and MySQL data sources.
- All database information connected to the selected catalog is automatically loaded.
- You can load the list of all databases connected to the MySQL instance group.
Table
You can load the list of all tables in the selected database from Data Catalog and MySQL.
- You can check the list of all tables in the selected database.
- You can check detailed column information for each table.
- Table data preview helps with query writing.
- You can generate the table's DDL to understand the structure in advance and assist with query writing.
Query editor
Users can load data sources, enter queries, and execute them on the query editor screen.
- The query editor allows running queries and analyzing results for all connected data sources.
- You can preview query results, and all query result data can be checked in the configured Object Storage bucket.
Execution flow chart
The execution flow chart visually shows the process of analyzing data with the executed query. Users can easily use it for execution plans and result analysis.
- Real-time status monitoring: You can check the progress and performance metrics of each stage of query execution in real time.
- Parallel processing visualization: Intuitively shows data distribution and parallel task structure to provide efficient data processing.
- Detailed execution information: Provides detailed information such as data throughput and query execution time at each stage at a glance.
Query execution lifecycle
Query execution has the lifecycle shown below, allowing you to check the status of the query process.
Query lifecycle
Query execution status
Status | Description |
---|---|
Waiting | The task is in the queue, waiting for query execution |
Running | The query is currently running |
Success | The query completed successfully - When query result reuse is used, shown as Success - Query result reused |
Canceled | The user stopped the query |
Failed | The query execution did not complete due to an error |
Data types
Supported data types
When writing queries, refer to the supported data types for each Data Catalog
and MySQL
source type.
Data types used in queries are automatically mapped according to the data source type.
Unsupported types are not listed in the table.
Data Catalog types
Hive | Data Query |
---|---|
VARCHAR | BOOLEAN, TINYINT, SMALLINT, INTEGER, BIGINT, REAL, DOUBLE, TIMESTAMP, DATE, CHAR, VARCHAR |
VARBINARY | VARCHAR |
TINYINT | VARCHAR, SMALLINT, INTEGER, BIGINT, DOUBLE, DECIMAL |
TIMESTAMP | VARCHAR, DATE |
SMALLINT | VARCHAR, INTEGER, BIGINT, DOUBLE, DECIMAL |
REAL | DOUBLE, DECIMAL |
INTEGER | VARCHAR, BIGINT, DOUBLE, DECIMAL |
DOUBLE | FLOAT, DECIMAL |
DECIMAL | DOUBLE, REAL, VARCHAR, TINYINT, SMALLINT, INTEGER, BIGINT, DECIMAL |
DATE | VARCHAR |
CHAR | VARCHAR, CHAR |
BOOLEAN | VARCHAR |
BIGINT | VARCHAR, DOUBLE, DECIMAL |
MySQL types
MySQL | Data Query |
---|---|
BIT | BOOLEAN |
BOOLEAN | TINYINT |
TINYINT | TINYINT |
TINYINT UNSIGNED | SMALLINT |
SMALLINT | VARCHAR, SMALLINT |
SMALLINT UNSIGNED | INTEGER |
INTEGER | INTEGER |
INTEGER UNSIGNED | BIGINT |
BIGINT | BIGINT |
BIGINT UNSIGNED | DECIMAL(20, 0) |
DOUBLE PRECISION | DECIMAL(20, 0) |
FLOAT | REAL |
REAL | REAL |
DECIMAL(p, s) | DECIMAL(p, s) |
CHAR(n) | CHAR(n) |
VARCHAR(n) | VARCHAR(n) |
TINYTEXT | VARCHAR(255) |
TEXT | VARCHAR(65535) |
MEDIUMTEXT | VARCHAR(16777215) |
LONGTEXT | VARCHAR |
ENUM(n) | VARCHAR(n) |
BINARY, VARBINARY, TINYBLOB, BLOB, MEDIUMBLOB, LONGBLOB | VARBINARY |
JSON | JSON |
DATE | DATE |
TIME(n) | TIME(n) |
DATETIME(n) | TIMESTAMP(n) |
TIMESTAMP(n) | TIMESTAMP(n) WITH TIME ZONE |
Event collection
You can check automatically collected events on Data Query user activity using the Cloud Trail service.
- Check Data Query events (Data Query events are collected per project.)
- For more details on Cloud Trail, refer to KakaoCloud’s Cloud Trail documentation.