Skip to main content

Key concepts

Data source

Data Query allows you to import data sources from KakaoCloud’s Data Catalog and MySQL for analysis and utilization.

  • All catalog lists created by users in Data Catalog are automatically loaded without creating separate connections.
  • For MySQL data, you can import data from the desired database through the data source creation procedure.

Query result storage location

All executed query result data is automatically saved in the Object Storage bucket set by the user.

  • Users can analyze and utilize query result data stored in the Object Storage bucket.
  • Query result storage paths can be set individually per user within a project to utilize query result data.
  • Query execution result data is stored as .metadata and .csv files.

Database

You can import and select all databases for Data Catalog and MySQL data sources.

  • All database information connected to the selected catalog is automatically loaded.
  • You can load the list of all databases connected to the MySQL instance group.

Table

You can load the list of all tables in the selected database from Data Catalog and MySQL.

  • You can check the list of all tables in the selected database.
  • You can check detailed column information for each table.
  • Table data preview helps with query writing.
  • You can generate the table's DDL to understand the structure in advance and assist with query writing.

Query editor

Users can load data sources, enter queries, and execute them on the query editor screen.

  • The query editor allows running queries and analyzing results for all connected data sources.
  • You can preview query results, and all query result data can be checked in the configured Object Storage bucket.

Execution flow chart

The execution flow chart visually shows the process of analyzing data with the executed query. Users can easily use it for execution plans and result analysis.

  • Real-time status monitoring: You can check the progress and performance metrics of each stage of query execution in real time.
  • Parallel processing visualization: Intuitively shows data distribution and parallel task structure to provide efficient data processing.
  • Detailed execution information: Provides detailed information such as data throughput and query execution time at each stage at a glance.

Query execution lifecycle

Query execution has the lifecycle shown below, allowing you to check the status of the query process.

Image
Query lifecycle

Query execution status

StatusDescription
WaitingThe task is in the queue, waiting for query execution
RunningThe query is currently running
SuccessThe query completed successfully
- When query result reuse is used, shown as Success - Query result reused
CanceledThe user stopped the query
FailedThe query execution did not complete due to an error

Data types

Supported data types

When writing queries, refer to the supported data types for each Data Catalog and MySQL source type.
Data types used in queries are automatically mapped according to the data source type.

caution

Unsupported types are not listed in the table.

Data Catalog types
HiveData Query
VARCHARBOOLEAN, TINYINT, SMALLINT, INTEGER, BIGINT, REAL, DOUBLE, TIMESTAMP, DATE, CHAR, VARCHAR
VARBINARYVARCHAR
TINYINTVARCHAR, SMALLINT, INTEGER, BIGINT, DOUBLE, DECIMAL
TIMESTAMPVARCHAR, DATE
SMALLINTVARCHAR, INTEGER, BIGINT, DOUBLE, DECIMAL
REALDOUBLE, DECIMAL
INTEGERVARCHAR, BIGINT, DOUBLE, DECIMAL
DOUBLEFLOAT, DECIMAL
DECIMALDOUBLE, REAL, VARCHAR, TINYINT, SMALLINT, INTEGER, BIGINT, DECIMAL
DATEVARCHAR
CHARVARCHAR, CHAR
BOOLEANVARCHAR
BIGINTVARCHAR, DOUBLE, DECIMAL
MySQL types
MySQLData Query
BITBOOLEAN
BOOLEANTINYINT
TINYINTTINYINT
TINYINT UNSIGNEDSMALLINT
SMALLINTVARCHAR, SMALLINT
SMALLINT UNSIGNEDINTEGER
INTEGERINTEGER
INTEGER UNSIGNEDBIGINT
BIGINTBIGINT
BIGINT UNSIGNEDDECIMAL(20, 0)
DOUBLE PRECISIONDECIMAL(20, 0)
FLOATREAL
REALREAL
DECIMAL(p, s)DECIMAL(p, s)
CHAR(n)CHAR(n)
VARCHAR(n)VARCHAR(n)
TINYTEXTVARCHAR(255)
TEXTVARCHAR(65535)
MEDIUMTEXTVARCHAR(16777215)
LONGTEXTVARCHAR
ENUM(n)VARCHAR(n)
BINARY, VARBINARY, TINYBLOB, BLOB, MEDIUMBLOB, LONGBLOBVARBINARY
JSONJSON
DATEDATE
TIME(n)TIME(n)
DATETIME(n)TIMESTAMP(n)
TIMESTAMP(n)TIMESTAMP(n) WITH TIME ZONE

Event collection

You can check automatically collected events on Data Query user activity using the Cloud Trail service.