Skip to main content

Key concepts

Data source

Data Query allows you to import and analyze data sources from KakaoCloud services such as Data Catalog, MySQL, and PostgreSQL.

  • All catalogs created by the user in Data Catalog are automatically loaded without requiring a separate connection.
  • For MySQL and PostgreSQL, you can retrieve data from your desired database by creating a data source.

Query result storage location

All executed query result data is automatically saved in the Object Storage bucket set by the user.

  • Users can analyze and utilize query result data stored in the Object Storage bucket.
  • Query result storage paths can be set individually per user within a project to utilize query result data.
  • Query execution result data is stored as .metadata and .csv files.

Database

You can load and select all databases from Data Catalog, MySQL, and PostgreSQL data sources.

  • Automatically loads all database information linked to the selected catalog.
  • Loads all databases associated with the MySQL instance group.
  • Loads all databases associated with the PostgreSQL instance group.

Table

Loads all tables from the selected database in Data Catalog, MySQL, or PostgreSQL.

  • You can check the list of all tables in the selected database.
  • You can check detailed column information for each table.
  • Table data preview helps with query writing.
  • You can generate the table's DDL to understand the structure in advance and assist with query writing.

Query editor

Users can load data sources, enter queries, and execute them on the query editor screen.

  • The query editor allows running queries and analyzing results for all connected data sources.
  • You can preview query results, and all query result data can be checked in the configured Object Storage bucket.

Execution flow chart

The execution flow chart visually shows the process of analyzing data with the executed query. Users can easily use it for execution plans and result analysis.

  • Real-time status monitoring: You can check the progress and performance metrics of each stage of query execution in real time.
  • Parallel processing visualization: Intuitively shows data distribution and parallel task structure to provide efficient data processing.
  • Detailed execution information: Provides detailed information such as data throughput and query execution time at each stage at a glance.

Query execution lifecycle

Query execution has the lifecycle shown below, allowing you to check the status of the query process.

이미지 Query lifecycle

Query execution status

StatusDescription
WaitingThe task is in the queue, waiting for query execution
RunningThe query is currently running
SuccessThe query completed successfully
- When query result reuse is used, shown as Success - Query result reused
CanceledThe user stopped the query
FailedThe query execution did not complete due to an error

Data types

Supported data types

When writing queries, refer to the supported data types for each Data Catalog, MySQL, or PostgreSQL source type.
Data types used in queries are automatically mapped according to the data source type.

caution

Unsupported types are not listed in the table.

Data Catalog types
HiveData Query
VARCHARBOOLEAN, TINYINT, SMALLINT, INTEGER, BIGINT, REAL, DOUBLE, TIMESTAMP, DATE, CHAR, VARCHAR
VARBINARYVARCHAR
TINYINTVARCHAR, SMALLINT, INTEGER, BIGINT, DOUBLE, DECIMAL
TIMESTAMPVARCHAR, DATE
SMALLINTVARCHAR, INTEGER, BIGINT, DOUBLE, DECIMAL
REALDOUBLE, DECIMAL
INTEGERVARCHAR, BIGINT, DOUBLE, DECIMAL
DOUBLEFLOAT, DECIMAL
DECIMALDOUBLE, REAL, VARCHAR, TINYINT, SMALLINT, INTEGER, BIGINT, DECIMAL
DATEVARCHAR
CHARVARCHAR, CHAR
BOOLEANVARCHAR
BIGINTVARCHAR, DOUBLE, DECIMAL
MySQL types
MySQLData Query
BITBOOLEAN
BOOLEANTINYINT
TINYINTTINYINT
TINYINT UNSIGNEDSMALLINT
SMALLINTVARCHAR, SMALLINT
SMALLINT UNSIGNEDINTEGER
INTEGERINTEGER
INTEGER UNSIGNEDBIGINT
BIGINTBIGINT
BIGINT UNSIGNEDDECIMAL(20, 0)
DOUBLE PRECISIONDECIMAL(20, 0)
FLOATREAL
REALREAL
DECIMAL(p, s)DECIMAL(p, s)
CHAR(n)CHAR(n)
VARCHAR(n)VARCHAR(n)
TINYTEXTVARCHAR(255)
TEXTVARCHAR(65535)
MEDIUMTEXTVARCHAR(16777215)
LONGTEXTVARCHAR
ENUM(n)VARCHAR(n)
BINARY, VARBINARY, TINYBLOB, BLOB, MEDIUMBLOB, LONGBLOBVARBINARY
JSONJSON
DATEDATE
TIME(n)TIME(n)
DATETIME(n)TIMESTAMP(n)
TIMESTAMP(n)TIMESTAMP(n) WITH TIME ZONE
PostgreSQL types
PostgreSQLData Query
BITBOOLEAN
BOOLEANBOOLEAN
SMALLINTSMALLINT
INTEGERINTEGER
BIGINTBIGINT
REALREAL
DOUBLEDOUBLE
NUMERIC(p, s)DECIMAL(p, s)
CHAR(n)CHAR(n)
VARCHAR(n)VARCHAR(n)
ENUM(n)VARCHAR(n)
BYTEAVARBINARY
DATEDATE
TIME(n)TIME(n)
TIMESTAMP(n)TIMESTAMP(n)
TIMESTAMP(n)TIMESTAMP(n) WITH TIME ZONE
MONEYVARCHAR
UUIDUUID
JSONJSON
JSONBJSON
HSTOREMAP(VARCHAR, VARCHAR)
ARRAYDisabled, ARRAY, or JSON

Event collection

You can check automatically collected events on Data Query user activity using the Cloud Trail service.