Create and manage crawler

Create crawler

A crawler in Data Catalog scans MySQL data, extracts metadata, and automatically updates Data Catalog to simplify data discovery.
To create a crawler, ensure that both the database and MySQL are already created. Follow these steps to create a crawler in the Data Catalog service.

info

For guidance on creating databases and MySQL, refer to the following guides:
How to create a database / How to create MySQL

Go to the KakaoCloud console > Analytics > Data Catalog.
In the Crawler menu, select the [Create crawler] button.

Enter the required information in Create crawler and select [Create].

Field	Description
Database	Name of the database to which the table will be added - Only databases with an Active status are shown in the list. - When selecting a database, network/subnet information can be checked.
Crawler name	Name of the crawler - Must consist of lowercase letters, numbers, or underscores (4–64 characters)
MySQL full path	Select the MySQL instance and enter the database name of the MySQL instance - Only MySQL instances in Available status are displayed in the list.
MySQL account	Enter the username and password set during MySQL creation - Connection test: After entering the MySQL full path and account information, test the connection using the [Test connection] button. * If the connection test fails, you cannot create the crawler.
Description (optional)	Add additional details about the crawler
Table prefix (optional)	Prefix for the table name; creates tables in the format of Prefix+MySQL database name_table name - Must consist of lowercase letters, numbers, or underscores (1–64 characters)
Schedule	Manage when the crawler runs - Choose from on-demand, hourly, daily, weekly, or monthly schedules. - On-demand means the crawler runs only when manually triggered.

info

Only databases and MySQL instances in Active/Available status will be displayed in the list.

Manage crawler

This section explains how to manage crawlers in the Data Catalog service.

View crawler list

You can view the list of active crawlers in Data Catalog.

Go to the KakaoCloud console > Analytics > Data Catalog.

Select the Crawler menu to view the crawler list.

Column	Description
Name	Name of the crawler entered during creation - You can select on the crawler name to access the details tab.
Description	Description entered during creation
Status	Status of the crawler
Schedule	Scheduled time for crawler execution
Last run status	Status of the most recent crawler run
Last run date	Date and time when the last crawler run occurred
[More] icon	- Modify: Update the crawler's description or schedule. - Run: Manually trigger the crawler. - Delete: Remove the crawler. * If the crawler is in the `CREATING`/`ALTERING`/`DELETING`/`RUNNING` status, modification, execution, and deletion are not allowed.

View crawler details

You can check detailed information about a crawler.

Go to the KakaoCloud console > Analytics > Data Catalog.
Select the Crawler menu and select the crawler to check.
View the detailed information for the selected crawler.

View crawler run history

You can view the execution history of a crawler.

Go to the KakaoCloud console > Analytics > Data Catalog.
Select the Crawler menu and select the crawler to view the run history.

In the detailed view, select the Run history tab to check the execution history.

info

Only the run history within the last 90 days is displayed. Execution history older than 90 days is automatically deleted.

Field	Description
Start time	Time the crawling started
End time	Time the crawling ended
Execution time	Duration of the crawler run
Status	Status of the crawler run - `Succeeded`: Crawling finished successfully. - `Running`: Crawling is in progress. - `Failed`: Crawling failed.

Delete crawler

You can delete crawlers that are no longer in use.

caution

Deleted crawlers and their run histories cannot be recovered. If a catalog is deleted, the associated crawler will also be removed.

Go to the KakaoCloud console > Analytics > Data Catalog.
Select the Crawler menu, then select the [More] icon for the crawler to delete, and select Delete.
In the popup, enter the crawler name exactly as shown and select [Delete].

Create crawler​

Manage crawler​

View crawler list​

View crawler details​

View crawler run history​

Delete crawler​

Create crawler

Manage crawler

View crawler list

View crawler details

View crawler run history

Delete crawler