Create and manage crawler
Create crawler
A crawler in Data Catalog scans MySQL data, extracts metadata, and automatically updates Data Catalog to simplify data discovery.
To create a crawler, ensure that both the database and MySQL are already created. Follow these steps to create a crawler in the Data Catalog service.
For guidance on creating databases and MySQL, refer to the following guides:
How to create a database / How to create MySQL
-
Go to the KakaoCloud Console > Analytics > Data Catalog.
-
In the Crawler menu, click the [Create crawler] button.
-
Enter the required information in Create crawler and click [Create].
Field Description Database Name of the database to which the table will be added
- Only databases with an Active status are shown in the list.
- When selecting a database, network/subnet information can be checked.Crawler name Name of the crawler
- Must consist of lowercase letters, numbers, or underscores (4–64 characters)MySQL full path Select the MySQL instance and enter the database name of the MySQL instance
- Only MySQL instances in Available status are displayed in the list.MySQL account Enter the username and password set during MySQL creation
- Connection test: After entering the MySQL full path and account information, test the connection using the [Test connection] button.
* If the connection test fails, you cannot create the crawler.Description (optional) Add additional details about the crawler Table prefix (optional) Prefix for the table name; creates tables in the format of Prefix+MySQL database name_table name
- Must consist of lowercase letters, numbers, or underscores (1–64 characters)Schedule Manage when the crawler runs
- Choose from on-demand, hourly, daily, weekly, or monthly schedules.
- On-demand means the crawler runs only when manually triggered.
Only databases and MySQL instances in Active/Available status will be displayed in the list.
Manage crawler
This section explains how to manage crawlers in the Data Catalog service.
View crawler list
You can view the list of active crawlers in Data Catalog.
-
Go to the KakaoCloud Console > Analytics > Data Catalog.
-
Click the Crawler menu to view the crawler list.
Column Description Name Name of the crawler entered during creation
- You can click on the crawler name to access the details tab.Description Description entered during creation Status Status of the crawler Schedule Scheduled time for crawler execution Last run status Status of the most recent crawler run Last run date Date and time when the last crawler run occurred [More] icon - Modify: Update the crawler's description or schedule.
- Run: Manually trigger the crawler.
- Delete: Remove the crawler.
* If the crawler is in theCREATING
/ALTERING
/DELETING
/RUNNING
status, modification, execution, and deletion are not allowed.
View crawler details
You can check detailed information about a crawler.
- Go to the KakaoCloud Console > Analytics > Data Catalog.
- Click the Crawler menu and select the crawler to check.
- View the detailed information for the selected crawler.
View crawler run history
You can view the execution history of a crawler.
-
Go to the KakaoCloud Console > Analytics > Data Catalog.
-
Click the Crawler menu and select the crawler to view the run history.
-
In the detailed view, click the Run history tab to check the execution history.
infoOnly the run history within the last 90 days is displayed. Execution history older than 90 days is automatically deleted.
Field Description Start time Time the crawling started End time Time the crawling ended Execution time Duration of the crawler run Status Status of the crawler run
-Succeeded
: Crawling finished successfully.
-Running
: Crawling is in progress.
-Failed
: Crawling failed.
Delete crawler
You can delete crawlers that are no longer in use.
Deleted crawlers and their run histories cannot be recovered. If a catalog is deleted, the associated crawler will also be removed.
- Go to the KakaoCloud Console > Analytics > Data Catalog.
- Click the Crawler menu, then click the [More] icon for the crawler to delete, and select Delete.
- In the popup, enter the crawler name exactly as shown and click [Delete].