Practical machine learning workflows starting with Kubeflow
Using machine learning and AI in the cloud is no longer an area limited to specific developers or researchers. It is becoming a technology that is closer to practitioners who plan or operate services, and even to beginners encountering AI technology for the first time.
In line with this trend, KakaoCloud provides the latest version of Kubeflow. This time, we are newly providing two hands-on tutorial series that let anyone build machine learning pipelines directly based on Kubeflow.
The newly released tutorials are series on LLM (large language model) practice and web service traffic prediction. Beyond simple code examples, they let you easily experience the full practical process, from model training to serving, optimization, and automation.
📘 Build generative AI yourself - LLM workflow tutorial series
The first series is the LLM workflow tutorial. This series is structured so that you can practice the entire process of serving a large language model directly in a Kubeflow environment, fine-tuning it for your intended purpose, and finally building a document-based question answering system (RAG).
In particular, this series uses Meta Llama 3.2 from Hugging Face Hub together with Kanana, a model developed by Kakao. You can directly experience various LLM usage scenarios, from real-time inference to domain-specific training.
The LLM series consists of three parts.
-
Part 1: Create an LLM model serving endpoint Deploy a pretrained LLM to a cloud environment using KServe and create an endpoint that supports real-time inference.
-
Part 2: Fine-tune an LLM model Guides you through efficiently retraining a selected model on domain-specific data based on PEFT (LoRA, and more). It also includes how to save and reuse the model after training.
-
Part 3: Implement document-based RAG Complete an LLM use case by embedding user text documents into vectors, storing them in FAISS, and configuring a question answering API using LangChain.
Because this series lets you directly configure an LLM using CPU/GPU in a cloud environment, we believe it will be a very useful starting point for developers and AI planners who want to review actual productization possibilities.
📈 From logs to insights - Traffic prediction model tutorial series
The second series is a hands-on tutorial for building a traffic prediction model. This series walks through the process of collecting access log data from a web service and creating a time-series machine learning model that predicts future traffic based on that data.
In particular, this tutorial does not stop at analysis. It also covers serving the trained model as an API and automating the entire process with Kubeflow Pipelines. In other words, you can experience an end-to-end pipeline that covers data preprocessing, model development, hyperparameter optimization, deployment, and operations all at once.
The traffic prediction series consists of four parts.
-
Part 1: Collect and preprocess traffic data Collect web server log data and refine it into a form suitable for time-series analysis. Create features that reflect periodic patterns such as day of week and time of day, and build a dataset that can be used as input for machine learning models.
-
Part 2: Tune model hyperparameters Based on the results of baseline model training, use Kubeflow Katib to perform hyperparameter optimization and improve performance.
-
Part 3: Create a model serving API Deploy the trained model as a KServe-based InferenceService and perform predictions through API requests.
-
Part 4: Configure a model pipeline Automate the entire process, from data preprocessing and model training to performance validation and serving deployment, with Kubeflow Pipelines.
This series is highly recommended for MLOps beginners and data engineers because it lets you practice the complete flow of an operational machine learning service directly in a cloud environment.
🚀 Practical machine learning workflows starting with Kubeflow
Both series released this time are built on KakaoCloud Kubeflow. Kubeflow is a tool that simplifies complex MLOps processes and helps manage reproducible machine learning experiments easily. You can intuitively configure machine learning infrastructure such as GPU, storage, and network settings in the KakaoCloud console, and it provides features for deploying and operating various machine learning workloads in a consistent way.
These tutorials are designed as practical learning paths where you can acquire technology flows applicable to real work, going beyond simply following steps. From the latest generative AI technologies such as LLMs to predictive models and pipeline configuration, you do not merely copy and run complex code. Instead, you configure the meaning of each step yourself, understand the technical context, and build practical intuition.
You can directly practice and experience two machine learning fields currently receiving attention, generative AI and time-series prediction, in the KakaoCloud environment. Start building practical machine learning pipelines with Kubeflow-based hands-on tutorials.
📝 View all Machine Learning & AI tutorials
👉 Start KakaoCloud now

