Renesas
RZ/V AI

6.10

The best solution for
starting your AI applications.

Provided by Renesas Electronics Corporation

To keep you updated,
Watch our GitHub repository

Watch

This project is maintained by renesas-rz

Hosted on GitHub Pages — Theme by orderedlist

How to replace a Transformer-based AI model

This document explains how to replace and execute a Transformer-based AI model (e.g., SegFormer) on the RZ/V2N platform using the DRP-AI TVM runtime.

The following is an example of how to implement the SegFormer model on RZ/V2N by adapting the existing `Q09_crack_segmentation` sample application.

Introduction

Model Overview

Segformer is a state-of-the-art semantic segmentation model that combines Transformers with lightweight multilayer perceptron (MLP) decoders. It leverages a hierarchical Transformer-based encoder (e.g., MiT-Mix Vision Transformer) to extract multi-scale features and a lightweight MLP decoder to produce high-resolution segmentation maps.

Work Flow

In this workflow, the process begins with preparing a Transformer-based AI model and ends with running inference on the RZ/V2N board.
From Step 1 to Step 5, each stage corresponds to the sections described below in this guide.

Step1: Prepare the SegFormer ONNX model
Setp2: Setup Environment
Step3: Compile AI Model with DRP-AI TVM
Step4: Build the Application
Step5: Deploy and Run Application on the Board

Step 1: Prepare the SegFormer ONNX Model

In your environment, prepare your own SegFormer model and convert it into ONNX format, which will be used in Step 3.
This step provides an example for reference purposes.

1. Operating Environment

Category	Item	Version
Hardware	Ubuntu Desktop GPU: NVIDIA GPU with CUDA support	20.04
Software	Python	3.10.6
	PyTorch	1.12.0+cu116
	Torchvision	0.13.0+cu116
	pip	25.3
	ONNX version	1.19.0
	Opset version	11
	mmcv-full	1.6.0
	mmsegmentation	0.30.0

2. SegFomer implementation details

SegFormer is implemented based on the MMSegmentation repository.
The training code was used from the MMSegmentation GitHub repository on the master branch, which is based on the mmcv library. The MMSegmentation package should be installed from the repository using the command below:

git clone https://github.com/open-mmlab/mmsegmentation
cd mmsegmentation
git checkout 38900d5c51395dde78b494d9e86a8ed92cc81b49
pip install -v -e .

3. Model Conversion

After training the model and obtaining the PyTorch checkpoint file (.pth), the next step is to convert it into the ONNX format.
This conversion can be done using PyTorch’s standard API, torch.onnx(PyTorch 1.12), which takes your trained model and a sample input tensor to generate an ONNX model file as output.

Step 2: Setup AI SDK Environment

Note Make sure that you have installed Docker on your Linux PC.

Once the ONNX model is generated, the next step is to prepare the execution environment for model compilation and deployment.

Please follow the Step 1 ~ 5 in the official RZ/V AI SDK Getting Started guide below to set up the AI SDK environment and launch the Docker container.
- RZ/V AI SDK Getting Started (v6.10)

Targer Board	PRODUCT	AI SDK	DRP-AI TVM	DRP-AI Translator
RZ/V2N Evaluation Board Kit (EVK)	RZ/V2N	v6.00	v2.5.1 (AI SDK v6.00)	i8 v1.04 (AI SDK v6.00)

After completing Getting Started Step 1 ~ 5, the Docker container environment will be running and ready.

Step 3: Compile AI Model with DRP-AI TVM

This chapter explains how to compile your ONNX model using DRP-AI TVM to make it executable on the RZ/V2N EVK.

Note Before proceeding with Step 3, make sure that you have completed Step 2 and the Docker container has been created successfully.

3-1. Copy Your ONNX Model into the Working Directory

Please choose any preferred method to copy your ONNX model into the working directory inside the Docker container, which is $TVM_ROOT/tutorials.

In this document, we provide an example using the docker cp command for simplicity.

If you are currently inside the Docker container, run the following command to exit.
```
exit
```
Run the following command on host PC to copy the ONNX model into the working directory on your Docker container.
```
sudo docker cp <path_to_file>/segformer.onnx rzv2n_ai_sdk_container:${TVM_ROOT}/tutorials/
```
Note The example use segformer.onnx, please replace it with your own ONNX model file name.

3-2. Move to the Working Directory

If the Docker container is not currently running, start it with the following command on your host PC.
```
sudo docker start -i rzv2n_ai_sdk_container
```
Once inside the Docker container, move to the working directory by running the following command.
```
cd ${TVM_ROOT}/tutorials/ 
```

3-3. Confirm the model information

To compile the model with DRP-AI TVM, please verify the settings you used during training.

Note If any of these conditions differ from the training settings, the model accuracy will significantly drop.

Example of settings to verify

Input shape (e.g., 1x3x240x320)
Color order (e.g., RGB or BGR)
Preprocess
- Normalization (mean and std)
- Resize algorithm and input size
Tensor layout (e.g., NCHW or NHWC)

Example:
Some of the settings can also be checked in Netron.
Input shape: [1,3,240,320]
Tensor layout: NCHW

3-4. Modify the sample script

Next, modify the sample script according to your model information . The input image format must match the conditions used during training (e.g., image size, normalization, channel order, etc.).
In this guide, we use compile_onnx_model_quant.py as the working example.
Please adjust the preprocessing steps according to the settings confirmed in Step 3-3.

Renesas modified the following process to match the SegFormer model input specification. And the modified version is provided as a reference.

Replaced the original ImageNet preprocessing in L.101~L.112 with a simplified version aligned to the SegFormer model.

 
- def pre_process_imagenet_pytorch(img, mean=[0.485, 0.456, 0.406], stdev=[0.229, 0.224, 0.225], dims=None, need_transpose=False):   
-   img = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
-   img = Image.fromarray(img)
-   img = F.resize(img, 256, Image.BILINEAR)
-   img = F.center_crop(img, 224)
-   img = F.to_tensor(img)
-   std = stdev
-   img = F.normalize(img, mean, std, inplace=False)
-   if not need_transpose:
-      img = img.permute(1, 2, 0) # NHWC
-   img = np.asarray(img, dtype='float32')
-   return img


+ def preprocess_image(image):
+   mean = np.array([123.675, 116.28, 103.53], dtype=np.float32)
+   std = np.array([58.395, 57.12, 57.375], dtype=np.float32)
+   img_scale = (320, 240)
+   resized_image = cv2.resize(image, img_scale)
+   img_float = resized_image.astype(np.float32)
+   img_normalized = (img_float - mean) / std
+   img_chw = img_normalized.transpose(2, 0, 1)
+   return img_chw

Add the following line between L.155 and L.157 to print the input details of ONNX model.

  for inp in model_inputs:
      if inp not in model_initializers:
          model_vars.append(inp)
          
+ print(f"Model inputs: {onnx_model.graph.input}")

  np.random.seed(41264126)

Replace the L.209 to match the input format required by the SegFormer model.

  for i in range(len(input_list)):
      img_file_name = str(input_list[i])
      image = cv2.imread(img_file_name)
-     input_data = pre_process_imagenet_pytorch(image, mean, stdev, need_transpose=True)
+     input_data = preprocess_image(image)
      input_data = np.expand_dims(input_data, 0)
      rt_mod.set_input(0, input_data)
      rt_mod.run()
      print("calib data", img_file_name)

Update the L.221.

  drp_config = {
      "target": "InterpreterQuant",
      "drp_compiler_version": opts["drp_compiler_version"],
      "quantization_tool": opts["quantization_tool"],
      "quantization_option": opts["quantization_option"],
-     "calibration_data": record_dirmodel_vars.append(inp)
+     "calibration_data": record_dir
  }

This completes the modification of the sample script.

3-5. Compilation

Using the modified sample script from the previous section, the SegFormer model can be compiled with the following command on your Docker container created in Step 2.

python3 compile_onnx_model_quant.py \
    ./segformer.onnx \                # target ONNX model file to compile
    -o ../data/segformer \            # specify the output directory for compiled files
    -t $SDK \                         # path to the SDK (toolchain)
    -d $TRANSLATOR \                  # path to the DRP-AI Translator
    -c $QUANTIZER                     # path to the DRP-AI Quantize

Note No calibration data was used in this sample case. Calibration is typically recommended, however, it was not required here since the inference results were nearly identical, and omitting calibration shortens the compilation time.

3-6. Confirming the output

After the compilation, the compiled AI model will be generated under the specified output directory.
In this example, the output directory was set to ${TVM_ROOT}/data/segformer/.
Check the generated files using the following command:

ls ${TVM_ROOT}/data/segformer/

If the following files are printed, the model files have been generated successfully.

deploy.json    deploy.params    deploy.so    input_0.bin    interpreter_out    preprocess

Step 4: Build the Application

Overview

To run inference with the AI model compiled by DRP-AI TVM, a C++ inference application is required.
Since the Segformer model is segmentation model trained with crack segmentation dataset, we can modify the source code of AI Applications, Q09_crack_segmentation.
Following table shows the summary of application elements. Bold letters require the modification in the source code.

Category	Item	Q09_crack_segmentation	This page	Comment
AI Model	Model name	U-Net	SegFormer
	AI task	Segmentation	Segmentation
	Dataset	Crack segmentation dataset	Crack segmentation dataset
	Target class	Background/Crack	Background/Crack
	Number of class	2	2
	Input size	224x224x3	240x320x3
	Output size	224x224x1	240x320x1
	Input datatype	floating-point type	floating-point type
	Output datatype	floating-point type	integer type
	Pre-processing	Pre-processing for U-Net	Pre-processing for SegFormer	Differences in Pre-processing will be explained inSection 4-2: Modify the Source Code.
	Post-processing	Post-processing for U-Net	Post-processing for SegFormer	Differences in Pre-processing will be explained inSection 4-2: Modify the Source Code.
Application	Target board	RZ/V2H, RZ/V2N, RZ/V2L	RZ/V2N (, RZ/V2H)	RZ/V2N and RZ/V2H are brother chips, same application can run on the board.
	Model folder name	"crack_segmentation_model"	"segformer_model"
	Application input data	USB Camera 1ch (640x480)	USB Camera 1ch (640x480)
	Application output	HDMI display 1920x1080	HDMI display 1920x1080
	Segmentation result display	RZ/V2H, RZ/V2N: Heatmap RZ/V2L: Green highlight	RZ/V2N: Red highlight

4-1. Prepare the Application

Check the Q09_crack_segmentation application, follow the instruction “Application File Generation 1-4” to go to the application source code directory.

4-2. Modify the Source Code

Edit the following source file to adapt the application to the SegFormer model compiled in Step 3.
Path: ${PROJECT_PATH}/Q09_crack_segmentation/src/crack_segmentation.cpp

The following code examples show the differences between the original U-Net model and the SegFormer model.

Modify the definition of the AI model's input size (L.101~L.102).

   /*Model input info*/
-  #define MODEL_IN_H          (224)
-  #define MODEL_IN_W          (224)
+  #define MODEL_IN_H          (240)
+  #define MODEL_IN_W          (320)

Modify the AI model's output datatype in L.152, L.370 and L.467.

L.152

 
-  std::vector<float> floatarr(1);
+  std::vector<uint64_t> intarr(1);

L.370

   ******************************************/
-  float *start_runtime(float *input)   
+  uint64_t *start_runtime(float *input)
   {
       int ret = 0;
       /* Set Pre-processing output to be inference input. */
       model_runtime.SetInput(0, input);
      ... remaining code omitted ... 
   }

L.467

   int ret = 0;
-  float *output; 
+  uint64_t *output;
   /*font size to be used for text output*/

Add a color palette for the red highlight implementation by inserting the following lines below L.178.

   /* Map to store input source list */
   std::map<std::string, int> input_source_map =
   {
       #ifndef V2N
           {"VIDEO", 1},
       #endif
       {"IMAGE", 2},
       {"USB", 3},
       #ifdef V2L
           {"MIPI", 4}
       #endif

   };
 
+  std::vector<cv::Vec3b> palette = 
+  {
+      {0,0,200},
+      {0,0,0}
+  };

Modify image resizing, add mean/std normalization, remove RGB conversion (L.352~L.362), and delete the image resizing process (L.492~L.494) for SegFormer pre-processing.

L.352~L.362

-  cv::Mat start_preprocessing(cv::Mat frame)
-  cv::Mat start_preprocessing(cv::Mat frame)
-  {
-      cv::cvtColor(frame, frame, cv::COLOR_BGR2RGB);
-      frame = hwc2chw(frame);
-      /*convert to FP32*/
-      frame.convertTo(frame, CV_32FC3,1.0 / 255.0, 0);
-      /*deep copy, if not continuous*/
-      if (!frame.isContinuous())
-      frame = frame.clone();
-      return frame;
-  }
+  cv::Mat start_preprocessing(const cv::Mat& image) 
+  {
+      cv::Scalar mean = {123.675f, 116.28f, 103.53f};
+      cv::Scalar std = {58.395f, 57.12f, 57.375f};
+      cv::Size img_scale(MODEL_IN_W,MODEL_IN_H);
+      cv::Mat processed_image;
+      cv::resize(image, processed_image, img_scale);
+      processed_image.convertTo(processed_image, CV_32FC3);
+      processed_image -= mean;
+      processed_image /= std;
+      processed_image = hwc2chw(processed_image);
+      return processed_image;
+  }

L.492~L.494

-  cv::Size size(MODEL_IN_H, MODEL_IN_W);
-  /*resize the image to the model input size*/
-  cv::resize(frame, frame, size);

Update the post-processing and variable definitions according to the model's output datatype (L.410~L.422).

-  floatarr.resize(g_out_size_arr);
    /* Post-processing for FP16 */
-  if (InOutDataType::FLOAT16 == std::get<0>(output_buffer))
-  {
-      /* Extract data in FP16 <uint16_t>. */
-      uint16_t *data_ptr = reinterpret_cast<uint16_t *>(std::get<1>(output_buffer));
-      for (int n = 0; n < g_out_size_arr; n++)
-      {
-           /* Cast FP16 output data to FP32. */
-           floatarr[n] = float16_to_float32(data_ptr[n]);
-      }
-  }
-  return floatarr.data();
+  intarr.resize(g_out_size_arr);
    /* Post-processing for FP16 */
+  if (InOutDataType::INT64 == std::get<0>(output_buffer))
+  {
+      /* Extract data in INT64 <uint16_t>. */
+     uint64_t *data_ptr = reinterpret_cast<uint64_t *>(std::get<1>(output_buffer));
+     for (int n = 0; n < g_out_size_arr; n++)
+     {
+          intarr[n] = data_ptr[n];
+     }
+  }
+  return intarr.data();

Update the post-processing step by removing colour_convert and overlaying the colorized segmentation result directly onto the input image data (L.425~L.456).

-  /*****************************************
-   * Function Name : colour_convert
-   * Description   : function to convert white colour to green colour.
-   * Arguments     : Mat image
-   * Return value  : Mat result
-   ******************************************/
-   cv::Mat colour_convert(cv::Mat image)
-   {
-       /* Convert the image to HSV */ 
-       cv::Mat hsv;
-       cv::cvtColor(image, hsv, cv::COLOR_BGR2HSV);
-       /* Define the lower and upper HSV range for white color */
-       cv::Scalar lower_white = cv::Scalar(0, 0, 200); // Adjust these values as needed
-       cv::Scalar upper_white = cv::Scalar(180, 30, 255); // Adjust these values as needed
-       /* Create a mask for the white color */
-       cv::Mat mask;
-       cv::inRange(hsv, lower_white, upper_white, mask);
-       /* Create a green image */
-       cv::Mat green_image = cv::Mat::zeros(image.size(), image.type());
-       green_image.setTo(cv::Scalar(0, 100, 0), mask);
-       /* Replace white regions in the original image with green */
-       cv::Mat result;
-       cv::bitwise_and(image, image, result, ~mask);
-       cv::add(result, green_image, result);
-       cv::resize(result, result, cv::Size(MODEL_IN_H, MODEL_IN_W));
-       /* return result */
-       return result;
-   }
+   cv::Mat overlay_segmentation(const cv::Mat& original_image, const cv::Mat& processed_mask, float alpha = 0.4f) 
+   {
+        int original_height = original_image.rows;
+        int original_width = original_image.cols;
+        cv::Mat mask_resized;
+        cv::resize(processed_mask, mask_resized, cv::Size(original_width, original_height), 0, 0, cv::INTER_NEAREST);
+        cv::Mat output_mask = cv::Mat::zeros(original_image.size(), original_image.type());
+        for (int y = 0; y < mask_resized.rows; ++y) 
+        {
+            for (int x = 0; x < mask_resized.cols; ++x) 
+            {
+                int class_id = mask_resized.at<uchar>(y, x);
+                if (class_id < 1) 
+                {
+                    output_mask.at<cv::Vec3b>(y, x) = palette[class_id];
+                }
+                else
+                {
+                    output_mask.at<cv::Vec3b>(y, x) = original_image.at<cv::Vec3b>(y, x);
+                }
+            }
+        }
+        cv::Mat overlayed_image;
+        cv::addWeighted(original_image, 1.0f - alpha, output_mask, alpha, 0.0, overlayed_image);
+        return overlayed_image;
+   }

Modify the run_inference() function to return a cloned frame in L.483.

   cv::Mat input_frame,output_frame;
-  input_frame = frame; 
+  input_frame = frame.clone();

Update segmentation result display to SegFormer red-highlight style by applying overlay_segmentation (L.509~L.527).

-  #ifdef V2H
-     /* convert float32 format to opencv mat image format */ 
-     cv::Mat img_mask(MODEL_IN_H,MODEL_IN_W,CV_32F,(void*)output);
-     /* setting minimum threshold to heatmap */ 
-     cv::threshold(img_mask,img_mask,min_threshold,0.0,cv::THRESH_TOZERO);
-     cv::normalize(img_mask, img_mask, 0.0, 1.0, cv::NORM_MINMAX);
-     /* Scale the float values to 0-255 range for visualization */
-     cv::Mat heatmap_scaled;
-     img_mask.convertTo(heatmap_scaled, CV_8U, 255.0);
-     /* Create a grayscale heatmap */
-     cv::applyColorMap(heatmap_scaled, img_mask, cv::COLORMAP_INFERNO);
-  #elif V2L
-     /* convert float32 format to opencv mat image format */ 
-     cv::Mat img_mask(MODEL_IN_H,MODEL_IN_W,CV_32F,(void*)output); 
-     cv::threshold(img_mask,img_mask,-0.5,255,cv::THRESH_BINARY);  
-     img_mask.convertTo(img_mask,CV_8UC1);
-  #endif  
+  cv::Mat mask(MODEL_IN_H, MODEL_IN_W, CV_8UC1);
+  for (int i = 0; i < MODEL_IN_H; ++i) 
+  {
+    for (int j = 0; j < MODEL_IN_W; ++j) 
+    {
+        mask.at<uchar>(i, j) = static_cast<uchar>(output[i * MODEL_IN_W + j]);
+    }
+  }
+  output_frame = overlay_segmentation(input_frame, mask);

Remove the post-processing for U-Net; for SegFormer, the process is consolidated into overlay_segmentation (L.537~L.561).

-  total_time = pre_time + ai_time + post_time;
-  cv::cvtColor(img_mask, output_frame, cv::COLOR_RGB2BGR);
-  /* convert white colour from output frame to green colour */
-  output_frame = colour_convert(output_frame);
-  cv::resize(input_frame, input_frame, cv::Size(IMAGE_OUTPUT_WIDTH, IMAGE_OUTPUT_HEIGHT));
-  cv ::cvtColor(input_frame, input_frame, cv::COLOR_RGB2BGR);
-  cv::resize(output_frame, output_frame, cv::Size(IMAGE_OUTPUT_WIDTH, IMAGE_OUTPUT_HEIGHT));
-  #ifdef V2H
-      cv::threshold(output_frame, output_frame, 0.7, 255, 3);
-  #endif
-  /* blending both input and ouput frames that have same size and format and combined one single frame */
-  cv::addWeighted(input_frame, 1.0, output_frame, 0.5, 0.0, output_frame);
-  #ifdef V2H
-      /* resize the output image with respect to output window size */
-      cv::cvtColor(output_frame, output_frame, cv::COLOR_RGB2BGR);
-  #elif V2L
-      /* resize the output image with respect to output window size */
-      cv::cvtColor(output_frame, output_frame, cv::COLOR_BGR2RGB);
-  #endif
+  total_time = pre_time + ai_time + post_time;

Modify the model folder name in L.730.

   /* Model Binary */
-  std::string model_dir = "crack_segmentation_model";
+  std::string model_dir = "segformer_model";

4-3. Build the Application

Follow the Q09_crack_segmentation “Application File Generation 4 to 7” to build the application.
After running the commands, the following application file would be generated in the ${PROJECT_PATH}/Q09_crack_segmentation/src/build.

crack_segmentation

Step 5: Deploy and Run Application on the Board

This step explains how to deloy and run application on the RZ/V2N board.

5-1. Deploy Stage

Prerequisites

This section assumes that the microSD card setup has been completed by following Step 7-1 of Getting Started Guide provided by Renesas.

Note If you prefer to deploy the files via SCP instead of using a microSD card, please refer to "5. Run on the Board" in the official How to compile Your Own Model | DRP-AI TVM on RZ/V series guide.

File Configuration

For deployment, the following files are required for the Segformer application (generated in Step 4).

File	Details
`deploy.so`	DRP-AI TVM compiled model (generated in Step 3)
`deploy.json`	Model graph definition file (generated in Step 3)
`deploy.params`	Model parameter file (generated in Step 3)
`crack_segmentation`	Application executable file (generated in Step 4)
`libtvm_runtime.so`	TVM runtime library required by the application

Instruction

Insert the microSD card to Linux PC.
Run the following commands to mount the partition 2, which contains the root filesystem.
```
sudo mkdir /mnt/sd -p
sudo mount /dev/sdb2 /mnt/sd
```
Warning Change /dev/sdb to your microSD card device name.
Create the application directory on root filesystem.
```
sudo mkdir /mnt/sd/home/weston/transformer/segformer_model
```
Note Directory name "transformer" can be determined by user.

Copy the necessary files in execution environment to the /home/weston/transformer directory of the rootfs (SD Card) for the board.
Use the following command to copy the files to root filesystem.

sudo cp $WORK/ai_sdk_setup/data/segformer/deploy.json /mnt/sd/home/weston/transformer/segformer_model
sudo cp $WORK/ai_sdk_setup/data/segformer/deploy.params /mnt/sd/home/weston/transformer/segformer_model 
sudo cp $WORK/ai_sdk_setup/data/segformer/deploy.so /mnt/sd/home/weston/transformer/segformer_model 
sudo cp $WORK/ai_sdk_setup/data/rzv_ai_sdk/Q09_crack_segmentation/src/build/crack_segmentation /mnt/sd/home/weston/transformer

Check if libtvm_runtime.so exists under /usr/lib directory of the rootfs (SD card) on the board.

Folder structure in the rootfs (SD Card) would look like:


    |-- usr
    |   `-- lib
    |       `-- libtvm_runtime.so
    `-- home
      `-- weston
        `-- transformer
            |-- segformer_model
            |   |-- deploy.json
            |   |-- deploy.params
            |    `-- deploy.so
            `-- crack_segmentation

Run the following command to sync the data with memory.
```
sync
```
Run the following command to unmount the partition 2.
```
sudo umount /mnt/sd
```
Eject the microSD card by running the following command and remove the microSD card from Linux PC.
```
sudo eject /dev/sdb
```
Warning Change /dev/sdb to your microSD card device name.
Follow the instruction RZ/V2N EVK Getting Started Step 7-3: Boot RZ/V2N EVK to boot.

5-2. Run Stage

Prerequisites

This section expects the user to have completed Step 7-3 of Getting Started Guide provided by Renesas.
After completion of the guide, the user is expected of following things.

The board setup is done.
The board is booted with microSD card, which contains the application file.

Instruction

On Board terminal, go to the transformer directory of the rootfs.
```
cd /home/weston/transformer/
```
Note Directory name "transformer" can be determined by user.

Run the application.

su 
./crack_segmentation USB
exit # After terminated the application.

Following window shows up on HDMI screen.

Board	AI inference time
RZ/V2N EVK	Approximately 320 ms

To terminate the application, switch the application window to the terminal by using Super(windows key)+Tab and press ENTER key on the terminal of the board.

Renesas
RZ/V AI

To keep you updated,
Watch our GitHub repository

✔: Newly added
*: Updated

This document explains how to replace and execute a Transformer-based AI model (e.g., SegFormer) on the RZ/V2N platform using the DRP-AI TVM runtime.

The following is an example of how to implement the SegFormer model on RZ/V2N by adapting the existing `Q09_crack_segmentation` sample application.

Introduction

Model Overview

Work Flow

Step 1: Prepare the SegFormer ONNX Model

1. Operating Environment

2. SegFomer implementation details

3. Model Conversion

Step 2: Setup AI SDK Environment

Step 3: Compile AI Model with DRP-AI TVM

3-1. Copy Your ONNX Model into the Working Directory

3-2. Move to the Working Directory

3-3. Confirm the model information

3-4. Modify the sample script

3-5. Compilation

3-6. Confirming the output

Step 4: Build the Application

Overview

4-1. Prepare the Application

4-2. Modify the Source Code

4-3. Build the Application

Step 5: Deploy and Run Application on the Board

5-1. Deploy Stage

Prerequisites

File Configuration

Instruction

5-2. Run Stage

Prerequisites

Instruction

This is the ending of replacing a Transformer-based AI model.

This document explains how to replace and execute a Transformer-based AI model (e.g., SegFormer) on the RZ/V2N platform using the DRP-AI TVM runtime.

The following is an example of how to implement the SegFormer model on RZ/V2N by adapting the existing Q09_crack_segmentation sample application.

Introduction

Model Overview

Work Flow

Step 1: Prepare the SegFormer ONNX Model

1. Operating Environment

2. SegFomer implementation details

3. Model Conversion

Step 2: Setup AI SDK Environment

Step 3: Compile AI Model with DRP-AI TVM

3-1. Copy Your ONNX Model into the Working Directory

3-2. Move to the Working Directory

3-3. Confirm the model information

3-4. Modify the sample script

3-5. Compilation

3-6. Confirming the output

Step 4: Build the Application

Overview

4-1. Prepare the Application

4-2. Modify the Source Code

4-3. Build the Application

Step 5: Deploy and Run Application on the Board

5-1. Deploy Stage

Prerequisites

File Configuration

Instruction

5-2. Run Stage

Prerequisites

Instruction

This is the ending of replacing a Transformer-based AI model.

The following is an example of how to implement the SegFormer model on RZ/V2N by adapting the existing `Q09_crack_segmentation` sample application.