基於OpenVINO C++ API部署YOLOv5-Seg實例分割模型

原創

2023-03-01 14:28

上一篇文章《基於OpenVNO部署YOLOv5-seg實時實例分割模型》介紹了基於OpenVINO Python API部署YOLOv5-Seg實例分割模型，本文介紹基於OpenVINO C++ API部署YOLOv5-Seg實例分割模型，主要步驟有：

配置OpenVINO C++開發環境
下載並轉換YOLOv5-Seg預訓練模型
使用OpenVINO Runtime C++ API編寫推理程序

下面，本文將依次詳述。

第一步，配置OpenVINO C++開發環境，請參考《在Windows中基於Visual Studio配置OpenVINO C++開發環境》

第二步，參考《基於OpenVNO部署YOLOv5-seg實時實例分割模型》克隆YOLOv5 Github 代碼倉到本地，然後運行命令獲得 yolov5s-seg ONNX 格式模型：yolov5s-seg.onnx：

python export.py --weights yolov5s-seg.pt --include onnx

接着運行命令獲得yolov5s-seg IR格式模型：yolov5s-seg.xml和yolov5s-seg.bin，如下圖所示

mo -m yolov5s-seg.onnx --compress_to_fp16

第三步：使用OpenVINO Runtime C++ API編寫推理程序。一個端到端的AI推理程序，主要包含五個典型的處理流程：

採集圖像&圖像解碼
圖像數據預處理
AI推理計算
對推理結果進行後處理
將處理後的結果集成到業務流程

基於OpenVINO Runtime C++API的同步推理代碼的關鍵片段如下所示：

int main(int argc, char* argv[]) {
    // -------- Get OpenVINO runtime version --------
    std::cout << ov::get_openvino_version().description << ':' << ov::get_openvino_version().buildNumber << std::endl;

    // -------- Step 1. Initialize OpenVINO Runtime Core --------
    ov::Core core;

    // -------- Step 2. Compile the Model --------
    auto compiled_model = core.compile_model(model_file, "GPU.1"); //GPU.1 is dGPU A770

    // -------- Step 3. Create an Inference Request --------
    ov::InferRequest infer_request = compiled_model.create_infer_request();

    // -------- Step 4. Read a picture file and do the preprocess --------
    cv::RNG rng;
    cv::Mat img = cv::imread(image_file); //Load a picture into memory
    cv::Mat masked_img;
    std::vector<float> paddings(3);       //scale, half_h, half_w
    cv::Mat resized_img = letterbox(img, paddings); //resize to (640,640) by letterbox
    // BGR->RGB, u8(0-255)->f32(0.0-1.0), HWC->NCHW
    cv::Mat blob = cv::dnn::blobFromImage(resized_img, 1 / 255.0, cv::Size(640, 640), cv::Scalar(0, 0, 0), true);

    // -------- Step 5. Feed the blob into the input node of YOLOv5 -------
    // Get input port for model with one input
    auto input_port = compiled_model.input();
    // Create tensor from external memory
    ov::Tensor input_tensor(input_port.get_element_type(), input_port.get_shape(), blob.ptr(0));
    // Set input tensor for model with one input
    infer_request.set_input_tensor(input_tensor);

    // -------- Step 6. Start inference --------
    infer_request.infer();

    // -------- Step 7. Get the inference result --------
    auto detect = infer_request.get_output_tensor(0);
    auto detect_shape = detect.get_shape();
    std::cout << "The shape of Detection tensor:"<< detect_shape << std::endl;
    auto proto = infer_request.get_output_tensor(1);
    auto proto_shape = proto.get_shape();
    std::cout << "The shape of Proto tensor:" << proto_shape << std::endl;

    // --------- Do the Post Process

    // Detect Matrix: 25200 x 117  
    cv::Mat detect_buffer(detect_shape[1], detect_shape[2], CV_32F, detect.data());
    // Proto Matrix:  1x32x160x160 => 32 x 25600
    cv::Mat proto_buffer(proto_shape[1], proto_shape[2] * proto_shape[3], CV_32F, proto.data());

    // -------- Step 8. Post-process the inference result -----------
   ...
}

完整範例代碼：https://gitee.com/ppov-nuc/yolov5_infer/blob/main/yolov5seg_openvino_dGPU.cpp

運行結果如下：

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

基於OpenVINO C++ API部署YOLOv5-Seg實例分割模型

如何解決報錯:unable to get local issuer certificate

torch.flatten vs torch.nn.Flatten 的區別

fetch_california_housing報錯：urllib.error.HTTPError: HTTP Error 403: Forbidden

在ImageNet 1k數據集上訓練yolov5m-cls分類模型

OpenCV vs Pillow 誰讀取文件速度更快？

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結