ESP32語音交互之一--------------ESP32-Korvo V1.1開發板初體念

    最近開始開發語音交互,找出了積灰多年的ESP32-LyraT開發板,一頓操作猛如虎,結果板子250,只能喚醒,命令詞沒搞起來,查看官方文檔和GitHub上的代碼發現了了一些線索,需要熟悉語音喚醒引擎WakeNet和命令詞引擎MultiNet,這兩引擎在一個統一的包裏,那就是ESP-Skainet,地址:https://github.com/espressif/esp-skainet

   根據文檔推薦了兩款開發板:ESP32-Korvo V1.1和ESP32-LyraT-Mini,由於有一塊雙麥的ESP32-LyraT開發板了,就不想體念單麥的ESP32-LyraT-Mini了,選擇了230大洋的ESP32-Korvo V1.1,想看看遠場交互能力如何。

   關於這款開發板的介紹見網頁:https://github.com/espressif/esp-skainet/blob/master/docs/zh_CN/hw-reference/esp32/user-guide-esp32-korvo-v1.1.md,這裏面介紹了板子的硬件配置,默認固件說明,PCB原理圖,PCB Layout等信息。下面來看下默認固件的體念:

   此默認固件很簡單,就是喚醒詞“嗨,樂鑫”,然後還有8句指令:

接上電源USB和串口USB線,打開串口終端,上電,按RST鍵,打印信息如下:

ets Jul 29 2019 12:21:46

rst:0x1 (POWERON_RESET),boot:0x13 (SPI_FAST_FLASH_BOOT)
configsip: 0, SPIWP:0xee
clk_drv:0x00,q_drv:0x00,d_drv:0x00,cs0_drv:0x00,hd_drv:0x00,wp_drv:0x00
mode:DIO, clock div:1
load:0x3fff0018,len:4
load:0x3fff001c,len:7128
load:0x40078000,len:11824
ho 0 tail 12 room 4
load:0x40080400,len:7344
entry 0x40080798
[0;32mI (64) boot: Chip Revision: 3[0m
[0;32mI (71) boot_comm: chip revision: 3, min. bootloader chip revision: 0[0m
[0;32mI (42) boot: ESP-IDF v3.2.3-201-gfda11a6-dirty 2nd stage bootloader[0m
[0;32mI (42) boot: compile time 14:40:42[0m
[0;32mI (42) boot: Enabling RNG early entropy source...[0m
[0;32mI (48) qio_mode: Enabling default flash chip QIO[0m
[0;32mI (53) boot: SPI Speed      : 80MHz[0m
[0;32mI (57) boot: SPI Mode       : QIO[0m
[0;32mI (61) boot: SPI Flash Size : 16MB[0m
[0;32mI (66) boot: Partition Table:[0m
[0;32mI (69) boot: ## Label            Usage          Type ST Offset   Length[0m
[0;32mI (76) boot:  0 factory          factory app      00 00 00010000 003c0000[0m
[0;32mI (84) boot:  1 nvs              WiFi data        01 02 003d0000 00004000[0m
[0;32mI (91) boot: End of partition table[0m
[0;32mI (96) boot_comm: chip revision: 3, min. application chip revision: 0[0m
[0;32mI (103) esp_image: segment 0: paddr=0x00010020 vaddr=0x3f400020 size=0x16fbd4 (1506260) map[0m
[0;32mI (510) esp_image: segment 1: paddr=0x0017fbfc vaddr=0x3ffb0000 size=0x00414 (  1044) load[0m
[0;32mI (511) esp_image: segment 2: paddr=0x00180018 vaddr=0x400d0018 size=0xc7034 (815156) map[0m
[0;32mI (733) esp_image: segment 3: paddr=0x00247054 vaddr=0x3ffb0414 size=0x032bc ( 12988) load[0m
[0;32mI (737) esp_image: segment 4: paddr=0x0024a318 vaddr=0x40080000 size=0x00400 (  1024) load[0m
[0;32mI (740) esp_image: segment 5: paddr=0x0024a720 vaddr=0x40080400 size=0x1959c (103836) load[0m
[0;32mI (797) boot: Loaded app from partition at offset 0x10000[0m
[0;32mI (797) boot: Disabling RNG early entropy source...[0m
WYB:Slave mode:80 !
WYB:ES7210_MCLK_CTL_REG02 = 0xC3!
************** enter es8311 init **************
Quantized wakeNet5: wakeNet5_v1_hilexin_5_0.95_0.90, mode:0 (Nov 20 2019 14:34:43)
Quantized MN1_2 (Dec  5 2019 20:22:00)
SHIFT: 8, 11, 17, 18, 18, 17, 7, 16, 15, 13, 
[0;32mI (287) MN: ---------------------SPEECH COMMANDS---------------------[0m
[0;32mI (297) MN: Command ID0, phrase 0: guan bi dian deng[0m
[0;32mI (297) MN: Command ID1, phrase 1: da kai bai deng[0m
[0;32mI (307) MN: Command ID2, phrase 2: da kai hong deng[0m
[0;32mI (307) MN: Command ID3, phrase 3: da kai lv deng[0m
[0;32mI (317) MN: Command ID4, phrase 4: da kai lan deng[0m
[0;32mI (317) MN: Command ID5, phrase 5: da kai huang deng[0m
[0;32mI (327) MN: Command ID6, phrase 6: da kai cheng deng[0m
[0;32mI (327) MN: Command ID7, phrase 7: da kai zi deng[0m
[0;32mI (337) MN: ---------------------------------------------------------
[0m
ESP32-Korvo V1.1 Firmware V0.1

從上面信息可見這個固件是ESP-IDF v3.2.3-201-gfda11a6-dirty開發環境,比較老的版本了,現在音頻基本都是v3.3.2版本開發了。好測試下語言效果:

嗨,樂鑫,打開紅燈:

嗨,樂鑫,打開紫燈:

伴隨着串口輸出:

Wake-up word detected.
phrase ID: 7, prob: 0.951218
1175.010010: command 7 detected.

近距離小聲交互都沒問題,站在5米元的地方試了下,需要大點聲才能交互。總體來說體念還可以!

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章