音頻信號增強

原創

西檬饭

2020-06-17 16:41

除了模型結構和超參數，數據是值得重視的。
因爲垃圾進，垃圾出。
數據量不夠是常見的問題，使用數據增強的方式產生合成新數據，從某種程度上可以提高模型的泛化能力。
這裏介紹一些基本的音頻信號增強方法。

語音增強方法

添加噪聲（noise injection），時間平移(shifting time), 改變音高和速度(changing pitch and speed).
使用Numpy添加噪聲和時間平移；
使用librosa修改pitch和speed；

noise injection

import numpy as np
def manipulate(data, noise_factor):
    noise = np.random.randn(len(data))
    augmented_data = data + noise_factor * noise
    # Cast back to same data type
    augmented_data = augmented_data.astype(type(data[0]))
    return augmented_data

Shifting Time

向左右平移數秒，右平移在左邊填0，左平移在右邊填0.

import numpy as np
def manipulate(data, sampling_rate, shift_max, shift_direction):
    shift = np.random.randint(sampling_rate * shift_max)
    if shift_direction == 'right':
        shift = -shift
    elif self.shift_direction == 'both':
        direction = np.random.randint(0, 2)
        if direction == 1:
            shift = -shift
    augmented_data = np.roll(data, shift)
    # Set to silence for heading/ tailing
    if shift > 0:
        augmented_data[:shift] = 0
    else:
        augmented_data[shift:] = 0
    return augmented_data

Changing Pitch

隨機修改音高

import librosa
def manipulate(data, sampling_rate, pitch_factor):
    return librosa.effects.pitch_shift(data, sampling_rate, pitch_factor)

Changing Speed

修改速度

import librosa
def manipulate(data, speed_factor):
    return librosa.effects.time_stretch(data, speed_factor)

不要隨便合成數據，要在瞭解信號的基礎上選擇合適的方式去產生。

譯自：Data Augmentation for Audio
作者： Edward Ma
原文鏈接：https://medium.com/@makcedward/data-augmentation-for-audio-76912b01fdf6

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

音頻信號增強

語音增強方法

noise injection

Shifting Time

Changing Pitch

Changing Speed

《日本蠟燭圖》讀書筆記 & 技術分析回測

一分鐘部署 Llama3 中文大模型，沒別的，就是快

Python多線程編程深度探索：從入門到實戰

《期貨-市場技術分析》讀書筆記

mongodb處理json數據很好

ffmpeg 百度雲盤

頂級 Javaer 都在用的 20 個類庫，真香！

[轉帖]cpupower

google瀏覽器插件開發

35K*14 薪，入職了！這公司只要不裁員，我能一直呆下去！

重讀經典神經網絡-AlexNet

卷積神經網絡-LeNet5

音頻信號增強

Python開發環境遷移

TensorlFlow 2.0基本API

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結