nnAudio.features.vqt.VQT

class nnAudio.features.vqt.VQT(sr=22050, hop_length=512, fmin=32.7, fmax=None, n_bins=84, filter_scale=1, bins_per_octave=12, norm=True, basis_norm=1, gamma=0, window='hann', pad_mode='reflect', earlydownsample=True, trainable=False, output_format='Magnitude', verbose=True)

Bases: torch.nn.modules.module.Module

Methods

__init__

Initializes internal Module state, shared by both nn.Module and ScriptModule.

forward

Convert a batch of waveforms to VQT spectrograms.

forward(x, output_format=None, normalization_type='librosa')

Convert a batch of waveforms to VQT spectrograms.

Parameters

x (torch tensor) –

Input signal should be in either of the following shapes.

  1. (len_audio)

  2. (num_audio, len_audio)

3. (num_audio, 1, len_audio) It will be automatically broadcast to the right shape