nnAudio.features.vqt.VQT¶
- class nnAudio.features.vqt.VQT(sr=22050, hop_length=512, fmin=32.7, fmax=None, n_bins=84, filter_scale=1, bins_per_octave=12, norm=True, basis_norm=1, gamma=0, window='hann', pad_mode='reflect', earlydownsample=True, trainable=False, output_format='Magnitude', verbose=True)¶
Bases:
torch.nn.modules.module.Module
Methods
__init__
Initializes internal Module state, shared by both nn.Module and ScriptModule.
Convert a batch of waveforms to VQT spectrograms.
- forward(x, output_format=None, normalization_type='librosa')¶
Convert a batch of waveforms to VQT spectrograms.
- Parameters
x (torch tensor) –
Input signal should be in either of the following shapes.
(len_audio)
(num_audio, len_audio)
3.
(num_audio, 1, len_audio)
It will be automatically broadcast to the right shape