aeneas.cmfcc¶
aeneas.cmfcc is a Python C Extension for computing the MFCCs from a WAVE mono file.
-
cmfcc.compute_from_data(data, sample_rate, filter_bank_size, mfcc_size, fft_order, lower_frequency, upper_frequency, emphasis_factor, window_length, window_shift)¶ Compute MFCCs for a given WAVE mono file, passed as a NumPy 1D array of
float64values in[-1.0, 1.0].The returned tuple
(mfcc, length, sr)contains the MFCCs as a NumPy 2D matrix of shape(n, mfcc_size), and the number of samples and sample rate of the WAVE file.The last two elements
lengthandsrare returned to make the signature of this function consistent with that of functioncmfcc.compute_from_file().Parameters: - data (
numpy.ndarray(1D)) – the audio data - sample_rate (int) – the audio sample rate
- filter_bank_size (int) – the number of Mel filters
- mfcc_size (int) – the number of MFCC coefficients
- fft_order (int) – the order of the FFT
- lower_frequency (float) – the lower frequency to cut, in Hz
- upper_frequency (float) – the upper frequency to cut, in Hz
- emphasis_factor (float) – the pre-emphasis factor
- window_length (float) – the length of the MFCC window, in seconds
- window_shift (float) – the shift of the MFCC window, in seconds
Return type: tuple
- data (
-
cmfcc.compute_from_file(audio_file_path, filter_bank_size, mfcc_size, fft_order, lower_frequency, upper_frequency, emphasis_factor, window_length, window_shift)¶ Compute MFCCs for a given WAVE mono file, passed as a file path on disk.
The returned tuple
(mfcc, length, sr)contains the MFCCs as a NumPy 2D matrix of shape(n, mfcc_size), and the number of samples and sample rate of the WAVE file.Parameters: - audio_file_path (string) – the path of the WAVE file to be created, UTF-8 encoded
- filter_bank_size (int) – the number of Mel filters
- mfcc_size (int) – the number of MFCC coefficients
- fft_order (int) – the order of the FFT
- lower_frequency (float) – the lower frequency to cut, in Hz
- upper_frequency (float) – the upper frequency to cut, in Hz
- emphasis_factor (float) – the pre-emphasis factor
- window_length (float) – the length of the MFCC window, in seconds
- window_shift (float) – the shift of the MFCC window, in seconds
Return type: tuple