tf.raw_ops.Mfcc  |  TensorFlow v2.16.1 (original) (raw)

tf.raw_ops.Mfcc

Stay organized with collections Save and categorize content based on your preferences.

Transforms a spectrogram into a form that's useful for speech recognition.

View aliases

Compat aliases for migration

SeeMigration guide for more details.

tf.compat.v1.raw_ops.Mfcc

tf.raw_ops.Mfcc(
    spectrogram,
    sample_rate,
    upper_frequency_limit=4000,
    lower_frequency_limit=20,
    filterbank_channel_count=40,
    dct_coefficient_count=13,
    name=None
)

Mel Frequency Cepstral Coefficients are a way of representing audio data that's been effective as an input feature for machine learning. They are created by taking the spectrum of a spectrogram (a 'cepstrum'), and discarding some of the higher frequencies that are less significant to the human ear. They have a long history in the speech recognition world, and https://en.wikipedia.org/wiki/Mel-frequency_cepstrumis a good resource to learn more.

Args
spectrogram A Tensor of type float32. Typically produced by the Spectrogram op, with magnitude_squared set to true.
sample_rate A Tensor of type int32. How many samples per second the source audio used.
upper_frequency_limit An optional float. Defaults to 4000. The highest frequency to use when calculating the ceptstrum.
lower_frequency_limit An optional float. Defaults to 20. The lowest frequency to use when calculating the ceptstrum.
filterbank_channel_count An optional int. Defaults to 40. Resolution of the Mel bank used internally.
dct_coefficient_count An optional int. Defaults to 13. How many output channels to produce per time slice.
name A name for the operation (optional).
Returns
A Tensor of type float32.

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates. Some content is licensed under the numpy license.

Last updated 2024-04-26 UTC.