Proceedings of the 2024 IEEE International Conference on Multimedia and Expo (ICME), (Niagara Falls, Ontario, Canada, July 15–19, 2024)

Why Some Audio Signal Short-Time Fourier Transform Coefficients Have Nonuniform Phase Distributions

doi: 10.1109/ICME57554.2024.10687591

Cite This Publication

Stephen D. Voran ORCID logo

Abstract:

The short-time Fourier transform (STFT) represents a window of audio samples as a set of complex coefficients. These are advantageously viewed as magnitudes and phases and the overall distribution of phases is very often assumed to be uniform. We show that when audio signal STFT phase distributions are analyzed per-frequency or per-magnitude range, they can be far from uniform. That is, the uniform phase distribution assumption obscures significant important details. We explain the significance of the nonuniform phase distributions and how they might be exploited, derive their source, and explain why the choice of the STFT window shape influences the nonuniformity of the resulting phase distributions.

Keywords: discrete Fourier transform (DFT); noise reduction; speech enhancement; source separation; phase distribution; phase recovery; short-time Fourier Transform (STFT)

For technical information concerning this report, contact:

Stephen D. Voran
Institute for Telecommunication Sciences
(303) 497-3839
svoran@ntia.gov

Disclaimer: Certain commercial equipment, components, and software may be identified in this report to specify adequately the technical aspects of the reported results. In no case does such identification imply recommendation or endorsement by the National Telecommunications and Information Administration, nor does it imply that the equipment or software identified is necessarily the best available for the particular application or uses.

For questions or information on this or any other NTIA scientific publication, contact the ITS Publications Office at ITSinfo@ntia.gov or 303-497-3572.

Back to Search Results