I'd also wonder about the time resolution: 16KHz = 1/16000 sec between
samples = 62 microseconds
.
That's relatively long for ITD (TDOA) estimation, which would suggest that
localisation of lower frequency signals would be impeded.
(I don't have evidence for this: it's just a suggestion).
--Leslie Smith
Junfeng Li wrote:
> Dear all,
>
> We are working on 3D audio rendering for signals with low sampling
> frequency.
> As you may know, the HRTFs are normally measured at the high sampling
> frequency, e.g., 48kHz or 44.1kHz. However, the sampling frequency of
> sound
> signals in our application is restricted to 16 kHz. Therefore, to render
> this low-frequency (≤8kHz) signal, one straight way is to first
> downsample
> the HRTFs from 48kHz/44.1kHz to 16kHz and then convolve with sound
> signals.
> However, the sound localization performance of the signal rendered with
> this approach is greatly decreased, especially elevation perception. To
> improve the sound localization performance, I am now wondering whether
> there is a certain good method to solve or mitigate this problem in this
> scenario.
>
> Any discussion is welcome.
>
> Thanks a lot again.
>
> Best regards,
> Junfeng
>
--
Prof Leslie Smith (Emeritus)
Computing Science & Mathematics,
University of Stirling, Stirling FK9 4LA
Scotland, UK
Tel +44 1786 467435
Web: http://www.cs.stir.ac.uk/~lss
Blog: http://lestheprof.com