[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: "Spectral holes" not "Birdies"?



Spectral holes result when the bit allocation model simply runs out of bits after coding the most psychoacoustically relevant spectral bins and is forced to allocate no bits to certain bins.  This
is considered to be a strong cause of swirlies or birdies.

	-----Original Message----- 
	From: AUDITORY Research in Auditory Perception on behalf of Danijel Domazet 
	Sent: Thu 3/16/2006 11:43 PM 
	To: AUDITORY@xxxxxxxxxxxxxxx 
	Cc: 
	Subject: "Spectral holes" not "Birdies"?
	
	
	 
	Hi Maxime,
	What you are talking about here is known as "spectral holes" (I think). 
	 
	Spectral holes occur in modern audio encoders as a result of (over)quantization which sets all spectral components of some (scalefactor) bands to zero. This is directly related to the masking threashold calculated by the psychoacoustic model, so the threashold should be modified in bands that must not be zeroed. 
	 
	Avoiding spectral holes is one of the top problems in designing good psychoacoustic models. 
	 
	One of the methods for spectral hole avoidance is presented in 3GPP's AAC encoder specification: TS 26.403 (freely available at www.3gpp.org). 
	 
	Regards,
	Daniel
	 
	 
	 

		----- Original Message ----- 
		From: Maxime Leroy <mailto:m.leroy@xxxxxxxxxx>  
		To: AUDITORY@xxxxxxxxxxxxxxx 
		Sent: Wednesday, March 15, 2006 8:34 PM
		Subject: [AUDITORY] RE : - "Birdies"

		Thank you Bob for you comments,
		 
		"But simple early schemes had interactions
		between the input signal and the sample frequency that caused
		"birdies" at sum and/or difference frequencies."
		 
		I realise now "birdies" might not be exactly what i meant. 
		I will rephrase then, if you ever looked closely at the spectrogram of a sample of music encoded at low bit-rate (20-64Kbps)by either mp3 or AAC codecs, you might have noticed dark spots in some places where it is obvious the energy of the signal is not suppose to be so small. I suppose that artifact is due to the richness of the signal at this precise moment (in comparison with the bit-rate) and therefore bit allocation can not cope with the demand. Then the coder being unable to encode leaves a hole in the spectrogram.
		 
		If i'm correct with the above assumption, what i'd like to know is if there is any documentation or perceptual intepretation of this problem of coding.
		 
		Regards,
		 
		Maxime

		 
  _____  

		De: AUDITORY Research in Auditory Perception de la part de Bob Masta
		Date: mer. 15/03/2006 14:30
		Ã: AUDITORY@xxxxxxxxxxxxxxx
		Objet : Re: - "Birdies"
		
		

		Hi, Maxime.  I'm not sure exactly what you are looking for, and I don't
		have any references to provide.  But if you are looking for a perceptual
		description, here's what I know:
		
		"Birdies" are little whistling sounds that are related to the
		program material, but are not harmonics of it.  They used to
		be a serious problem in sigma-delta converters, which compare the
		input signal to a reconstruction of the output signal, and generate
		a "higher than" or "lower than" response on each sample.  That
		1-bit stream is then used to create the reconstruction for the
		comparison (and the eventual output).  Nowadays, this is all
		done at very high sample rates and then ultimately converted
		down to a nominal rate, and the reconstruction processing is
		very sophisticated.  But simple early schemes had interactions
		between the input signal and the sample frequency that caused
		"birdies" at sum and/or difference frequencies.  The birdies might
		be only 40 dB down, but even if they were much softer than that
		they were clearly audible, especially on sparse program material
		like simple sine waves, flutes, etc, since they appeared in
		non-harmonic locations and were not masked by the program
		itself.  They also often had the annoying habit of sweeping in the
		opposite direction to a sweep in the signal frequency, which made
		them really obvious.
		
		
		Hope that helps!
		
		Best regards,
		
		Bob Masta
		



-----------------------------------------
This message (including any attachments) may contain confidential
information intended for a specific individual and purpose.  If you
are not the intended recipient, delete this message.  If you are
not the intended recipient, disclosing, copying, distributing, or
taking any action based on this message is strictly prohibited.