perceptual audio coders what to listen for artifacts of
play

Perceptual Audio Coders What to listen for: Artifacts of - PowerPoint PPT Presentation

Perceptual Audio Coders What to listen for: Artifacts of Parametric Coding Heiko Purnhagen, Bernd Edler Laboratorium f ur Informationstechnologie University of Hannover AES 109th Convention, Los Angeles, September 22-25, 2000 Hannover


  1. Perceptual Audio Coders – What to listen for: Artifacts of Parametric Coding Heiko Purnhagen, Bernd Edler Laboratorium f¨ ur Informationstechnologie University of Hannover AES 109th Convention, Los Angeles, September 22-25, 2000 Hannover 1 Laboratorium für Informationstechnologie

  2. Introduction: Parametric Coding Analysis Coding Audio Bit- Signal stream Source Perception Model Model Established coding techniques: • Speech coding: Excitation + Resonances (CELP) ⇒ source model extensively exploited • Audio coding: Spectral Decomposition (MPEG-1/2) ⇒ perception model extensively exploited ⇒ waveform coding techniques Hannover 2 Laboratorium für Informationstechnologie

  3. Introduction: Parametric Coding What is Parametric Audio Coding? • Idea: use “abstract” representation of audio signals ( musical score is more compact than waveform ) – decompose input signal into components – select appropriate source models for components – describe components by model parameters – use perception models to pick relevant compon.s ⇒ attractive for very low bitrate coding • Sound represented by model parameters ⇒ waveform approximation not necessary Hannover 3 Laboratorium für Informationstechnologie

  4. Introduction: Parametric Coding Examples of Parametric Coders: • Sinusoidal coding N � t � x ( t ) = ˆ a i ( t ) · sin( ϕ i + 2 π 0 f i ( τ ) dτ ) i =1 • Extentions to sinusoidal coding: +noise, +transients • MPEG-4 HVXC (parametric speech coder) • MPEG-4 HILN (parametric audio coder) • Q-Design QDMC (?) ⇒ various approaches, ongoing development Hannover 4 Laboratorium für Informationstechnologie

  5. Introduction: Coder Example Example: MPEG-4 Parametric Audio Coder HILN (“Harmonic and Individual Lines plus Noise”) Component models and parameters in HILN: fundamental freq. & LPC spectrum harmonic lines: individual lines: frequency & amplitude [opt.: ampl. envelope, start phase] noise: LPC spectrum Note: non-deterministic decoder behaviour (noise generator, random start phases) Hannover 5 Laboratorium für Informationstechnologie

  6. Introduction: Coder Example Example: MPEG-4 HILN @ 6 kbit/s ( f s = 16 kHz) Original Parameter Model Based Signal Decoding Synthesis Harmonic Dequant Components Sinusoidal Demux Dequant Components Bit- Audio stream Signal Noise Dequant Components Block diagram of HILN decoder Hannover 6 Laboratorium für Informationstechnologie

  7. Potential Parametric Coding Artifacts Potential artifacts related to source models: • limitations of source models • bad decomposition (hard decisions are problematic) • bad parameter estimation Potential artifacts related to perception models: • quantisation (consider “just noticeable differences”) • selection of most relevant components • is phase information irrelevant? (transients, clipping in sinusoidal synthesiser) Hannover 7 Laboratorium für Informationstechnologie

  8. Examples of Artifacts • Parametric coding: no waveform approximation ⇒ difference signal meaningless – original: pop music – coded by parametric audio coder – difference signal (original-coded) • Limitations of source models: model noise with sinusoids (e.g. applause) – original: white noise – coded using 0 to 120 sinusoids Hannover 8 Laboratorium für Informationstechnologie

  9. Examples of Artifacts • Limitations of source models: no model for transient (percussive) components – original: castanets – coded using sinusoids + noise – same, but with amplitude envelopes enabled Hannover 9 Laboratorium für Informationstechnologie

  10. Examples of Artifacts • Limitations of source models: specialised speech model not suitable for music – original: speech – coded by parametric speech coder – original: pop music – coded by parametric speech coder Hannover 10 Laboratorium für Informationstechnologie

  11. Examples of Artifacts • Bad signal decomposition: many sinusoids forced on harmonic grid – original: orchestral music – coded (harmonic component too strong) • Bad signal decomposition: many tonal components modelled as noise – original: pop music – coded (noise component too strong) Hannover 11 Laboratorium für Informationstechnologie

  12. Summary & Outlook Summary: Parametric Coding • attractive for very low bitrate audio coding • new types of artifacts (sounds “synthetic” ?) • more chances for “unlucky” decisions in encoder Outlook: ongoing development • parametric coding is still a young technique • encoders will improve . . . • parametric encoding = “auditory scene analysis” ? Hannover 12 Laboratorium für Informationstechnologie

  13. further reading ... • Parametric Audio Coding Bibliography http://www.tnt.uni-hannover.de/˜purnhage/ • MPEG Audio Web Page (tutorials, test reports, etc.) http://www.tnt.uni-hannover.de/project/mpeg/audio/ Hannover 13 Laboratorium für Informationstechnologie

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend