Sound Synthesis Graduate School of Culture Technology (GSCT) Juhan - - PowerPoint PPT Presentation

sound synthesis
SMART_READER_LITE
LIVE PREVIEW

Sound Synthesis Graduate School of Culture Technology (GSCT) Juhan - - PowerPoint PPT Presentation

CTP 431 Music and Audio Computing Sound Synthesis Graduate School of Culture Technology (GSCT) Juhan Nam 1 Outlines Brief history of sound synthesis Additive Synthesis Subtractive Synthesis Analog synthesizers Nonlinear


slide-1
SLIDE 1

1

Sound Synthesis

CTP 431 Music and Audio Computing

Graduate School of Culture Technology (GSCT) Juhan Nam

slide-2
SLIDE 2

Outlines

§ Brief history of sound synthesis § Additive Synthesis § Subtractive Synthesis

– Analog synthesizers

§ Nonlinear Synthesis

– Ring modulation / Frequency modulation – Wave-shaping

§ Sample-based Synthesis

2

slide-3
SLIDE 3

Brief History

§ Telharmonium (Cahill, 1897)

– Room-size additive synthesizer using electro-magnetic “tone wheels” – Transmitted through telephone lines (subscription only!) – Sound like organ: evolved into Hammond B3 organ (drawbars) – https://www.youtube.com/watch?v=PPlbXl81Rs0

§ Theremin (Léon Theremin, 1928)

– Two metal antennas recognize the relative position of hands by detecting the change of electro-magnetic fields – Each of them controls amplitude and pitch of a tone – https://www.youtube.com/watch?v=w5qf9O6c20o – https://www.youtube.com/watch?v=pSzTPGlNa5U

3

slide-4
SLIDE 4

Brief History

§ Music Concrete (Pierre Schaeffer, 1948)

– Creating sounds by splicing the pieces of tapes where sounds are recorded: sampling-based synthesis – Related to musical composition – https://www.youtube.com/watch?v=c4ea0sBrw6M – Mellotron (1963): https://www.youtube.com/watch?v=HdkixaxjZCM

§ RCA Synthesizer: Mark II (1957)

– First programmable synthesizer – Room-size and off-line processing as a synthesizer and a sequencer – https://www.youtube.com/watch?v=rgN_VzEIZ1I

§ Moog Synthesizers (Moog, 1964)

– Mini-moog (1971): the first popular synthesizer – “Switched-On-Bach” by Wendy Carlos – https://www.youtube.com/watch?v=usl_TvIFtG0

4

slide-5
SLIDE 5

Brief History

§ Yamaha DX7 (1983)

– FM synthesis, the first commercially successful synthesizer – Electronic piano sounds in 80’s pop music

§ Fairlight CMI (1979)

– The first sampling-based digital synthesizer – https://www.youtube.com/watch?v=iOlPCpSmhRM

§ Kurzweil K250 (1983)

– The first synthesizer that faithfully reproduced an acoustic grand piano

§ Yamaha VL-1 (1994)

– The first commercially available physical-modeling synthesizer

5

slide-6
SLIDE 6

Sound Synthesis Techniques

§ Categories

– Additive synthesis – Subtractive synthesis – Non-linear: modulation / wave-shaping – Sample-based synthesis – Physical modeling

6

Memory (Storage) Programmability (by # of parameters) Reproducibility of natural sounds Interpretability

  • f parameters

Computa=on power Addi=ve

** ***** **** **** ****

Subtrac=ve

* *** ** *** **

Non-linear

* *** ** ** **

Sample-base

***** * ***** N/A * ~ ***

Physical model

*** ** **** ***** *** ~ *****

slide-7
SLIDE 7

Additive Synthesis

§ Synthesize sounds by adding multiple sine oscillators

– Also called Fourier synthesis

7

OSC OSC OSC

. . .

Amp (Env) Amp (Env) Amp (Env)

. . .

+

slide-8
SLIDE 8

Sound Examples

§ Web Audio Demo

– http://femurdesign.com/theremin/ – http://www.venlabsla.com/x/additive/additive.html

§ Examples (instruments)

– Kurzweil K150

  • https://soundcloud.com/rosst/sets/kurzweil-k150-fs-additive

– Kawai K5, K5000 – Hammond Organ

8

slide-9
SLIDE 9

Subtractive Synthesis

§ Synthesize sounds by filtering wide-band oscillators

– Source-Filter model – Examples

  • Analog Synthesizers: oscillators + resonant lowpass filters
  • Voice Synthesizers: glottal pulse train + formant filters

9

5 10 15 20 −60 −50 −40 −30 −20 −10 10 20 Frequency (kHz) Magnitude (dB) 5 10 15 20 −60 −50 −40 −30 −20 −10 10 20 Frequency (kHz) Magnitude (dB) 0.5 1 1.5 2 2.5 x 10

4

−60 −50 −40 −30 −20 −10 10 20 Frequency (kHz) Magnitude (dB)

Source Filter Filtered Source

slide-10
SLIDE 10

§ Moog Synthesizer 


Subtractive Synthesis

10

Envelope

Envelope LFO Wheels Slides Pedal Physical Control

Filter Oscillators Amp

Keyboard Audio Path SoN Control Parameter = offset + depth*control (e.g. filter cut-off frequency) (sta=c value) (dynamic value)

Parameter Parameter Parameter

slide-11
SLIDE 11

Oscillators

§ Classic waveforms 
 § Modulation

– Pulse width modulation – Hard-sync – More rich harmonics

11

50 100 150 200 −2 2 5 10 15 20 −60 −40 −20 20 Frequency (kHz) Magnitude (dB)

−6dB/oct

50 100 150 200 −1 1 5 10 15 20 −60 −40 −20 20 Frequency (kHz) Magnitude (dB)

−6dB/oct

50 100 150 200 −2 2 5 10 15 20 −60 −40 −20 20 Frequency (kHz) Magnitude (dB)

−12dB/oct

Sawtooth Triangular Square

slide-12
SLIDE 12

Amp Envelop Generator

§ Amplitude envelope generation

– ADSR curve: attack, decay, sustain and release – Each state has a pair of time and target level

12

Note On Note Off

AUack Decay Sustain Release

Amplitude (dB)

slide-13
SLIDE 13

Examples

§ Web Audio Demos

– http://www.google.com/doodles/robert-moogs-78th-birthday – http://webaudiodemos.appspot.com/midi-synth/index.html – http://aikelab.net/websynth/ – http://nicroto.github.io/viktor/

§ Example Sounds

– SuperSaw – Leads – Pad – MoogBass – 8-Bit sounds: https://www.youtube.com/watch?v=tf0-Rrm9dI0 – TR-808: https://www.youtube.com/watch?v=YeZZk2czG1c

13

slide-14
SLIDE 14

Modulation Synthesis

§ Modulation is originally from communication theory

– Carrier: channel signal, e.g., radio or TV channel – Modulator: information signal, e.g., voice, video

§ Decreasing the frequency of carrier to hearing range can be used to synthesize sound § Types of modulation synthesis

– Amplitude modulation (or ring modulation) – Frequency modulation

§ Modulation is non-linear processing

– Generate new sinusoidal components

14

slide-15
SLIDE 15

Ring Modulation / Amplitude Modulation

§ Change the amplitude of one source with another source

– Slow change: tremolo – Fast change: generate a new tone

15

OSC OSC

(1+ am(t))Ac cos(2π fct) am(t)Ac cos(2π fct)

Amplitude Modula=on

x

Carrier Modulator

OSC OSC

x

Carrier Modulator

+ Ring Modula=on

slide-16
SLIDE 16

Ring Modulation / Amplitude Modulation

§ Frequency domain

– Expressed in terms of its sideband frequencies – The sum and difference of the two frequencies are obtained according to trigonometric identity – If the modulator is a non-sinusoidal tone, a mirrored-spectrum with regard to the carrier frequency is obtained

16

fc+fm fc fc-fm

am(t) = Am sin(2π fmt))

carrier sideband sideband

slide-17
SLIDE 17

Examples

§ Tone generation

– SawtoothOsc x SineOsc – https://www.youtube.com/watch?v=yw7_WQmrzuk

§ Ring modulation is often used as an audio effect

– http://webaudio.prototyping.bbc.co.uk/ring-modulator/

17

slide-18
SLIDE 18

Frequency Modulation

§ Change the frequency of one source with another source

– Slow change: vibrato – Fast change: generate a new (and rich) tone – Invented by John Chowning in 1973 à Yamaha DX7

18

Ac cos(2π fct + β sin(2π fmt)) β = Am fm

Index of modula=on

OSC OSC

Carrier Modulator frequency

slide-19
SLIDE 19

Frequency Modulation

§ Frequency Domain

– Expressed in terms of its sideband frequencies – Their amplitudes are determined by the Bessel function – The sidebands below 0 Hz or above the Nyquist frequency are folded

19

y(t) = Ac Jk(

k=−∞ k=−∞

β)cos(2π( fc + kfm)t)

fc+fm fc fc+2fm fc+3fm fc-fm fc-2fm fc-3fm

carrier sideband1 sideband1 sideband2 sideband2 sideband3 sideband3

slide-20
SLIDE 20

Bessel Function

20

Jk(β) = (−1)n(β 2 )k+2n n!(n + k)! n=0

50 100 150 200 250 300 350 −0.5 0.5 1 beta J_(k) Carrier Sideband 1 Sideband 2 Sideband 3 Sideband 4

slide-21
SLIDE 21

Bessel Function

21

slide-22
SLIDE 22

The Effect of Modulation Index

22

500 1000 1500 2000 −1 1 Time (Sample) Amplitude 0.2 0.4 0.6 0.8 1 −60 −40 −20 20 Frequency (kHz) Magnitude (dB) Beta = 0 500 1000 1500 2000 −1 1 Time (Sample) Amplitude 0.2 0.4 0.6 0.8 1 −60 −40 −20 20 Frequency (kHz) Magnitude (dB) Beta = 1 500 1000 1500 2000 −1 1 Time (Sample) Amplitude 0.2 0.4 0.6 0.8 1 −60 −40 −20 20 Frequency (kHz) Magnitude (dB) Beta = 10 500 1000 1500 2000 −1 1 Time (Sample) Amplitude 0.2 0.4 0.6 0.8 1 −60 −40 −20 20 Frequency (kHz) Magnitude (dB) Beta = 20

fc = 500, fm = 50

slide-23
SLIDE 23

“Algorithms” in DX7

23 hUp://www.audiocentralmagazine.com/yamaha-dx-7-riparliamo-di-fm-e-non-solo-seconda-parte/yamaha-dx7-algorithms/

slide-24
SLIDE 24

Examples

§ Web Audio Demo

– http://www.taktech.org/takm/WebFMSynth/

§ Sound Examples

– Bell – Wood – Brass – Electric Piano – Vibraphone

24

slide-25
SLIDE 25

Non-linear Synthesis (wave-shaping)

§ Generate a rich sound spectrum from a sinusoid using non- linear transfer functions (also called “distortion synthesis”) § Examples of transfer function: y = f(x)

– y = 1.5x’ – 0.5x’3 – y = x’/(1+|x’|) – y = sin(x’) – Chebyshev polynomial: Tk+1(x) = 2xTk(x)-Tk-1(x)

25

50 100 150 200 −1 1 Time (Sample) Amplitude 5 10 15 20 −60 −40 −20 20 Frequency (kHz) Magnitude (dB)

50 100 150 200 −1 1 Time (Sample) Amplitude 5 10 15 20 −60 −40 −20 20 Frequency (kHz) Magnitude (dB)

−1 −0.5 0.5 1 −1 −0.5 0.5 1 Time (Sample) Amplitude

x’=gx: g correspond to the “gain knob” of the distor=on T0(x)=1, T1(x)=x, T2(x)=2x2-1, T2(x)=4x3-3x

slide-26
SLIDE 26

Sample-based Synthesis

26

Synthogy Ivory II Piano Foley (filmmaking) Ringtones

§ The majority of digital sound and music synthesis today is accomplished via the playback of stored waveforms

– Media production: sound effects, narration, prompts – Digital devices: ringtone, sound alert – Musical Instruments

  • Native Instrument Kontakt5: 43+ GB (1000+ instruments)
  • Synthogy Ivory II Piano: 77GB+ (Steinway D Grand, ….)
slide-27
SLIDE 27

Why Don’t We Just Use Samples?

§ Advantages

– Reproduce realistic sounds (needless to say) – Less use of CPU

§ Limitations

– Not flexible: repeat the same sound again, not expressive – Can require a great deal of storage – Need high-quality recording – Limited to real-world sounds

§ Better ways

– Modify samples based on existing sound processing techniques

  • Much richer spectrum of sounds

– Trade-off: CPU, memory and programmability

27

slide-28
SLIDE 28

Sampling Synthesis

§ Overview

28

Off-Line Processing Recording Storing in table Off-line Online Processing Sample Fetching On-line

Meta-data: pitch, loudness, ac=on, text

User input Audio Effect

  • Sample edi=ng
  • Pitch change
  • Filter, EQ, envelope
  • Normaliza=on
  • Pitch Change
  • Filter, EQ
  • Envelope
  • Delay-based effects

(e.g. room Effects)

  • Pitch, velocity,

Timbre

  • Speed, strength
  • text

Read Samples from the table

slide-29
SLIDE 29

Wavetable Synthesis

§ Playback samples stored in tables

– Multi-sampling: choose different sample tables depending on input conditions such pitch and loudness

  • Velocity switching

§ Reducing sample tables in musical synthesizers

– Sample looping: reduce the size of tables – Pitch shifting by re-sampling: avoid sampling every single pitch – Filtering: avoid sampling every single loudness

  • e.g. low-pass filtering for soft input

29

slide-30
SLIDE 30

Sample Looping

§ Find a periodic segment and repeat it seamlessly during playback

– Particularly for instruments with forced oscillation (e.g. woodwind) – Usually taken from the sustained part of a pitched musical note

§ It is not easy to find an exactly clean loop

– The amplitude envelopes often decays or modulated:

  • e.g. piano, guitar, violin

– Period in sample is not integer à non-integer-size sample table?

30

AUack Loop

Playback using looping

slide-31
SLIDE 31

Sample Looping

§ Solutions

– Decaying amplitude: normalize the amplitude

  • Compute the envelope and multiply it inverse
  • Then, multiply the envelope back later

– Non-integer period in sample

  • Use multiple periods for the loop such that the total period is

close to integers * e.g. Period = 100.2 samples à 5*Period = 501 samples – Amplitude modulation

  • Crossfade between the end of loop and the beginning of loop

meet

§ Automatic loop search

– Pitch detection and zero-crossing detection: c.f. samplers

31

slide-32
SLIDE 32

Concatenative Synthesis

§ Splicing sample segments based on input information

– Typically done in speech synthesis: unit selection

§ Sample size depends on applications

– ARS: limited expression and context-dependent

  • word or phrase level

– TTS: unlimited expression and context-independent

  • phone or di-phone (phone-to-phone transition) level

32

slide-33
SLIDE 33

Pitch Shifting (Re-sampling)

§ Change pitch by adjusting the playback rate given sampling rate

– Corresponding to sliding tapes on the magnetic header in a variable speed (c.f. music concrete) – Down-sampling: pitch goes up and time shrinks (“chipmunk effect”) – Up-sampling: pitch goes down and time expands

§ Interpolation from discrete samples

– Convolution with interpolation filters (e.g. windowed sinc) – Need to avoid aliasing for down sampling

  • Narrowing the bandwidth of the lowpass filter à the shape of

sinc function gets wider – “resample.m” in Matlab

33

slide-34
SLIDE 34

Pitch Shifting (Re-sampling)

§ Interpolation with the windowed sinc function

34

h(t) = w(t)sinc(t) = w(t)sin(πt) πt

x(d) = x(k)

k=−(L−1) k=L

h(d − k)

Delayed by d ( 0 < d < 1)

−5 −4 −3 −2 −1 1 2 3 4 5 −0.4 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 Windowed Sinc Sample Time −5 −4 −3 −2 −1 1 2 3 4 5 −0.4 −0.2 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 Windowed Sinc Sample Time

slide-35
SLIDE 35

Types of Interpolators

35

They are all lowpass filters with different transi=on bands. In general, interpolators with higher orders have narrower transi=on bands.

−5 −4 −3 −2 −1 1 2 3 4 5 0.5 1 1.5 Windowed Sinc Sample Time −5 −4 −3 −2 −1 1 2 3 4 5 0.5 1 1.5 Linear Sample Time −5 −4 −3 −2 −1 1 2 3 4 5 0.5 1 1.5 3rd−order B−spline Sample Time −5 −4 −3 −2 −1 1 2 3 4 5 0.5 1 1.5 3rd−order Lagrange Sample Time

slide-36
SLIDE 36

Pitch Shifting (Re-sampling)

§ Change in time and spectrum by the pitch shifting

36

[The DaFX book]

x(αt)↔ 1 α X( f α )

Scaling theorem in FT