• Ei tuloksia

Introduction: Who are we?

N/A
N/A
Info
Lataa
Protected

Academic year: 2023

Jaa "Introduction: Who are we?"

Copied!
22
0
0

Kokoteksti

(1)

Formants from the Wave Equation and

Recording Speech During MRI

Pertti Palo <jpalo (ät) math.hut.fi>,

Antti Hannukainen, Ville Havu, Teemu Lukkari and Jarmo Malinen

AB

HELSINKI UNIVERSITY OF TECHNOLOGY

Institute of Mathematics

(2)

Introduction: Who are we?

Antti Hannukainen: Programming, FEM

Ville Havu: Mathematical analysis, numerics, FEM

Teemu Lukkari: Mathematical analysis, acoustics, signal processing

Jarmo Malinen: Mathematical analysis, measurements

Pertti Palo: Phonetics, programming, measurements, acoustics, PR . . .

(3)

Background

(a)

6 8 10 12

−1 0 1 2 2 3 4 5 6 7 8 9 10 11

cm cm

cm

(b)

Figure 1: (a) The wave equation model and (b) a sample vowel geometry

The main goal is to simulate vowels based on a wave equation model.

We need accurate anatomic data and simultaneously recorded sound to validate the simulation results.

(4)

Acoustical model in more detail

By using the velocity potential

Φ

the perturbation pressure can be expressed as

p

= ρ

0

Φ

t, where

Φ

is a function, which is related to the particle velocity by

v = −∇ Φ

.

Solve

Φ

, for a given input signal

u

:

 

 

 

 

 

 

Φ

tt

= c

2

∆Φ

for

( r , t) ∈ Ω × R , Φ = 0

for

( r , t) ∈ Γ

1

× R ,

Φ

∂ν

= 0

for

( r , t) ∈ Γ

2

× R ,

and

Φ

t

+ c

∂νΦ

= 2 q

c

ρ0

u

for

( r , t) ∈ Γ

3

× R ,

(1)

where

u = u( r , t)

is a power input signal at the glottis end (per unit area),

c

the

speed of sound within the VT,

ν

the outer normal of

∂ Ω

, and ∂νΦ

= ν · ∇ Φ

.

(5)

Mathematics: Finite Element Method in our project

x

y

z

˛

Our initial mesh had about 64000 tetrahedral elements.

Computation of the resonance (Helmholtz) problem took about half an hour.

With the time dependent case, we do not aim to produce real time synthesis, but do expect to get relatively close.

(6)

Results: Computed formants in F2-F1 plane

0.8 1 1.2 1.4 1.6 1.8 2 2.2

0.3 0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 0.75 0.8

F1 / kHz

F2 / kHz

EB99 EB99 oe FEM oe

Figure 2: Computed formants and Olov Engwall’s measured formants for long vowels in F2-F1 plane.

(7)

Results: Pressure distributions

cm

cm

F4

2 4 6 8 10 12 14

−1 0 1 2

Figure 3: Four approximate eigenfunctions corresponding to the lowest eigenval- ues ie. pressure distributions for formants 1-4. Glottis is on the left and mouth on the right.

(8)

Intermission

That was our status last spring.

So what have we been up to during the last year or so?

(9)

Sound measurements: What would we like to get?

The fundamental frequency F0, . . .

F1, F2, F3 and, if possible, F4 . . .

. . . and their bandwidths . . .

. . . before, after and during the MR imaging sequence.

Access to clean speech signal in real time.

(10)

Sound measurements: What’s the problem then?

No metal allowed inside the MRI main coil.

No magnetic material allowed inside the MRI room.

All electronics in the MRI room have to be RF-shielded.

Strong acoustic noise (over 90 dB SPL) present during the imaging sequence.

(11)

What did we decide to do?

The recording system is based on three main design principles:

1. using air as signal medium when unavoidable,

2. using real-time analog electronics for first stages of signal processing, and 3. using DSP for post-processing.

(12)

System measurement setup

Figure 4: The setup for acoustic field measurements

(13)

Point like sound source

Figure 5: This sound source will be used to measure the frequency response of the sound recording setup as well as the directionality of the sound collector

(14)

Sound collector

There is a two channel sound collector in our system. One channel is for noise and the other for the contaminated speech.

(a) (b)

Figure 6: The sound collector from (a) below with the sound source and (b) above

(15)

Acoustic wave guides

(a) (b)

Figure 7: (a) The acoustic wave guides hanging from a magnet free stative and (b) the suspension in close up

(16)

Faraday cage

(a) (b)

Figure 8: (a) The Faraday cage houses the microphones and (b) the acoustic waveguides enter the cage through electromagnetic waveguides

(17)

Microphone array

Figure 9: The microphone array consists of four microphones.

(18)

De-noising amplifier

Analog electronics provide real time response.

Overvoltage and RF shielded inputs

One speech input channel

Up to three noise input channels

Optional low-pass filtering and independent amplifications

(19)

Tests: Acoustic wave guides

500 600 700 800 900 1000 1250 1500 1750 2000 2400 2800 3300

−18420

−16

−14

−12

−10

−8

−6

−4

−2 0 2

Frequency (Hz)

Attenuation (dB)

Figure 10: Frequency response of the acoustic wave guides

(20)

Tests: Does the noise cancellation work with acoustic components?

1000

420 500 600 700 800 900 1250 1500 1750 2000 2400 2800 3300

−25

−20

−15

−10

−5 0

Frequency (Hz)

CMRR (dB)

Acoustic CMRR, optimal @ 1kHz Electronic CMRR, optimal @ 1kHz Acoustic CMRR, optimal @ 2.8 kHz

Figure 11: CMRR of the whole system excluding the sound collector

(21)

Tests: Two channel signal source

We used a custom built acoustic signal source to obtain the previous data.

(22)

Full circle

 

 

 

 

 

 

Φ

tt

= c

2

∆Φ

for

( r , t) ∈ Ω × R , Φ = 0

for

( r , t) ∈ Γ

1

× R ,

Φ

∂ν

= 0

for

( r , t) ∈ Γ

2

× R ,

and

Φ

t

+ c

∂νΦ

= 2 q

c

ρ0

u

for

( r , t) ∈ Γ

3

× R ,

Thank you.

Questions, please?

Viittaukset

LIITTYVÄT TIEDOSTOT

1 Introduction 2 Time-Frequency masking for speech enhancement 3 Sound direction of arrival (DOA) estimation 4 Results 5 Conclusions and

The sound event detection setup familiar from previous DCASE challenges deals with audio material containing target sound events and a reference annotation containing the labels,

The semantic similarity within a suitable number of nearest neighbors could be used as an objective measure for recommending labels for sound events, and the common label could

A sound event detection system is used to de- tect sound events present in the tested context and the event histogram constructed from the recognition result is matched with

&#34;pointer&#34; Different objects were fixed to the drums hanging on strings or chains Niurenius writes: &#34;On the drum itself, tied by sinews, hang claws and bones from all

Attempting to expand the dominant orientation in sound studies on the sensory experience of sound, Pinch and Bijsterveld’s discussion of the Sound Ear demonstrates the

The analysis shows to what extent the muscle activation underlying the sound-producing motion and the resultant sound on the same musical instrument can have similar amplitude

 Capacitive sensors are also commonly used to measure pressure..  “Condenser” microphones measure changes in air pressure of incoming