Home

Teaching

UIUC Website

PmWiki

Edit SideBar

http://www.pmwiki.org/

Last Modified : Mon, 26 Oct 15

ECE537-2014SpeechProcessing

  • ECE-537: Fundamentals of Speech Processing ; Description; UIUC ;
  • Website Fall ;; OLD:; ; ;; 2008; ;
  • Professor: Jont B. Allen
  • Time: 12-12:50 MWF; Location: 3013 ECEB
  • Textbook Course description; About the final; Exam I
  • Office Hours: Monday 10AM; Location: 3036 ECEB
  • This week's schedule
    *Many of the documents on this page are encoded in a format called Djvu. You may freely download the software to display such documents from . In many cases I have a corresponding PDF file, which I am happy to make available to you upon request.

ECE 537 Schedule (Fall 2014)

L W D Date Lecture and Assignment
Part I: Acoustics/VT modeling/Signal processing (20 Lectures)
1 35 M 8/25 Introductions.
Lecture: Overview: Speech production, Acoustics, Psychophysics, neural and information processing.
Entropy: Its meaning, definition and the intuition behind it;
Read: Flanagan, Chapt. 1: (pdf); Entire Book: (djvu (2014), pdf (2014))
2 W 8/27 Lecture: Mechanisms of Speech production: Sounds of speech: Vowels and consonants;
Read: Chapter 2 (pdf); Chapter 2 of Orig. Book (djvu);
HW1: Basic Acoustics (due Wed 9/4/11) (HW01.pdf, HW01-sol.pdf)
3 F 8/29 Lecture: Pressure, volume velocity, impedance. Intensity, speech power; sound level; dB, dB-SPL
Read: Chapter 3 (pdf)
- 36 M 9/1 Labor Day Holiday -- No class
4 W 9/3 Lecture: 1D wave equation; ABCD-Transmission (Chain) matrix; Acoustic Transmission lines; Matrix formulations; Vowels, Formants; Helmholtz resonator (Lecture Supplement)
Read: Flanagan, Ch. 2.2, 3.1-3.2 (pdf)
HW1 due
HW2: TL and reflectance (due Wed Sep 10, 2014) (pdf, sol-pdf, txline.m)
5 2 F 9/5 Lecture: Solution of 1-D transmission line equation: Lip radiation impedance, The Helmholtz resonator
Read: Lecture on Horns (2014 pdf)
6 37 M 9/8 Lecture:d'Alembert solutions for 1-D and 3-D transmission lines; Sound propagation in tubes in speech production; Introduction to reflectance;
Read:: Wave model of the Cat eardrum, Parent and Allen 2007 (djvu, pdf), 2010 Human ear canal , ;
Review: Matlab filter design, bilinear Z, FIR, IIR
7 W 9/10 Lecture: Transmission Lines with complex loads and the Propagated Reflectance;
Text: Flanagan, Ch. VI (Sect. 6.262, pp 272-276);
Readings:Conversion tables for 2-ports (djvu,pdf)
Bilbao PhD Thesis p. 1-15 (djvu, pdf) 25%
HW2 due; Ask if you want more time?
HW3: TLs with complex terminations (due in two weeks on Wed 9/24);
Simulation of the middle ear:'' pdf
8 F 9/12 Lecture:Impedance and reflectance at a tube junction (Karal correction djvu), and half-sphere (the mouth)
Text:: Ch. 3, Sec. 3.3, pages 136-152
9 38 M 9/15 Lecture: 2-port networks;
Read:: Bilabo Thesis (see page 14) (djvu,pdf); Text Chap. 3, sect. 3.8.1 pages 31-34 radiation impedance load;
Review: Rosowski, Carney and Peak (1988) on the cat middle ear djvu; Background material: Guinan and Peak djvu; Lynch et al. 1982 djvu
10 W 9/17 Lecture: cont. 2-port and the 3-port nasal tract; Network Postulates djvu;
Conversion tables for 2-ports (djvu,pdf)
Optional Read: Thevenin (djvu), Norton (djvu)
Read: Peterson and Barney (1952) djvu and Peterson (1952) djvu
11 F 9/19 Lecture: Horn Radiation Impedance; Karal Correction; Bernoulli's equation
Text: Flanagan pp 41-53; Read: vanDen Berg (1957) djvu
12 39 M 9/22 Lecture: 'Signal processing review:' Fourier Series, Fourier Transform, Laplace Transform, ZT, DTFT, DFT, FFT
13 W 9/24 Lecture: Cepstral analysis;
HW3 Due (sol-pdf; Matlab Solution: , )
HW4:
Vocal-Tract Simulation: Due in 16 days (pdf, files)
14 F 9/26 Lecture: History of acoustics: , pdf
BC: Pythagoras; Aristotle;
17C: Mersenne, Marin; Galilei, Galileo; Hooke, Robert; Boyle, Robert; Newton, Sir Issac;
18C: Bernoulli, Daniel; Euler; Lagrange; d'Alembert;
19C: Gauss; Laplace; Fourier; Helmholtz; Heaviside; Strutt, William; Rayleigh, Lord; , AG
20C: Campbell, George; Hilbert, David; , Emmy; , Harvey; Nyquist, Harry; , Henrik; Dudley, Homer; Shannon, Claude; Flanagan, James;
15 40 M 9/29 Lecture: Review HW1, HW2, HW3;
The glottal oscillator & Bernoulli's (, , )
Read:: Flanagan Sec. 3.74 pages 69-72
16 W 10/1 Lecture: Linear prediction of speech;
Read: Flanagan Sec. 8.112, pp 372-376; Sec. 8.13, pp 390-395; Atal and Hanauer (1971) pdf,djvu
17 F 10/3 Lecture: 'STFT window methods;' Inverse STFT; STFT for speech processing with analysis/synthesis; filtering; Allen & Rabiner (1977) pdf,djvu)
Reading spectrograms
Read:: Flanagan Chap. 8, pp 361-363;
Begin writing your Final exam: Part I;
- 41 M 10/6 No office hours; Optional class: Discuss of Exam; No class due to Exam I: 7-9:30 PM; 3013 ECEB NO CRIB SHEET
Part II: Psychoacoustics (10 Lectures)
18 W 10/8 Lecture: Room acoustics; point source; 1, 2 and 6 wall Image method; Wall reflection coef. for finite impedance walls: (pdf); Di and Gilbert (1993) (djvu)
19 F 10/10 Lecture: Music and audio compression coding (MPEG)
Read: $\mu$-Law speech coding pdf; Music coding methods pdf
20 42 M 10/13 Special Guest Lecture on Digital Speech Coding/compression, Prof. Mark Hasagawa-Johnson;
Read: Speech coding, '2002 Wiley Encyclopedia of Telecommunications' pdf
21 W 10/15 Lecture: Psychoacoustics I: Internal noise model of the JND Riesz pure-tone intensity JND (1928)
Masking: Weber's and Fechner's Law; Intensity JND and the near-miss;
'Psychoacoustics II:' Introduction to loudness, Steven's Law;
Loudness Lecture notes (Allen): pdf
Read:: Flanagan Chapter 4 (pdf, original pdf, djvu), Allen Review (pages 20-30) , Fletcher and Munson (1933) (pp 82-94)
HW5: LPC (Due 1 week 10/22) (pdf, Speech samples)
HW4 due
22 F 10/17 Lecture: Cochlear Physiology I: Middle ear and inner ear (Cochlear) anatomy, basilar membrane, 1D Models, Hair cells, Nonlinear basilar membrane;
Frequency JND, semitone, Internal noise and Masking; relation between the intensity and frequency JND (Cochlear frequency response and the slope of the tuning curve);
Read:: Review of Cochlear Modeling: Part II (pp 19-28):; page 151(166)-155(170) of Fletcher's 1929 Book djvu
MIT/HST-725: The auditory system
23 43 M 10/20 Lecture: Cochlear Physiology II: traveling waves, neural tuning curves, critical bands, hair cells, neural masking, Upward spread of masking; Forward masking;
Auditory Pathway I: Neural Tuning
Read:: Review of Cochlear Modeling (pp 1-19) ()
24 W 10/22 Lecture: Cochlear Physiology III: Micromechanics, OHC, IHC Lecture Notes (): Modeling the Cochlea and Organ of Corti, and
Read:: Wegel and Lane (1924), Part II (djvu)
HW6: Radial waves/STFT/OLA/Speech coding (Due Mon 11/10 in 2 weeks)(pdf, speech files)

HW5 due

25 F 10/24 Lecture: Psychoacoustics III: Relations between Psychophysics and the cochlea; Greenwood's place-map function
Read:: Review of Cochlear Modeling (pp 1-19) (), Part I; Fletcher and Munson (1933) (pp 82-94) (djvu)
26 44 M 10/27 Lecture: Cochlear Critical bands
Read::'Harvey Fletcher's role ...' (all but the section on the Articulation Index (AI)) djvu)
27 W 10/29 Lecture: Cochlear Physiology IV: The nonlinear Cochlea; The upward spread of masking (USM) and 2-tone suppression (2TS)
Supplement: The Auditory Nerve ()
Read:: Features in speech Allen and Li, (2010)(pdf)
28 F 10/31 Lecture: Prof. Wickesberg, Part I: The central auditory system; Neural signal and the Auditory Pathway)
29 45 M 11/3 Lecture: Prof. Wickesberg, Auditory Pathway (AN+CN), Part II; KalluriJCNS (pdf,pdf)
30 W 11/5 Lecture: Prof. Wickesberg, Auditory Pathway (AN+CN): Part III: (pdf, pptx)
Part III: Human Speech Recognition/Information Processing (10 Lectures)
31 F 11/7 Lecture: Information theory I: Information, Entropy, Relative Entropy; Channel Capacity
Read:: Pierce, JR ''Early days of Information Theory (pdf)
32 46 M 11/10 Lecture: Information theory II: Morse code example Shannon Channel
Read:: pp 1-10 Shannon (1948) (pdf I, pdf III)
33 W 11/12 Lecture: Information theory III:; Entropy, Relative Entropy, Markov models, State diagram;
Read:: Shannon (1950) (pdf)
34 F 11/14 Lecture: EM algorithm: Example: Speech and noise separation

Read:: French and Steinberg (1947) (djvu); Good-Turing djvu
Discussion of the EM alg. with examples
35 47 M 11/17 Lecture: Articulation Index/Speech Transmission Index/Speech Intelligibility Index; Support material: RV Shannon et al (1995) djvu, Smith et al (2002) djvu
HW7: Information processing (Due Thur 12/11); (pdf, files)
36 W 11/19 Lecture: STI/SII
Read:: Steeneken & Houtgast (1980) Speech Transmission Index (djvu) and Houtgast (1989) Modulation detection (djvu)
37 F 11/21 Lecture: Human speech recognition (HSR), Articulation Index (AI), average score: Pc(AI)=1-echanceeminAI, The confusion matrix (CM), maximum entropy syllable error models, etc.; Allen notes (djvu)
Read:: Continue with French and Steinberg (1947)djvu; Miller Nicely (1955) djvu; Miller Nicely confusions as a function of the articulation index; entropy, grouping and chance djvu
DUE: Preliminary version of your Final Exam for review

- 48 - - Thanksgiving Holiday (11/22-11/30)
- 49 M 12/1 Exam II: (NO CLASS) Psychoacoustics, Physiology, Speech, LPC, Tubes, AI, Historical items;Place: 3013 ECEB; Time: 7-9 PM
Assignment: Work on Final exam
38 W 12/3 Lecture: Effects of language and semantic context Miller (1962), Boothroyd (1988),
Allen Notes: Events and the AI(djvu)
Read: Miller, Heisen and Lichten (1951) (djvu)
39 F 12/5 Lecture: Language context models, Boothroyd; Bronkhorst93
40 50 M 12/8 Lecture: Language context models, cont., Bronkhorst93
Read: W. Li Random texts exhibit Zipf's-Law djvu
41 W 12/10 HW7 due;
Course Review; Final paper Prep;
- R 12/11 Reading Day
- 51 M 12/12-19 [Final paper Due Dec 13-20 TBD]

Final

The final is a 15-25 page paper, written in the style and format (but single column) of a journal paper, that discusses everything that you have learned in this course. Writing style, spelling, figures, labels of figures, are all part of the grade.

Allen's Grading of the final

The final is graded based on a list of all the topics that are covered. If there is a paragraph that discusses each topic on my list, then you get at least 1 point, and if the discussion covers the topic effectively, you can get up to 5 points. There are at least 20 topics on the list. When you get to 100 points, you get an A+ on the exam. I expect that you draw on the homework as a starting point. Don't just dump the homework into the exam without modification, that wont get you points. Don't just dump a large number of unexplained figures (that you got from someone else for example) and expect to get points. I need words around each figure. I am looking for insightful comments that link the material together.

Your comments on the relevance of each of the topics I covered in this course, homework problems, exams, etc., are welcome. No points will be taken off, nor given, for strong opinions on my teaching style, or lack thereof, organization, or lack thereof, etc. Please put all such comments in a discussion section at the end of the paper, isolated from the rest of the material.

course description

- - - Not proofed beyond here

Powered by PmWiki