BE2M31ZRE - Speech processing

Schedule of Seminars
Summer semester 2018/2019

Tools for practical exercises:

  1. Tu - 19.2.2019 - Introduction: speech signals, analysis tools
    - interactive work with signals in the environment of MATLAB, Praat, Wavesurfer
    - recording of database for further work during the semester
    Guidelines for the seminar

  2. Tu - 26.2.2019 - Basic time- and frequency-domain characteristics
    - Energy, power, intensity, RMS value
    - Zero-crossing rate
    - Spectrogram and filter-bank spectrum
    Guidelines for the seminar

  3. Tu - 5.3.2019 - Pitch estimation
    - Pitch estimation from autocorrelation function
    - Necessary pre-processing and post-processing
    - Estimation within MATLAB and Praat (Wavesurfer)
    Guidelines for the seminar

    Tu - 12.3.2019 - Cancelled - out of the office - HOMEWORK
    - Completion and delivery of the minidatabase (processing of recorded utterances at home)

  4. Tu - 19.3.2019 - Cepstrum, cepstral distance and voice activity detection
    - DFT, LPC, and MFCC cepstrum
    - Cepstrum of longer speech utterance
    - Energy-based and cepstral based voice activity detection
    Guidelines for the seminar

  5. Tu - 26.3.2019 - LPC vocoder: MATLAB implementation of principles parts
    - AR model of speech signal
    - elementary vocoder with error signal exication
    - quantization within error-excited vocoder (RELP simulation)
    - vocoder with atificially generated exication
    Guidelines for the seminar

  6. Tu - 2.4.2019 - LPC spectrum and formant estimation
    - computation of LPC spectrum (spectrogram)
    - formant estimation from AR model (LPC parameters)
    - Estimation within MATLAB and Praat (Wavesurfer)
    Guidelines for the seminar

  7. Tu - 9.4.2019 - Basics of classification based on GMM and VQ
    - principles of GMM and VQ
    - the recognition of vowels based on formants or cepstrum
    Guidelines for the seminar

  8. Tu - 16.4.2019 - DTW-based speech recognition
    - computation of distance matrix
    - looking for the optimal path through given distance matrix
    - simple recognizer of isolated digits
    Guidelines for the seminar

  9. Tu - 23.4.2019 - HMM-based recognition I
    - Recognition of vowel sequence
    - Computation of GMM for particular states
    - Emitted probability in selected states
    Guidelines for the seminar

  10. Tu - 30.4.2019 - HMM-based recognition II
    - Emitted probability in all states
    - Transient probabilities
    - Passing through HMM, Viterbi decoding, optimum path
    Guidelines for the seminar

  11. Tu 7.5.2019 - Speaker verification on the basis of vector quantization
    - cepstrum distribution for given speaker, differences among speakers
    - creation of speaker codebook
    - realization cross verification from available database
    - interactive on-line verification
    Guidelines for the seminar

    Tu - 14.5.2019 - Cancelled - change of scheduling

  12. Tu 21.5.2019 - Speaker identification on the basis of GMM
    - GMM based speaker verification (a simple modification of the task from last seminar)
    - usage of extended feature vector
    - search of selected speaker within a set of unknown utterances
    Guidelines for the seminar