Ceska verze teto stranky Back to main page

BE2M31ZRE - Speech processing - Seminars
Summer semester 2022/2023

Tools for practical exercises:

  1. Tu - 21.2.2023 - Introduction: speech signals, analysis tools
    - interactive work with signals in the environment of MATLAB, Praat, Wavesurfer
    - recording of database for further work during the semester
    Guidelines for the seminar
    Task to be delivered (5 POINTS) - till Fr 3.3.2023 10:00, see Tak No.1 - Delivery of minidatabes with recorded utterances for details.

  2. Tu - 28.2.2023 - Basic time- and frequency-domain characteristics
    - Energy, power, intensity, RMS value
    - Zero-crossing rate
    - Spectrogram and filter-bank spectrum
    Guidelines for the seminar

  3. Tu - 7.3.2023 - Pitch estimation
    - Pitch estimation from autocorrelation function
    - Necessary pre-processing and post-processing
    - Estimation within MATLAB and Praat (Wavesurfer)
    Guidelines for the seminar

  4. Tu - 14.3.2023 - LPC spectrum and formant estimation
    - computation of LPC spectrum (spectrogram)
    - formant estimation from AR model (LPC parameters)
    - Estimation within MATLAB and Praat (Wavesurfer)
    Guidelines for the seminar

  5. Tu - 21.3.2023 - LPC vocoder: MATLAB implementation of principles parts - checked homework (5 POINTS)
    - AR model of speech signal
    - elementary vocoder with error signal exication
    - quantization within error-excited vocoder (RELP simulation)
    - vocoder with atificially generated exication
    Guidelines for the seminar

  6. Tu - 28.3.2023 - Cepstrum, cepstral distance and voice activity detection
    - DFT, LPC, and MFCC cepstrum
    - Cepstrum of longer speech utterance
    - Energy-based and cepstral based voice activity detection
    Guidelines for the seminar

  7. Tu - 4.4.2023 - Basics of classification based on GMM and VQ
    - principles of GMM and VQ
    - the recognition of vowels based on formants or cepstrum
    Guidelines for the seminar

  8. Tu - 11.4.2023 - DTW-based speech recognition
    - computation of distance matrix
    - looking for the optimal path through given distance matrix
    - simple recognizer of isolated digits
    Guidelines for the seminar

  9. Tu - 18.4.2023 - HMM-based recognition I - checked homework (5 POINTS)
    - Recognition of vowel sequence
    - Computation of GMM for particular states
    - Emitted probability in selected states
    Guidelines for the seminar

  10. Tu - 25.4.2023 - HMM-based recognition II
    - Emitted probability in all states
    - Transient probabilities
    - Passing through HMM, Viterbi decoding, optimum path
    Guidelines for the seminar

  11. Tu - 2.5.2023 - Speaker verification on the basis of VQ and GMM
    - cepstrum distribution for given speaker, differences among speakers
    - creation of speaker codebook and GMM model
    - realization cross verification from available database
    - interactive on-line verification
    Guidelines for the seminar

    Tu - 9.5.2023 - Cancelled - change of scheduling

  12. Tu - 16.5.2023 - Speaker identification on the basis of GMM
    - GMM based speaker verification (a simple modification of the task from last seminar)
    - usage of extended feature vector
    - search of selected speaker within a set of unknown utterances
    Guidelines for the seminar

  13. Tu - 23.5.2023 - SEMESTER TEST (20 POINTS) + Speech synthesis - checked homework (5 POINTS)