X

Digital Speech Processing

By Prof. Shyamal Kumar Das Mandal   |   IIT Kharagpur
Learners enrolled: 895
ABOUT THE COURSE:
Oral Speech may be the most natural, common and direct mode of human communication. Since the middle of the last century, Speech has become an area of intense and active research and development (R&D) to become a prime means of direct Human-Computer Interactions (HCI). The pace of such R&D has farther got boosted with the general abundance of cheap computing power in the form of PC, PDA or Mobile Handset. While man to machine in speech mode is yet to reach the minimum threshold level for wide-spread deployment, spoken messages directly by machine. This need research in speech science and development of speech technology. The course provides the foundation knowledge on speech production and perception along with processing of speech signal in digital domain.

INTENDED AUDIENCE: ECE, CS, EE, IE

PREREQUISITES: Digital Signal Processing or Signals and System

INDUSTRY SUPPORT: Companies, Industry like Microsoft, Google , IBM who are working in the area of speech technology development
Summary
Course Status : Completed
Course Type : Elective
Language for course content : English
Duration : 8 weeks
Category :
  • Electrical, Electronics and Communications Engineering
Credit Points : 2
Level : Undergraduate
Start Date : 25 Jul 2022
End Date : 16 Sep 2022
Enrollment Ends : 08 Aug 2022
Exam Date : 25 Sep 2022 IST

Note: This exam date is subject to change based on seat availability. You can check final exam date on your hall ticket.


Page Visits



Course layout

Week 1: Introduction to speech processing, Digitization and Recording of speech signal, Review of Digital Signal Processing Concepts
Week 2: Human Speech production, Acoustic Phonetics and Articulatory Phonetics, Different categories speech sounds and Location of sounds in the acoustic waveform and spectrograms
Week 3: Uniform Tube Modeling of Speech Production, Speech Perception
Week 4: Time Domain Methods in Speech Processing, Analysis and Synthesis of Pole-Zero Speech Models
Week 5: Short-Time Fourier Transform, Analysis:- FT view and Filtering view, Synthesis:-Filter bank summation (FBS) Method and OLA Method
Week 6: Features Extraction, Extraction of Fundamental frequency
Week 7: Speech Prosody, Speech Prosody Modeling (Fujisaki Model)
Week 8: Speech based Applications (TTS, ASR and spoken language acquisition)

Books and references

  1. Discrete-Time Speech Signal Processing: Principles and Practice by Thomas F. Quatieri
  2. Theory and Applications of Digital Speech Processing, by L. R. Rabiner and R. W. Schafer,

Instructor bio

Prof. Shyamal Kumar Das Mandal

IIT Kharagpur
Dr. S K. Das Mandal was born on October 1975; He received the B.E degree in Electronics and Telecommunication engineering in 1998 and Ph.D degree in 2007 from Jadavpur University, India and currently working in Indian institute of Technology Kharagpur as an Assistant Professor. His current research interests include automatic speech recognition, speech synthesis, and computer assisted spoken language acquisition.

Course certificate

The course is free to enroll and learn from. But if you want a certificate, you have to register and write the proctored exam conducted by us in person at any of the designated exam centres.
The exam is optional for a fee of Rs 1000/- (Rupees one thousand only).
Date and Time of Exams:  25 September 2022 Morning session 9am to 12 noon; Afternoon Session 2pm to 5pm.
Registration url: Announcements will be made when the registration form is open for registrations.
The online registration form has to be filled and the certification exam fee needs to be paid. More details will be made available when the exam registration form is published. If there are any changes, it will be mentioned then.
Please check the form for more details on the cities where the exams will be held, the conditions you agree to when you fill the form etc.

CRITERIA TO GET A CERTIFICATE

Average assignment score = 25% of average of best 6 assignments out of the total 8 assignments given in the course.
Exam score = 75% of the proctored certification exam score out of 100

Final score = Average assignment score + Exam score

YOU WILL BE ELIGIBLE FOR A CERTIFICATE ONLY IF AVERAGE ASSIGNMENT SCORE >=10/25 AND EXAM SCORE >= 30/75. If one of the 2 criteria is not met, you will not get the certificate even if the Final score >= 40/100.

Certificate will have your name, photograph and the score in the final exam with the breakup.It will have the logos of NPTEL and IIT Kharagpur .It will be e-verifiable at nptel.ac.in/noc.

Only the e-certificate will be made available. Hard copies will not be dispatched.

Once again, thanks for your interest in our online courses and certification. Happy learning.

- NPTEL team


MHRD logo Swayam logo

DOWNLOAD APP

Goto google play store

FOLLOW US