Natural Language Processing

By Prof. Pawan Goyal   |   IIT Kharagpur
Learners enrolled: 7457
This course starts with the basics of text processing including basic pre-processing, spelling correction, language modeling, Part-of-Speech tagging, Constituency and Dependency Parsing, Lexical Semantics, distributional Semantics and topic models. Finally, the course also covers some of the most interesting applications of text mining such as entity linking, relation extraction, text summarization, text classification, sentiment analysis and opinion mining. 

PREREQUISITES : Basic knowledge of probabilities for the lectures and python for programming assignment 
INDUSTRY SUPPORT : Microsoft Research, Google, Adobe, Xerox, Flipkart, Amazon

Course Status : Completed
Course Type : Elective
Duration : 12 weeks
Category :
  • Computer Science and Engineering
  • Artificial Intelligence
  • Data Science
Credit Points : 3
Level : Undergraduate/Postgraduate
Start Date : 25 Jul 2022
End Date : 14 Oct 2022
Enrollment Ends : 08 Aug 2022
Exam Date : 29 Oct 2022 IST

Note: This exam date is subjected to change based on seat availability. You can check final exam date on your hall ticket.

Page Visits

Course layout

Week 1:  Introduction and Basic Text Processing
Week 2:  Spelling Correction, Language Modeling
Week 3:  Advanced smoothing for language modeling, POS tagging 
Week 4:  Models for Sequential tagging – MaxEnt, CRF 
Week 5:  Syntax – Constituency Parsing 
Week 6:  Dependency Parsing 
Week 7:  Distributional Semantics
Week 8:  Lexical Semantics
Week 9:  Topic Models 
Week 10: Entity Linking, Information Extraction 
Week 11: Text Summarization, Text Classification 
Week 12: Sentiment Analysis and Opinion Mining 

Books and references

1. Dan Jurafsky and James Martin. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition. Prentice Hall, Second Edition, 2009. 
Some draft chapters of the third edition are available online: https://web.stanford.edu/~jurafsky/slp3/

2. Chris Manning and Hinrich Schütze. Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, MA: May 1999.

Instructor bio

Prof. Pawan Goyal

IIT Kharagpur
Prof. Pawan Goyal is an Assistant Professor at the Department of Computer Science and Engineering, IIT Kharagpur. His research interests include Natural Language Processing, Text Mining, Information Retrieval and Sanskrit Computational Linguistics. He has published around 75 research papers in international conferences and journals. He has published in various top-tier conferences and journals including ACL, NAACL, EMNLP, SIGIR, KDD, CIKM, JCDL, ICWSM, CSCW, Coling, IEEE and ACM transactions. Prior to joining IIT Kharagpur, he received his B. Tech. degree in Electrical Engineering from IIT Kanpur in 2007 and his Ph.D. degree in the faculty of Computing and Engineering from University of Ulster, UK in 2011. He was then a Post Doctoral Fellow at INRIA Paris Rocquencourt.

Course certificate

The course is free to enroll and learn from. But if you want a certificate, you have to register and write the proctored exam conducted by us in person at any of the designated exam centres.
The exam is optional for a fee of Rs 1000/- (Rupees one thousand only).
Date and Time of Exams: 29 October 2022 Morning session 9am to 12 noon; Afternoon Session 2pm to 5pm.
Registration url: Announcements will be made when the registration form is open for registrations.
The online registration form has to be filled and the certification exam fee needs to be paid. More details will be made available when the exam registration form is published. If there are any changes, it will be mentioned then.
Please check the form for more details on the cities where the exams will be held, the conditions you agree to when you fill the form etc.


Average assignment score = 25% of average of best 8 assignments out of the total 12 assignments given in the course.
Exam score = 75% of the proctored certification exam score out of 100

Final score = Average assignment score + Exam score

YOU WILL BE ELIGIBLE FOR A CERTIFICATE ONLY IF AVERAGE ASSIGNMENT SCORE >=10/25 AND EXAM SCORE >= 30/75. If one of the 2 criteria is not met, you will not get the certificate even if the Final score >= 40/100.

Certificate will have your name, photograph and the score in the final exam with the breakup.It will have the logos of NPTEL and IIT Kharagpur .It will be e-verifiable at nptel.ac.in/noc.

Only the e-certificate will be made available. Hard copies will not be dispatched.

Once again, thanks for your interest in our online courses and certification. Happy learning.

- NPTEL team

MHRD logo Swayam logo


Goto google play store