NATURAL LANGUAGE PROCESSING
Academic Year 2024/2025 - Teacher: MISAEL MONGIOVI'Expected Learning Outcomes
Based on the Dublin descriptors, students will, at the end of the course, acquire:
1) Knowledge and understanding:
The student will have a solid understanding of the basic principles and tools for natural language processing.
They will be able to understand the most recent advancements in the state of the art in the field of natural language processing.
2) Ability to apply knowledge and understanding:
The student will be able to analyze a text to extract relevant syntactic and semantic information.
They will effectively use specific tools for addressing the main tasks in natural language processing.
They will be able to navigate the landscape of natural language processing techniques and propose innovative solutions to tackle practical problems in this field.
3) Judgment skills:
The student will be able to evaluate the techniques and tools available in the field of natural language processing and select those most suited to solve specific problems.
4) Communication skills:
The student will have acquired the typical lexicon of the field of natural language processing and will be able to use terms in a correct and unambiguous manner, facilitating communication with other experts in the field and with non-specialists.
5) Learning skills:
The student will possess both theoretical and practical methodological skills to face and solve new challenges in the field of natural language processing.
They will also have a solid autonomy in their studies, allowing them to delve into specific topics and stay updated on the latest developments and advancements in the sector.
Course Structure
Required Prerequisites
Attendance of Lessons
Detailed Course Content
Textbook Information
Course Planning
Subjects | Text References | |
---|---|---|
1 | Introduction to Natural Language Processing | |
2 | Text normalization | |
3 | N-gram language models | |
4 | Naive Bayes for text classification | |
5 | Logistic Regression for text classification | |
6 | Distributional semantics and word embeddings | |
7 | Lexical resources | |
8 | Neural language models | |
9 | Transformers | |
10 | BERT | |
11 | POS-tagging and Named Entity Resolution | |
12 | Syntactic parsers | |
13 | Semantic parsers | |
14 | GPT (Generative Pretrained Transformer) architectures | |
15 | Large Language Models (LLMs) in practice | |
16 | NLP applications |
Learning Assessment
Learning Assessment Procedures
Students with disabilities and/or specific learning disorders (SLD) must contact the professor, the CInAP representative of the DMI (Prof. Daniele), and the CInAP well in advance of the exam date to communicate their intention to take the exam using appropriate compensatory measures.
Examples of frequently asked questions and / or exercises
- Describe the smoothing techniques for N-gram-based language models, highlighting the advantages and limitations of various methods.
- Discuss the importance of word embeddings in understanding natural language. Explain the functioning of the Word2Vec model.
- Explore the role of Transformers within neural language models. Illustrate the functioning of the BERT model.