194.093 Natural Language Processing and Information Extraction
This course is in all assigned curricula part of the STEOP.
This course is in at least 1 assigned curriculum part of the STEOP.

2020W, VU, 2.0h, 3.0EC
TUWEL

Properties

  • Semester hours: 2.0
  • Credits: 3.0
  • Type: VU Lecture and Exercise
  • Format: Online

Learning outcomes

After successful completion of the course, students are able to extract structure from natural language data by applying standard methods for text segmentation, word and sequence tagging, or syntactic parsing. They will have a high-level overview of the most important rule-based and learning-based approaches to each task and the standard methods for evaluating them. Students will gain a fundamental understanding of artificial neural networks and methods for training them, with a special emphasis on architectures for processing sequential data, allowing them to solve a variety of NLP tasks with deep learning. An overview of information extraction tasks will be given, allowing students to approach various problems involving the extraction of structured information from unstructured text data. A survey of common specialized IE tasks is also provided, acquainting the students with some of the most common NLP applications.

Subject of course

- Basics of text processing: segmentation, tokenization, decompounding, stemming, lemmatization; regular expressions

- N-gram language modeling, simple classification tasks in NLP

- Part-of-speech tagging, named entity recognition, and shallow parsing with Hidden Markov Models

- Syntactic representations and syntactic parsing

- Basics of natural language semantics

- Neural network basics. Feed forward networks and recurrent neural networks

- Sequence modeling and sequence-to-sequence models. 

- Neural language modeling. Word vectors and contextualized language models. 

- Information extraction tasks: entity recognition, relation extraction, knowledge base population

- Information extraction applications: summarization, question answering, chatbots

Teaching methods

Lectures on the fundamentals

2 assignments (individually done)

1 Term project (done in groups)

Mode of examination

Immanent

Additional information

The link to the online lectures is in TUWEL.


Workload for Students (in hours):

  • Lectures: 24
  • Homework (2 Exercises): 16
  • Final Project: 35

Summe: 75

Lecturers

Institute

Course dates

DayTimeDateLocationDescription
Fri13:00 - 15:0002.10.2020 - 22.01.2021 (LIVE)Natural Language Processing and Information Extraction Lecture
Natural Language Processing and Information Extraction - Single appointments
DayDateTimeLocationDescription
Fri02.10.202013:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri09.10.202013:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri16.10.202013:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri23.10.202013:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri30.10.202013:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri06.11.202013:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri13.11.202013:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri20.11.202013:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri27.11.202013:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri04.12.202013:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri11.12.202013:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri18.12.202013:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri08.01.202113:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri15.01.202113:00 - 15:00 Natural Language Processing and Information Extraction Lecture
Fri22.01.202113:00 - 15:00 Natural Language Processing and Information Extraction Lecture

Examination modalities

2 assignments, 1 term project

 

Course registration

Begin End Deregistration end
21.09.2020 08:00 04.11.2020 23:55 04.11.2020 23:55

Curricula

Study CodeObligationSemesterPrecon.Info
066 645 Data Science Not specified

Literature

No lecture notes are available.

Language

English