Our group includes PostDocs, PhD students, and student assistants, and is headed by Prof. Felix Naumann. If you are interested in joining our team, please contact Felix Naumann.
For bachelor students we offer German lectures on database systems in addition to paper- or project-oriented seminars. Within a one-year bachelor project, students finalize their studies in cooperation with external partners. For master students we offer courses on information integration, data profiling, and information retrieval enhanced by specialized seminars, master projects and we advise master theses.
Most of our research is conducted in the context of larger research projects, in collaboration across students, across groups, and across universities. We strive to make available most of our datasets and source code.
Understanding the human language by machines is one of the important topics in computer science. There is a large range of tools and technologies for natural language processing that are used by many users in daily life: from the simplest cases such as spell checkers and grammar checkers to more complicated systems such as speech recognition, machine translation, question answering, email categorization, hand writing recognition, and search engines.
Processing natural language is divided into two main parts:
end to end applications that are frequently used by many people,
intermediate techniques that are the core part of different applications.
In this lecture, the main techniques and applications of natural language processing will be introduced. In addition we briefly describe language modeling and machine learning concepts that are required to deal with language processing techniques and applications.
Task: implement the probabilistics CKY algorithm that use a set of rules R. The rules will be sent to the mailing list. For the start up you can use the small set of rules from the slides.
Deadline: 30.05.2012
Exercise 2:
Task: finding similar words based on their context in a corpus. The detailed description will be sent to the mailing list.
Deadline: 20.06.2012
Course Book
SPEECH AND LANGUAGE PROCESSING
An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition