Project Description

The huge amount of biological data potentially available for research is a great opportunity to apply existing machine learning algorithms as well as newly developed ones in medical research. To this end, machine learning and data mining methods need to be adapted to the particular needs of bioinformatics. Together with biological data also the publication of scientific results in writing has increased. We therefor apply text mining methods to make these results accessible and easier retrievable.

Subprojects

Text Mining in the Medical Domain
Data-Intensive Computational Biology

Project-Related Publications

Heller, D., Krestel, R., Ohler, U., Vingron, M., Marsico, A.: ssHMM: Extracting Intuitive Sequence-Structure Motifs from High-Throughput RNA-Binding Protein Data. Nucleic Acid Research. 45, 11004–11018 (2017).

[ Abstract ] [ BibTeX ]

Park, J., Blume-Kohout, M., Krestel, R., Nalisnick, E., Smyth, P.: Analyzing NIH Funding Patterns over Time with Statistical Text Analysis. Scholarly Big Data: AI Perspectives, Challenges, and Ideas (SBD 2016) Workshop at AAAI 2016. AAAI (2016).

[ Abstract ] [ BibTeX ] [ Download ]

Grundke, M., Jasper, J., Perchyk, M., Sachse, J.P., Krestel, R., Neves, M.: TextAI: Enhancing TextAE with Intelligent Annotation Support. Proceedings of the 7th International Symposium on Semantic Mining in Biomedicine (SMBM 2016). pp. 80–84. CEUR-WS.org (2016).

[ Abstract ] [ BibTeX ] [ Download ]

Project Description

Subprojects

Project-Related Publications

Chair

News

17.11.2025 | New book chapter about "Data Quality for Enterprise AI" published

01.11.2025 | Paper accepted at WOP@ISWC

29.09.2025 | Paper accepted at NeurIPS 2025

29.09.2025 | Paper accepted at SIGMOD 2026

09.07.2025 | Paper accepted in SIGMOD Record

Project highlights

People and open positions