Borchert, F., Schapranow, M.-P.: HPI-DHC @ BioASQ DisTEMIST: Spanish Biomedical Entity Linking with Pre-trained Transformers and Cross-lingual Candidate Retrieval. Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum. bll. 244–258. , Bologna, Italy (2022).
Borchert, F., Lohr, C., Modersohn, L., Witt, J., Langer, T., Follmann, M., Gietzelt, M., Arnrich, B., Hahn, U., Schapranow, M.-P.: GGPONC 2.0 - The German Clinical Guideline Corpus for Oncology: Curation Workflow, Annotation Policy, Baseline NER Taggers. Proceedings of the Language Resources and Evaluation Conference. bll. 3650–3660. European Language Resources Association, Marseille, France (2022).
Despite remarkable advances in the development of language resources over the recent years, there is still a shortage of annotated, publicly available corpora covering (German) medical language. With the initial release of the German Guideline Program in Oncology NLP Corpus (GGPONC), we have demonstrated how such corpora can be built upon clinical guidelines, a widely available resource in many natural languages with a reasonable coverage of medical terminology. In this work, we describe a major new release for GGPONC. The corpus has been substantially extended in size and re-annotated with a new annotation scheme based on SNOMED CT top level hierarchies, reaching high inter-annotator agreement (γ=.94). Moreover, we annotated elliptical coordinated noun phrases and their resolutions, a common language phenomenon in (not only German) scientific documents. We also trained BERT-based named entity recognition models on this new data set, which achieve high performance on short, coarse-grained entity spans (F1=.89), while the rate of boundary errors increases for long entity spans. GGPONC is freely available through a data use agreement. The trained named entity recognition models, as well as the detailed annotation guide, are also made publicly available.
Henkenjohann, R., Bergner, B., Borchert, F., Bougatf, N., Hund, H., Eils, R., Schapranow, M.-P.: An Engineering Approach towards Multi-Site Virtual Molecular Tumor Board Software Support. In: Pissaloux, E., Papadopoulos, G., Achilleos, A., en Velázquez, R. (reds.) ICT for Health, Accessibility and Wellbeing. IHAW 2021. bll. 156–170. Springer, Cham (2022).