Tagging and Captioning Art-Historical Photographs (Wintersemester 2022/2023)

Description

For centuries, the art world was a completely analog domain, but especially in recent years efforts were made to digitize artworks in order to conserve them and make them available to a wider audience. At the same time, this also enables the application of machine learning techniques such as image captioning/tagging to enrich large amounts of digitized art with meta information, such as tags or captions.

Using generated tags enables more efficient handling of the data for archivists and researchers because they can filter the data by categories, such as portraits, photos of architecture, etc. Additionally, it is a first step to make digitally-published artworks accessible to visually impaired people. They can already get an idea of the visual contents even when no captions (produced by a dedicated image captioning algorithm) are available.

Understanding and tagging images is already widely researched on natural images (e.g. ImageNet). However, there exist some differences to the data of the art domain regarding visuals and semantics (especially when it comes to paintings). Archives can suffer from image degradation over time and often. Artistic images can contain visual cues which differ from the typical objects depicted by figurative images. This is especially true for more abstract artworks.
In addition, art datasets often do not provide the labels needed to apply supervised techniques.

In this seminar, we want to:

tag and caption a real-life photograph archive provided by our project partner, the Wildenstein Plattner Institute,
research ways how existing methods can be applied to our unlabeled data,
study connections between image tagging and captioning, and
investigate how we can bridge the domain gap between artistic and natural images.

Dates

18th Oct. Presentation of the seminar (House F-E.06)
25th Oct. Introduction
1st Nov. Related work discussion
Weekly discussion
31st Registration period ends
Intermediate presentation. (13th of December)
- Code Quality
Final presentation (31st January)
Scientific Writing Slides (3rd February)
Written report (28th of February) Template

Contact Information

Alejandro Sierra Múnera Alejandro.Sierra(at)hpi.de
Hendrik Rätz Hendrik.raetz(at)hpi.de
Jona Otholt Jona.Otholt(at)hpi.de

Remote Access

https://uni-potsdam.zoom.us/j/64412783761
Passcode: 51342261