The student teams approached the challenge through intensive user research and arrived at complementary insights from different perspectives.
Both teams began with interviews, observations, and immersive methods to understand the perspectives of content creators, users, and experts. It quickly became clear that perception of the problem depends heavily on personal involvement. While creators see existential risks, many users only take the issue seriously when they are personally affected.
A key lesson: Trust is less a technical issue than a deeply human and emotional one. As soon as people imagine that their own voice could be misused, their attitude changes abruptly.
The two teams defined two different starting points for solutions:
1. “Content consumption” perspective – guidance instead of overwhelm
One team focused on content users who feel increasingly overwhelmed in their digital daily lives. Their insight: This led to the prototype of an AI Voice Detector – an app that analyzes content across platforms, detects AI-generated voices, and enables users to specifically filter, flag, or mute them.
- Users do not want to have to actively verify whether content is authentic.
- They desire simple, intuitive control over the content they consume.
- Trust is built through transparency and immediate feedback mechanisms.
The key insight: People do not need more information – but rather better decision-making capabilities at the moment of consumption.
2. “Content creation” perspective – protection instead of powerlessness
The second team focused on content creators and their need for protection. Their interviews revealed: Based on this, the team developed the AuthentiVoice prototype: an end-to-end platform that guides creators through the entire process – from detecting cloned voices and prioritizing cases to taking legal action such as filing copyright claims or issuing cease-and-desist letters.
- Many creators feel powerless in the face of abuse.
- There is a lack of clear processes for how they can take action against voice cloning.
- Existing solutions usually stop at detection – not at taking action based on that detection.
The key insight here: The real problem is not just detecting fakes, but the lack of ability to respond to them.