Offline

Throughout the years, we have been tracking the progress of cascaded solutions and end-to-end approaches on a variety of settings, including diverse languages, domains, speaking styles and recording conditions. We would continue this tradition and challenge the communities on their SLT solutions, including those using LLMs, with our evaluation framework.

The SLT system’s performance will be evaluated with respect to its capability to produce translations similar to the target-language references. Such similarity will be measured in terms of multiple automatic metrics: COMET, BLEURT, BLEU, TER, and characTER. The submitted runs will be ranked based on the COMET calculated on the test set by using automatic resegmentation of the hypothesis based on the reference translation by mwerSegmenter. The detailed evaluation script can be found in the SLT.KIT. Moreover, a human evaluation will be performed on each participant’s best-performing submission.

IWSLT25INSTRUCT test set

The IWSLT25Instruct test set consists of audio recordings from the scientific domain, specifically presentations of research papers at major NLP conferences within the *ACL community. These recordings feature one of the authors presenting their paper’s scientific content in English.

Entries for Offline

Language en
de
zh

TVSERIES test set

TV Series is part of ITV Plc, which includes the UK’s largest commercial broadcaster. They create and produce a broad range of programming (drama, entertainment, factual) in 13 countries, which they distribute globally, providing high-quality subtitles. We would like to thank ITV Studios for providing IWLST with samples of their video content for research and evaluation purposes and would like to ask you not to use these videos and/or the accompanying subtitles for any commercial purposes or to make them publicly available on any other website.

Entries for Offline

Language en
de

CHALLENGEACCENT test set

The accented English-to-German test set is originated from the Edinburgh International Accents of English Corpus. It is a data featuring conversations, each containing two friends interacting on a daily topic, such as hobbies and vacation. The speakers were selected to cover a wide range of English speakers around the globe. In addition to the variety of accents, another major challenge is the presence of spontaneous speech.

Entries for Offline

Language en
de

BUSINESSNEWS test set

Business News is part of SRMG, the largest integrated media group in the MENA (Middle East and North Africa) region. An exclusive content agreement with ‘Bloomberg Media’ powers this distinguished business news multi-platform, drawing on Bloomberg’s comprehensive coverage from more than 2,700 journalists and analysts globally. Asharq Business with Bloomberg is a leading source for Arabic economic news rich in context and content and unparalleled market data, delivered through a TV channel and across digital and social media platforms.

Entries for Offline

Language en
ar
de