Skip to main content
Technical component

Machine Translation System

Domain-specific Machine Translation

The Machine Translation system ensures accurate and contextually appropriate translations by fine-tuning general-purpose machine translation models with domain-specific scientific data.

There are currently 3 translation models supporting the following language pairs: French to English, Spanish to English, Portuguese to English.

Functionalities

FR-EN CTranslate2 model

French to English Translation model fine-tuned on scientific parallel data from all four pilots.

ES-EN CTranslate2 model

Spanish to English Translation model fine-tuned on scientific parallel data from all four pilots.

PT-EN CTranslate2 model

Portuguese to English Translation model fine-tuned on scientific parallel data from all four pilots.

Roadmap

Jan 2023: TRL 6 - Model demonstration in a relevant environment
Jun 2024: TRL 7 - Service prototype demonstration in a space environment
Dec 2025: TRL 9 - Actual system "flight proven" through successful mission operations

For

Research Communities

Provided by

Contacts

Sokratis Sofianopoulos

Related Articles

Machine Translation for the Scientific Domain

11 July 2024
SciLake's partners from Athena RC present advancements in Machine Translation at the 25th Annual Conference of The European Association for Machine Translation.

Domain-Specific Machine Translation for SciLake

10 January 2024
Sokratis Sofianopoulos and Dimitris Roussis from Athena RC present their cutting-edge Machine Translation system, which will be integrated into the Scientific Lake Service.