Q14 – I deal with texts that are full of in-lines and tags. Most SMT systems only offer plain text and it takes a long time to copy and paste the in-lines/tags back in place. Have you done anything to solve this problem?

Yes, that´s right. Statistical machine translation systems usually produce plain text output because this is also the format they can process. However, we are keen to see PangeaMT solutions in use and adapted to the most demanding language industry requirements. This is why we focused our effort on developing SMT engines capable of handling in-line coding typical of other content formats used in localization production environments. Thanks to an innovative inline parser, PangeaMT can identify in-lines without attempting to translate them. An in-line placeholder is first inserted, and then replaced by the in-line itself before the output.

If the engine has to process very heavy in-line text, translation quality may suffer. In that scenario, some clients may choose to have in-line instances identified and presented in a given segment position (e.g. beginning or end), and have their post-editors move the tag back to the right place. Or they may opt to have the engine produce in-lines where they should be.

Experience shows that this is a fair measure. To our knowledge, our in-line parser constitutes an innovation well-above the current level of maturity of well-known SMT systems. Moreover, PangeaMT solutions are the only ones providing you with a choice of content formats (txt / TMX / XLIFF) as our mission is two-fold: to follow open standards and democratize Machine Translation as much as possible.

Check out our online demo, where you can test our TMX generator and some abridged versions of our domain engines in a few language directions!