Blog

Author: Dr. Jingyi Han, Machine Translation Scientist @ Language Weaver Introduction Nowadays, Byte Pair Encoding (BPE) has become one of the most commonly used tokenization strategies due to its universality and effectiveness in handling rare words. Although many previous works show that subword models with embedding layers in general achieve more stable and competitive results in neural machine translation (NMT), character-based (see issue #60) and Byte-based subword...

Read More
NMT 135 Recovering Low-Frequency Words in Non-Autoregressive Neural MT

Author: Dr. Patrik Lambert, Senior Machine Translation Scientist @ Iconic Introduction Non-Autoregressive Translation (NAT), in which the target words are generated independently, is raising a lot of interest because of its efficiency. However, the assumption that target words are independent of each other leads to errors which affect translation quality. In this post we take a look at a paper by Ding et al. (2021) which confirms...

Read More
NMT 134 A Targeted Attack on Black-Box Neural Machine Translation

Author: Dr. Karin Sim, Machine Translation Scientist @ Iconic Introduction Last week we looked at how neural machine translation (NMT) systems are naturally susceptible to gender bias. In today’s blog post we look at the vulnerability of an NMT system to targeted attacks, which could result in unsolicited or harmful translations. Specifically we report on work by Xu et al., 2021, which examines attacks on black-box NMT...

Read More
NMT 133 Evaluating Gender Bias in Machine Translation

Author: Akshai Ramesh, Machine Translation Scientist @ Iconic Introduction We often tend to personify aspects of life that may vary based upon the beholder's interpretation. There are plenty of examples for this - “Mother Earth”, Doctor (Men), Cricketer (Men), Nurse(Woman), Cook(Woman), etc. The MT systems are trained with a large amount of parallel corpus which encodes this social bias. If that is the case, then to what...

Read More
Relativity Fest London 2021 feature image

Date: 18 - 19 May 2021 Location: Virtual Title: Relativity Fest London Iconic is pleased to be a Silver Level sponsor at Relativity Fest London which takes place the 18th - 19th of May, 2021. Relativity Fest London is noted as one of the largest e-discovery events in Europe, providing a platform to transform discovery. Iconic’s integrated translation solutions for e-discovery, compliance, and digitial forensics,...

Read More
The eDisclosure Systems Buyers Guide – 2021 Edition (Andrew Haslam)

Every springtime, those in the eDiscovery space await Andrew Haslam’s updated and ever popular eDisclosure Systems Buyer’s Guide. Compiled in conjunction with ComplexDiscovery, it is a one of a kind, exhaustive resource that offers practical advice on selecting litigation support services and software, while also providing a definitive collection of vendor and software information in the UK. This important resource has even been referred to...

Read More