Startups

With AI translation service that rivals professionals, Lengoo attracts new $20M round

Comment

Abstract image of faces with gears, arrows, clouds, and faucets to illustrate the concept of learning.
Image Credits: DrAfter123 (opens in a new window) / Getty Images

Most people who use AI-powered translation tools do so for commonplace, relatively unimportant tasks like understanding a single phrase or quote. Those basic services won’t do for an enterprise offering technical documents in 15 languages — but Lengoo’s custom machine translation models might just do the trick. And with a new $20 million B round, they may be able to build a considerable lead.

The translation business is a big one, in the billions, and isn’t going anywhere. It’s simply too common a task to need to release a document, piece of software or live website in multiple languages — perhaps dozens.

These days that work is done by translation agencies, which employ expert speakers to provide translation on demand at a high level of quality. The rise of machine translation as an everyday tool hasn’t affected them as much as you might think, since the occasional Portuguese user using Google’s built-in webpage translation on a Korean website is very much a niche case, and things like translating social media posts or individual sentences isn’t really something you could or would farm out to professionals.

In these familiar cases, “good enough” is the rule, since the bare meaning is all anyone really wants or needs. But if you’re releasing a product in 10 different markets speaking 10 different languages, it won’t do to have the instructions, warnings, legal agreements or technical documentation perfect in one language and merely fine in the other nine.

Amazon launches a Live Translation feature for Echo devices

Lengoo started from a team working on automating that workflow between companies and translators.

“The next step to take obviously was automating the translation itself,” said CEO and founder Christopher Kränzler. “We’ll still need humans in the loop for a long time — the goal is to get the models to the level where’s they’re actually usable and the human has fewer translations to make.”

With machine learning capabilities constantly being improved, that’s not an unrealistic goal at all. Other companies have started down that road — DeepL and Lilt, for instance, which made their cases by showing major improvements over Google and Microsoft frameworks, but never claiming to remove humans from the process.

Lengoo iterates on their work by focusing on speed and specificity — that is, making a language model that integrates all the jargon, stylistic preferences and formatting requirements of a given client. To do this they make a custom language model by training it not just with the customer’s own documents and websites, but by continually adding in feedback from the translation process itself.

Illustration showing an infinity sign on which are various components of the machine learning feedback process.
A fanciful representation of the self-improving model process. Image: Lengoo 

“We have an automated training pipeline for the models,” said Kränzler. The more people contribute to the correction process, the faster the process gets. Eventually we get to be about three times faster than Google or DeepL.”

A new client may start with a model customized on a few thousand documents from the last couple years. But whenever the model produces text that needs to be corrected, it remembers that particular correction and integrates it with the rest of its training.

Diagram showing how fewer corrections are needed after the AI receives additional feedback.
An exciting bar graph. After 30 iterations, the segments requiring no corrections have doubled, and those requiring few are much increased. Image: Lengoo 

While the “quality” of a translation can be difficult to quantify objectively, in this case there’s no problem, because working as a human translator’s tool means there’s a quality check built right in. How good the translation is can be measured by “correction distance,” essentially the amount of changes the human has to make to the model’s suggested text. Fewer corrections not only means a better translation, but a faster one, meaning quality and speed both have objective metrics.

The improvements have won over customers that were leery of over-automation in the past.

“At the beginning there was resistance,” admitted Kränzler. “People turn to Google Translate for everyday translations, and they see the quality is getting better — they and DeepL have been educating the market, really. People understand now that if you do it right, machine translation works in a professional use case. A big customer may have 30, 40, 50 translators, and they each have their own style… We can make the point that we’re faster and cheaper, but also that the quality, in terms of consistency, goes up.”

Although customizing a model with a client’s data is hardly a unique approach, Lengoo seems to have built a lead over rivals and slower large companies that can’t improve their products quick enough to keep up. And they intend to solidify that lead by revamping their tech stack.

The issue is that due to relying on more or less traditional machine learning technologies, the crucial translator-AI feedback loop is limited. How quickly the model is updated depends on how much use it gets, but you’re not going to retrain a large model just to integrate a few hundred more words’ worth of content. It’s expensive computationally to retrain, so it can only be done sporadically.

But Lengoo plans to build its own, more responsive neural machine translation framework that integrates the various pipelines and processes involved. The result wouldn’t improve in real time, exactly, but would include the newest information in a much quicker and less involved way.

“Think of it as a segment by segment improvement,” said applied research lead Ahmad Taie (segments vary in size but generally are logical “chunks” of text). “You translate one segment, and by the next one, you already have the improvements made to the model.”

Making that key product feature better, faster and easier to implement customer by customer is key to keeping clients on the hook, of course. And while there will likely be intense competition in this space, Kränzler doesn’t expect it to come from Google or any existing large companies, which tend to pursue an acquire-and-integrate approach rather than an agile development one.

Lilt is building a machine translation business with humans at the core

As for the human expert translators, the field won’t replace them but may extend their effectiveness by, eventually, as much as an order of magnitude, which may shrink the workforce there. But if international markets continue to grow and with them the need for professional translation, they might just keep pace.

The $20 million round, led by Inkef Capital, will allow Lengoo to make the jump to North American markets as well as additional European ones, and integrate with more enterprise stacks. Existing investors Redalpine, Creathor Ventures, Techstars (out of which program the company originated), and angels Matthias Hilpert and Michael Schmitt all joined in the round, along with new investors Polipo Ventures and Volker Pyrtek.

More TechCrunch

The Series C funding, which brings its total raise to around $95 million, will go toward mass production of the startup’s inaugural products

AI chip startup DEEPX secures $80M Series C at a $529M valuation 

A dust-up between Evolve Bank & Trust, Mercury and Synapse has led TabaPay to abandon its acquisition plans of troubled banking-as-a-service startup Synapse.

Infighting among fintech players has caused TabaPay to ‘pull out’ from buying bankrupt Synapse

The problem is not the media, but the message.

Apple’s ‘Crush’ ad is disgusting

The Twitter for Android client was “a demo app that Google had created and gave to us,” says Particle co-founder and ex-Twitter employee Sara Beykpour.

Google built some of the first social apps for Android, including Twitter and others

WhatsApp is updating its mobile apps for a fresh and more streamlined look, while also introducing a new “darker dark mode,” the company announced on Thursday. The messaging app says…

WhatsApp’s latest update streamlines navigation and adds a ‘darker dark mode’

Plinky lets you solve the problem of saving and organizing links from anywhere with a focus on simplicity and customization.

Plinky is an app for you to collect and organize links easily

The keynote kicks off at 10 a.m. PT on Tuesday and will offer glimpses into the latest versions of Android, Wear OS and Android TV.

Google I/O 2024: How to watch

For cancer patients, medicines administered in clinical trials can help save or extend lives. But despite thousands of trials in the United States each year, only 3% to 5% of…

Triomics raises $15M Series A to automate cancer clinical trials matching

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of transportation. Sign up here for free — just click TechCrunch Mobility! Tap, tap.…

Tesla drives Luminar lidar sales and Motional pauses robotaxi plans

The newly announced “Public Content Policy” will now join Reddit’s existing privacy policy and content policy to guide how Reddit’s data is being accessed and used by commercial entities and…

Reddit locks down its public data in new content policy, says use now requires a contract

Eva Ho plans to step away from her position as general partner at Fika Ventures, the Los Angeles-based seed firm she co-founded in 2016. Fika told LPs of Ho’s intention…

Fika Ventures co-founder Eva Ho will step back from the firm after its current fund is deployed

In a post on Werner Vogels’ personal blog, he details Distill, an open-source app he built to transcribe and summarize conference calls.

Amazon’s CTO built a meeting-summarizing app for some reason

Paris-based Mistral AI, a startup working on open source large language models — the building block for generative AI services — has been raising money at a $6 billion valuation,…

Sources: Mistral AI raising at a $6B valuation, SoftBank ‘not in’ but DST is

You can expect plenty of AI, but probably not a lot of hardware.

Google I/O 2024: What to expect

Dating apps and other social friend-finders are being put on notice: Dating app giant Bumble is looking to make more acquisitions.

Bumble says it’s looking to M&A to drive growth

When Class founder Michael Chasen was in college, he and a buddy came up with the idea for Blackboard, an online classroom organizational tool. His original company was acquired for…

Blackboard founder transforms Zoom add-on designed for teachers into business tool

Groww, an Indian investment app, has become one of the first startups from the country to shift its domicile back home.

Groww joins the first wave of Indian startups moving domiciles back home from US

Technology giant Dell notified customers on Thursday that it experienced a data breach involving customers’ names and physical addresses. In an email seen by TechCrunch and shared by several people…

Dell discloses data breach of customers’ physical addresses

Featured Article

Fairgen ‘boosts’ survey results using synthetic data and AI-generated responses

The Israeli startup has raised $5.5M for its platform that uses “statistical AI” to generate synthetic data that it says is as good as the real thing.

18 hours ago
Fairgen ‘boosts’ survey results using synthetic data and AI-generated responses

Hydrow, the at-home rowing machine maker, announced Thursday that it has acquired a majority stake in Speede Fitness, the company behind the AI-enabled strength training machine. The rowing startup also…

Rowing startup Hydrow acquires a majority stake in Speede Fitness as their CEO steps down

Call centers are embracing automation. There’s debate as to whether that’s a good thing, but it’s happening — and quite possibly accelerating. According to research firm TechSci Research, the global…

Retell AI lets companies build ‘voice agents’ to answer phone calls

TikTok is starting to automatically label AI-generated content that was made on other platforms, the company announced on Thursday. With this change, if a creator posts content on TikTok that…

TikTok will automatically label AI-generated content created on platforms like DALL·E 3

India’s mobile payments regulator is likely to extend the deadline for imposing market share caps on the popular UPI (unified payments interface) payments rail by one to two years, sources…

India likely to delay UPI market caps in win for PhonePe-Google Pay duopoly

Line Man Wongnai, an on-demand food delivery service in Thailand, is considering an initial public offering on a Thai exchange or the U.S. in 2025.

Thai food delivery app Line Man Wongnai weighs IPO in Thailand, US in 2025

Ever wonder why conversational AI like ChatGPT says “Sorry, I can’t do that” or some other polite refusal? OpenAI is offering a limited look at the reasoning behind its own…

OpenAI offers a peek behind the curtain of its AI’s secret instructions

The federal government agency responsible for granting patents and trademarks is alerting thousands of filers whose private addresses were exposed following a second data spill in as many years. The…

US Patent and Trademark Office confirms another leak of filers’ address data

As part of an investigation into people involved in the pro-independence movement in Catalonia, the Spanish police obtained information from the encrypted services Wire and Proton, which helped the authorities…

Encrypted services Apple, Proton and Wire helped Spanish police identify activist

Match Group, the company that owns several dating apps, including Tinder and Hinge, released its first-quarter earnings report on Tuesday, which shows that Tinder’s paying user base has decreased for…

Match looks to Hinge as Tinder fails

Private social networking is making a comeback. Gratitude Plus, a startup that aims to shift social media in a more positive direction, is expanding its wellness-focused, personal reflections journal to…

Gratitude Plus makes social networking positive, private and personal

With venture totals slipping year-over-year in key markets like the United States, and concern that venture firms themselves are struggling to raise more capital, founders might be worried. After all,…

Can AI help founders fundraise more quickly and easily?