Meta has developed an AI translator for an unwritten language
Meta has developed an artificially intelligent translation system that can convert an oral language — Hokkien — to spoken English. It's another step closer to finally making Star Trek's Universal Translator a reality.
CEO Mark Zuckerberg shared a video demonstrating the tech alongside software engineer Peng-Jen Chen in a Facebook post on Wednesday. In it, the two converse in English and Hokkien respectively, with Meta's AI system audibly translating. The demonstration appears fairly impressive, though like Meta's VR legs, it's highly likely the video was edited for illustrative purposes and the current product isn't quite as smooth.
Translation AI is typically trained on text, with researchers feeding their systems reams of written words for them to learn from. However, there are over 3,000 languages that are primarily spoken and have no widely used written system, making them difficult to incorporate into such training.
Hokkien is one such language. Used by over 45 million people across mainland China, Taiwan, Malaysia, Singapore, and the Philippines, Hokkien is an oral language without an official, standard written system.
As such, Hokkien speakers who need to write down information tend to do so phonetically, resulting in significant variation depending on the writer. There is very little recorded data translating Hokkien to English either, and professional human translators are scarce.
To work around this, Meta used written Mandarin as an intermediary between English and Hokkien when training its AI.
"Our team first translated English or Hokkien speech to Mandarin text, and then translated it to Hokkien or English — both with human annotators and automatically," Meta researcher Juan Pino said. "They then added the paired sentences to the data used to train the AI model."
Tweet may have been deleted
Of course, filtering a sentence through multiple languages can sometimes distort its meaning — as anyone who has ever played around with Google Translate knows. Meta also worked with Hokkien speakers to check translations, and is releasing its models, data, and research as open-source information for other researchers to utilise.
Related Stories
- Meta's VR legs aren't real and can't hurt you
- Anne Hathaway and Jimmy Fallon's Google translated songs are catchier than the originals
- Meta has made thousands of dollars from hate speech ads, report finds
- Google Meet can now translate speech and turn it into captions on the fly
- Meta employees aren't thrilled about Mark Zuckerberg's metaverse chase
Mashable has reached out to Meta for comment.
We're still quite a way off from having a fully functioning TARDIS Translation Matrix. Meta's oral translation system can currently only handle one sentence at a time, and only works with translations between Hokkien and English. Even so, it's promising progress toward the real-time oral translation Meta is aiming for.
Meta's Hokkien-English oral translation project is part of Meta's ongoing Universal Speech Translator program, which aims to be able to translate speech between two languages as they're being spoken.
(责任编辑:产品中心)
- 水产品占“四席”!广州南沙十个农产品上榜“国字号”
- “三为汉源 爱家树风”大宣讲走进汉源县大树镇
- The 'Queer Eye' guys visited a town that's actually called Yass
- Police seek arrest warrant for Seungri in Burning Sun scandal
- CeeDee Lamb secures record
- Best TV deal: Get the Fire TV 50
- ‘Former President Chun Doo
- US spy aircraft on missions over East Sea: aviation tracker
- It's Unnecessary But, AMD Is Basically Lying About CPU Performance
- 爽!购物节上好多安逸货
- 芦山县龙门乡五星村老两口传承芦山花灯
- Donald Trump denies mocking reporter, accuses him of “grandstanding” on disability.
- “笋货”上市采购旺!清远西牛麻竹笋迎秋季尝鲜热
- Elon Musk is tied to Tesla, for better or worse.
- Echo Dot (5th gen) deal — get it for $29.99 at Amazon
- Trump, Cruz, Rubio, Bush: Here’s where things stand in the GOP race.
- Liberia Ebola death first since being declared free of virus.
- Lyft deal: Spend $100, save $15 on a Lyft gift card
- Ford can make your Mustang Mach
- Moon invites Kim Jong