Meta AI creates speech recognition translator that works in dozens of languages

A complete picture of people all over the world sitting and listening to speeches. — A speech at an international diplomatic conference was used to train a machine learning translation system.Credit: Janek Skarzynski/AFP/Getty

The dream of the Babel Fish, the translation animal envisioned in the classic science fiction series The Hitchhiker’s Guide to the Galaxy, may be a little closer to reality. Researchers at tech giant Meta have developed a machine learning system that almost instantly translates speech in 101 languages into words spoken by a speech synthesizer in any of 36 target languages.

The Large Scale Multilingual and Multimodal Machine Translation (SEAMLESSM4T) system is also capable of speech-to-text, text-to-speech, and text-to-text translation. The results are published in the January 15 issue of Nature1.

Following the successful release of SEAMLESSM4T, Meta, a Menlo Park, Calif.-based company that operates social media sites such as Facebook, WhatsApp, and Instagram, is now making SEAMLESSM4T available for other researchers who want to build on it. announced that it would be made available as open source. We bring LLaMA’s large-scale language models to developers around the world.

Lack of data

Machine translation has come a long way in the past few decades, thanks to the introduction of neural networks trained on large datasets. While training data is abundant for major languages, especially English, many other languages are notoriously lacking in training data. This inequality limits the range of languages that machines can be trained to translate. “This has implications for languages that appear less frequently on the Internet,” Alison Konecke, a computer scientist at Cornell University in Ithaca, New York, wrote in a News & Views article accompanying the paper. .

Robowriter: The rise and risks of language-generating AI

Meta’s team built on previous work on speech-to-speech translation 2 and a project called No Language Left Behind 3, which aims to provide text-to-text translation for nearly 200 languages. . Through experience, researchers such as Meta have discovered that making a translation system multilingual can improve performance even when translating languages with limited training data. It’s unclear why this happens.

The research team collected millions of hours of audio files of speeches, as well as human translations of those speeches, from the Internet and other sources such as United Nations archives. The authors also collected transcripts of some of these speeches.

The team also trained a model using authoritative data to identify two pieces of matching content. This allowed the researchers to combine approximately 500,000 hours of audio and text and automatically match each fragment in one language with the corresponding fragment in another language.

speech to speech

Source link

What's Hot

I’ve seen all the Marvel movies. Here’s how to save your MCU

London Stock Exchange Group share price rises as PISCES debut nears and financial results approach

Indian Americans largely disapprove of Trump’s first-year performance, but Democrats aren’t benefiting: Survey

Meta AI creates speech recognition translator that works in dozens of languages

D Street Massacre, Humanity Milestones, Bangladesh Election Results, PMO Shift, and More

A smarter way for AI to understand text and images

Surprisingly Tough Competition for Meta’s Ray-Ban

20 Most Anticipated Sex Movies of 2025

How to tell the difference between fake and genuine Adidas Sambas

President Trump’s SEC nominee Paul Atkins marries multi-billion dollar roof fortune

Alice Munro’s Passive Voice | New Yorker

D Street Massacre, Humanity Milestones, Bangladesh Election Results, PMO Shift, and More

A smarter way for AI to understand text and images

Surprisingly Tough Competition for Meta’s Ray-Ban

How AI assistance impacts the formation of coding skills \ Anthropic

Our Picks

I’ve seen all the Marvel movies. Here’s how to save your MCU

London Stock Exchange Group share price rises as PISCES debut nears and financial results approach

Indian Americans largely disapprove of Trump’s first-year performance, but Democrats aren’t benefiting: Survey

Most Popular

chatgpt makers claim data breach claims “seriously”

Everything you need to know

Everything you need to know about Google’s premium AI

Subscribe to Updates

What's Hot

Meta AI creates speech recognition translator that works in dozens of languages

Lack of data

speech to speech

Related Posts