IBL News | New York
Meta this month announced the release of SeamlessM4T, an AI open-source model that can translate and transcribe 100 languages across text and speech.
It’s available along with a new translation data set named SeamlessM4T. According to Meta, this is a “significant breakthrough” in the field of AI-powered speech-to-speech and speech-to-text.
“Our single model provides on-demand translations that enable people who speak different languages to communicate more effectively,” Meta said to TechCrunch.
Several companies, such as Google, Amazon, Microsoft, OpenAI, and a number of startups, are investing resources in developing sophisticated AI translation and transcription tools.
Google is creating a “Universal Speech Model”, a model that can understand the world’s 1,000 most-spoken languages.
Mozilla, meanwhile, spearheaded Common Voice, one of the largest multi-language collections of voices for training automatic speech recognition algorithms.