THENEXTWEB.COM
DeepL takes on next frontier in AI translation with DeepL Voice
German tech darling DeepL has (finally) launched a voice-to-text service. Its called DeepL Voice, and it turns audio from live or video conversations into translated text.DeepL users can now listen to people speaking a language they dont understand and automatically translate it to one they do in real-time.The new feature currently supports English, German, Japanese, Korean, Swedish, Dutch, French, Turkish, Polish, Portuguese, Russian, Spanish, and Italian.What makes the launch of DeepL Voice exciting is that it runs on the same neural networks as the companys text-to-text offering, which itclaims is the worlds best AI translator.As someone whos just moved to a foreign country, Im keen to try a voice-to-text translator that actually might work. All the ones Ive tried so far arent real-time theres a lag that renders them pretty useless and the translation quality is pretty poor.Register NowFor face-to-face conversations, you can launch DeepL Voice on your mobile and place it between you and the other speaker. It then displays your conversation so each person can follow translations easily on one device.You can also integrate DeepL Voice into Microsoft Teams and video-conference across language barriers. The translated text appears on a sidebar as captions. It remains to be seen whether DeepL Voice will be available on platforms like Zoom or Google Meet anytime soon. The next frontier While this is DeepLs first such offering, its unlikely to be its last. DeepLs founder and CEO, Jarek Kutylowskicalled real-time voice translationthe next frontier for the business.DeepL is already a leader in written translation, but real-time speech translation is an entirely different story, said DeepLs founder and CEO, Jarek Kutylowski.When translating speech as it happens, youre dealing with incomplete input, pronunciation issues, latency and more, all of which can lead to inaccurate translations and poor user experience.Sowe built a solution that would take these into account from the offset and enable businesses to break down language barriers by enabling them to communicate in multiple languages as required, said Kutylowski.Quality will likely be DeepL Voices differentiating factor from the countless other providers of voice-to-text translations. From a technological perspective, DeepLs success lies in the architecture of its neural networks, the input from human editors, and the training data. But Kutylowski also believes it has a key advantage over its competitors: focus.Focus is always an important thing, Kutylowski previously told TNW. Translate isnt the core business of Google its one of the 100 side gigs. The same goes if you consider LLMs and the OpenAIs of this world as our competition; translation is only one thing of what theyre doing and their GPU is doing a tonne of different things. Were focused on one particular area.In May, the DeepL reached a $2bn valuation after securing a new investment of $300mn (277mn). It covers 32 languages and counts over 100,000 business users. Story by Sin Geschwindt Sin is a climate and energy reporter at TNW. From nuclear fusion to escooters, he covers the length and breadth of Europe's clean tech ecos (show all) Sin is a climate and energy reporter at TNW. From nuclear fusion to escooters, he covers the length and breadth of Europe's clean tech ecosystem. He's happiest sourcing a scoop, investigating the impact of emerging technologies, and even putting them to the test. Sin has five years journalism experience and holds a dual degree in media and environmental science from the University of Cape Town, South Africa. Get the TNW newsletterGet the most important tech news in your inbox each week.Also tagged with
0 Kommentare 0 Anteile 18 Ansichten