Meta unveils Voicebox, a generative AI instrument for text-to-speech

19 June 2023

Voicebox is Meta’s breakthrough in generative voice AI, which transforms textual content into reasonable and expressive speech. The AI instrument, which works just like ChatGPT or Dall-E, is a complicated AI mannequin able to performing speech technology duties reminiscent of content material enhancing, sampling and conversion type, even with out particular coaching, because of in-context studying.

Meta unveils Voicebox, a generative AI tool for text-to-speech

It stands out from different speech synthesis fashions by excelling in varied duties reminiscent of noise suppression, text-to-speech synthesis and cross-linguistic type switch, pushing the boundaries of artificial speech technology. Voicebox additionally surpasses present fashions in pace, working at a charge 20 occasions quicker.

Voicebox has undergone in depth coaching utilizing a dataset of over 50,000 hours of unfiltered audio. The AI mannequin was educated utilizing Meta’s progressive “Move Matching” approach, a flexible various to the diffusion-based studying strategies employed by different generative fashions.

Meta’s coaching dataset consists of recorded speeches and transcriptions of public area audiobooks in a number of languages, reminiscent of English, French, Spanish, German, Polish, and Portuguese.

In keeping with Mark Zuckerberg, Voicebox is “the first-ever generative AI voice mannequin able to performing duties for which it has not been particularly educated”.

Supply: Mark Zuckerberg

Sooner or later, Voicebox and comparable AI fashions will be capable to present natural-sounding voices to digital assistants and non-player characters within the metaverse. They’ll additionally allow visually impaired individuals to listen to messages written in acquainted voices utilizing AI and provides creators straightforward instruments to edit audio tracks in movies.

Voicebox and the hazards of deepfakes

Nonetheless, Voicebox might pose moral and social challenges, particularly within the context of deepfakes. Deepfakes, created by AI fashions, are artificial media that manipulates an individual’s voice, typically in malicious methods. Voicebox might create compelling deepfakes that imitate somebody’s voice or make them say issues they by no means stated. This might have severe implications for privateness, safety and belief.

Microsoft President Brad Smith raised considerations final month in regards to the injury brought on by deepfakes. He highlighted the necessity for mechanisms to distinguish real materials from AI-generated materials, particularly in instances of malicious intent. He referred to as for accountability and safety measures to take care of human management over vital infrastructure ruled by AI programs. Moreover, he proposed a system the place builders monitor utilization and supply transparency to determine manipulated movies, just like a KYC method.

Meta says it’s conscious of the potential hurt Voicebox might trigger and that the corporate is engaged on an efficient method to distinguish between genuine speech and audio generated by Voicebox. Whereas Voicebox remains to be underneath improvement and never presently accessible to the general public, Meta acknowledges the potential dangers related to superior AI expertise.

Be taught extra:

Meta unveils Voicebox, a generative AI instrument for text-to-speech

Voicebox and the hazards of deepfakes

기술이 게임을 더 안전하게 만드는 방식

Veelive: The place the Metaverse Meets Actual-World Funding Alternatives

Somnia launches Devnet, opening public testing for its blockchain

LEAVE A REPLY Cancel reply

Most Popular

Branding: Layer 1 of a Profitable Web3 Advertising and marketing Technique

Ether’s Loss of life Cross Threatens Additional Draw back as ETH Worth Trades at Key Help Stage

Elon Musk Reportedly Plans AI Startup to Rival ChatGPT Maker OpenAI

Poppe: US Banking Disaster Might Set off Bitcoin Bull Market

EU finalizes controversial sensible contract kill swap guidelines below knowledge legislation

Gemini Recordsdata Lawsuit In opposition to Digital Forex Group and Barry Silbert Over Genesis and Earn Program

Environmental affect of AI: pioneering options for a sustainable future

EDITOR PICKS

Bitcoin Worry and Greed Index Stays Grasping at 63 as BTC Recovers

Longest streak of Bitcoin withdrawals outpacing deposits indicators market shift

Celsius reportedly ready litigation in opposition to creditor for leaking inside info

TREND NEWS

Branding: Layer 1 of a Profitable Web3 Advertising and marketing Technique

Ether’s Loss of life Cross Threatens Additional Draw back as ETH Worth Trades at Key Help Stage

Elon Musk Reportedly Plans AI Startup to Rival ChatGPT Maker OpenAI

POPULAR Tags

ABOUT US

FOLLOW US