According to reports, a new language model from Microsoft Vall-E, can mimic any voice using only a three-second sample. The 60,000 hours of English speech data that were used to test this recently released AI tool.
Researchers from Cornell University claimed in a report that it could mimic the emotions and tone of a speaker. This ability has been demonstrated by Microsoft on the TTS system’s GitHub page. Vall-E will recite any text while maintaining the tone of the 3-second audio, which can be spoken in any emotional state (angry, tired, neutral, amused, disgusted, etc.).
Evidently, such conclusions held true even when capturing words that the original speaker had never actually pronounced.
According to the researchers, “Vall-E emerges in-context learning capabilities and can be used to synthesize high-quality personalized speech with only a 3-second enrolled recording of an unseen speaker as an acoustic prompt. Experiment results show that Vall-E significantly outperforms the state-of-the-art zero-shot [text to speech] system in terms of speech naturalness and speaker similarity”. They also reveal: “In addition, we find Vall-E could preserve the speaker’s emotion and acoustic environment of the acoustic prompt in synthesis.”
However, the research in text-to-speech AI comes with a warning.
“Since Vall-E could synthesize speech that maintains speaker identity, it may carry potential risks in misuse of the model, such as spoofing voice identification or impersonating a specific speaker,” the researchers say. “We conducted the experiments under the assumption that the user agree to be the target speaker in speech synthesis. When the model is generalized to unseen speakers in the real world, it should include a protocol to ensure that the speaker approves the use of their voice and a synthesized speech detection model.”
The European Union (EU) is launching investigations into tech giants Apple, Google, and Meta (Facebook’s parent company) over concerns they might not be following a new law designed to promote fair competition in the digital market.
Google Chrome is on the brink of initiating a seismic shift that threatens to reshape the very foundation of the modern internet. The imminent demise…
In a highly anticipated market debut, Reddit stock took off, closing its first day of trading with a staggering 48% surge from its initial public offering (IPO) price of $34 per share. The social media giant’s stock concluded the day at $50.44, propelling its market capitalization to over $8 billion.
In a significant move in Vietnam’s corporate landscape, Vingroup, the country’s largest conglomerate, has completed the sale of a 41.5% stake and other assets in its retail arm, Vincom Retail, for a staggering $1.6 billion. This transaction ranks among the largest mergers and acquisitions (M&A) deals in Vietnam in recent years, underscoring the dynamism of the country’s economy and corporate sector.
The integration of artificial intelligence (AI) into various facets of healthcare has revolutionized patient care, and its impact on surgery is poised to be profound. Leveraging AI’s capabilities to connect, analyze, and predict based on operating room data holds the promise of enhancing surgical efficiency and clinical decision-making. Recognizing this potential, NVIDIA has partnered with Johnson & Johnson MedTech to explore new AI capabilities within the realm of surgery, with the goal of advancing the company’s connected digital ecosystem for surgical procedures.
Apple is reportedly in discussions with Alphabet, Google’s parent company, regarding the licensing of Google’s “Gemini” artificial intelligence (AI) training model for integration into iPhones.
An Illinois jury has delivered a significant verdict in a case involving Mead Johnson’s Enfamil baby formula, ordering the company to pay $60 million to the mother of a premature baby who died from an intestinal disease after consuming the formula.
In the fast-paced world of software development, innovation is the name of the game. Cognition, a leading tech company, has introduced a groundbreaking solution that promises to transform the industry as we know it: Devin AI, the world’s first AI software engineer. Devin AI is not just another tool in the developer’s toolbox; it’s a game-changer that leverages the power of artificial intelligence and machine learning to automate coding tasks and streamline the development process.
When it comes to acing a job interview, there are certain tactics that can help you stand out and impress prospective employers. From showcasing your passion for the role to demonstrating problem-solving skills, there are plenty of strategies to employ. However, it’s equally important to be mindful of phrases that could raise red flags during the interview process.
The ongoing saga between the United States and ByteDance, the Chinese tech giant behind the popular social media platform TikTok, has taken another turn. With the House approving a bill requiring ByteDance to divest TikTok within roughly six months, the possibility of a forced sale or effective ban looms large.
-
Cultural differences between the East and the West
-
Vietnam received recognition in the 29th World Travel Awards in numerous categories
-
Vietnamese dishes that fascinate foreigners
-
First NFT vending machine in the world is now operational in New York
-
The LEGO Group today began construction on a new $1 billion factory in Binh Duong, Vietnam
-
19 DeFi Startups To Watch
Google Maps to revolutionize navigation with Satellite features, eliminating dead zones
East Asia’s Growth Outpaces Global Average Amidst China’s Economic Challenges, Says World Bank
Google Maps to revolutionize navigation with Satellite features, eliminating dead zones
EU Probes Apple, Google and Meta for Potential Violations of New Digital Law
Exploring the Enchanting Wonders of Hawaii- A Journey to Remember
Vibrant Activities in Ha Nam Cultural and Tourism Week 2023
Exploring the Enchanting Wonders of Hawaii- A Journey to Remember
Unveiling the Heritage Guide: Vietnam Airlines’ Exquisite Travel Companion for Exploring Vietnam
Thomson Medical Group Expands Southeast Asian Presence with Acquisition of FV Hospital in Vietnam
By Rotation: The Startup Revolutionizing Fashion with Peer-to-Peer Clothing Rental
Talks with Leaders, Ep8: 1st AI Doctor Assistant in Vietnam provides 90+% correct diagnosis within 5s
Talks with Leaders, Special episode: Blockchain meets Metaverse, featuring business leaders, investors, tech experts and global influencers
NVIDIA and Johnson & Johnson MedTech’s AI Collaboration Revolutionizing Surgery
Google Maps to revolutionize navigation with Satellite features, eliminating dead zones
Valentino Joins the Metaverse: An Exciting Partnership with UNXD
Coinbase Faces Technical Woes Amidst Bitcoin Surge to $64,000
Coinbase Faces Technical Woes Amidst Bitcoin Surge to $64,000
Exploring the Enchanting Wonders of Hawaii- A Journey to Remember
Abode: Artist Challenges Adobe’s Dominance with Lifetime Creative Software Suite
By Rotation: The Startup Revolutionizing Fashion with Peer-to-Peer Clothing Rental
Threads App’s Rollercoaster Ride: Plummeting 82% of Users Raise Concerns for Meta’s Social Experiment
Job Interview Red Flags: Phrases to avoid according to a former Google recruiter
Exploring the Enchanting Wonders of Hawaii- A Journey to Remember
Vibrant Activities in Ha Nam Cultural and Tourism Week 2023
Thomson Medical Group Expands Southeast Asian Presence with Acquisition of FV Hospital in Vietnam
By Rotation: The Startup Revolutionizing Fashion with Peer-to-Peer Clothing Rental
- Google Maps to revolutionize navigation with Satellite features, eliminating dead zones - April 22, 2024
- East Asia’s Growth Outpaces Global Average Amidst China’s Economic Challenges, Says World Bank - April 4, 2024
- EU Probes Apple, Google and Meta for Potential Violations of New Digital Law - March 27, 2024