MyShell introduced OpenVoice, an open-source AI for voice cloning

TL;DR:

  • MyShell, in collaboration with MIT and Tsinghua University, introduces OpenVoice, an open-source AI for voice cloning.
  • OpenVoice offers remarkable speed and precision in voice cloning, allowing granular control over tone, emotion, accent, and more.
  • Dual AI models enable instant voice cloning, with the first model handling language styles and emotions and the second focusing on tone conversion.
  • OpenVoice’s training on diverse datasets, encompassing multiple languages and emotions, empowers it to clone voices with minimal data.
  • MyShell, a Calgary-based startup with over 400,000 users, positions itself as a decentralized platform for AI app creation.
  • MyShell offers various AI applications, including chatbot personalities, meme generators, and user-created text RPGs, some of which are available through a subscription fee.
  • MyShell’s decision to open source OpenVoice through HuggingFace demonstrates its commitment to an open model of AI development.

Main AI News:

In a groundbreaking development, MyShell, the Calgary-based startup that has already garnered over 400,000 users, has released OpenVoice, a cutting-edge open-source AI. Developed in collaboration with researchers from MIT and Tsinghua University, this innovative technology offers voice cloning with unparalleled speed and precision, ushering in a new era in the world of AI.

OpenVoice boasts the remarkable ability to clone voices using just seconds of audio input, providing users with unprecedented control over various vocal elements such as tone, emotion, accent, rhythm, and more. This remarkable achievement opens up a myriad of possibilities for industries ranging from entertainment to customer service.

MyShell’s recent announcement has been met with widespread anticipation, as it promises to revolutionize the field of voice cloning. The technology is underpinned by two distinct AI models working in tandem to deliver exceptional results: one for text-to-speech conversion and the other for voice tone cloning.

The first model is responsible for managing language styles, accents, emotions, and various speech patterns. It has been meticulously trained on a diverse dataset of 30,000 audio samples, featuring speakers of English, Chinese, and Japanese, each expressing a wide range of emotions. This extensive training enables OpenVoice to replicate nuanced vocal nuances.

The second model, known as the “tone converter,” is equally impressive. It has learned from a vast dataset comprising over 300,000 samples encompassing 20,000 distinct voices. This comprehensive training empowers OpenVoice to accurately clone voices with minimal data, a feat that significantly outpaces alternatives like Meta’s Voicebox.

MyShell’s commitment to democratizing AI is evident in its approach to OpenVoice. By open-sourcing this remarkable technology through HuggingFace, MyShell is actively contributing to the advancement of an open model of AI development, enabling developers and innovators to explore new horizons in voice cloning and beyond.

MyShell’s broader ecosystem includes an array of AI applications, including original text-based chatbot personalities, meme generators, user-created text RPGs, and more. While some content is accessible through a subscription fee, the company also provides opportunities for bot creators to promote their creations on its platform.

Conclusion:

MyShell’s OpenVoice represents a significant leap forward in voice cloning technology. With its precision and speed, it has the potential to disrupt various industries, from entertainment to customer service. MyShell’s broader ecosystem of AI applications positions the company as a leader in democratizing AI development, offering opportunities for innovation and accessibility. The open-sourcing of OpenVoice through HuggingFace signals a promising future for AI development and collaboration.

Source