Microsoft has introduced a new tool, the Azure AI Speech text-to-speech avatar. This will allow users to create realistic avatars delivering scripted content. Unveiled during the Microsoft Ignite 2023 event, this new feature is currently available in public preview and offers a unique way to generate videos featuring photorealistic avatars saying text-based content.
Users can upload images of the desired avatar appearance and provide a script for the virtual character to speak. Microsoft’s tool utilizes a model to animate the avatar, while a separate text-to-speech model, either prebuilt or trained on the person’s voice, vocalizes the script. This technology enables users to efficiently create videos for diverse purposes, such as training sessions, product introductions, and customer testimonials, using straightforward text input.
The avatars generated with this tool can communicate in multiple languages and, in chatbot scenarios, it can utilize AI models like OpenAI’s GPT-3.5 to respond to unscripted questions from customers. Additionally, Microsoft says it is cautious about the potential misuse of such technology. To address this, most Azure subscribers will only have access to prebuilt avatars, while custom avatars are restricted to specific use cases, requiring registration and approval.
This precaution aligns with Microsoft’s commitment to safeguarding individual and societal rights, encouraging transparent human-computer interaction, and combatting the dissemination of harmful deepfakes and misleading content. Microsoft emphasizes that the text-to-speech avatar is designed with responsible usage in mind.
To further ensure responsible usage, Microsoft has laid out specific requirements for users of custom avatars, TechCrunch reported. Customers will be required to obtain explicit written permission and consent statements from avatar talent, detailing the duration, use, and any content limitations. Additionally, customers are mandated to include disclosures stating that the avatars have been created with AI and are AI-generated.
This tool’s introduction not only adds an engaging dimension to digital content creation but also underscores Microsoft’s approach to ethical considerations. As concerns over deepfake technology escalate, Microsoft’s proactive approach aims to strike a balance between technological innovation and responsible use, setting a standard for AI-driven content creation tools.
{Categories} _Category: Platforms,*ALL*,_Category: Implications{/Categories}
{URL}https://www.indiatoday.in/technology/news/story/amid-deepfake-scrutiny-microsoft-launches-software-to-effortlessly-create-text-to-speech-ai-avatar-2464115-2023-11-17{/URL}
{Author}unknown{/Author}
{Image}https://akm-img-a-in.tosshub.com/businesstoday/images/story/202311/microsoft-is-hiring-a-software-engineering-intern-application-details-163450979-16x9_0-original.jpg{/Image}
{Keywords}{/Keywords}
{Source}Implications{/Source}
{Thumb}{/Thumb}