Cloud Computing

Speed up the in-vehicle digital expertise with Azure Cognitive Providers | Azure Weblog and Updates

Hero image

Microsoft helps to reshape the automotive business in the best way it serves its drivers with in-vehicle infotainment techniques. For example, Azure is partnering with XPENG to allow AI voice experiences for automotive manufacturers and clients. The answer offers the business with a recent tackle text-to-speech and expressive voice, world languages, speaker constancy, and self-service customization. XPENG joins a rising development of automakers rethinking investments in environmental voice.

“This can be a cutting-edge exploration of auto voice interplay within the auto business,” XPENG automotive AI product senior knowledgeable Hao Chao stated. “The expertise delivers an entire new degree of pure speech. With a deep understanding of city mobility, we’re discovering many extra situations to leverage AI know-how for a excessive degree of driver-machine instinct.”

XPENG tapped into Microsoft’s neural text-to-speech know-how for his or her in-car consumer expertise. By utilizing Microsoft’s neural text-to-speech with emotional kinds, XPENG can present a extra pleasant listening expertise for his or her clients and fight listening fatigue. Microsoft’s neural text-to-speech offers fluency and naturalness that’s corresponding to a human voice. Coupled with multi-emotional voices, Microsoft text-to-speech acts as a refreshing alternative to the monotonous sound many automobile assistants have at this time.

“We’re excited to reimagine how speech and voice can enhance the lives of drivers,” Azure AI Speech Product Lead Binggong Ding stated. “Whereas from a technical standpoint, we actually wish to make this a mannequin that may serve all auto manufacturers and their builders. How can we finest optimize the usage of artificial speech to allow a high-fidelity voice expertise with out compromising sound high quality? XPENG is constructing upon this problem to offer a voice assistant that clients have been in search of.”

Microsoft’s long-term aim is to make superior multi-emotional, world voice capabilities the brand new customary for world automobile manufacturers and shoppers. The know-how adopted by XPENG added dozens of voice kinds, distinctive emotional depth management, and deduction talents. It covers 90 certifications worldwide together with home insurance policies, regulatory knowledge middle requirement and EU GDPR, and better knowledge privacy-policy holder necessities. Along with the automobile producers, Microsoft is creating new driving experiences with speech based mostly on the text-to-speech and speech-to-text capabilities inside Azure Cognitive Providers for speech.

Accelerated speech innovation

Voice is the brand new interface in ambient computing know-how. The standard of text-to-speech and speech-to-text has improved lately resulting from analysis and technological leaps enabled by the event of neural networks. Excessive-quality speech-to-text and text-to-speech fulfill the wants of the automaker to create the subsequent era fashionable in-car speech expertise. Microsoft speech-to-text provides strong recognition capabilities that are speaker-independent and able to dealing with ambient noise whereas driving. Microsoft text-to-speech additionally includes a extra fluid, natural-sounding voice which is usually a differentiation for automakers and clients alike. Each speech-to-text and text-to-speech additionally improve hands-free management of the automobile infotainment system. Microsoft text-to-speech helps a number of talking kinds, together with chat, newscast, and customer support. These developments permit drivers to have a extra pleasant driving expertise. For extra details about the current developments in speech-to-text and text-to-speech take a look at speech-to-text with its analysis outcomes, reaching human parity on the Switchboard analysis benchmark and neural-text-to-speech is near human-parity.

Providing world languages

Microsoft helps automakers cowl their world enterprise and only recently hit a milestone of 100 languages and now helps 119 languages and variants with 278 voices out-of-box. That is aligned with our firm imaginative and prescient to empower each particular person and group on the planet to realize extra. “100 languages is an efficient milestone for us to realize our ambition for everybody to have the ability to talk whatever the language they communicate,” stated Xuedong Huang, Microsoft Technical Fellow and Azure AI Chief Expertise Officer. With extra languages with their variants lined, we’re excited to be powering pure and intuitive voice experiences for automakers.

Differentiation with customization

Microsoft empowers automakers to develop a extremely sensible branded voice for extra pure conversational interfaces utilizing the customized neural voice functionality. Primarily based on the neural text-to-speech know-how and the multi-lingual multi-speaker common mannequin, customized neural voice permits you to create artificial voices which are wealthy in talking kinds or adaptable cross languages with as little as half-hour of audio. The sensible and natural-sounding voice of customized neural voice can characterize manufacturers and particular personas and permit customers to work together with purposes naturally in a conversational type. Take a look at this weblog for a step-by-step information on easy methods to create a customized neural voice.

Compliance and accountable AI

Microsoft is dedicated to investing in assembly regulatory requirements across the globe to satisfy the automakers’ compliance necessities. The speech service, a part of Azure Cognitive Providers, is licensed by SOC, FedRAMP, PCI DSS, HIPAA, HITECH, and ISO. Backed by Azure infrastructure, the speech service additionally provides enterprise-grade safety, availability, compliance, and manageability.


Microsoft is dedicated to creating AI know-how in a accountable manner. We use completely different technical and coverage options to safeguard towards misuse of the know-how. For instance, we’re designing and releasing Customized Neural Voice with the intention of defending the rights of people and society, fostering clear human-computer interplay, and counteracting the proliferation of dangerous deepfakes and deceptive content material. This aligns with Microsoft’s dedication to accountable AI. That dedication contains Transparency Notes, which communicates the aim, capabilities, and limitations of an AI system.

Study extra

Azure Cognitive Providers brings AI inside attain. Find out how you speed up innovation with breakthrough AI analysis.

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button