• Documentation
  • Pricing
  • Training Explore free online learning resources from videos to hands-on-labs
  • Blog Read the latest posts from the Azure team
  • Free account

    Speaker Recognition

    Identify individual speakers or use speech as a means of authentication with Speaker Recognition

    Speaker Verification

    Use your voice for verification. The API can be used to power applications with an intelligent verification tool. If the speaker claims to be of a certain identity use voice to verify this claim.

    To see how is works, select a pass phrase from the given list of phrases. Use that phrase and record three audio samples to register your voice with the service, this step is called "enrollment". After your enrollment is completed, you can start the verification step using a different voice recording or phrase to test the service.

    See it in action

    "i am going to make him an offer he cannot refuse"

    Read the phrase above three times to enroll your voice.

    1
    2
    3

    Want to build this?

    Speaker Identification

    Identify who is speaking. The API can be used to determine the identity of an unknown speaker. Input audio of the unknown speaker is paired against a group of selected speakers, and in the case there is a match found, the speaker’s identity is returned.

    We have selected 5 different US presidents and enrolled them to the service using one of the speeches they gave. To see how the demo works, select a speech for one of the presidents by clicking on the sample audios below, or upload one of your own, to test how to automatically identify which president is speaking.

    See it in action

    President Barack Obama
    President George W Bush
    President William J Clinton
    President George H W Bush
    President Ronald Reagan
    President Jimmy Carter

    Want to build this?

    Explore the Cognitive Services APIs

    Computer Vision

    Distill actionable information from images

    Face

    Detect, identify, analyze, organize, and tag faces in photos

    Ink Recognizer PREVIEW

    An AI service that recognizes digital ink content, such as handwriting, shapes, and ink document layout

    Video Indexer

    Unlock video insights

    Custom Vision

    Easily customize your own state-of-the-art computer vision models for your unique use case

    Form Recognizer PREVIEW

    The AI-powered document extraction service that understands your forms

    Text Analytics

    Easily evaluate sentiment and topics to understand what users want

    Translator Text

    Easily conduct machine translation with a simple REST API call

    QnA Maker

    Distill information into conversational, easy-to-navigate answers

    Language Understanding

    Teach your apps to understand commands from your users

    Immersive Reader PREVIEW

    Empower users of all ages and abilities to read and comprehend text

    Speech Services

    Unified speech services for speech-to-text, text-to-speech and speech translation

    Speaker Recognition PREVIEW

    Use speech to identify and verify individual speakers

    Content Moderator

    Automated image, text, and video moderation

    Anomaly Detector PREVIEW

    Easily add anomaly detection capabilities to your apps.

    Personalizer PREVIEW

    An AI service that delivers a personalized user experience

    Ready to supercharge your app?