AI & ML Academy - Prebuilt AI

Welcome to the AI & ML Academy (AIA) - Prebuilt AI!

In this section we will go through an overview of the AI Cognitive Services pillars and what services reside within each of them. There is also a level of customization that can be achieved in each of these pillars through services like Custom Vision and Custom Speech that we will introduce. Finally, we provide some best practices for each of these services.

AI Cognitive Services

Azure Cognitive Services are cloud-based artificial intelligence (AI) services that help you build cognitive intelligence into your applications. They are available as REST APIs, client library SDKs, and user interfaces. You can add cognitive features to your applications without having AI or data science skills. Cognitive Services enable you to build cognitive solutions that can see, hear, speak, understand, and even make decisions.

Vision

Cognitive Services features (1) the ability to leverage off-the-shelf APIs pretrained to tag and analyze your images and video, and (2) customizable models that allow you to train models using your own data.

Computer Vision

Analyze content in images and video with a turn-key API service

  • Demo - Sandbox demos to see what you can do with the Computer Vision API
  • Vision Studio - Explore functionality by trying out each API (requires Azure account)
  • Computer Vision Learning Path - Get started analyzing images with the API

Custom Vision

Build custom Image Classification and Object Detection models for your scenario

Face API

Detect and identify people in images

  • Face API Learning Path - Get started detecting and analyzing faces
  • Transparency Note - Understand how Face API works, the choices you can make as a system owner that influence accuracy, and the importance of thinking about the whole system, including the technology, the people, and the environment

Language

Cognitive Services for Language similarly provides pre-trained, pre-configured models to use in a turnkey fashion, as well as customizable services that enable you to leverage your own data with the provided platform and tooling.

With an Azure subscription, navigate to the Language Studio to explore all of the tools offered for Natural Language Processing within Azure Cognitive Services.

Key functionality includes:

  • Named Entity Recognition: Identify and categorized named entities from input documents, including names of people, locations, organizations, events, products, addresses, phone numbers, emails, URLs, IP addresses, dates & times, and quantities.
  • Personally identifying (PII) and health (PHI) information detection: Identify, categorize, and redact sensitive information from documents, including names of people, job roles, phone numbers, organizations, addresses, emails, URLs, IP addresses, dates and times, quantities, ABA routing numbers, SWIFT codes, credit card numbers, International Banking Account Numbers, and country/region-specific identification (e.g. U.S. Social Security Numbers).
  • Language detection: Determine which language a document is written in.
  • Sentiment analysis and opinion mining: Leverage Sentiment Analysis to label text as positive, neutral, or negative. Use Opinion Mining to gather more granular information about sentiment, including the subject the text is referring to as well as the associated opinion or sentiment.
  • Summarization: Summarize documents or conversations.
  • Key phrase extraction: Identify and extract the main concepts in text.
  • Entity linking: Identify entities in text and provide a Wikipedia link for more information.
  • Text Analytics for Health: Extract and label medical information from health documents such as doctor’s notes, discharge summarsies, clinical documentes, and electronic health records.
  • Custom Text Classification: Train your own text classification models using your data.
  • Conversational language understanding: Predict what the user’s intent is when they say a particular phrase or sentence so that you can reply accordingly.
  • Question answering: Find the most appropriate answer for a user question for conversational client applications.

To begin getting hands-on, refer to the following resources:

Speech

Cognitive Services for Speech also has both pre-trained models as well as customizable services to convert speech to text, text to speech, as well as speaker recognition and speech translation.

Speech to Text:

  • Speech to Text Overview: Explore the capabilities of the STT service. There are also quick starts available below this tab on the same page.

Text to Speech:

  • Text to Speech Overview: Explore the capabilities of the TTS service. There are also quick starts available below this tab on the same page.

Speaker Recognition:

  • Speaker Recognition Overview: Explore Speaker Recognition and some commonly asked questions. There is a quick start available below this tab on the same page.

Speech Translation:

Custom Neural Voice

Content Creation