AI & ML Academy - Prebuilt AI
Welcome to the AI & ML Academy (AIA) - Prebuilt AI!
In this section we will go through an overview of the AI Cognitive Services pillars and what services reside within each of them. There is also a level of customization that can be achieved in each of these pillars through services like Custom Vision and Custom Speech that we will introduce. Finally, we provide some best practices for each of these services.
AI Cognitive Services
Azure Cognitive Services are cloud-based artificial intelligence (AI) services that help you build cognitive intelligence into your applications. They are available as REST APIs, client library SDKs, and user interfaces. You can add cognitive features to your applications without having AI or data science skills. Cognitive Services enable you to build cognitive solutions that can see, hear, speak, understand, and even make decisions.
Vision
Cognitive Services features (1) the ability to leverage off-the-shelf APIs pretrained to tag and analyze your images and video, and (2) customizable models that allow you to train models using your own data.
Computer Vision
Analyze content in images and video with a turn-key API service
- Demo - Sandbox demos to see what you can do with the Computer Vision API
- Vision Studio - Explore functionality by trying out each API (requires Azure account)
- Computer Vision Learning Path - Get started analyzing images with the API
Custom Vision
Build custom Image Classification and Object Detection models for your scenario
- Learning Path for Object Detection - Get started using AI to recognize objects in images using the Custom Vision service
- Learning Path for Image Classification - Get started classifying images using the Custom Vision service
- Rock Paper Scissors Code Sample - Hands-on Lab to create Node.js app with Rock Paper Scissors custom image classifier
Face API
Detect and identify people in images
- Face API Learning Path - Get started detecting and analyzing faces
- Transparency Note - Understand how Face API works, the choices you can make as a system owner that influence accuracy, and the importance of thinking about the whole system, including the technology, the people, and the environment
Language
Cognitive Services for Language similarly provides pre-trained, pre-configured models to use in a turnkey fashion, as well as customizable services that enable you to leverage your own data with the provided platform and tooling.
With an Azure subscription, navigate to the Language Studio to explore all of the tools offered for Natural Language Processing within Azure Cognitive Services.
Key functionality includes:
- Named Entity Recognition: Identify and categorized named entities from input documents, including names of people, locations, organizations, events, products, addresses, phone numbers, emails, URLs, IP addresses, dates & times, and quantities.
- Custom Named Entity Recognition: Build custom models to extract domain-specific entities.
- Personally identifying (PII) and health (PHI) information detection: Identify, categorize, and redact sensitive information from documents, including names of people, job roles, phone numbers, organizations, addresses, emails, URLs, IP addresses, dates and times, quantities, ABA routing numbers, SWIFT codes, credit card numbers, International Banking Account Numbers, and country/region-specific identification (e.g. U.S. Social Security Numbers).
- Language detection: Determine which language a document is written in.
- Sentiment analysis and opinion mining: Leverage Sentiment Analysis to label text as positive, neutral, or negative. Use Opinion Mining to gather more granular information about sentiment, including the subject the text is referring to as well as the associated opinion or sentiment.
- Summarization: Summarize documents or conversations.
- Key phrase extraction: Identify and extract the main concepts in text.
- Entity linking: Identify entities in text and provide a Wikipedia link for more information.
- Text Analytics for Health: Extract and label medical information from health documents such as doctor’s notes, discharge summarsies, clinical documentes, and electronic health records.
- Custom Text Classification: Train your own text classification models using your data.
- Conversational language understanding: Predict what the user’s intent is when they say a particular phrase or sentence so that you can reply accordingly.
- Question answering: Find the most appropriate answer for a user question for conversational client applications.
To begin getting hands-on, refer to the following resources:
- Learning Path for Azure Cognitive Services Language: Explore natural language processing within Azure
- Github Python Samples for Text: Common scenario operations with the Azure Text Analytics client library for Python
- Learning Path for Common Pre-configured Language APIs: Get started extracting insights from text
- Learning Path for Customizable Language solutions: Build custom text classification and custom NER models using the Language service
Speech
Cognitive Services for Speech also has both pre-trained models as well as customizable services to convert speech to text, text to speech, as well as speaker recognition and speech translation.
Speech to Text:
- Speech to Text Overview: Explore the capabilities of the STT service. There are also quick starts available below this tab on the same page.
Text to Speech:
- Text to Speech Overview: Explore the capabilities of the TTS service. There are also quick starts available below this tab on the same page.
Speaker Recognition:
- Speaker Recognition Overview: Explore Speaker Recognition and some commonly asked questions. There is a quick start available below this tab on the same page.
Speech Translation:
- Speech Translation Quickstart: Get hands on with translating speech from a microphone in the language of choice.
Custom Neural Voice
- Custom Neural Voice Overview: Explore the different project types for custom neural voice. This page is followed by a great step by step guide on how to get started.
- Custom Neural Voice - Getting Started: Video tutorial for getting started.
- How to Create a Custom Neural Voice: A great tutorial along with best practices.
- Latest Updates on Azure Neural TTS: Stay up to date with the latest changes and view some examples.
- Announcing new voices and emotions: Check out some new additions to the service!
- Applying for Custom Neural Voice: Official form to start working with Custom Neural Voice.
- Transparency Note and Use Cases: Understanding some key considerations when using Custom Neural Voice
- Guidelines for responsible deployment of synthetic voice technology: Understand how to responsibly use synthetic voice technology.
- Data, privacy and security for Custom Neural Voice: Understand how the data will be used and processed.
Content Creation
- Speech Studio: Test out the features of Content Creation
- Audio Content Creation: Neat tutorial on how to convert text to speech using Microsoft AI voices.