title

markmap

AI-102: Azure AI Engineer Associate

colorFreezeLevel
4

AI-102 - Resources

AI-102: Azure AI Engineer Associate
Course: Designing and Implementing a Microsoft Azure AI Solution
Study guide
Practice assessment
Exam prep videos
Azure AI Hub
Exam Sandbox: Experience the look and feel of the exam interface before taking it.

Skills at a glance

Plan and manage an Azure AI solution (15–20%)

Select the appropriate Azure AI service

Select the appropriate service for a computer vision solution

Azure AI Vision

Processes images and videos to understand their content

Face API

Detects and recognizes human faces

Azure AI Custom Vision Service

Builds and deploys custom image classification models

Document Intelligence

Extracts text, key-value pairs, and tables from documents

Azure AI Video Indexer

Extracts insights from videos and live streams

Select the appropriate service for a natural language processing solution

Custom text classification
Custom named entity recognition
Conversational Language Understanding
Entity Linking
Key Phrase Extraction
Language Detection
Named Entity Recognition (NER)
Orchestration workflow
Personally identifiable information (PII) detection
Question Answering
Sentiment Analysis
Summarization
Text Analytics for Health

Select the appropriate service for a speech solution

Speech to Text

Supports intermediate results, end-of-speech detection, automatic text formatting, profanity masking, and includes real-time speech-to-text and batch transcription

Language identification

Identifies the spoken language in a given audio stream

Text-to-Speech

Converts text to natural-sounding speech

Speech Recognition

Identifies and verifies the people speaking based on audio

Pronunciation Assessment

Evaluates the pronunciation and provides feedback on the accuracy and fluency of the speech

Speech translation

Translates streaming audio in real-time and provides result as text/synthesized speech

Intent recognition

Derives user intents from transcribed speech and act on voice commands

Select the appropriate service for a generative AI solution

==TODO==

Select the appropriate service for a document intelligence solution

Document analysis model
Prebuilt model
Custom model

Select the appropriate service for a knowledge mining solution

Azure AI Search

Plan, create and deploy an Azure AI service

Plan for a solution that meets Responsible AI principles

Fairness: AI systems should treat all people fairly.
- Fairlearn: An Open-source toolkit for assessing and improving the fairness of machine learning models.
Reliability and safety: AI systems should perform reliably and safely.
- Test the model
- Risks and harm related information should be accessible from the model users
Privacy and security: AI systems should respect privacy and maintain security.
- Personally identifiable information (PII) should be protected
Inclusiveness: AI systems should empower everyone and engage people.
Transparency: AI systems should be transparent and understandable.
- Interpretability/Intellegibility: The ability to explain the results of a model in a way that is understandable to humans.
Accountability: AI systems should be accountable to people.
- Model governance: The process of managing the entire lifecycle of a model, including model creation, deployment, and monitoring.
- Organizational principles: Define the roles and responsibilities of the people involved in the model lifecycle.

Create an Azure AI resource

From the Azure portal
Using Azure CLI
Using client librariy
Using ARM templates, Bicep, or Terraform

Determine a default endpoint for a service

Multi-service resource
- Multiple Azure AI resources with a single key and endpoint
- Consolidate billing from the the services you use
Single-service resource
- Single Azure AI resource with a single key and endpoint
- Use free tier for testing and development: only supported in single-service resources
Endpoint URI is one of the three primary parameters for Azure AI
2 (access) keys are provided for each Azure AI resource by default
- Protect the keys by using Azure Key Vault
Authenticate with:
- Single or multi-service key
- Token (REST API)
- Entra ID identity

Integrate Azure AI services into a continuous integration and continuous delivery (CI/CD) pipeline

Plan and implement a container deployment

==Azure Container Instances== (ACI): on demand standalones containers with minimal setup in serverless environment.
==Azure Kubernetes Service== (AKS): Managed Kubernetes service for deploying, managing, and scaling containerized applications using Kubernetes.

Manage, monitor, and secure an Azure AI service

Configure diagnostic logging

Enable diagnostic logging for an Azure AI resource:
- ==Log Analytics Workspace== to analyze logs and metrics (Azure Monitor)
- ==Event Hub== for streaming logs to other services
- ==Storage Account== for archiving logs with less expensive storage

Monitor an Azure AI resource

Metrics: capture regular data points about the behavior of the resource in time-series database
Alerts: notify you when a metric breaches a threshold
Diagnostics settings: configure the resource to send logs and metrics to a destination
Activity logs: records operations made on the resource

Manage costs for Azure AI services

Azure Pricing Calculator
- Estimate the cost of Azure services
Azure Cost Management and Billing
- Monitor and analyze costs
- Create budgets and alerts
- Optimize costs
- Billing administrative tasks

Manage account keys

==TODO==

Protect account keys by using Azure Key Vault

==TODO==

Manage authentication for an Azure AI Service resource

==TODO==

Manage private communications

==TODO==

Implement content moderation solutions (10–15%)

Create solutions for content delivery

Implement a text moderation solution with Azure AI Content Safety

Detect and filter harmful or inappropriate text content in applications
1. Get an API endpoint + subscription key
2. Send a request to the endpoint with the subscription key and the text to analyze
3. Get a response with the classification of the text as JSON
- Harm categories (e.g. hate and fairness, sexual, violence, self-harm)
- Severity level from 0 to 7 (e.g. safe, low, medium, high)

Implement an image moderation solution with Azure AI Content Safety

Detect and filter harmful or inappropriate images in applications
1. Get an API endpoint + subscription key
2. Send a request to the endpoint with the subscription key and the image to analyze
3. Get a response with the classification of the image as JSON
- Harm categories (e.g. hate and fairness, sexual, violence, self-harm)
- Severity level from 0 to 7 (e.g. safe, low, medium, high)

Implement computer vision solutions (15–20%)

Analyze images

Select visual features to meet image processing requirements

Create Azure AI custom vision training and prediction resources.

Detect objects in images and generate image tags

TODO:

Include image analysis features in an image processing request

TODO:

Interpret image processing responses

==TODO==

Extract text from images using Azure AI Vision

Azure AI vision can extract text from images and handwritten text
- ==OCR for images (version 4.0)==
  - Inputs: Images: General, in-the-wild images
  - Examples: labels, street signs, and posters
  - Optimized for general, non-document images with a performance-enhanced synchronous API that makes it easier to embed OCR in your user experience scenarios.
- ==Document Intelligence read model==
  - Inputs:Documents: Digital and scanned, including images
  - Examples: books, articles, and reports
  - Optimized for text-heavy scanned and digital documents with an asynchronous API to help automate intelligent document processing at scale.

Convert handwritten text using Azure AI Vision

==TODO==

Implement custom computer vision models by using Azure AI Vision

Choose between image classification and object detection models

Image classification: Classify or assign a label to an image
Object detection: Identify and locate objects in an image
You can upload and tag your images to train the classifier or detector model.
For both image classification and object detection, you need to:
1. Create a new project
2. Name and describe it
3. Select a project type: Classification or Object Detection
4. Select an available domain (General, Food, Landmarks, Retail, Logo etc.)
5. Train and test the model
6. Publish and consume the model

Label images

For image classification you need to select either:
- Multilabel classification: Assign multiple labels to an image
- Multiclass classification: Assign a single label to an image

Train a custom image model, including image classification and object detection

Select train button to start training the model
The training process can take a few minutes to a few hours
Monitor the training process and check the metrics via the performance tab
Delete obsolete iterations

Evaluate custom vision model metrics

Available metrics:
- Precision
  - A percentage value that indicates the proportion of true positive predictions in the total number of positive predictions.
- Recall
  - A percentage value that indicates the proportion of true positive predictions in the total number of actual positive instances.
- mAP (mean Average Precision) - Object Detection only
  - A metric that evaluates the precision-recall curve for object detection models.
Additional metrics:
- Probability threshold: The level of confidence that a prediction needs to have in order to be considered correct (for the purposes of calculating precision and recall)
- Overlapping threshold: Sets the minimum allowed overlap between the predicted object's bounding box and the actual user-entered bounding box. If the bounding boxes don't overlap to this degree, the prediction won't be considered correct.

Publish a custom vision model

Make your model available for consumption by others by publishing it.
1. Select the Publish ✓ button
2. Provide the model name and prediction resource
3. Select the Publish button

Consume a custom vision model

==TODO==

Analyze videos

Use Azure AI Video Indexer to extract insights from a video or live stream

Analyze video content to extract topics, labels, named-entities, emotions, and scenes.
A timeline is provided to navigate through the video content along with the dialogue and speaker identification.

Use Azure AI Vision Spatial Analysis to detect presence and movement of people in video

People counting
Entrance and exit counting
Social distancing and face/mask detection

Implement natural language processing solutions (30–35%)

Analyze text by using Azure AI Language

Extract key phrases

Identify the main points in a text
1. Create an Azure AI language resource
2. Get the endpoint and subscription key
3. Send a request to the endpoint with the subscription key and the raw text to analyze
4. Get a response with the key phrases as JSON: stream or store locally.
3 consumption ways:
- Language Studio
- REST API
- Docker container

Extract entities

Entity linking: identify and disambiguate entities in text.
- Different endpoint for entity linking.
Named entity recognition: identify and classify named entities in text.
- Ex: person, location, organization, date, etc.

Determine sentiment of text

Evaluate text and returns sentiment scores and labels for each sentence
- Sentiment analysis: Provides sentiment labels (such as "negative", "neutral" and "positive") based on the highest confidence score found by the service at a sentence and document-level.
  - This feature also returns confidence scores between 0 and 1 for each document & sentences within it for positive, neutral and negative sentiment.
- Opinion mining: Also known as aspect-based sentiment analysis in Natural Language Processing (NLP).
  - this feature provides more granular information about the opinions related to words (such as the attributes of products or services) in text.

Detect the language used in text

Evaluates a text and returns scored language identifiers.
- Large panel of languages supported including regional dialects.
- In case of mixed languages, the service will return the most used language with a low confidence score

Detect personally identifiable information (PII) in text

Identify, categorize and redact sensitive information in unstructured text.
1. Create an Azure AI language resource
2. Get the endpoint and subscription key
3. Send a request to the endpoint with the subscription key and the raw text to analyze
4. Get a response with the key phrases as JSON: stream or store locally.
API is stateless in synchronous mode and available for 24h in asynchronous mode.

Process speech by using Azure AI Speech

Implement text-to-speech

Life-like speech synthesis (fluid and natural-sounding)
Customizable voices
Fined-grained audio controls (rate, pitch, pause, pronunciation etc.)
Flexible deployment (cloud or containers)

Implement speech-to-text

Real-time transcription of audio streams into written text by using SSML (Speech Synthesis Markup Language).
- High quality transcription
- Flexible deployment
- Customizable models
- Production-ready

Improve text-to-speech by using Speech Synthesis Markup Language (SSML)

SSML can be used to fine-tune text-to-speech models outputs.
- SSML is a markup language that allows developers to control various aspects of speech synthesis, such as pronunciation, volume, pitch, rate, and more.
Custom neural voice (CNV) models can be used to create custom voices for text-to-speech applications.
- CNV models are trained on a speaker's voice data to create a custom voice that can be used in text-to-speech applications.

Implement custom speech solutions

Test custom speech solutions for Word Error Rate (WER) with accuracy testing and custom acoustic models:
- Needs improvement: >30%
- Acceptable: ~20%
- Ready for production: <10%

Implement intent recognition

Taking a written or spoken input and determining the intent behind it.

2 methods:
- Pattern matching: for offline solutions
  1. Create code and speech configuration
  2. Initialize the intent recognizer and declare entities as intent
  3. Enable recognition of intent
  4. Instruct code to stop on intent recognition
  5. Display results
  6. Publish
- CLU (Conversational Language Understanding): prediction of intents
  1. Create a new project by importing a JSON file
  2. Train model
  3. Choose training mode and data splitting
  4. Deploy model
  5. Use model to recognize intents from an audio stream

Implement keyword recognition

Detect word or short phrase within audio stream or content
1. Create a new project in speech studio
2. Create a custom keyword:
3. Create new model
4. Provide name/description and the keyword
5. Validate
6. Select a model type and Create
- Basic: rapid prototyping
- Advanced: improved accuracy characteristics for product integration
1. Select Tune to download the model
2. This model can now be used

Translate language

Resources

Translate text and documents by using the Azure AI Translator service

2 distinct types of endpoints enable:
- Text translation: Translate text between languages (real-time)
  - REST API cloud-based translator
  - Docker container based translator
  - Supported methods:
    - Languages: Returns a list of languages supported by Translate, Transliterate, and Dictionary Lookup operations. This request doesn't require authentication; just copy and paste the following GET request into your favorite REST API tool or browser:
      - https://api.cognitive.microsofttranslator.com/languages?api-version=3.0
    - Translate: Renders single source-language text to multiple target-language texts with a single request.
    - Transliterate: Converts characters or letters of a source language to the corresponding characters or letters of a target language.
    - Detect: Returns the source code language code and a boolean variable denoting whether the detected language is supported for text translation and transliteration.
    - Dictionary lookup: Returns equivalent words for the source term in the target language.
    - Dictionary example: Returns grammatical structure and context examples for the source term and target term pair.
- Document translation: Translate documents between languages (asynchronous)
  - REST API cloud-based translator
  - Client library SDK
  - Supported methods:
    - Translate large files: Translate whole documents asynchronously.
    - Translate numerous files: Translate multiple files across all supported languages and dialects while preserving document structure and data format.
    - Preserve source file presentation: Translate files while preserving the original layout and format.
    - Apply custom translation: Translate documents using general and custom translation models.
    - Apply custom glossaries: Translate documents using custom glossaries.
    - Automatically detect document language: Let the Document Translation service determine the language of the document.
    - Translate documents with content in multiple languages: Use the autodetect feature to translate documents with content in multiple languages into your target language.

Implement custom translation, including training, improving, and publishing a custom model

Train a custom model:
1. Select train model, enter sample data and select full training
2. Select sample-source language, target language and review training costs
3. Select Train now then Train to start training
4. Once trained, select Model details to review the model
Test and publish a custom model
1. Select Test model, enter sample data
2. Test (human evaluation) the translation
3. Select Publish model to make the model available
4. Select a region and validate.

Translate speech-to-speech by using the Azure AI Speech service

Speech-to-speech service can translate an audio stream/input to another language as an audio output.
- Works in real-time.

Translate speech-to-text by using the Azure AI Speech service

4 translation services for Speech-to-text:
- ==Speech translator API==
  - Typically used for real-time translation of spoken languages
- ==Speech CLI==
  - Experiment with minimal code solution
- ==Speech SDK==
  - Use in your own applications
- ==Speech Studio==
  - Typically used to test and tune speech services

Translate to multiple languages simultaneously

Implement and manage a language understanding model by using Azure AI Language

Create intents and add utterances

Intent: action or goal expressed in a user's utterance
Utterance: spoken or written phrases

Create entities

Word or phrase within utterances that can be identified and extracted
- Learned component: enables predictions based on context learned while labelling of utterances
- List component: Fixed ser of related words with their synonyms
- Prebuilt component: Built-in entities like date, time, number, etc.
- Regex component: Regular expression to match entities
To create entities:
- Navigate to Entities pivot
- Select Add and type entity name
- Define composition settings
- Attach a Learned, Prebuilt or List component

Train, evaluate, deploy, and test a language understanding model

CLU can be used to build a custom natural language understanding model which predicts intention and extract information of utterances.
Creation process:
1. Select data and define schema
2. Label data
3. Train model
4. View model performance results
5. Tune the model
6. Deploy
7. Predict intents and entities

Optimize a language understanding model

Ensure training data set is representative and sufficient
- Insufficient data can lead to overfitting and lower accuracy
- Adding more labeled data can improve the accuracy of the model
Ensure all entities are covered in test data
- Absence of labeled instance can reduce accuracy of model evaluation
- Ensure all entities are covered in the test data
Fix unclear or ambiguous distinction between intents and entities
- Similar data for different intents can lead to confusion
- You can solve this by merging similar entities or adding more examples

Consume a language model from a client application

Azure AI language models can be consumed from a client application using the REST API or SDKs.
1. This enables users to use natural language as input to interact with the application.
2. User's intent and entities are extracted and processed by the model to provide the desired output.
3. Application performs the necessary actions.

Backup and recover language understanding models

Export replicas of language understanding models to backup and recover them in case of data loss.
- Export
  1. Create a POST request with Ocp-Apim-Subscription-Key to create export task
  2. Use GET request to get a status of the export task
  3. Use GET request to download the exported model
- Import
  1. Create a POST request with Ocp-Apim-Subscription-Key to create import task
    - Body should contain the exported model as JSON
  2. Use GET request to get a status of the import task
  3. Wait for successful completion of the task

Create a custom question answering solution by using Azure AI Language

Create a custom question answering project

Enable custom question answering
Create a new project with a name and a language
Add question-answer pairs from source URLs or manually

Add question-and-answer pairs manually

In this case, you need to type the question and the answer manually.

Import sources

Use different sources to populate Azure Question Answering project:
- Structured documents (manuals, guidelines, etc.)
  - Questions will be derived from the headings and subheadings of the document
  - Answers will be derived from the subsequent text
- Unstructured documents (articles, blogs, etc.)
- Question-and-answer documents (FAQs, etc.)
  - .docx, .pdf, .txt, .html, .tsv, .csv...

Train, test and publish a knowledge base

In the knowledge base, source documents are imported as Questions. You can amend the questions and answers as needed.
Select Save and train, then Test
A test version of the knowledge base is created and you can analyze it with the Inspect button
You can Publish the knowledge base to make it available for consumption through REST endpoint

Create a multi-turn conversation

Multi-turn conversations are dialogues between a user and a bot that require multiple steps to complete.
To create:
1. Select Add follow-up prompts in the knowledge base
2. Fill details of the prompt
3. Create link to new pair
4. Save
Multiple follow-up prompts can be added to a single question by repeating the same process.

Add alternate phrasing

Add alternate questions with differences in the sentence structure or wording to improve the accuracy of the model.

Add chit-chat to a knowledge base

Chit-chat is a feature that allows the bot to engage in casual conversation with the user.
- Provide bot the ability to answer question in a way that fits your brand
- Set a personality for the bot
- Automatically add simple question-answer pairs to the knowledge base

Export a knowledge base

Exporting a knowledge base allows you to save a copy of the knowledge base for:
- Backup purpose
- CI/CD integration
- Deployment region mobility
Steps:
1. Open the custom question answering project
2. Select Export
3. Select the export format (.xlsx or .tsv) that will be exported in a .zip file

Create a multi-language question answering solution

Multi-language question answering solutions can be created by training the model with data in multiple languages.
- The model can be trained with data in multiple languages to support multi-language question answering solutions.
Steps:
1. When creating the new custom question answering project:
2. I want to select the language when I create a project in this resource
3. Enter basic information and create the project
4. Add sources to deploy the project

Implement knowledge mining and document intelligence solutions (10–15%)

Implement an Azure AI Search solution

Provision an Azure AI Search resource

Azure Cognitive Search (formerly known as “Azure Search”) is a cloud search service that gives developers infrastructure, APIs, and tools for building a rich search experience over private, heterogeneous content in web, mobile, and enterprise applications.
On the search service itself, the two primary workloads are indexing and querying.
- Indexing engine
  - Intake process that loads content into your search service and makes it searchable.
  - Internally, inbound text is processed into tokens and stored in inverted indexes, and inbound vectors are stored in vector indexes.
  - The document format that Azure AI Search can index is JSON. You can upload JSON documents that you've assembled, or use an indexer to retrieve and serialize your data into JSON.
  - Applied AI through a skillset extends indexing with image and language models.
    - If you have images or large unstructured text in source document, you can attach skills that perform OCR, describe images, infer structure, translate text and more.
    - You can also attach skills that perform data chunking and vectorization.
- Query engine is used when your client app sends query requests to a search service and handles responses. All query execution is over a search index that you control.
  - Semantic ranking is an extension of query execution. It adds secondary ranking, using language understanding to reevaluate a result set, promoting the most semantically relevant results to the top.

Create data sources

Azure AI Search can index content from a variety of data sources:
- Azure Storage (Blobs, Tables)
- Azure Cosmos DB
- Azure SQL Database, managed instance or SQL server
Both push and pull methods are supported.

Create an index

An index is a collection of JSON objects with unique keys and one or more fields.
Index attributes can be:
- Searchable: Full-text search
- Filterable
- Facetable: Used for aggregations/categorization and hit count
- Sortable
- Retrievable: Enables the field to be returned in search results or hidden from them.

Define a skillset

A skillset is a reusable object in Azure AI Search that's attached to an indexer.
- Contains one or more skills that call built-in AI or external custom processing over documents retrieved from an external data source.
Steps:
1. Document Cracking
2. Field mappings
3. Skillset execution
4. Output field mappings
5. Push to index
Up to 30 skills per skillset
Can repeat skills
Support chained operations, looping and branching

Implement custom skills and include them in a skillset

An AI enrichment pipeline can include both built-in skills and custom skills that you personally create and publish.
Your custom code executes externally from the search service (for example, as an Azure function), but accepts inputs and sends outputs to the skillset just like any other skill.
Following data are required to setup a new custom skill in a skillset:
- uri
- httpMethod (PUT or POST)
- httpHeaders
- timeout (default 30s)
- batchSize: data records to send to the skill at once (1000 per default)
- degreeOfParallelism: maximum number of concurrent requests for this endpoint (between 1 and 10, default 5)
- For managed-identity connections:
  - resourceId
  - authResourceId

Create and run an indexer

An indexer definition consists of properties that uniquely identify the indexer, specify which data source and index to use, and provide other configuration options that influence run time behaviors, including whether the indexer runs on demand or on a schedule.
Extracts and serializes data from a data source, passing it to a search service for data ingestion.

Query an index, including syntax, sorting, filtering, and wildcards

Full text search semantics based on Lucene query syntax over the index.
- Simple Lucene Query Parser
- Full Lucene Query Syntax: for specialized query forms: wildcard, fuzzy search, proximity search, regular expressions.
Queries are processed in 4 stages:
1. Query parsing
2. Lexical analysis
3. Document retrieval
4. Scoring

Manage Knowledge Store projections, including file, object, and table projections

Projection is a way to define the shape of the data that you want to retrieve from the index.
- Enriched documents are stored in the knowledge store.
- Useful for knowledge mining scenarios.
- Projections can be read from 3 types of sources:
  - Files
  - Objects
  - Tables

Implement an Azure AI Document Intelligence solution

Provision a Document Intelligence resource

Azure AI Document Intelligence is a cloud service that uses machine learning to extract information from documents.

Use prebuilt models to extract data from documents

Prebuilt models are trained on a wide range of document types and can extract information from documents with minimal configuration:
- Receipts
- Invoices
- Business cards
- Identity documents
- Contracts
- Tax forms
- Vaccination cards
- and more...

Implement a custom document intelligence model

You can train custom models to classify and extract information from documents that are specific to your organization.
- Custom extraction models can be trained to extract information from documents that are specific to your organization.
- Custom classification models can be trained to classify documents based on their content.
Train, test, and publish a custom document intelligence model:
1. Create a new project in Document Intelligence Studio
2. Label data
3. Train the model
4. Test the model

Create a composed document intelligence model

==TODO==

Implement a document intelligence model as a custom Azure AI Search skill

==TODO==

Implement generative AI solutions (10–15%)

Use Azure OpenAI Service to generate content

Provision an Azure OpenAI Service resource

Create an Azure OpenAI resource to access the OpenAI API and use it to generate content:
1. Identify subscription, resource group, region, and pricing tier
2. Configure network security
3. Confirm configuration to deploy the resource

By CLI:

az cognitiveservices account create -n <resource-name> -g <resource-group> \n
--subscription <subscription-id> --location <location> --kind OpenAI --sku <sku>

Select and deploy an Azure OpenAI model

Azure OpenAI provides access to a range of models that can be used to generate content:
- GPT-4: Newest model for natural language and code generation
- GPT-3.5: Natural language and code generation
- DALL-E: Image generation
- Embeddings: Similarity, text and code search etc.
Deploy a model:
1. Select subscription and OpenAI resource
2. Create a new deployment:
  - Select the model
  - Add a deployment name
  - Setting advanced features like content filtering, token rate limits, etc.

By CLI:

az cognitiveservices account deployment create -n <model-name> -g <resource-group> \n
--deployment-name <deployment-name> --model-name <model-name> \n
--model-version <model-version> --model-format "OpenAI" \n
--scale-settings-scale-type "Standard"

Submit prompts to generate natural language

You can submit prompt for multiple purposes:
- Classifying content
- Generating new content
- Transformation and translation
- Summarization
- Continuation
- Question answering
- Chat
- and more...

Submit prompts to generate code

Use prompt engineering to define precisly the code you want to generate:
- Define the problem
- Define the input
- Define the output
- Define the constraints
- Define the evaluation metric
Break down complex problems into smaller, more manageable parts

Use the DALL-E model to generate images

DALL-E is a model that can generate images from textual descriptions:
- Uses Neural network based model
- Uses Natural Language Processing (NLP) to understand the textual description
- Specify style and content to generate images with specific characteristics

Use Azure OpenAI APIs to submit prompts and receive responses

==TODO==

Optimize generative AI

Configure parameters to control generative behavior

Use ==Chat Playground== to familiarize with model parameters to control the generative behavior, like:
- Deployments: Your deployment name that is associated with a specific model.
- Temperature: Controls randomness.
  - Lowering the temperature means that the model produces more repetitive and deterministic responses.
  - Increasing the temperature results in more unexpected or creative responses.
  - Try adjusting temperature or Top P but not both.
- Max length (tokens): Set a limit on the number of tokens per model response.
  - The API supports a maximum of 4096 tokens shared between the prompt (including system message, examples, message history, and user query) and the model's response. One token is roughly four characters for typical English text.
- Top probabilities Similar to temperature, this controls randomness but uses a different method. Lowering Top P narrows the model's token selection to likelier tokens. Increasing Top P lets the model choose from tokens with both high and low likelihood.
  - Try adjusting temperature or Top P but not both.
- Multi-turn conversations Select the number of past messages to include in each new API request. This helps give the model context for new user queries. Setting this number to 10 results in five user queries and five system responses.
- Stop sequences Stop sequence make the model end its response at a desired point. The model response ends before the specified sequence, so it won't contain the stop sequence text. For GPT-35-Turbo, using <|im_end|> ensures that the model response doesn't generate a follow-up user query. You can include as many as four stop sequences.

Apply prompt engineering techniques to improve responses

To improve generative AI responses, prompt engineering techniques can be used:
- Provide clear instructions
- Primary, supporting, and grounding content
- Providing cues
- Requesting output composition: length, style, formatting, etc.
- Using system messages
- Conversation history and few-shot learning
- Chain of thought

Use your own data with an Azure OpenAI model

You can use your own data with Azure OpenAI models to generate content that is specific to your organization:
1. Setup a data-source: such as blob storage
2. Configure studio to connect to the data-source
3. Use Azure OpenAI model per usual to generate content
You can configure the model with specific parameters to control the generative behavior:
- Strictness determines the system's aggressiveness in filtering search documents based on their similarity scores.
- Retrieved documents is an integer that can be set to 3, 5, 10, or 20, and controls the number of document chunks provided to the large language model for formulating the final response.
- Limit responses attempts to only rely on your documents for responses.

Fine-tune an Azure OpenAI model

Fine-tuning an Azure OpenAI model allows you to customize the model to better suit your needs
Fine-tuning is expensive and time-consuming, but reduces the need for many examples to achieve good performance

ChrisNeff/ai-102-mindmap

AI-102 - Resources

Skills at a glance

Plan and manage an Azure AI solution (15–20%)

Select the appropriate Azure AI service

Select the appropriate service for a computer vision solution

Azure AI Vision

Face API

Azure AI Custom Vision Service

Document Intelligence

Azure AI Video Indexer

Select the appropriate service for a natural language processing solution

Select the appropriate service for a speech solution

Speech to Text

Language identification

Text-to-Speech

Speech Recognition

Pronunciation Assessment

Speech translation

Intent recognition

Select the appropriate service for a generative AI solution

Select the appropriate service for a document intelligence solution

Select the appropriate service for a knowledge mining solution

Plan, create and deploy an Azure AI service

Plan for a solution that meets Responsible AI principles

Create an Azure AI resource

Determine a default endpoint for a service

Integrate Azure AI services into a continuous integration and continuous delivery (CI/CD) pipeline

Plan and implement a container deployment

Manage, monitor, and secure an Azure AI service

Configure diagnostic logging

Monitor an Azure AI resource

Manage costs for Azure AI services

Manage account keys

Protect account keys by using Azure Key Vault

Manage authentication for an Azure AI Service resource

Manage private communications

Implement content moderation solutions (10–15%)

Create solutions for content delivery

Implement a text moderation solution with Azure AI Content Safety

Implement an image moderation solution with Azure AI Content Safety

Implement computer vision solutions (15–20%)

Analyze images

Select visual features to meet image processing requirements

Detect objects in images and generate image tags

Include image analysis features in an image processing request

Interpret image processing responses

Extract text from images using Azure AI Vision

Convert handwritten text using Azure AI Vision

Implement custom computer vision models by using Azure AI Vision

Choose between image classification and object detection models

Label images

Train a custom image model, including image classification and object detection

Evaluate custom vision model metrics

Publish a custom vision model

Consume a custom vision model

Analyze videos

Use Azure AI Video Indexer to extract insights from a video or live stream

Use Azure AI Vision Spatial Analysis to detect presence and movement of people in video

Implement natural language processing solutions (30–35%)

Analyze text by using Azure AI Language

Extract key phrases

Extract entities

Determine sentiment of text

Detect the language used in text

Detect personally identifiable information (PII) in text

Process speech by using Azure AI Speech

Implement text-to-speech

Implement speech-to-text

Improve text-to-speech by using Speech Synthesis Markup Language (SSML)

Implement custom speech solutions

Implement intent recognition

Implement keyword recognition

Translate language

Resources

Translate text and documents by using the Azure AI Translator service

Implement custom translation, including training, improving, and publishing a custom model

Translate speech-to-speech by using the Azure AI Speech service

Translate speech-to-text by using the Azure AI Speech service

Translate to multiple languages simultaneously