Copyright (c) 2026 MindMesh Academy. All rights reserved. This content is proprietary and may not be reproduced or distributed without permission.

3.2.1. Azure Vision in Foundry Tools

Azure AI Vision provides the following pre-built capabilities:

FeatureDescriptionReturns
Image AnalysisComprehensive image understandingTags, captions, objects, brands
Optical Character RecognitionExtract printed and handwritten textText with bounding boxes
Spatial AnalysisAnalyze people movement in videoCounting, dwell time, social distancing
Key Outputs from Image Analysis:
Analysis TypeWhat It Provides
TaggingDescriptive keywords (metadata) for the image
CategorizationAssigns image to predefined categories
DescriptionNatural language caption with confidence score
Object DetectionObjects with bounding box coordinates
Brand DetectionRecognized commercial logos

⚠️ Exam Distinction:

  • Tagging = Keywords/metadata for image attributes
  • Categorization = Predefined category from a limited set
  • Description = Natural language sentence describing the image

What Azure Vision Eliminates: When using pre-built Azure Vision, you don't need to:

  • Choose a model (pre-trained)
  • Train a model (already trained)
  • Evaluate a model (already validated)

You STILL need to:

  • Provision an Azure resource
  • Perform inferencing (make predictions)