Copyright (c) 2026 MindMesh Academy. All rights reserved. This content is proprietary and may not be reproduced or distributed without permission.
3.2.1. Azure Vision in Foundry Tools
Azure AI Vision provides the following pre-built capabilities:
| Feature | Description | Returns |
|---|---|---|
| Image Analysis | Comprehensive image understanding | Tags, captions, objects, brands |
| Optical Character Recognition | Extract printed and handwritten text | Text with bounding boxes |
| Spatial Analysis | Analyze people movement in video | Counting, dwell time, social distancing |
Key Outputs from Image Analysis:
| Analysis Type | What It Provides |
|---|---|
| Tagging | Descriptive keywords (metadata) for the image |
| Categorization | Assigns image to predefined categories |
| Description | Natural language caption with confidence score |
| Object Detection | Objects with bounding box coordinates |
| Brand Detection | Recognized commercial logos |
⚠️ Exam Distinction:
- Tagging = Keywords/metadata for image attributes
- Categorization = Predefined category from a limited set
- Description = Natural language sentence describing the image
What Azure Vision Eliminates: When using pre-built Azure Vision, you don't need to:
- Choose a model (pre-trained)
- Train a model (already trained)
- Evaluate a model (already validated)
You STILL need to:
- Provision an Azure resource
- Perform inferencing (make predictions)