What's new in Azure Vision in Foundry Tools? - Foundry Tools (original) (raw)

Learn what's new in Azure Vision. Check this page to stay up to date with new features, enhancements, fixes, and documentation updates.

August 2025

Image Analysis 4.0 Preview API deprecation

On March 31, 2025, the Image Analysis 4.0 Preview APIs will be retired. Before that date, you need to migrate your Azure Image Analysis workloads to the Image Analysis 4.0 GA API.

We encourage you to make the transition sooner to gain access to improvements such as multimodal embedding, synchronous OCR, people detection, image tagging, smart cropping, caption, dense caption, and image object detection. These Image Analysis 4.0 preview APIs will be retired on March 31, 2025:

The following features will no longer be available upon retirement of the preview API versions, and they are removed from the Studio experience as of January 10, 2025:

To maintain a smooth operation of your models, transition to Azure AI Custom Vision, which is now generally available. Custom Vision offers similar functionality to these retiring features.

February 2024

Multimodal embeddings GA: new multi-language model

The Multimodal embeddings API has been updated and is now generally available. The new 2024-02-01 API includes a new model that supports text search in 102 languages. The original English-only model is still available, but it can't be combined with the new model in the same search index. If you vectorized text and images using the English-only model, these vectors aren't compatible with multi-lingual text and image vectors.

See the language support page for the list of supported languages.

January 2024

New Image Analysis SDK 1.0.0-beta.1 (breaking changes)

The Image Analysis SDK was rewritten in version 1.0.0-beta.1 to better align with other Azure SDKs. All APIs have changed. See the updated quickstarts, samples and how-to-guides for information on how to use the new SDK.

Major changes:

November 2023

Analyze Image 4.0 GA

The Analyze Image 4.0 REST API is now in General Availability. Follow the Analyze Image 4.0 quickstart to get started.

The other features of Image Analysis, such as model customization, background removal, and multimodal embeddings, remain in public preview.

September 2023

Deprecation of outdated Computer Vision API versions

Computer Vision API versions 1.0, 2.0, 3.0, and 3.1 will be retired on September 13, 2026. Developers won’t be able to make API calls to these APIs after that date. We recommend that all affected customers migrate their workloads to the generally available Computer Vision 3.2 API by following this QuickStart at their earliest convenience. Customers should also consider migrating to Image Analysis 4.0 API (preview), which has our latest and greatest Image Analysis capabilities.

Visit our Q&A for any questions.

May 2023

Image Analysis 4.0 Product Recognition (public preview)

The Product Recognition APIs let you analyze photos of shelves in a retail store. You can detect the presence and absence of products and get their bounding box coordinates. Use it in combination with model customization to train a model to identify your specific products. You can also compare Product Recognition results to your store's planogram document. Product Recognition.

March 2023

Azure Vision Image Analysis 4.0 SDK public preview

The Florence foundation model is now integrated into Azure Vision. The improved Vision services enable developers to create market-ready, responsible Azure Vision applications across various industries. Customers can now seamlessly digitize, analyze, and connect their data to natural language interactions, unlocking powerful insights from their image and video content to support accessibility, drive acquisition through SEO, protect users from harmful content, enhance security, and improve incident response times. For more information, see Announcing Microsoft's Florence foundation model.

Image Analysis 4.0 SDK (public preview)

Image Analysis 4.0 is now available through client library SDKs in C#, C++, and Python. This update also includes the Florence-powered image captioning and dense captioning at human parity performance.

Image Analysis V4.0 Captioning and Dense Captioning (public preview):

"Caption" replaces "Describe" in V4.0 as the improved image captioning feature rich with details and semantic understanding. Dense Captions provides more detail by generating one-sentence descriptions of up to 10 regions of the image in addition to describing the whole image. Dense Captions also returns bounding box coordinates of the described image regions. There's also a new gender-neutral parameter to allow customers to choose whether to enable probabilistic gender inference for alt-text and Seeing AI applications. Automatically deliver rich captions, accessible alt-text, SEO optimization, and intelligent photo curation to support digital content. Image captions.

Video summary and frame locator (public preview):

Search and interact with video content in the same intuitive way you think and write. Locate relevant content without the need for extra metadata. Available only in Vision Studio.

Image Analysis 4.0 model customization (public preview)

You can now create and train your own custom image classification and object detection models, using Vision Studio or the v4.0 REST APIs.

Multimodal embeddings APIs (public preview)

The Multimodal embeddings APIs, part of the Image Analysis 4.0 API, enable the vectorization of images and text queries. They let you convert images and text to coordinates in a multi-dimensional vector space. You can now search with natural language and find relevant images using vector similarity search.

Background removal APIs (public preview)

As part of the Image Analysis 4.0 API, the Background removal API lets you remove the background of an image. This operation can either output an image of the detected foreground object with a transparent background, or a grayscale alpha matte image showing the opacity of the detected foreground object.

Azure Vision 3.0 & 3.1 previews deprecation

The preview versions of Azure Vision 3.0 and 3.1 APIs are scheduled to be retired on September 30, 2023. Customers won't be able to make any calls to these APIs past this date. Customers are encouraged to migrate their workloads to the generally available (GA) 3.2 API instead. Mind the following changes when migrating from the preview versions to the 3.2 API:

October 2022

Azure Vision Image Analysis 4.0 (public preview)

Image Analysis 4.0 has been released in public preview. The new API includes image captioning, image tagging, object detection, smart crops, people detection, and Read OCR functionality, all available through one Analyze Image operation. The OCR is optimized for general non-document images in a performance-enhanced synchronous API that makes it easier to embed OCR-powered experiences in your workflows.

September 2022

Azure Vision 3.0/3.1 Read previews deprecation

The preview versions of Azure Vision 3.0 and 3.1 Read API are scheduled to be retired on January 31, 2023. Customers are encouraged to refer to the How-To and QuickStarts to get started with the generally available (GA) version of the Read API instead. The latest GA versions provide the following benefits:

June 2022

Vision Studio launch

Vision Studio is UI tool that lets you explore, build, and integrate features from Azure Vision into your applications.

Vision Studio provides you with a platform to try several service features, and see what they return in a visual manner. Using the Studio, you can get started without needing to write code, and then use the available client libraries and REST APIs in your application.

Azure Vision 3.2-preview deprecation

The preview versions of the 3.2 API are scheduled to be retired in December of 2022. Customers are encouraged to use the generally available (GA) version of the API instead. Mind the following changes when migrating from the 3.2-preview versions:

  1. The Analyze Image and Read API calls now take an optional model-version parameter that you can use to specify which AI model to use. By default, they use the latest model.
  2. The Analyze Image and Read API calls also return a model-version field in successful API responses. This field reports which model was used.
  3. Image Analysis APIs now use a different error-reporting format. See the API reference documentation to learn how to adjust any error-handling code.

May 2022

OCR (Read) API model is generally available (GA)

Azure Vision's OCR (Read) API latest model with 164 supported languages is now generally available as a cloud service and container.

See the OCR how-to guide to learn how to use the GA model.

February 2022

OCR (Read) API Public Preview supports 164 languages

Azure Vision's OCR (Read) API expands supported languages to 164 with its latest preview:

See the OCR how-to guide to learn how to use the new preview features.

September 2021

OCR (Read) API Public Preview supports 122 languages

Azure Vision's OCR (Read) API expands supported languages to 122 with its latest preview:

See the OCR how-to guide to learn how to use the new preview features.

August 2021

Image tagging language expansion

The latest version (v3.2) of the Image tagger now supports tags in 50 languages. See the language support page for more information.

July 2021

New HeadPose and Landmarks improvements for Detection_03

May 2021

Spatial Analysis container update

A new version of the Spatial Analysis container has been released with a new feature set. This Docker container lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments.

April 2021

Azure Vision v3.2 GA

Azure Vision API v3.2 is now generally available with the following updates:

March 2021

Azure Vision 3.2 Public Preview update

Azure Vision API v3.2 public preview has been updated. The preview release has all Azure Vision features along with updated Read and Analyze APIs.

February 2021

Read API v3.2 Public Preview with OCR support for 73 languages

Azure Vision Read API v3.2 public preview, available as cloud service and Docker container, includes these updates:

See the Read API how-to guide to learn more.

January 2021

Spatial Analysis container update

A new version of the Spatial Analysis container has been released with a new feature set. This Docker container lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments.

October 2020

Azure Vision API v3.1 GA

Azure Vision API in General Availability has been upgraded to v3.1.

September 2020

Spatial Analysis container preview

The Spatial Analysis container is now in preview. The Spatial Analysis feature of Azure Vision lets you analyze real-time streaming video to understand spatial relationships between people and their movement through physical environments. Spatial Analysis is a Docker container you can use on-premises.

Read API v3.1 Public Preview adds OCR for Japanese

Azure Vision Read API v3.1 public preview adds these capabilities:

See the Read API how-to guide to learn more.

July 2020

Read API v3.1 Public Preview with OCR for Simplified Chinese

The Azure Vision in Foundry Tools Read API v3.1 public preview adds support for Simplified Chinese.

See the Read API how-to guide to learn more.

May 2020

Azure Vision API v3.0 entered General Availability, with updates to the Read API:

See the OCR overview to learn more.

March 2020

January 2020

Read API 3.0 Public Preview

You now can use version 3.0 of the Read API to extract printed or handwritten text from images. Compared to earlier versions, 3.0 provides:

Follow an Extract text quickstart to get starting using the 3.0 API.

Azure update announcements for Foundry Tools