What's Changed
- Enable continuous decoding for NvTensorRtRtx EP by @anujj in #1697
- Use updated Decoder API with
skip_special_tokensby @sayanshaw24 in #1722 - Update extensions to include memleak fix by @baijumeswani in #1724
- Support batch processing for whisper example by @jiafatom in #1723
- Update onnxruntime_extensions dependency version by @baijumeswani in #1725
- Include C++ header in native nuget and fix compiler warnings by @baijumeswani in #1727
- Update Microsoft.Extensions.AI to 9.8.0 by @rogerbarreto in #1689
- Update Extensions commit for Qwen 2.5 Chat Template Tools Fix by @sayanshaw24 in #1730
- Whisper Truncation Extensions Commit Update by @sayanshaw24 in #1735
- Enable Cuda Graph for TensorRtRtx by default by @anujj in #1734
- Update sampling benchmark by @tianleiwu in #1729
- Add Windows WinML x64 build workflow by @chrisdMSFT in #1740
- Fix CUDA synchronization issue between ORT-GenAI and TRT-RTX inference by @anujj in #1733
- Hello WindowsML by @chrisdMSFT in #1711
- [CUDA] sampling kernel improvements by @tianleiwu in #1732
- Update GitHub Actions to latest versions by @snnn in #1749
- Update WinML version to 1.8.2091 by @nieubank in #1750
- Address macos packaging pipeline issues by @baijumeswani in #1747
- ProviderOptions level device filtering and APIs to configure model level device filtering by @vortex-captain in #1744
- Fix string indexing bug with Phi-4 mm tokenization by @kunal-vaishnavi in #1751
- Fix TRT-RTX EP regression by @gaugarg-nv in #1754
- Fix typo in C API header by @kunal-vaishnavi in #1753
- Enable WinML by default in ADO pipelines by @chrisdMSFT in #1755
- Change default build configuration to 'relwithdebinfo' by @baijumeswani in #1757
- Pin cmake and vcpkg versions in macOS workflows by @snnn in #1760
- Add TRT_RTX support for onnxruntime-genai-trt-rtx wheel by @anujj in #1736
- rel-0.10.0 by @chrisdMSFT in #1767
- Microsoft.ML.OnnxRuntimeGenAI.WinML.props by @chrisdMSFT in #1776
- Warning fix - ort_genai.h by @chrisdMSFT in #1778
- Microsoft.ML.OnnxRuntimeGenAI.targets by @chrisdMSFT in #1781
New Contributors
Full Changelog: v0.9.2...v0.10.0