Add Arm64 AdvSimd implementation of Matrix4x4 Invert by a74nh · Pull Request #128640 · dotnet/runtime

a74nh · 2026-05-27T11:11:19Z

Code proposed by the Arm MCP server guided workflow. https://github.com/arm/mcp

Testing using dotnet/performance InvertBenchmark shows a 17% improvement on Cobalt.

Code proposed by the Arm MCP server guided workflow. https://github.com/arm/mcp Testing using dotnet/performance InvertBenchmark shows a 17% improvement on Cobalt.

Copilot

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds an Arm64 (AdvSimd/NEON) intrinsic implementation for Matrix4x4 inversion to improve performance on Arm64 platforms.

Changes:

Add an AdvSimd.Arm64 fast-path in Invert.
Introduce AdvSimdImpl mirroring the existing DirectXMath/SSE-based inversion algorithm using NEON intrinsics.

dotnet-policy-service · 2026-05-27T11:12:49Z

Tagging subscribers to this area: @dotnet/area-system-numerics
See info in area-owners.md if you want to be subscribed.

tannergooding · 2026-05-27T16:16:02Z

                }

+                [CompExactlyDependsOn(typeof(AdvSimd.Arm64))]
+                static bool AdvSimdImpl(in Impl matrix, out Impl result)


As a general nit, we're looking at landing #127690 which adds a number of xplat helper APIs and will allow us to unify the Arm64 and x64 implementations to a single code path.

It provides helpers like Vector128.ConcateLowerLower(row1, row2) which avoids having to extract a Vector64<T> if that isn't viable (such as on x64 or for SVE where no "half width" vector exists for Vector<T>)

It also provides ones like Vector128.UnzipEven(vTemp1, vTemp2) which unifies the consideration of needing to use a shuffle on some platforms or for some base types vs having a dedicated instruction on others.

If we can hold off until that lands, we should be able to just update Matrix4x4 to no longer have any architecture specific code paths.

If we can hold off until that lands, we should be able to just update Matrix4x4 to no longer have any architecture specific code paths.

That would be a much better solution. This PR is essentially duplicating the X86 code path.

What are the chances of landing #127690 and someone producing a combined version of Invert in time for .NET11? If it's not likely to happen, then would this PR be useful as a stopgap to help performance? Understood you may not want to for code size and churn reasons.

#127690 should be merged in the next few days, its just waiting on secondary sign-off. It's part of the planned work for .NET 11

Once that's done, updating Matrix4x4 to be xplat should be trivial; I can get it done relatively quickly.

a74nh · 2026-05-29T08:40:00Z

Closing this as it should be implemented with the new APIs once they are available.

Add Arm64 AdvSimd implementation of Matrix4x4 Invert

3236568

Code proposed by the Arm MCP server guided workflow. https://github.com/arm/mcp Testing using dotnet/performance InvertBenchmark shows a 17% improvement on Cobalt.

Copilot AI review requested due to automatic review settings May 27, 2026 11:11

github-actions Bot added the area-System.Numerics label May 27, 2026

dotnet-policy-service Bot added the community-contribution Indicates that the PR has been added by a community member label May 27, 2026

Copilot AI reviewed May 27, 2026

View reviewed changes

Comment thread src/libraries/System.Private.CoreLib/src/System/Numerics/Matrix4x4.Impl.cs

Comment thread src/libraries/System.Private.CoreLib/src/System/Numerics/Matrix4x4.Impl.cs

Comment thread src/libraries/System.Private.CoreLib/src/System/Numerics/Matrix4x4.Impl.cs

This was referenced May 27, 2026

slow macOS - "##[error]The job running on agent Azure Pipelines 9 ran longer than the maximum time of 60 minutes." dotnet/dnceng#1883

Open

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

tannergooding reviewed May 27, 2026

View reviewed changes

This was referenced May 27, 2026

"We stopped hearing from agent Azure Pipelines 32. Verify the agent machine is running and has a healthy network connection" dotnet/dnceng#1886

Open

XHarness package install failure on iOS due to devicectl NSPOSIXErrorDomain error 49 #123796

Open

a74nh closed this May 29, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Arm64 AdvSimd implementation of Matrix4x4 Invert#128640

Add Arm64 AdvSimd implementation of Matrix4x4 Invert#128640
a74nh wants to merge 1 commit into
dotnet:mainfrom
a74nh:matrix_github

a74nh commented May 27, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dotnet-policy-service Bot commented May 27, 2026

Uh oh!

tannergooding May 27, 2026

Uh oh!

a74nh May 28, 2026

Uh oh!

tannergooding May 28, 2026

Uh oh!

a74nh commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

a74nh commented May 27, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dotnet-policy-service Bot commented May 27, 2026

Uh oh!

tannergooding May 27, 2026

Choose a reason for hiding this comment

Uh oh!

a74nh May 28, 2026

Choose a reason for hiding this comment

Uh oh!

tannergooding May 28, 2026

Choose a reason for hiding this comment

Uh oh!

a74nh commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants