fix: correct MPS execution on apple silicon by yanghan234 · Pull Request #145 · microsoft/mattersim

yanghan234 · 2026-04-07T10:16:28Z

Summary

register SphericalBasisLayer.coef as a buffer so it moves with the model on MPS
precompute graph-derived indexing values in batch_to_dict() and move the input dict to the target device
explicitly
remove MPS device-to-host synchronization hotspots in the M3GNet forward path and stress path

- Use batch_to_dict -> move_to_device pattern in deprecated get_properties method, consistent with predict/fit paths - Create index_map on the same device as num_triple_ij to avoid CPU/CUDA tensor mismatch in three_body_edge_map computation Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Merge the device transfer logic into batch_to_dict via its existing device parameter (now defaulting to None). This ensures every call site gets device placement automatically and removes the risk of forgetting a separate move_to_device call. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Add conftest.py with a device fixture that auto-detects available torch devices. Tests using the fixture run on all available backends. A --device flag allows restricting to a single device. Converted test_batch_relax.py from unittest to pytest style to use the device fixture. Verified passing on both cpu and mps. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- M3Gnet.forward now uses .get() with fallback computation for precomputed keys (total_num_atoms, bond_index_bias, etc.), so callers constructing input dicts directly won't KeyError. - batch_to_dict creates index_map on CPU (moved to device at the end), avoiding intermediate device mismatches. - Remove unused pytest import in test_batch_relax.py. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Prevents device mismatch if graph_batch tensors are already on a non-CPU device when batch_to_dict is called. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

fix: correct MPS execution in m3gnet forcefield

191cf36

yanghan234 force-pushed the hanyang/fix-mps-device-handling branch from 450d7cf to 191cf36 Compare April 7, 2026 12:20

yanghan234 and others added 6 commits April 7, 2026 13:27

test: log device name during test execution

1939a2d

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

fix: create index_map on same device as num_triple_ij in batch_to_dict

6ac93e7

Prevents device mismatch if graph_batch tensors are already on a non-CPU device when batch_to_dict is called. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: correct MPS execution on apple silicon#145

fix: correct MPS execution on apple silicon#145
yanghan234 wants to merge 7 commits intomainfrom
hanyang/fix-mps-device-handling

yanghan234 commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yanghan234 commented Apr 7, 2026

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant