Skip to content

AMDGPU tests are OOMing the runners #428

@kshyatt

Description

@kshyatt

Thanks @luraess for surfacing this

We're being bad neighbors in the AMD GPU CI, our tests are tooooooo memory heavy and it's OOMing some of the Docker runners.

Seems it allocs about 60 GB if not wrong. Runners 1 has "only" 32 GB. Maybe that's one of the reason. Dunno tho as in CUDA tests memory usage spikes at 80 GB. But maybe the runner can accommodate this for CUDA

Quick fix: restrict to target gfx1101 in our runners.

Could be a good forcing function to make our tests more modular and do what the GPUArrays testsuite does.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions