Skip to content

Replace rsmi_init with amdsmi_init (via dlsym) in intra_node_comm#3299

Open
adam360x wants to merge 62 commits into
developfrom
users/adam360x/fix-rsmi-init-interposition
Open

Replace rsmi_init with amdsmi_init (via dlsym) in intra_node_comm#3299
adam360x wants to merge 62 commits into
developfrom
users/adam360x/fix-rsmi-init-interposition

Replace all rsmi_* usage with AMDSMI (via dlsym) in intra_node_comm

edd012e
Select commit
Loading
Failed to load commit list.
ROCm Repo Management API / Jenkins failed Jun 16, 2026 in 5h 52m 2s

Tests/Test Inductor/Run pytorch_inductor_1: warning in 'junit' step

Tests / Test PyTorch / Test PyTorch / Run pytorch_test_2 / Error signal

Error in error step, with arguments Found 1 failure(s) in pytorch_reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-31aeee80e26252d1.xml report Found 1 failure(s) in pytorch_reports/python-pytest/test_cuda/test_cuda-4a39cb2c0a7559f1.xml report Found 1 failure(s) in pytorch_reports/python-pytest/test_cuda/test_cuda-bd9e9c3d53e31eba.xml report.

Found 1 failure(s) in pytorch_reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-31aeee80e26252d1.xml report
Found 1 failure(s) in pytorch_reports/python-pytest/test_cuda/test_cuda-4a39cb2c0a7559f1.xml report
Found 1 failure(s) in pytorch_reports/python-pytest/test_cuda/test_cuda-bd9e9c3d53e31eba.xml report

Tests / Test PyTorch / Test PyTorch / Run pytorch_test_2 / Error signal

Error in error step, with arguments Some tests are failed or errored.

Some tests are failed or errored

Tests / Test PyTorch / Test PyTorch / Run pytorch_test_2 / Error signal

Error in error step, with arguments pytorch_test_2 failed.

pytorch_test_2 failed

Tests / Test PyTorch / Test PyTorch / Run pytorch_test_2 / Archive JUnit-formatted test results

Warning in junit step.

3 tests failed

Tests / Test Distributed / Test Distributed / Run pytorch_distributed_1 / Error signal

Error in error step, with arguments Found 1 failure(s) in pytorch_reports/dist-nccl-init-file/distributed.test_distributed_spawn/distributed.test_distributed_spawn-59db5d6c2b2da568.xml report.

Found 1 failure(s) in pytorch_reports/dist-nccl-init-file/distributed.test_distributed_spawn/distributed.test_distributed_spawn-59db5d6c2b2da568.xml report

Tests / Test Distributed / Test Distributed / Run pytorch_distributed_1 / Error signal

Error in error step, with arguments Some tests are failed or errored.

Some tests are failed or errored

Tests / Test Distributed / Test Distributed / Run pytorch_distributed_1 / Error signal

Error in error step, with arguments pytorch_distributed_1 failed.

pytorch_distributed_1 failed

Tests / Test Distributed / Test Distributed / Run pytorch_distributed_1 / Archive JUnit-formatted test results

Warning in junit step.

1 tests failed

Tests / Test Inductor / Test Inductor / Run pytorch_inductor_1 / Error signal

Error in error step, with arguments Found 1 failure(s) in pytorch_reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-f3e73088079c8ba8.xml report.

Found 1 failure(s) in pytorch_reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-f3e73088079c8ba8.xml report

Tests / Test Inductor / Test Inductor / Run pytorch_inductor_1 / Error signal

Error in error step, with arguments Some tests are failed or errored.

Some tests are failed or errored

Tests / Test Inductor / Test Inductor / Run pytorch_inductor_1 / Error signal

Error in error step, with arguments pytorch_inductor_1 failed.

pytorch_inductor_1 failed

Tests / Test Inductor / Test Inductor / Run pytorch_inductor_1 / Archive JUnit-formatted test results

Warning in junit step.

1 tests failed

Details

  • Kill older PR Builds (2.6 sec)
  • Initialize (51 min)
    • Download CI scripts (31 sec)
    • Checkout Pytorch (1 min 13 sec)
    • Check base Docker image existence (10 sec)
    • Pull Docker Image (9 min 50 sec)
    • Build PyTorch (38 min)
  • Tests (5 hr 0 min)
    • Test PyTorch (9 ms)
      • Test PyTorch (1 hr 48 min)
        • Run pytorch_test_1 (1 hr 2 min)
        • Run pytorch_test_2 (45 min)
          Error: Found 1 failure(s) in pytorch_reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-31aeee80e26252d1.xml report
          Found 1 failure(s) in pytorch_reports/python-pytest/test_cuda/test_cuda-4a39cb2c0a7559f1.xml report
          Found 1 failure(s) in pytorch_reports/python-pytest/test_cuda/test_cuda-bd9e9c3d53e31eba.xml report
          - logs
          Error: Some tests are failed or errored - logs
          Error: pytorch_test_2 failed - logs
          Unstable: 3 tests failed - logs
    • Test Distributed (10 ms)
      • Test Distributed (5 hr 0 min)
        • Run pytorch_distributed_1 (2 hr 8 min)
          Error: Found 1 failure(s) in pytorch_reports/dist-nccl-init-file/distributed.test_distributed_spawn/distributed.test_distributed_spawn-59db5d6c2b2da568.xml report - logs
          Error: Some tests are failed or errored - logs
          Error: pytorch_distributed_1 failed - logs
          Unstable: 1 tests failed - logs
        • Run pytorch_distributed_2 (2 hr 51 min)
    • Test Inductor (8 ms)
      • Test Inductor (2 hr 56 min)
        • Run pytorch_inductor_1 (2 hr 56 min)
          Error: Found 1 failure(s) in pytorch_reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-f3e73088079c8ba8.xml report - logs
          Error: Some tests are failed or errored - logs
          Error: pytorch_inductor_1 failed - logs
          Unstable: 1 tests failed - logs
    • Test PyTorch Slow (12 ms)
      • Test PyTorch Slow (12 sec)
    • Microbenchmark (25 sec)
      • Microbenchmark (12 sec)
  • Post Build (2.6 sec)
  • Declarative: Post Actions (5.2 sec)