Skip to content

fix(e2e): reduce E2E test flakiness

e579fc6
Select commit
Loading
Failed to load commit list.
Open

fix(e2e): reduce E2E test flakiness (sandbox events, duplicate CSE timing) #8480

fix(e2e): reduce E2E test flakiness
e579fc6
Select commit
Loading
Failed to load commit list.
Azure Pipelines / Agentbaker E2E failed May 25, 2026 in 37m 10s

Build #20260525.3 had test failures

Details

Tests

  • Failed: 2 (0.49%)
  • Passed: 405 (99.51%)
  • Other: 0 (0.00%)
  • Total: 407

Annotations

Check failure on line 1024 in Build log

See this annotation in the file changed.

@azure-pipelines azure-pipelines / Agentbaker E2E

Build log #L1024

Script failed with exit code: 1

Check failure on line 1 in Test_ACL_ARM64/default

See this annotation in the file changed.

@azure-pipelines azure-pipelines / Agentbaker E2E

Test_ACL_ARM64/default

Failed
Raw output
=== RUN   Test_ACL_ARM64/default
=== PAUSE Test_ACL_ARM64/default
=== CONT  Test_ACL_ARM64/default
    azure.go:478: [0.000s] Looking up images in https://ms.portal.azure.com/#@microsoft.onmicrosoft.com/resource/subscriptions/c4c3550e-a965-4993-a50c-628fd38cd3e1/resourceGroups/aksvhdtestbuildrg/providers/Microsoft.Compute/galleries/PackerSigGalleryEastUS/images/aks-acl-arm64-gen2-tl/overview
    azure.go:567: [139.516s] Image version /subscriptions/c4c3550e-a965-4993-a50c-628fd38cd3e1/resourceGroups/aksvhdtestbuildrg/providers/Microsoft.Compute/galleries/PackerSigGalleryEastUS/images/aclgen2arm64TL/versions/1.1779606253.32116 is already in region westus3
    vhd.go:363: [139.516s] got version by tag branch=refs/heads/main: https://ms.portal.azure.com/#@microsoft.onmicrosoft.com/resource/subscriptions/c4c3550e-a965-4993-a50c-628fd38cd3e1/resourceGroups/aksvhdtestbuildrg/providers/Microsoft.Compute/galleries/PackerSigGalleryEastUS/images/aks-acl-arm64-gen2-tl/versions/1.1779606253.32116/overview
    test_helpers.go:390: [139.516s] TAGS {Name:Test_ACL_ARM64/default ImageName:aclgen2arm64TL OS:azurecontainerlinux Arch:arm64 NetworkIsolated:false NonAnonymousACR:false GPU:false WASM:false BootstrapTokenFallback:false KubeletCustomConfig:false Scriptless:false VHDCaching:false MockAzureChinaCloud:false VMSeriesCoverageTest:false}
    test_helpers.go:221: [139.517s] → running scenario...
    test_helpers.go:258: [139.517s] → preparing AKS node...
    vmss.go:476: [139.517s] → creating VMSS o3aw-2026-05-25-aclarm64default...
    vmss.go:384: [139.835s] VMSS portal link: https://ms.portal.azure.com/#@microsoft.onmicrosoft.com/resource/subscriptions/8ecadfc9-d1a3-4ea4-b844-0d9f87e4d7c8/resourceGroups/MC_abe2e-westus3_abe2e-kubenet-v4-e1f58_westus3/providers/Microsoft.Compute/virtualMachineScaleSets/o3aw-2026-05-25-aclarm64default/overview
    vmss.go:390: [139.835s] Managed cluster portal link: https://ms.portal.azure.com/#@microsoft.onmicrosoft.com/resource/subscriptions/8ecadfc9-d1a3-4ea4-b844-0d9f87e4d7c8/resourceGroups/MC_abe2e-westus3_abe2e-kubenet-v4-e1f58_westus3/providers/Microsoft.ContainerService/managedClusters/abe2e-kubenet-v4-e1f58/overview
    vmss.go:509: [143.727s] VM will be automatically deleted after the test finishes, to preserve it for debugging purposes set KEEP_VMSS=true or pause the test with a breakpoint before the test finishes or failed
    vmss.go:513: [143.727s] SSH Instructions: (may take a few minutes for the VM to be ready for SSH)
        ========================
        az network bastion ssh --target-resource-id "/subscriptions/8ecadfc9-d1a3-4ea4-b844-0d9f87e4d7c8/resourceGroups/MC_abe2e-westus3_abe2e-kubenet-v4-e1f58_westus3/providers/Microsoft.Compute/virtualMachineScaleSets/o3aw-2026-05-25-aclarm64default/virtualMachines/0" --name "abe2e-kubenet-v4-e1f58-bastion" --resource-group MC_abe2e-westus3_abe2e-kubenet-v4-e1f58_westus3 --auth-type ssh-key --username azureuser --ssh-key /tmp/private-key-1529357407
        
    bastionssh.go:304: [335.612s] Attempt 1/5 establishing SSH over bastion to 10.224.0.15
    vmss.go:563: [336.792s] VM reached running state
    vmss.go:533: [336.792s] ✓ creating VMSS o3aw-2026-05-25-aclarm64default done (197.3s)
    kube.go:132: [336.792s] → waiting for node o3aw-2026-05-25-aclarm64default to be ready...
    kube.go:154: [336.876s] node o3aw-2026-05-25-aclarm64default000000 is ready. Taints: [{"key":"node.cloudprovider.kubernetes.io/uninitialized","value":"true","effect":"NoSchedule"}] Conditions: [{"type":"MemoryPressure","status":"False","lastHeartbeatTime":"2026-05-25T01:01:26Z","lastTransitionTime":"2026-05-25T01:01:22Z","reason":"KubeletHasSufficientMemory","message":"kubelet has sufficient memory available"},{"type":"DiskPressure","status":"False","lastHeartbeatTime":"2026-05-25T01:01:26Z","last
... [The stack trace has been truncated as it exceeded the maximum allowed size. Please refer to the complete log available in the Test Run attachments for full details.]

Check failure on line 1 in Test_ACL_ARM64

See this annotation in the file changed.

@azure-pipelines azure-pipelines / Agentbaker E2E

Test_ACL_ARM64

Failed
Raw output
=== RUN   Test_ACL_ARM64
=== PAUSE Test_ACL_ARM64
=== CONT  Test_ACL_ARM64
--- FAIL: Test_ACL_ARM64 (0.00s)