Skip to content

Evaluation with GitHub Copilot #522

Evaluation with GitHub Copilot

Evaluation with GitHub Copilot #522

Re-run triggered April 25, 2026 15:17
Status Success
Total duration 59m 34s
Artifacts 102

copilot-evaluation.yml

on: workflow_dispatch
get-entries  /  get-entries
16s
get-entries / get-entries
Matrix: evaluate-with-copilot-cli
summarize-results  /  Results
45s
summarize-results / Results
requeue  /  requeue-if-needed
4s
requeue / requeue-if-needed
requeue  /  cleanup-ephemeral-tag
0s
requeue / cleanup-ephemeral-tag
Fit to window
Zoom out
Zoom in

Annotations

1 error and 2 warnings
bcbench.evaluate.testgeneration
Build failed during evaluation of microsoftInternal__NAV-176194: Build or publish failed for App\Layers\W1\Tests\ERM: C:\Source\App\Layers\W1\Tests\ERM\O365SalesItemChargeTests.Codeunit.al(320,40): error AL0185: Enum 'Default Quantity to Ship' is missing
bcbench.results.base
Result for microsoftInternal__NAV-176194 missing metrics: llm_duration
bcbench.results.base
Result for microsoftInternal__NAV-214825 missing metrics: llm_duration

Artifacts

Produced during runtime
Name Size Digest
evaluation-results-24930694425-microsoftInternal__NAV-176194
2.83 KB
sha256:b3e9504272aa51cfe84ebc2426de340dec8d1f473ab315d4f6dcb70e972a547c
evaluation-results-24930694425-microsoftInternal__NAV-214825
2.71 KB
sha256:ce277653e46281c91e11aac94376045c5c369441f09d8a08c4f784fb1dac16fe
evaluation-summary
1.03 KB
sha256:e188fa1a615a21e5c7b80d3441a21445de70c180ea33813cc06bbf3b1804a3ea