-
Notifications
You must be signed in to change notification settings - Fork 235
TMT: run tests with GPUs #1101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TMT: run tests with GPUs #1101
Conversation
Reviewer's GuideThis PR configures Packit to trigger TMT tests on NVIDIA GPU instances by adding dedicated RPM and no-RPM jobs, updates unit and system tests for deterministic behavior, fixes container build paths, and provides a TMT orchestration script alongside FMF plans for automated test runs. Class diagram for new and updated TMT test job configurationclassDiagram
class PackitJob {
+string job
+string trigger
+list packages
+list targets
+string tmt_plan
+string identifier
+bool skip_build
}
class FMFPlan {
+string name
+list tests
+string hardware_requirements
}
PackitJob "*" -- "*" FMFPlan : uses
class GPUInstance {
+string type
+string vendor
}
FMFPlan "1" -- "*" GPUInstance : requests
%% Highlight new/modified jobs
class PackitJob {
<<new/modified>>
}
class FMFPlan {
<<new/modified>>
}
File-Level Changes
Possibly linked issues
Tips and commandsInteracting with Sourcery
Customizing Your ExperienceAccess your dashboard to:
Getting Help
|
8e6be74
to
8b8828a
Compare
@ericcurtin @rhatdan we're able to access gpu instances via TMT and that can be verified through the TMT log. See But, |
Look at: .github/workflows/ci.yml: sudo ./container-images/scripts/build_llama_and_whisper.sh This builds the released version of llama.cpp and whisper.cpp and installs them in the host or in a container. |
1b9f459
to
eaa58c5
Compare
54cc323
to
5116408
Compare
I'm now seeing these 2 errors in
and this as well (looks like issues accessing the url).
The unit tests are failing on
See the detailed logs at: https://artifacts.dev.testing-farm.io/74a1da74-2417-4d94-ab38-e067214441d5/ |
i see one issue was no python3-huggingface-hub installed. |
Signed-off-by: Sergio Arroutbi <[email protected]>
a342d35
to
42d7a7e
Compare
For the rootful case, the default store is at /var/lib/ramalama. Signed-off-by: Lokesh Mandvekar <[email protected]>
b8a38e7
to
6d49d41
Compare
This commit adds TMT test jobs triggered via Packit that fetches an instance with NVIDIA GPU, specified in `plans/no-rpm.fmf`, and can be verified in the gpu_info test result. In addition, system tests (nocontainer), validate, and unit tests are also triggered via TMT. Fixes: containers#1054 TODO: 1. Enable bats-docker tests 2. Resolve f41 validate test failures Signed-off-by: Lokesh Mandvekar <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hey @lsm5 - I've reviewed your changes - here's some feedback:
- plans/no-rpm.fmf and plans/rpm.fmf are added but empty—please populate them with the FMF metadata needed for TMT to pick up those test plans.
- Replacing the two Fedora targets with
fedora-all
may pull in unintended variants—please verify that it matches the original scope of development and latest-stable. - The new bats-tests.sh script duplicates existing CI orchestration logic—consider reusing or refactoring current CI scripts to avoid maintaining parallel test runners.
Prompt for AI Agents
Please address the comments from this code review:
## Overall Comments
- plans/no-rpm.fmf and plans/rpm.fmf are added but empty—please populate them with the FMF metadata needed for TMT to pick up those test plans.
- Replacing the two Fedora targets with `fedora-all` may pull in unintended variants—please verify that it matches the original scope of development and latest-stable.
- The new bats-tests.sh script duplicates existing CI orchestration logic—consider reusing or refactoring current CI scripts to avoid maintaining parallel test runners.
Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.
They are not empty.
This was intended because validate test breaks on F41.
needed for TMT tests such that the scripts can also be run locally without any TMT environment. Ideally this config should live inside Makefile, but that can be for later. |
@ericcurtin @rhatdan @smooge PTAL. There's one commit from @sarroutbi from #1567 as well to fix a unit test issue. |
LGTM |
This commit adds TMT test jobs triggered via Packit that fetches an instance with NVIDIA GPU, specified in
plans/no-rpm.fmf
, and can be verified in the gpu_info test result.In addition, system tests (nocontainer), validate, and unit tests are also triggered via TMT.
Fixes: #1054
TODO:
Summary by Sourcery
Tests:
Summary by Sourcery
Enable GPU-accelerated and comprehensive TMT-based test workflows via Packit and new FMF plans, updating configuration and test scripts to support the enhanced testing pipeline.
New Features:
/plans/rpm
and/plans/no-rpm
FMF plans for Fedora and CentOSbats-tests.sh
script to manually run docker or nocontainer bats tests under TMTEnhancements:
fedora-all
in.packit.yaml
container_build.sh
CI:
Tests:
test_accel_image
for stable unit testing