test: ensure all hf tests run (cuda) #397

planetf1 · 2026-02-02T14:42:16Z

Test Infrastructure Improvements

Type of PR

Bug Fix
New Feature
Documentation
Other

Description

Link to Issue: bug: Correct categorization and detection for heavier tests #396

Fixes test infrastructure issues preventing tests from running on systems without required backends. Tests were failing at collection time due to backend initialization at import and missing markers.

Fixes #396

Changes:

Fixed tests calling start_session() at class/module definition time (moved to fixtures)
Added missing ollama and llm markers to tests using default backend
Added missing ollama marker to guardian safety examples (guardian.py, guardian_huggingface.py)
Marked flaky test_kv with xfail(strict=False) - model safety refusal despite context
- opened test: test_kv Flaky Due to Model Safety Refusal #398 for proper fix to test
Marked broken docs/examples/aLora/101_example.py with skip marker
- see bug: broken aLora example after intrinsics refactor #385
Skipped docs/examples/mify/rich_document_advanced.py - CXXABI_1.3.15 not found on HPC systems with old glibc
- Environment-specific issue, not a code bug
- Example fails to import due to conda environment using system libstdc++.so.6 (too old)
- Skipped to prevent test failures on affected systems
Documented NVIDIA MPS solution for GPU test sharing - added comprehensive guide in test/README.md
- Explains "Parent Trap" issue: pytest parent holds CUDA context, blocking subprocesses in EXCLUSIVE_PROCESS mode
- Solution 1 (Recommended): Enable NVIDIA MPS via job scheduler flag (e.g., mps=yes for LSF)
  - Allows multiple processes to share GPU without code changes
  - Verified on IBM HPC: 34/34 tests passed, 0 "CUDA device busy" errors, 5:46 runtime (Job 434111)
  - Note: MPS must be enabled per-job; it's not a repository setting
- Solution 2 (Fallback): Run tests sequentially when MPS unavailable
- MPS solves GPU sharing at driver level - no application code changes needed

Results: 34/34 HuggingFace tests pass on HPC with MPS enabled. Tests properly skip when backends unavailable.

Verification:

Full mypy check passes (245 source files, 0 errors)
All pre-commit hooks pass (ruff format, ruff lint, mypy, uv-lock, codespell)
HPC tests verified: 34/34 tests pass with MPS enabled

Testing

Tests added to the respective file if code was changed
New code has 100% coverage if code as added
Ensure existing tests and github automation passes (a maintainer will kick off the github automation when the rest of the PR is populated)

github-actions · 2026-02-02T14:42:35Z

The PR description has been updated. Please fill out the template for your PR to be reviewed.

mergify · 2026-02-02T14:42:52Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert|release)(?:\(.+\))?:

planetf1 · 2026-02-03T16:03:50Z

This PR will now allow the running of the hf tests on a well-provisioned environment.

Rebased ~hour ago - appreciate any review comments, after which I can squash and rebase to clean up commit log & run a final test.

.gitignore

test/README.md

PR_DESCRIPTION.md

test_kv_issue.md

- Add @pytest.mark.ollama to tests requiring Ollama backend - Update test/README.md with comprehensive marker documentation - Update .gitignore for logs/ and pytest output files

planetf1 mentioned this pull request Feb 2, 2026

bug: Correct categorization and detection for heavier tests #396

Open

planetf1 force-pushed the largetests branch from e6a0168 to 617052b Compare February 3, 2026 12:55

planetf1 changed the title ~~test: optimize GPU cleanup to module scope~~ test: ensure all hf tests run (cuda) Feb 3, 2026

planetf1 requested a review from jakelorocco February 3, 2026 16:02

planetf1 marked this pull request as ready for review February 3, 2026 16:03

jakelorocco requested changes Feb 3, 2026

View reviewed changes

.gitignore Outdated Show resolved Hide resolved

test/README.md Outdated Show resolved Hide resolved

test/README.md Outdated Show resolved Hide resolved

PR_DESCRIPTION.md Outdated Show resolved Hide resolved

test_kv_issue.md Outdated Show resolved Hide resolved

planetf1 force-pushed the largetests branch 8 times, most recently from 3876742 to 938cc89 Compare February 3, 2026 19:28

test: add Ollama markers and improve test documentation

fd3ad2e

- Add @pytest.mark.ollama to tests requiring Ollama backend - Update test/README.md with comprehensive marker documentation - Update .gitignore for logs/ and pytest output files

planetf1 force-pushed the largetests branch from 938cc89 to fd3ad2e Compare February 3, 2026 19:30

planetf1 requested a review from jakelorocco February 3, 2026 19:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: ensure all hf tests run (cuda) #397

test: ensure all hf tests run (cuda) #397

Uh oh!

planetf1 commented Feb 2, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 2, 2026

Uh oh!

mergify bot commented Feb 2, 2026

Uh oh!

planetf1 commented Feb 3, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

test: ensure all hf tests run (cuda) #397

Are you sure you want to change the base?

test: ensure all hf tests run (cuda) #397

Uh oh!

Conversation

planetf1 commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Infrastructure Improvements

Type of PR

Description

Testing

Uh oh!

github-actions bot commented Feb 2, 2026

Uh oh!

mergify bot commented Feb 2, 2026

Merge Protections

🟢 Enforce conventional commit

Uh oh!

planetf1 commented Feb 3, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

planetf1 commented Feb 2, 2026 •

edited

Loading