otto test¶

otto test runs test suites – classes that extend OttoSuite with a Test-prefixed name, which registers them automatically. Each suite becomes its own subcommand with typed CLI options.

`otto test --help`¶

otto test --help __ __ ____ / /_/ /_____ / __ \/ __/ __/ __ \ / /_/ / /_/ /_/ /_/ / \____/\__/\__/\____/ Usage: otto test [OPTIONS] COMMAND [ARGS]... Run a registered test suite by name, or select tests directly with --tests / -m — no suite required. ╭─ Options ────────────────────────────────────────────────────────────────────╮ │ --list-suites List test suites │ │ with run syntax │ │ and exit. │ │ --list-markers List the markers │ │ available to │ │ --markers and │ │ exit. │ │ --list-tests List the selected │ │ tests (optionally │ │ narrowed by a │ │ suite name / │ │ --markers) and │ │ exit. │ │ --markers -m EXPRESSION pytest -m marker │ │ expression │ │ applied after │ │ collection. With │ │ no suite name, │ │ runs a suite-less │ │ selection across │ │ all repos. │ │ --tests NAME[,NAME...] Run specific │ │ tests by exact │ │ name, across all │ │ suites and repos │ │ — no suite │ │ subcommand │ │ needed. │ │ Comma-separated; │ │ TestClass::name │ │ disambiguates. │ │ Combine with │ │ --markers to │ │ narrow. │ │ --iterations -i <int> Repeat each test │ │ N times within a │ │ single │ │ setup/teardown │ │ cycle (0 = │ │ disabled). │ │ [default: 0] │ │ --duration -d <int> Repeat tests for │ │ N seconds within │ │ a single │ │ setup/teardown │ │ cycle (0 = │ │ disabled). │ │ [default: 0] │ │ --threshold <float> Minimum per-test │ │ pass rate │ │ percentage │ │ required in │ │ stability mode │ │ (0-100). │ │ [default: 100.0] │ │ --results PATH Write test │ │ results (JUnit │ │ XML) to PATH │ │ (default: auto in │ │ log dir). │ │ --cov Collect gcov │ │ coverage from │ │ remote hosts │ │ after the suite │ │ finishes. │ │ --cov-dir <directory> Directory to │ │ write coverage │ │ data to. Implies │ │ --cov. Default │ │ when --cov is │ │ used alone: │ │ <output_dir>/cov. │ │ --overwrite-cov-… Allow --cov-dir │ │ to target an │ │ existing │ │ non-empty │ │ directory (its │ │ contents will be │ │ cleared before │ │ the run). │ │ --cov-clean --no-cov-clean Delete .gcda │ │ files on remote │ │ hosts before the │ │ test run. │ │ [default: │ │ cov-clean] │ │ --cov-report -r Generate an HTML │ │ coverage report │ │ after the suite │ │ finishes. Implies │ │ --cov. Default │ │ location: │ │ <output_dir>/cov… │ │ --cov-report-dir <directory> Directory to │ │ write the HTML │ │ coverage report │ │ to. Implies │ │ --cov-report (and │ │ --cov). Default │ │ when --cov-report │ │ is used alone: │ │ <output_dir>/cov… │ │ --overwrite-cov-… Allow │ │ --cov-report-dir │ │ to target an │ │ existing │ │ non-empty │ │ directory (its │ │ contents will be │ │ cleared before │ │ the report is │ │ rendered). │ │ --project-name <str> Title shown in │ │ the HTML report │ │ header (only used │ │ with │ │ --cov-report). │ │ [default: │ │ Coverage Report] │ │ --monitor --no-monitor Collect host │ │ performance │ │ metrics for the │ │ entire test run. │ │ [default: │ │ no-monitor] │ │ --monitor-interv… SECONDS [x>=1.0] Sampling interval │ │ for --monitor. │ │ [default: 5.0] │ │ --monitor-output PATH Override the │ │ destination for │ │ monitor data. │ │ Format inferred │ │ from suffix │ │ (.json or .db). │ │ Default: │ │ <output_dir>/mon… │ │ --monitor-hosts REGEX Regex matched │ │ against host IDs │ │ to restrict │ │ --monitor │ │ (re.search). │ │ --help -h Show this message │ │ and exit. │ ╰──────────────────────────────────────────────────────────────────────────────╯ ╭─ Commands ───────────────────────────────────────────────────────────────────╮ │ TestExample A minimal suite: `otto test TestExample` (auto-registered by │ │ its Test* name). │ ╰──────────────────────────────────────────────────────────────────────────────╯

Defining a test suite¶

Create a test_*.py file in one of your repo’s tests directories:

from typing import Annotated

import pytest
import typer

from otto import options
from otto.suite import OttoSuite


@options
class _Options:
    firmware: Annotated[str, typer.Option(
        help="Firmware version to validate against.",
    )] = "latest"

    check_interfaces: Annotated[bool, typer.Option(
        help="When True, verify all expected interfaces are up.",
    )] = True


class TestDevice(OttoSuite[_Options]):
    """Validate device configuration and connectivity."""

    Options = _Options

    async def test_device_reachable(self, suite_options: _Options) -> None:
        """Verify the device responds to basic connectivity checks."""
        self.logger.info(f"firmware={suite_options.firmware!r}")
        assert True

    @pytest.mark.timeout(30)
    async def test_firmware_version(self, suite_options: _Options) -> None:
        """Verify the running firmware matches the expected version."""
        assert True

    @pytest.mark.retry(2)
    async def test_management_plane(self) -> None:
        """Verify management-plane access (retried up to 2 times)."""
        assert True

    @pytest.mark.integration
    async def test_interface_state(self, suite_options: _Options) -> None:
        """Verify all expected interfaces are up (requires live device)."""
        if not suite_options.check_interfaces:
            pytest.skip("Interface check disabled via --no-check-interfaces")
        assert True

    @pytest.mark.parametrize("interface", ["eth0", "eth1", "mgmt0"])
    async def test_interface_up(self, interface: str) -> None:
        """Parametrized -- runs once per interface name."""
        assert True

Suite registration¶

OttoSuite.__init_subclass__ auto-registers any subclass whose name starts with Test (matching pytest’s own python_classes = Test* collection rule). Registration:

Reads the inner Options dataclass
Converts each field into a Typer CLI parameter
Creates a runner function with the matching signature
Adds the suite as a subcommand of otto test

This all happens at import time when otto scans your tests directories. Auto-registration is one seam among many; see Extension points for the registry machinery behind this and every other way otto can be extended.

Options classes¶

A suite’s inner Options class is expanded into otto test <Suite> flags and handed to each test method as suite_options — the test-suite stage of otto’s options lifecycle. See Options classes for how to define, validate, and share an options class (including inheriting a repo-wide base across suites).

Running suites¶

otto --lab my_lab test TestDevice
otto --lab my_lab test TestDevice --firmware 2.1
otto --lab my_lab test TestDevice --no-check-interfaces
otto test --list-suites                     # list suites with run syntax
otto test --list-markers                    # list markers available to --markers
otto test --list-tests                      # list every registered test
otto test --list-tests --markers slow TestDevice   # narrow by marker and/or suite

otto test <SuiteName> gets registry-backed completion and --list-suites for free, like every other registry — these candidates are the demo repo’s registered suites, resolved by the real completion machinery:

otto test <TAB><TAB> TestExample $ otto test

Suites can also run as a plain library call, with no CLI/Typer involved — see Running suites from Python in the Python library guide.

Running without a suite name¶

otto test doesn’t require a suite subcommand. Passing --tests and/or -m/--markers alone selects tests by exact name and/or marker expression across every suite and repo that has a match, including plain pytest test_* functions (not just OttoSuite classes). Bare otto test with no suite name and neither flag just prints help.

otto test --tests test_login                    # every test named test_login, any suite
otto test --tests TestB::test_login,test_plain   # disambiguate + mix suite/plain names
otto test -m "not integration"                   # marker expression, no suite name
otto test --tests test_login -m slow             # narrow a name selection by marker too

--tests NAME[,NAME...] matches on exact function name: a bare name (e.g. test_login) matches that name in every suite/repo, across all parametrizations; TestClass::test_name restricts to one suite. Unknown names raise an error with did-you-mean suggestions rather than silently running nothing.
-m EXPRESSION alone (no --tests, no suite name) runs the marker selection the same way — one pytest session per repo that has a match.

Tab-completing `--tests`¶

--tests tab-completes test names, matched by base name — a bare test_login selects every test_login[...] parametrization, and TestClass::test_login disambiguates:

otto test --tests <TAB><TAB> TestExample::test_logs_message test_example_function test_logs_message $ otto test --tests

Candidates come from a static source scan plus, once warmed, real pytest collection — so dynamically generated tests are included too, and the first slower TAB is a one-time cost. See Instructions and suites — the execution pipeline for the two-layer mechanism behind it. For the exact, fully-expanded per-parametrization list, otto test --list-tests still prints every collected id.

Multi-repo selection runs write one JUnit file per repo (junit_<repo>.xml) instead of the single-suite junit.xml. An explicit --results PATH fans out the same way: PATH’s stem gets _<repo> appended for each participating repo (e.g. --results custom.xml becomes custom_repoA.xml, custom_repoB.xml, …), so multiple repos’ sessions never overwrite each other’s results.
Stability (--iterations/-i, --duration/-d, --threshold), --cov*, --monitor*, and --results all apply to selection runs the same as to a named suite.

Suite-specific options and selection runs¶

Suite-specific options (declared on a suite’s Options class) only exist as CLI flags on that suite’s own subcommand — otto test TestDevice --flag. Selection runs (--tests/-m with no suite name) span multiple suites at once, so there’s no single flag set to parse; each suite’s Options class is instead default-constructed once per suite. If a suite’s Options has a required field (no default), its tests fail during the selection run with a hint to re-run that suite directly:

suite 'TestDevice' has required options — run `otto test TestDevice ...` to pass them (...)

Suites whose options are all optional (have defaults) run fine under selection — they just get their defaults instead of CLI-provided values.

Parent command options¶

These options live on otto test itself and must appear before the suite name on the command line (when a suite name is given at all — see Running without a suite name above):

--markers / -m EXPRESSION

Pytest marker expression. Example: --markers "not integration" TestDevice. With no suite name, runs the marker selection in every repo that has a match instead.

--tests NAME[,NAME...]

Run specific tests by exact name, across all suites/repos — no suite subcommand needed. Comma-separated; TestClass::name disambiguates. Combine with --markers to narrow further. Unknown names raise an error with did-you-mean suggestions. Example: --tests test_login,TestB::test_plain

--iterations / -i N

Repeat each test N times within a single setup/teardown cycle. Default: 0 (disabled). Example: --iterations 50 TestDevice

--duration / -d SECONDS

Repeat tests for SECONDS seconds within a single setup/teardown cycle. Default: 0 (disabled). Example: --duration 300 TestDevice

When both --iterations and --duration are specified, testing stops when either limit is reached first.

--threshold FLOAT

Minimum per-test pass rate percentage required in stability mode (0-100). Default: 100 (all iterations must pass). Example: --iterations 50 --threshold 95 TestDevice

--results PATH

Write JUnit XML results to PATH. Default: auto-generated in the log directory.

--monitor

Collect host performance metrics for the entire test run. Samples every host (or those matched by --monitor-hosts) on a fixed interval and emits per-test start/end events automatically. At the end of the run a format:1 JSON snapshot of all metrics and events is written to <output_dir>/monitor.json. The file is loadable in the dashboard via otto monitor <path>.

--monitor-interval SECONDS

Sampling interval for --monitor (minimum 1, default 5).

--monitor-output PATH

Override the destination for the captured monitor data. Format inferred from the suffix: .json (default) writes a self-contained format:1 snapshot, .db writes a SQLite session archive — both loadable via otto monitor <path>. Implies --monitor.

--monitor-hosts REGEX

Restrict --monitor to host IDs matching this regex (re.search). Implies --monitor.

Markers¶

@pytest.mark.integration: Requires live Vagrant VMs. Skip with --markers "not integration".
@pytest.mark.timeout(seconds): Fail the test if it runs longer than seconds.
@pytest.mark.retry(n): Retry a failing test up to n times before reporting failure.
@pytest.mark.parametrize("arg", [values]): Run the test once per value. Each parameter combination gets its own artifact directory.

Suite features¶

Logging¶

Every suite has a self.logger attribute:

self.logger.info("Starting test")
self.logger.info("[bold]Rich markup[/bold]", extra={"markup": True})

Per-test artifact directories¶

Each test gets a self.testDir directory for artifacts. Parametrized tests get unique directory names based on their parameter values.

Non-fatal assertions¶

Use self.expect() to record a failure without stopping the test:

self.expect(result.status == Status.Success, "Command should succeed")
self.expect("expected" in result.value, "Output should contain 'expected'")

All failed expectations are reported at the end of the test.

Monitoring from a suite¶

Start the monitor during a test to collect metrics:

async def test_performance(self) -> None:
    await self.start_monitor(hosts=[host1, host2])
    # ... run workload ...
    await self.add_monitor_event("workload started", color="blue")
    # ... wait for results ...
    await self.stop_monitor()