Skip to content

benchmarks.SpacemiTX60.baseline_dual_model_async.plot_results

Source: benchmarks/SpacemiTX60/baseline_dual_model_async/plot_results.py

benchmarks.SpacemiTX60.baseline_dual_model_async.plot_results

Plot sweep results produced by run_remote_sweep.py.

  • Reads the CSV
  • Filters successful runs (exit_code==0 and csv_parse_ok==1)
  • Aggregates repeats (mean/std) per sweep value
  • Produces PNG plots and aggregated CSV tables per sweep_param
  • Summarizes min_all/max_all and renders 2D grid heatmaps (if present)
  • NEW: Automatically plots ALL aggregated metrics (anything ending in *_mean)
Requires

pip install matplotlib

Usage

python3 plot_sweep_results.py --in_csv results/sweep.csv --out_dir results/plots

Optional

--no_plot_all_metrics Disable auto-plotting of every aggregated metric --verbose Print each generated PNG/CSV path

auto_plot_all_metrics(sp, xs, table_rows, out_dir, xlabel, baseline_vals, baseline_row, already_plotted, verbose)

For this sweep, plot every aggregated metric series that appears as _mean (with _std if present). Returns number of PNGs generated.

baseline_for_metric(mean_key, baseline_vals, baseline_row)

Try to find a sensible baseline horizontal line for an aggregated metric. - If we stored it in baseline_vals, use that. - Else, if it's a raw metric like d_lat_p99_ms_mean -> look up d_lat_p99_ms in baseline row.

effective_seconds(row, fallback_duration=10.0)

Many totals are accumulated only after warmup. Use effective = duration_s - warmup_s (per row).