Baseline Matching and Diagnostics Assessment
Purpose
The Baseline Matching and Diagnostics Assessment tool applies the Spatially Explicit Matched Dynamic Baseline (SEMDB) approach to prepare a defensible baseline package before Carbon Verification Audit by screening donor candidates, validating matching quality, and emitting trajectory plus uncertainty artifacts for audit preparation.
Typical Questions This Tool Helps Answer
- Is the donor pool sufficiently comparable to project units before verification packaging?
- Do matching diagnostics pass configured SMD and trend-consistency thresholds?
- What baseline trajectory and uncertainty outputs should be carried into downstream carbon audit reporting?
Background
The Baseline Matching and Diagnostics Assessment tool is an upstream baseline-defensibility workflow, not a replacement for downstream verification packaging. It helps teams reduce audit risk by enforcing donor eligibility and comparability diagnostics before claim interpretation.
This workflow is most useful when regulator or verifier scrutiny requires explicit evidence that baseline assumptions are stable and reproducible.
Inputs
| Parameter | Type | Required | Description |
|---|---|---|---|
| project_boundary | Vector | Yes | Project boundary polygons for treated units. |
| ecoregion_layer | Vector | Yes | Ecoregion scope for donor eligibility filtering. |
| year0_landcover | Raster | Yes | Baseline land-cover eligibility layer. |
| pre_period_ndvi_stack | Raster (multiband) | Yes | Multi-year pre-period NDVI stack. |
| elevation | Raster | Yes | Elevation covariate used in matching. |
| precipitation | Raster | Yes | Precipitation covariate used in matching. |
| soil_organic_carbon | Raster | Yes | SOC covariate used in matching. |
| soil_texture_class | Raster | Yes | Soil texture class constraint covariate. |
| roads | Vector | Yes | Accessibility covariate source. |
| tenure_status | Vector | No | Optional tenure/legal filtering source. |
Parameters
- donor_pool_exclusion_distance_km (optional): leakage-control exclusion distance; default
5.0. - matching_mode (optional):
mahalanobisorpropensity_score; defaultmahalanobis. - neighbors (optional): nearest-neighbor count; default
1. - with_replacement (optional): donor reuse toggle; default
true. - calliper_elevation_std, calliper_precipitation_std, calliper_soc_std (optional): calliper constraints.
- exact_match_soil_texture_class (optional): enforce exact class matching; default
true. - random_seed (optional): deterministic seed for reproducibility; default
0. - smd_threshold (optional): maximum accepted SMD; default
0.1. - parallel_trend_tolerance (optional): trend-slope tolerance; default
0.01. - pre_period_min_years (optional): minimum pre-period depth; default
5. - output_prefix (required): output basename for all artifacts.
Outputs
| Artifact | Runtime Output Key | Type | Description |
|---|---|---|---|
| Donor candidate mask | donor_candidate_mask | Raster | Donor eligibility output after filtering. |
| Matching assignments preview | matching_assignments_preview | CSV | Treated-control assignment preview. |
| Matching preview summary | matching_preview | JSON | Matching configuration and preview diagnostics. |
| SMD diagnostics | smd_diagnostics | CSV | Covariate balance diagnostics and threshold checks. |
| Parallel trends diagnostics | parallel_trends | CSV | Pre-period trend consistency diagnostics. |
| Baseline trajectory | baseline_trajectory | CSV | Baseline trajectory output for downstream workflows. |
| Baseline uncertainty | baseline_uncertainty | CSV | Baseline uncertainty envelope output. |
| Baseline trajectory summary | baseline_trajectory_summary | JSON | Machine-readable trajectory summary. |
| Baseline spatial preview | baseline_spatial_preview | Raster | Spatial preview of baseline signal behavior. |
| Preflight report | preflight | JSON | Input and rule-check readiness output. |
| Workflow summary contract | summary | JSON | Main SEMDB output contract for handoff. |
QA and Acceptance Criteria
Minimum acceptance before downstream Carbon Verification Audit:
- Required inputs validated and aligned.
- Donor pool remains operationally adequate after filters.
- SMD diagnostics pass agreed threshold policy.
- Parallel trend diagnostics pass agreed tolerance policy.
- Baseline trajectory/uncertainty artifacts are complete.
Troubleshooting
- Sparse donor pool: review exclusion distance and eligibility constraints.
- SMD failures: tighten callipers and inspect covariate outliers.
- Trend failures: increase pre-period depth and review NDVI stack quality.
- Reproducibility mismatch: lock
random_seedand verify unchanged inputs.
For additional operator support, share these artifacts with Whitebox support:
summarypreflightsmd_diagnosticsparallel_trends
Example
import whitebox_workflows as wbw
env = wbw.WbEnvironment(include_pro=True, tier="pro")
result = env.baseline_matching_and_diagnostics_assessment(
project_boundary="data/project_boundary.gpkg",
ecoregion_layer="data/ecoregions.gpkg",
year0_landcover="data/landcover_y0.tif",
pre_period_ndvi_stack="data/ndvi_pre_stack.tif",
elevation="data/elevation.tif",
precipitation="data/precip.tif",
soil_organic_carbon="data/soc.tif",
soil_texture_class="data/soil_texture.tif",
roads="data/roads.gpkg",
output_prefix="output/semdb_baseline",
)
print(result)
References
- Tool implementation:
wbtools_pro/src/tools/workflow_products/baseline_matching_and_diagnostics_assessment.rs
Advanced Operational Guidance
- Use fixed-seed reruns for verifier-facing reproducibility.
- Archive
summary,preflight,smd_diagnostics, andparallel_trendstogether. - Keep threshold profiles stable within a reporting cycle.
Positioning vs Carbon Verification Audit
- Baseline Matching and Diagnostics Assessment: baseline construction and comparability diagnostics.
- Carbon Verification Audit: downstream verification packaging and audit-ready reporting.
Use SEMDB first, then Carbon Verification Audit once baseline diagnostics are acceptable.
When To Use This Workflow
Use Baseline Matching and Diagnostics Assessment when baseline defensibility is under scrutiny and you need explicit comparability diagnostics before audit-stage reporting.
Results Delivery Checklist
- Input provenance and temporal scope documented.
- Threshold policy values recorded (
smd_threshold,parallel_trend_tolerance). - SMD and trend outputs reviewed and accepted.
- Baseline trajectory plus uncertainty outputs attached to handoff package.
Common Questions
Q: Can we skip SEMDB and go directly to Carbon Verification Audit? A: You can, but you lose explicit baseline-defensibility diagnostics that many reviewers request.
Q: What is the most common SEMDB review blocker? A: Imbalance or trend diagnostics that fail policy thresholds.
Q: Does SEMDB issue certified credits? A: No. It is an upstream baseline diagnostics workflow that supports defensible verification preparation.