docs: add column_value_anomalies test documentation#2183
docs: add column_value_anomalies test documentation#2183devin-ai-integration[bot] wants to merge 3 commits intodocsfrom
Conversation
Co-Authored-By: Yosef Arbiv <yosef.arbiv@gmail.com>
🤖 Devin AI EngineerI'll be helping with this pull request! Here's what you should know: ✅ I will automatically:
Note: I can only respond to comments from users who have write access to this repository. ⚙️ Control Options:
|
|
Preview deployment for your docs. Learn more about Mintlify Previews.
💡 Tip: Enable Workflows to automatically generate PRs for you. |
|
Important Review skippedAuto reviews are disabled on base/target branches other than the default branch. Please check the settings in the CodeRabbit UI or the ⚙️ Run configurationConfiguration used: defaults Review profile: CHILL Plan: Pro Run ID: You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
…config Co-Authored-By: Yosef Arbiv <yosef.arbiv@gmail.com>
Co-Authored-By: Yosef Arbiv <yosef.arbiv@gmail.com>
Summary
Adds documentation for a new
column_value_anomaliestest — a row-level anomaly detection test that operates directly on raw column values rather than on aggregate metrics (min, max, average) per time bucket.This is a docs-first PR ahead of implementation in the dbt-data-reliability package. The new page:
column_anomaliestest with a comparison tabletimestamp_column,where_expression,anomaly_sensitivity,anomaly_direction,detection_period,training_period,detection_delay)docs.jsonUpdates since last revision
time_bucketandseasonalityfrom the config block and examples. Since this test operates on raw individual values (no per-bucket aggregation), time bucketing is not applicable. The test usestraining_periodanddetection_perioddirectly to define the historical baseline and evaluation windows.training_period/detection_periodinstead oftime_bucket.Screenshots
Mintlify preview: https://elementary-devin-1775975833-column-value-anomalies-docs.mintlify.app/data-tests/anomaly-detection-tests/column-value-anomalies
Review & Testing Checklist for Human
time_bucketandseasonalityshould be excluded — these were intentionally removed since this test doesn't aggregate per bucket. Verify this aligns with the planned implementation.timestamp_columnshould be required — this doc says it's required, unlike other anomaly tests where it's "highly recommended." Confirm this is desired.ignore_small_changes,anomaly_exclude_metrics, anddimensionsthat other anomaly tests support. Verify whether these should be supported for this test type.<Note>block, and<pre>config block render correctly.Notes
dbt-data-reliabilityhas not been created yet — this PR documents the intended API surface.<Note>block.code-quality,create_pylon_issue) are pre-existing issues unrelated to this PR.Link to Devin session: https://app.devin.ai/sessions/97ebe636d57244fb82a7452e1521604e
Requested by: @arbiv