impr(workflow engine): Add caching for detector lookup by data source #106085

Christinarlong · 2026-01-12T17:20:13Z

Okay basically we're caching the bulk query and adapting the single detector query in the subscription processor work with that.

src/sentry/workflow_engine/processors/data_source.py

tests/sentry/workflow_engine/processors/test_data_sources.py

src/sentry/incidents/subscription_processor.py

src/sentry/workflow_engine/processors/data_source.py

src/sentry/workflow_engine/models/detector.py

src/sentry/incidents/subscription_processor.py

saponifi3d

let me know if it'd help to hop on a call / discuss any of these comments too! this whole area is pretty confusing, even for those with context. haha

src/sentry/incidents/subscription_processor.py

src/sentry/workflow_engine/models/detector.py

src/sentry/workflow_engine/processors/data_source.py

src/sentry/workflow_engine/endpoints/validators/base/detector.py

src/sentry/incidents/subscription_processor.py

src/sentry/workflow_engine/endpoints/validators/base/detector.py

saponifi3d

up to you on flattening the cache stuff and pulling it off of the detector model, i would recommend it as it will allow us to DRY up the code around purging / updating the cache.

src/sentry/receivers/rule_snooze.py

src/sentry/incidents/logic.py

src/sentry/uptime/subscriptions/subscriptions.py

src/sentry/receivers/rule_snooze.py

src/sentry/uptime/subscriptions/subscriptions.py

src/sentry/incidents/logic.py

src/sentry/monitors/utils.py

src/sentry/workflow_engine/endpoints/validators/base/detector.py

src/sentry/uptime/subscriptions/subscriptions.py

src/sentry/workflow_engine/endpoints/validators/base/detector.py

saponifi3d

love the change with the transaction!

mind adding some tests to make sure all the invalidation stuff is working as expected?

One specific test case i'd like to include is changing a detector's enabled attribute to false and save that to see if it correctly invalidates the cache. (this is a code path that exists with Uptime, they have some code that will disable a detector based on billing information).

Another test case would be on the detector endpoint; have a hot cache then trigger a successful PUT / POST, and then ensure the cache is correctly invalidated.

src/sentry/monitors/utils.py

src/sentry/uptime/subscriptions/subscriptions.py

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

src/sentry/uptime/subscriptions/subscriptions.py

sentry · 2026-01-27T17:53:41Z

src/sentry/workflow_engine/processors/data_source.py

    This will also prefetch all the subsequent data models for evaluating the detector.
    """
-    return list(
-        Detector.objects.filter(
-            enabled=True, data_sources__source_id=source_id, data_sources__type=query_type
+    cache_key = get_detectors_by_data_source_cache_key(source_id, query_type)
+    detectors = cache.get(cache_key)
+    if detectors is None:
+        detectors = list(
+            Detector.objects.filter(
+                data_sources__source_id=source_id,
+                data_sources__type=query_type,
+                enabled=True,
+            )
+            .select_related("workflow_condition_group")
+            .prefetch_related("workflow_condition_group__conditions")
+            .distinct()
+            .order_by("id")


The cache storage mechanism doesn't have a version or cache invalidation strategy for the Detector objects themselves. While individual cache entries are invalidated correctly by invalidate_detectors_by_data_source_cache(), there's a risk that stale Detector model instances (with outdated attributes like name, enabled, etc.) could be cached and returned. Consider implementing cache versioning or model change hooks (e.g., via Django signals on Detector.save()) to ensure consistency, especially since detector attributes can be modified directly via .update() calls without always triggering the cache invalidation callbacks.
_{Severity: MEDIUM}

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/workflow_engine/processors/data_source.py#L14-L30 Potential issue: The cache storage mechanism doesn't have a version or cache invalidation strategy for the `Detector` objects themselves. While individual cache entries are invalidated correctly by `invalidate_detectors_by_data_source_cache()`, there's a risk that stale `Detector` model instances (with outdated attributes like `name`, `enabled`, etc.) could be cached and returned. Consider implementing cache versioning or model change hooks (e.g., via Django signals on Detector.save()) to ensure consistency, especially since detector attributes can be modified directly via `.update()` calls without always triggering the cache invalidation callbacks.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

sentry · 2026-01-27T17:53:41Z

src/sentry/uptime/subscriptions/subscriptions.py

 def delete_uptime_detector(detector: Detector):
    uptime_subscription = get_uptime_subscription(detector)

+    # Capture data source info before any state changes
+    data_sources = list(detector.data_sources.values_list("source_id", "type"))
+
    remove_uptime_seat(detector)
    detector.update(status=ObjectStatus.PENDING_DELETION)
    RegionScheduledDeletion.schedule(detector, days=0)
    delete_uptime_subscription(uptime_subscription)

+    for source_id, source_type in data_sources:
+        invalidate_detectors_by_data_source_cache(source_id, source_type)
+


In delete_uptime_detector() at line 585-598, the cache is invalidated synchronously (not via transaction.on_commit()). This differs from all other functions in this file that use transaction.on_commit(). If the subsequent delete_uptime_subscription() or RegionScheduledDeletion.schedule() calls modify database state, the synchronous invalidation could occur before the transaction commits, creating a brief window where the cache is cleared but the detector still exists in the database. For consistency and safety, this should use transaction.on_commit() like the other functions.
_{Severity: HIGH}

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/uptime/subscriptions/subscriptions.py#L585-L598 Potential issue: In `delete_uptime_detector()` at line 585-598, the cache is invalidated synchronously (not via `transaction.on_commit()`). This differs from all other functions in this file that use `transaction.on_commit()`. If the subsequent `delete_uptime_subscription()` or `RegionScheduledDeletion.schedule()` calls modify database state, the synchronous invalidation could occur before the transaction commits, creating a brief window where the cache is cleared but the detector still exists in the database. For consistency and safety, this should use `transaction.on_commit()` like the other functions.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

sentry · 2026-01-27T17:53:41Z

src/sentry/workflow_engine/endpoints/validators/base/detector.py


            instance.save()

+        self._invalidate_cache_by_detector(instance)


The _invalidate_cache_by_detector() method (lines 145-149) is called in both update() (line 214) and delete() (line 236). However, in update() it's called INSIDE the atomic transaction block, while in other locations like uptime subscriptions it's called via transaction.on_commit(). This inconsistency could cause cache invalidation to occur before the transaction commits. For consistency with other parts of the codebase and to avoid race conditions, consider wrapping the _invalidate_cache_by_detector() call in update() with a transaction.on_commit() callback instead of calling it synchronously.
_{Severity: HIGH}

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/workflow_engine/endpoints/validators/base/detector.py#L145-L214 Potential issue: The `_invalidate_cache_by_detector()` method (lines 145-149) is called in both `update()` (line 214) and `delete()` (line 236). However, in `update()` it's called INSIDE the atomic transaction block, while in other locations like uptime subscriptions it's called via `transaction.on_commit()`. This inconsistency could cause cache invalidation to occur before the transaction commits. For consistency with other parts of the codebase and to avoid race conditions, consider wrapping the `_invalidate_cache_by_detector()` call in `update()` with a `transaction.on_commit()` callback instead of calling it synchronously.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

sentry · 2026-01-27T17:53:41Z

src/sentry/receivers/rule_snooze.py

+            detector = alert_rule_detector.detector
+            detector.update(enabled=is_enabled)
+
+            # Invalidate cache after transaction commits (signal may be called within a transaction)
+            data_sources = list(detector.data_sources.values_list("source_id", "type"))
+
+            def invalidate_cache():
+                for source_id, source_type in data_sources:


In _update_workflow_engine_models() at line 23-30, the code defines a nested function invalidate_cache() inside a closure and then schedules it via transaction.on_commit(). This is correct, but ensure the signal handler context doesn't already have an active transaction that might not commit as expected. Django signals can be tricky with transactions. Document or verify that this signal is always called within a request-response cycle where transaction behavior is predictable.
_{Severity: MEDIUM}

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: src/sentry/receivers/rule_snooze.py#L23-L30 Potential issue: In `_update_workflow_engine_models()` at line 23-30, the code defines a nested function `invalidate_cache()` inside a closure and then schedules it via `transaction.on_commit()`. This is correct, but ensure the signal handler context doesn't already have an active transaction that might not commit as expected. Django signals can be tricky with transactions. Document or verify that this signal is always called within a request-response cycle where transaction behavior is predictable.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

sentry · 2026-01-27T17:53:41Z

tests/sentry/workflow_engine/endpoints/test_organization_detector_details.py

+        from sentry.workflow_engine.processors.data_source import bulk_fetch_enabled_detectors
+
+        data_source = self.detector.data_sources.first()
+        assert data_source is not None
+
+        # Warm the cache
+        cached_detectors = bulk_fetch_enabled_detectors(data_source.source_id, data_source.type)
+        assert len(cached_detectors) == 1
+        assert cached_detectors[0].id == self.detector.id
+
+        valid_data = {
+            "id": self.detector.id,
+            "projectId": self.project.id,
+            "name": "Updated Detector Name",
+            "type": MetricIssue.slug,
+            "dateCreated": self.detector.date_added,
+            "dateUpdated": timezone.now(),
+            "conditionGroup": {
+                "id": self.data_condition_group.id,
+                "organizationId": self.organization.id,
+                "logicType": self.data_condition_group.logic_type,
+                "conditions": [
+                    {
+                        "id": self.condition.id,
+                        "comparison": 100,
+                        "type": Condition.GREATER,
+                        "conditionResult": DetectorPriorityLevel.HIGH,
+                        "conditionGroupId": self.data_condition_group.id,
+                    },
+                    {
+                        "id": self.resolve_condition.id,
+                        "comparison": 100,
+                        "type": Condition.LESS_OR_EQUAL,
+                        "conditionResult": DetectorPriorityLevel.OK,
+                        "conditionGroupId": self.data_condition_group.id,
+                    },
+                ],
+            },
+            "config": self.detector.config,
+        }


In test_put_invalidates_cache() at line 1071-1110, the test verifies that the cache is invalidated after a PUT request. However, it only tests one data source. If a detector has multiple data sources (allowed by the system depending on configuration), ensure the cache is invalidated for ALL of them. Consider adding a test case with multiple data sources to verify comprehensive coverage.
_{Severity: MEDIUM}

Prompt for AI Agent

Review the code at the location below. A potential bug has been identified by an AI agent. Verify if this is a real issue. If it is, propose a fix; if not, explain why it's not valid. Location: tests/sentry/workflow_engine/endpoints/test_organization_detector_details.py#L1071-L1110 Potential issue: In `test_put_invalidates_cache()` at line 1071-1110, the test verifies that the cache is invalidated after a PUT request. However, it only tests one data source. If a detector has multiple data sources (allowed by the system depending on configuration), ensure the cache is invalidated for ALL of them. Consider adding a test case with multiple data sources to verify comprehensive coverage.

_{Did we get this right? 👍 / 👎 to inform future reviews.}

Christinarlong added 2 commits January 12, 2026 09:13

cache detectors feature

c734f7e

tests for detector cache

49148ca

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Jan 12, 2026

Christinarlong commented Jan 12, 2026

View reviewed changes

src/sentry/workflow_engine/processors/data_source.py Outdated Show resolved Hide resolved

vercel bot deployed to Preview January 12, 2026 17:23 View deployment

combine bulk query cache logic

71d286b

vercel bot deployed to Preview January 12, 2026 18:26 View deployment

add handling for edge case of proj getting insta deleted

bca8315

Christinarlong commented Jan 12, 2026

View reviewed changes

src/sentry/incidents/subscription_processor.py Outdated Show resolved Hide resolved

Christinarlong marked this pull request as ready for review January 12, 2026 23:28

Christinarlong requested review from a team as code owners January 12, 2026 23:28

vercel bot deployed to Preview January 12, 2026 23:29 View deployment

Christinarlong requested review from kcons and saponifi3d January 12, 2026 23:32

cursor bot reviewed Jan 12, 2026

View reviewed changes

src/sentry/incidents/subscription_processor.py Outdated Show resolved Hide resolved

saponifi3d reviewed Jan 13, 2026

View reviewed changes

redo approach and add validation

e1269b7

vercel bot deployed to Preview January 15, 2026 18:13 View deployment

cursor bot reviewed Jan 15, 2026

View reviewed changes

src/sentry/workflow_engine/endpoints/validators/base/detector.py Outdated Show resolved Hide resolved

fix tests

e0a1340

vercel bot deployed to Preview January 15, 2026 18:42 View deployment

sentry bot reviewed Jan 15, 2026

View reviewed changes

src/sentry/incidents/subscription_processor.py Outdated Show resolved Hide resolved

src/sentry/workflow_engine/endpoints/validators/base/detector.py Outdated Show resolved Hide resolved

add cache invalidation to more places

d7111cf

vercel bot deployed to Preview January 15, 2026 19:30 View deployment

nits

5534b69

Christinarlong requested a review from saponifi3d January 15, 2026 19:57

vercel bot deployed to Preview January 15, 2026 19:57 View deployment

saponifi3d reviewed Jan 15, 2026

View reviewed changes

src/sentry/receivers/rule_snooze.py Outdated Show resolved Hide resolved

vercel bot deployed to Preview January 21, 2026 17:32 View deployment

sentry bot reviewed Jan 21, 2026

View reviewed changes

src/sentry/incidents/logic.py Outdated Show resolved Hide resolved

This comment was marked as outdated.

Sign in to view

cursor comments

9c75729

vercel bot deployed to Preview January 23, 2026 20:18 View deployment

sentry bot reviewed Jan 23, 2026

View reviewed changes

src/sentry/uptime/subscriptions/subscriptions.py Outdated Show resolved Hide resolved

sentry comment

98b8d02

vercel bot deployed to Preview January 23, 2026 20:25 View deployment

sentry bot reviewed Jan 23, 2026

View reviewed changes

src/sentry/receivers/rule_snooze.py Outdated Show resolved Hide resolved

This comment was marked as outdated.

Sign in to view

more on commits for cache invalidation during model updates

09489cd

vercel bot deployed to Preview January 23, 2026 20:56 View deployment

sentry bot reviewed Jan 23, 2026

View reviewed changes

src/sentry/uptime/subscriptions/subscriptions.py Show resolved Hide resolved

cursor bot reviewed Jan 23, 2026

View reviewed changes

src/sentry/incidents/logic.py Outdated Show resolved Hide resolved

src/sentry/monitors/utils.py Show resolved Hide resolved

even more cursor comments

1f13295

vercel bot deployed to Preview January 23, 2026 21:24 View deployment

sentry bot reviewed Jan 23, 2026

View reviewed changes

src/sentry/workflow_engine/endpoints/validators/base/detector.py Show resolved Hide resolved

src/sentry/uptime/subscriptions/subscriptions.py Show resolved Hide resolved

Christinarlong requested a review from saponifi3d January 23, 2026 21:28

cursor bot reviewed Jan 23, 2026

View reviewed changes

src/sentry/workflow_engine/endpoints/validators/base/detector.py Show resolved Hide resolved

saponifi3d reviewed Jan 23, 2026

View reviewed changes

src/sentry/monitors/utils.py Show resolved Hide resolved

Christinarlong added 2 commits January 23, 2026 15:55

add tests for cache invalidation

7338ce8

Merge branch 'master' into crl/cache-deteocts

36218bc

Christinarlong requested a review from saponifi3d January 23, 2026 23:57

vercel bot deployed to Preview January 23, 2026 23:59 View deployment

cursor bot reviewed Jan 24, 2026

View reviewed changes

src/sentry/uptime/subscriptions/subscriptions.py Show resolved Hide resolved

Christinarlong added 2 commits January 26, 2026 08:53

Merge branch 'master' into crl/cache-deteocts

e00d119

fix tests

a28866b

vercel bot deployed to Preview January 26, 2026 17:00 View deployment

cursor bot reviewed Jan 26, 2026

View reviewed changes

src/sentry/uptime/subscriptions/subscriptions.py Show resolved Hide resolved

sentry bot reviewed Jan 27, 2026

View reviewed changes

Uh oh!

impr(workflow engine): Add caching for detector lookup by data source #106085

Are you sure you want to change the base?

impr(workflow engine): Add caching for detector lookup by data source #106085

Uh oh!

Conversation

Christinarlong commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

saponifi3d left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

saponifi3d left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

saponifi3d left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

sentry bot Jan 27, 2026

Choose a reason for hiding this comment

sentry bot Jan 27, 2026

Choose a reason for hiding this comment

sentry bot Jan 27, 2026

Choose a reason for hiding this comment

sentry bot Jan 27, 2026

Choose a reason for hiding this comment

sentry bot Jan 27, 2026

Choose a reason for hiding this comment

Labels

3 participants

Christinarlong commented Jan 12, 2026 •

edited

Loading