Support native advanced queries in LightningStore #318

ultmaster · 2025-11-17T17:15:36Z

No description provided.

Copilot

Pull Request Overview

This PR introduces native support for advanced queries in LightningStore, enabling filtering, sorting, and pagination across rollouts, attempts, spans, workers, and resources. The changes modernize the query interface while maintaining backward compatibility through deprecated parameter aliases.

Key changes:

Introduced PaginatedResult type that behaves like a sequence while carrying pagination metadata (limit, offset, total)
Added comprehensive filtering with _must groups and AND/OR logic combinators
Extended all query methods with sort_by, sort_order, limit, and offset parameters
Refactored adapters to accept Sequence[Span] instead of List[Span] for better type flexibility

Reviewed Changes

Copilot reviewed 27 out of 27 changed files in this pull request and generated 15 comments.

Show a summary per file

File	Description
agentlightning/types/core.py	Defines new PaginatedResult, FilterOptions, SortOptions, and FilterField types for query interface
agentlightning/store/base.py	Updates LightningStore interface with expanded query method signatures
agentlightning/store/collection_based.py	Implements filtering/sorting/pagination using collection primitives, adds deprecation warnings
agentlightning/store/collection/memory.py	Refactors filter matching logic to support _must groups, moves to module-level functions
agentlightning/store/collection/base.py	Moves type definitions to core.py for broader visibility
agentlightning/store/client_server.py	Updates server routes and client methods to handle new query parameters
agentlightning/store/threading.py	Pass-through wrapper updated for new signatures
agentlightning/store/memory.py	Adapts get_running_rollouts to convert set to list for within filter
agentlightning/adapter/	Updates adapters to accept Sequence[Span] for broader compatibility
tests/store/	Comprehensive test coverage for new filtering, sorting, pagination features
docs/	Updates documentation to reflect new query capabilities
examples/	Migrates from deprecated status/rollout_ids to status_in/rollout_id_in

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-11-17T17:22:33Z

agentlightning/types/core.py

+    def __repr__(self) -> str:
+        first_item_repr = repr(self.items[0]) if self.items else "empty"
+        items_repr = f"[{first_item_repr}, ...]" if len(self.items) > 1 else first_item_repr
+        slice_repr = f"{self.offset}:{self.offset + self.limit}" if self.limit >= 0 else f"{self.offset}:"


[nitpick] The __repr__ implementation shows incorrect slice notation when limit == -1. When limit is -1 (unlimited), the slice f"{self.offset}:" is displayed correctly, but the condition check should be self.limit == -1 instead of self.limit >= 0.

Current code shows (1: of 5) when limit=-1 and offset=1, but the slice should represent [1:] which means "from offset 1 to end". The representation is technically correct but could be clearer.

Suggested change

slice_repr = f"{self.offset}:{self.offset + self.limit}" if self.limit >= 0 else f"{self.offset}:"

slice_repr = f"{self.offset}:" if self.limit == -1 else f"{self.offset}:{self.offset + self.limit}"

Copilot · 2025-11-17T17:22:33Z

agentlightning/types/core.py

+FilterOptions = Mapping[
+    Union[str, Literal["_aggregate", "_must"]],
+    Union[FilterField, Literal["and", "or"], Mapping[str, FilterField]],
+]
+"""A mapping of field name -> operator dict.
+
+Each operator dict can contain:
+
+- "exact": value for exact equality.
+- "within": iterable of allowed values.
+- "contains": substring to search for in string fields.
+
+The filter can also have a special field called "_aggregate" that can be used to specify the logic
+to combine the results of the filters:
+
+- "and": all conditions must match. This is the default value if not specified.
+- "or": at least one condition must match.
+
+All conditions within a field and between different fields are
+stored in a unified pool and combined using `_aggregate`.
+


The FilterOptions type uses Union[FilterField, Literal["and", "or"], Mapping[str, FilterField]] for values, but this allows _aggregate to be mapped to a FilterField or Mapping[str, FilterField], which doesn't make semantic sense. The _aggregate key should only map to Literal["and", "or"], and _must should only map to Mapping[str, FilterField] or a sequence of such mappings.

Consider using a more precise type definition with separate TypedDict for the structure, or add validation to catch incorrect usage at runtime.

Suggested change

FilterOptions = Mapping[

Union[str, Literal["_aggregate", "_must"]],

Union[FilterField, Literal["and", "or"], Mapping[str, FilterField]],

]

"""A mapping of field name -> operator dict.

Each operator dict can contain:

- "exact": value for exact equality.

- "within": iterable of allowed values.

- "contains": substring to search for in string fields.

The filter can also have a special field called "_aggregate" that can be used to specify the logic

to combine the results of the filters:

- "and": all conditions must match. This is the default value if not specified.

- "or": at least one condition must match.

All conditions within a field and between different fields are

stored in a unified pool and combined using `_aggregate`.

class FilterOptions(TypedDict, total=False):

"""A mapping of field name -> operator dict, with special keys for aggregation and must conditions.

Each field name maps to a FilterField.

Special keys:

- "_aggregate": specifies the logic to combine the results of the filters ("and" or "or").

- "_must": a mapping or sequence of mappings for required conditions.

"""

_aggregate: Literal["and", "or"]

_must: Union[Mapping[str, FilterField], Sequence[Mapping[str, FilterField]]]

# Other fields map to FilterField

# Use NotRequired for optional fields (Python 3.11+), otherwise total=False covers it.

# Example: field_name: FilterField

# For Python <3.11, we can't use NotRequired, so total=False is used above.

Let's do this in future.

agentlightning/store/client_server.py

agentlightning/store/collection/memory.py

agentlightning/store/client_server.py

agentlightning/store/memory.py

agentlightning/store/client_server.py

agentlightning/types/core.py

agentlightning/store/collection_based.py

tests/store/test_collection_memory.py

ultmaster · 2025-11-17T17:38:31Z

/ci

github-actions · 2025-11-17T17:38:41Z

🚀 CI Watcher for correlation id-3543099762-mi3fhx1d triggered by comment 3543099762
🏃‍♀️ Tracking 6 workflow run(s):

🟢 Spider - PR #318 - ci-all - id-3543099762-mi3fhx1d — completed/success
🔴 Unsloth - PR #318 - ci-all - id-3543099762-mi3fhx1d — completed/failure
🟢 APO - PR #318 - ci-all - id-3543099762-mi3fhx1d — completed/success
🟢 Backward Compatibility - PR #318 - ci-all - id-3543099762-mi3fhx1d — completed/success
🟢 Calc-X - PR #318 - ci-all - id-3543099762-mi3fhx1d — completed/success
🟢 GPU Test - PR #318 - ci-all - id-3543099762-mi3fhx1d — completed/success

✅ All runs completed.

Copilot

Pull Request Overview

Copilot reviewed 28 out of 28 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-11-18T01:19:05Z

agentlightning/store/collection_based.py

+            resolved_rollout_ids = rollout_id_in
+        elif rollout_ids is not None:
+            warnings.warn("rollout_ids is deprecated, use rollout_id_in instead", DeprecationWarning, stacklevel=3)
+            resolved_rollout_ids = rollout_ids


The deprecation warning for rollout_ids is missing the stacklevel parameter. It should be stacklevel=3 to match the pattern used for the status parameter on line 409, ensuring the warning points to the caller's code rather than this implementation.

Copilot · 2025-11-18T01:19:05Z

agentlightning/store/collection_based.py

+            filters["status"] = {"within": list(status_in)}
+        if worker_id_contains is not None:
+            filters["worker_id"] = {"contains": worker_id_contains}
+        filters["_aggregate"] = filter_logic


The filter_logic is always added to filters, even when no actual filter conditions are present. This differs from query_rollouts (line 423-424) which only adds _aggregate when filter_logic == 'or'. This inconsistency could cause unexpected behavior if the collection implementation treats an empty filter dict with _aggregate differently than a truly empty filter dict.

Suggested change

filters["_aggregate"] = filter_logic

if filters:

filters["_aggregate"] = filter_logic

Copilot · 2025-11-18T01:19:06Z

agentlightning/store/collection_based.py

+            return await self.collections.workers.query(
+                filter=filters or None,


The expression filters or None will evaluate to None when filters is {'_aggregate': 'and'}, but this still represents a filter with aggregate logic. This could lead to incorrect behavior. Consider checking if not filters or filters == {'_aggregate': 'and'} or only setting _aggregate when there are actual filter conditions, similar to the pattern in query_rollouts.

Copilot · 2025-11-18T01:19:06Z

agentlightning/store/collection/memory.py

+                    logger.warning(f"Contradictory exact filters for field '{pk}': {exact_value} != {candidate}")
+                    return ()


[nitpick] The warning for contradictory exact filters is only implemented in _iter_matching_items for primary key prefix optimization. Consider whether this validation should also be performed in the general _item_matches_filters function to catch contradictory filters on non-primary-key fields as well.

ultmaster added 5 commits November 17, 2025 19:48

move filter and sort options

32470c7

collection based store implementation

54cd178

update memory implementation

5aa69a8

make query_xxx return PaginatedResult

e6504b2

doc and examples update

2240742

Copilot AI review requested due to automatic review settings November 17, 2025 17:15

Copilot started reviewing on behalf of ultmaster November 17, 2025 17:16 View session

Copilot finished reviewing on behalf of ultmaster November 17, 2025 17:19

Copilot AI reviewed Nov 17, 2025

View reviewed changes

ultmaster added 2 commits November 18, 2025 01:28

.

b4e9eb8

resolve comments

ac5e062

ultmaster added the ci-all label Nov 17, 2025

ultmaster requested a review from Copilot November 18, 2025 01:16

Copilot started reviewing on behalf of ultmaster November 18, 2025 01:16 View session

Copilot finished reviewing on behalf of ultmaster November 18, 2025 01:17

Copilot AI reviewed Nov 18, 2025

View reviewed changes

bug fixes

f407e4c

ultmaster merged commit 3761c0f into main Nov 18, 2025
14 checks passed

totoluo pushed a commit to totoluo/agent-lightning that referenced this pull request Dec 6, 2025

Support native advanced queries in LightningStore (microsoft#318)

0dc39ba

totoluo pushed a commit to totoluo/agent-lightning that referenced this pull request Dec 6, 2025

Support native advanced queries in LightningStore (microsoft#318)

af48f98

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support native advanced queries in LightningStore #318

Support native advanced queries in LightningStore #318

Uh oh!

ultmaster commented Nov 17, 2025

Copilot AI left a comment

Copilot AI Nov 17, 2025

Copilot AI Nov 17, 2025

ultmaster Nov 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ultmaster commented Nov 17, 2025

github-actions bot commented Nov 17, 2025 •

edited

Loading

Copilot AI left a comment

Copilot AI Nov 18, 2025

Copilot AI Nov 18, 2025

Copilot AI Nov 18, 2025

Copilot AI Nov 18, 2025

Uh oh!

Labels

2 participants

	slice_repr = f"{self.offset}:{self.offset + self.limit}" if self.limit >= 0 else f"{self.offset}:"
	slice_repr = f"{self.offset}:" if self.limit == -1 else f"{self.offset}:{self.offset + self.limit}"

	filters["_aggregate"] = filter_logic
	if filters:
	filters["_aggregate"] = filter_logic

		return await self.collections.workers.query(
		filter=filters or None,

		logger.warning(f"Contradictory exact filters for field '{pk}': {exact_value} != {candidate}")
		return ()

Support native advanced queries in LightningStore #318

Support native advanced queries in LightningStore #318

Uh oh!

Conversation

ultmaster commented Nov 17, 2025

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Copilot AI Nov 17, 2025

Choose a reason for hiding this comment

Copilot AI Nov 17, 2025

Choose a reason for hiding this comment

ultmaster Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ultmaster commented Nov 17, 2025

github-actions bot commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Copilot AI Nov 18, 2025

Choose a reason for hiding this comment

Copilot AI Nov 18, 2025

Choose a reason for hiding this comment

Copilot AI Nov 18, 2025

Choose a reason for hiding this comment

Copilot AI Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

Labels

2 participants

github-actions bot commented Nov 17, 2025 •

edited

Loading