fix: ODFV output projection in offline retrieval (#6099) by jyejare · Pull Request #6140 · feast-dev/feast

jyejare · 2026-03-23T08:28:10Z

Summary

Fixes #6099 - Ensures offline retrieval honors ODFV feature projection, matching online retrieval behavior.

Problem

When requesting a subset of features from an OnDemandFeatureView:

Online retrieval ✅ Returns only requested features
Offline retrieval ❌ Returns ALL ODFV output features (before this fix)

This caused schema mismatches between training and serving pipelines.

Solution

Modified RetrievalJob.to_arrow() in offline_store.py to:

Parse requested features from metadata.features
Build a mapping of ODFV name → requested feature names
Filter ODFV transformation output to only include requested columns

Example

Before this fix:

features = ["my_odfv:feature_a"]
offline_result = store.get_historical_features(features=features, ...)
# Columns: driver_id, event_timestamp, feature_a, feature_b, feature_c ❌

After this fix:

features = ["my_odfv:feature_a"]
offline_result = store.get_historical_features(features=features, ...)
# Columns: driver_id, event_timestamp, feature_a ✅

Changes

Modified: `sdk/python/feast/infra/offline_stores/offline_store.py`

Updated RetrievalJob.to_arrow() method (lines 140-184)
Added filtering logic for ODFV output projection
Maintains backward compatibility

Added: Test in `sdk/python/tests/integration/offline_store/test_universal_historical_retrieval.py`

test_odfv_projection() - Comprehensive test verifying:
- Single feature request returns only that feature
- Multiple feature request returns only requested features
- Unrequested features are NOT included
- Offline and online retrieval have consistent behavior
Parametrized for both full_feature_names=True and False

Testing

The new test test_odfv_projection verifies:

✅ Requesting 1 out of 3 ODFV features → returns only that 1 feature
✅ Requesting 2 out of 3 ODFV features → returns only those 2 features
✅ Unrequested features are NOT included in the result
✅ Offline and online retrieval return consistent schemas

Backward Compatibility

✅ Falls back to old behavior if metadata is unavailable
✅ No breaking changes to existing functionality
✅ Only affects ODFV feature projection

Impact

This fix ensures:

✅ Consistent behavior between online and offline retrieval
✅ No schema mismatches in ML pipelines
✅ More efficient - doesn't compute/return unnecessary features
✅ Matches user expectations - returns exactly what was requested

franciscojavierarceo · 2026-03-23T12:59:19Z

sdk/python/feast/infra/offline_stores/offline_store.py

+
+            if metadata and metadata.features:
+                for feature_ref in metadata.features:
+                    if ":" in feature_ref:


this is going to be brittle after my feature view version PR lands as feature references will now support @vN syntax.

This commit fixes issue feast-dev#6099 where offline retrieval (get_historical_features) was returning ALL OnDemandFeatureView output features, even when only a subset was requested, while online retrieval correctly returned only requested features. Changes: - Modified RetrievalJob.to_arrow() to filter ODFV outputs based on requested features from metadata, matching online retrieval behavior - Added test_odfv_projection to verify the fix and prevent regression Before this fix: - Online: features=['odfv:feature_a'] -> returns feature_a only ✓ - Offline: features=['odfv:feature_a'] -> returns feature_a, feature_b, feature_c ✗ After this fix: - Both online and offline return only the requested features ✓ This ensures schema consistency between training (offline) and serving (online) pipelines, preventing downstream issues in ML workflows. Fixes feast-dev#6099

- Fix empty list edge case: Use explicit dict key check instead of 'or' operator to avoid treating empty sets as falsy - Use sets instead of lists for requested features to prevent duplicates and improve lookup performance (O(1) instead of O(n))

Some RetrievalJob implementations don't implement the metadata property and raise NotImplementedError. Wrap metadata access in try-except to gracefully handle this case and maintain backward compatibility. Fixes CI test failure in test_retrieval_job_dataframe.py

jyejare requested review from a team as code owners March 23, 2026 08:28

jyejare requested review from dmartinol, ejscribner and shuchu and removed request for a team March 23, 2026 08:28

jyejare changed the title ~~Fix ODFV output projection in offline retrieval (#6099)~~ fix: ODFV output projection in offline retrieval (#6099) Mar 23, 2026

This comment was marked as resolved.

Sign in to view

jyejare marked this pull request as draft March 23, 2026 09:14

franciscojavierarceo reviewed Mar 23, 2026

View reviewed changes

Ambient Code Bot added 3 commits March 23, 2026 20:40

jyejare force-pushed the fix/odfv-output-projection-6099 branch from 6dc5107 to a6bbfda Compare March 23, 2026 15:10

jyejare marked this pull request as ready for review March 23, 2026 15:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: ODFV output projection in offline retrieval (#6099)#6140

fix: ODFV output projection in offline retrieval (#6099)#6140
jyejare wants to merge 3 commits intofeast-dev:masterfrom
jyejare:fix/odfv-output-projection-6099

jyejare commented Mar 23, 2026 •

edited by devin-ai-integration bot

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

franciscojavierarceo Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jyejare commented Mar 23, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Example

Changes

Modified: sdk/python/feast/infra/offline_stores/offline_store.py

Added: Test in sdk/python/tests/integration/offline_store/test_universal_historical_retrieval.py

Testing

Backward Compatibility

Impact

Uh oh!

This comment was marked as resolved.

Uh oh!

franciscojavierarceo Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jyejare commented Mar 23, 2026 •

edited by devin-ai-integration bot

Loading

Modified: `sdk/python/feast/infra/offline_stores/offline_store.py`

Added: Test in `sdk/python/tests/integration/offline_store/test_universal_historical_retrieval.py`