-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use cached features in /rows #1573
Conversation
Codecov ReportPatch coverage:
Additional details and impacted files@@ Coverage Diff @@
## main #1573 +/- ##
==========================================
- Coverage 92.17% 90.44% -1.73%
==========================================
Files 77 193 +116
Lines 5443 12027 +6584
==========================================
+ Hits 5017 10878 +5861
- Misses 426 1149 +723
Flags with carried forward coverage won't be shown. Click here to find out more.
☔ View full report in Codecov by Sentry. |
parquet_files=response["content"]["parquet_files"], partial=response["content"]["partial"] | ||
parquet_files=response["content"]["parquet_files"], | ||
partial=response["content"]["partial"], | ||
features=None, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
what is it used for? better not to provide the field if it's always None
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
config_parquet_content
is instantiated for mypy typing I believe, but we don't return the features anyway in DatasetParquetResponse
so no need to specify it.
I'll add a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe we could test the other branch of the if/else
@@ -30,6 +31,18 @@ class FileSystemError(Exception): | |||
pass | |||
|
|||
|
|||
def _clean_mongo_objects(obj: Any) -> Any: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe move this to the root function in simple_cache.py that gets the content out of the MongoDB?
also: I think we have to update the openapi spec |
ArgoCD Diff for commit
|
Legend | Status |
---|---|
✅ | The app is synced in ArgoCD, and diffs you see are solely from this PR. |
The app is out-of-sync in ArgoCD, and the diffs you see include those changes plus any from this PR. | |
🛑 | There was an error generating the ArgoCD diffs due to changes in this PR. |
|
/rows needs the cached
features
since they're not always available in the parquet metadata.This was causing some
Image
columns to be seen as a struct of binary data, which are not supported in the viewer (shown as "null").Therefore I'm now passing the
features
fromconfig-parquet-and-info
toconfig-parquet
and then toconfig-parquet-metadata
. I kept it backward compatible in case a cached value doesn't have this field yet.Therefore there's no need for a mongo migration. We can just re-run all the
config-parquet
andconfig-parquet-metadata
jobs. I incremented their versions.close #1421