Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(batch-exports): Only read person properties for S3 #18095

Merged
merged 5 commits into from
Oct 19, 2023

Conversation

tomasfarias
Copy link
Contributor

@tomasfarias tomasfarias commented Oct 19, 2023

Problem

Only S3 exports deal with person_properties; all other destinations do not include this field in their schemas. It can be quite the heavy field, so let's not read it to avoid heavy queries. It may even be droppable from S3, but starting with the safe ones that do not require it.

Changes

Only read person_properties if use_s3_fields is set to True, which is only done in s3_batch_export.py. It's a bit rough solution, ideally we would have all fields be configurable and then each destination sets it's own. But this is just to fix an issue, and we can develop it further later.

👉 Stay up-to-date with PostHog coding conventions for a smoother review.

How did you test this code?

Nothing should change for any destination.

@tomasfarias tomasfarias merged commit db9238d into master Oct 19, 2023
@tomasfarias tomasfarias deleted the fix/do-not-read-person-properties branch October 19, 2023 16:39
daibhin pushed a commit that referenced this pull request Oct 23, 2023
Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com>
Justicea83 pushed a commit to Justicea83/posthog that referenced this pull request Oct 25, 2023
Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant