Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Test with realistic data from H&M article dataset #5

Merged
merged 1 commit into from
Sep 26, 2024

Conversation

agourlay
Copy link
Member

This PR introduces a test using a realistic dataset from H&M. (see articles.csv)

The dataset weights around 30MB and is included in the repository for convenience.

The unit test loads the CSV and creates one payload per row extracting all field.

The CSV is loaded twice in the storage to generate load and to be able to validate data consistency.

Copy link
Contributor

@coszio coszio left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great to test on real data!

An additional idea would be to perform a whole storage update by shifting point offsets by one

@agourlay agourlay merged commit f6dbea2 into master Sep 26, 2024
2 checks passed
@agourlay agourlay deleted the test-realistic-data-set-HM branch September 26, 2024 15:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants