Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Custome data for bench marking #546

Open
praweenkumar opened this issue Oct 2, 2024 · 1 comment
Open

Custome data for bench marking #546

praweenkumar opened this issue Oct 2, 2024 · 1 comment

Comments

@praweenkumar
Copy link

Hey @erikbern and ANN Benchmarking team!,

Thanks for adding this clean implementation.

i have two query -

  1. not a easy way to run test with custom data, i have data in .pkl format.
  2. i dont see oracle 23AI added in engine.

Could you provide help with these 2 ?

@maumueller
Copy link
Collaborator

Hi @praweenkumar,

  1. You will have to write some custom code to transform your dataset to the hdf5 file format that we are using. See the examples in https://github.com/erikbern/ann-benchmarks/blob/main/ann_benchmarks/datasets.py for inspiration on how to do that.

  2. Feel free to contribute the dataset yourself as a PR. We are primarily interested in datasets that add some diversity to the existing ones, so a plot with the performance curves of some of our implementations would be appreciated to judge that.

Best,
Martin

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants