Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PARQUET-2261 Size Statistics #14000

Merged
merged 173 commits into from
Dec 6, 2023
Merged

Commits on Aug 29, 2023

  1. stub in new SizeStatistics

    etseidl committed Aug 29, 2023
    Configuration menu
    Copy the full SHA
    9589fc3 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    21ab768 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    76d54cf View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    7d2c41e View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    58e64a1 View commit details
    Browse the repository at this point in the history

Commits on Aug 30, 2023

  1. Configuration menu
    Copy the full SHA
    d7a6591 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b52d226 View commit details
    Browse the repository at this point in the history

Commits on Aug 31, 2023

  1. Configuration menu
    Copy the full SHA
    3de71b0 View commit details
    Browse the repository at this point in the history
  2. checkpoint option 2

    etseidl committed Aug 31, 2023
    Configuration menu
    Copy the full SHA
    b81d2cc View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    a2ad28e View commit details
    Browse the repository at this point in the history

Commits on Sep 1, 2023

  1. Configuration menu
    Copy the full SHA
    dc3c637 View commit details
    Browse the repository at this point in the history
  2. fix bug in histogram calc

    etseidl committed Sep 1, 2023
    Configuration menu
    Copy the full SHA
    b61de00 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    da4aac1 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    44051f3 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    44e0f23 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    6a5b021 View commit details
    Browse the repository at this point in the history
  7. latest from rapidsai#197

    etseidl committed Sep 1, 2023
    Configuration menu
    Copy the full SHA
    51773e3 View commit details
    Browse the repository at this point in the history
  8. a few more tweaks

    etseidl committed Sep 1, 2023
    Configuration menu
    Copy the full SHA
    5fc47b9 View commit details
    Browse the repository at this point in the history
  9. add some comments

    etseidl committed Sep 1, 2023
    Configuration menu
    Copy the full SHA
    abba811 View commit details
    Browse the repository at this point in the history

Commits on Sep 4, 2023

  1. Configuration menu
    Copy the full SHA
    be7f7f1 View commit details
    Browse the repository at this point in the history

Commits on Sep 5, 2023

  1. move cutoffs to header

    etseidl committed Sep 5, 2023
    Configuration menu
    Copy the full SHA
    5843073 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ee205cd View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    09c0c49 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    18d4158 View commit details
    Browse the repository at this point in the history

Commits on Sep 7, 2023

  1. Configuration menu
    Copy the full SHA
    fb3b6cc View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    94a8d64 View commit details
    Browse the repository at this point in the history
  3. latest from parquet-2261

    etseidl committed Sep 7, 2023
    Configuration menu
    Copy the full SHA
    0c23dca View commit details
    Browse the repository at this point in the history

Commits on Sep 8, 2023

  1. add TODO

    etseidl committed Sep 8, 2023
    Configuration menu
    Copy the full SHA
    06f0bef View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    74da962 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    00974b8 View commit details
    Browse the repository at this point in the history

Commits on Sep 9, 2023

  1. Configuration menu
    Copy the full SHA
    4140d30 View commit details
    Browse the repository at this point in the history

Commits on Sep 11, 2023

  1. Configuration menu
    Copy the full SHA
    db0779d View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    da2fb0e View commit details
    Browse the repository at this point in the history
  3. fix for string list

    etseidl committed Sep 11, 2023
    Configuration menu
    Copy the full SHA
    5376a83 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    cf531cf View commit details
    Browse the repository at this point in the history
  5. clean up enums

    etseidl committed Sep 11, 2023
    Configuration menu
    Copy the full SHA
    971931d View commit details
    Browse the repository at this point in the history
  6. refactor list functors

    etseidl committed Sep 11, 2023
    Configuration menu
    Copy the full SHA
    c4126c1 View commit details
    Browse the repository at this point in the history
  7. fix for int list functor

    etseidl committed Sep 11, 2023
    Configuration menu
    Copy the full SHA
    fae64c0 View commit details
    Browse the repository at this point in the history

Commits on Sep 12, 2023

  1. more list refactoring

    etseidl committed Sep 12, 2023
    Configuration menu
    Copy the full SHA
    6a50029 View commit details
    Browse the repository at this point in the history
  2. move functors to cpp file. they're only used by the read methods, so

    having them in the header serves no purpose.
    etseidl committed Sep 12, 2023
    Configuration menu
    Copy the full SHA
    9725f70 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    c3b6422 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    e84baf6 View commit details
    Browse the repository at this point in the history
  5. Revert "use CRTP to get rid of pure virtual"

    This reverts commit e84baf6.
    etseidl committed Sep 12, 2023
    Configuration menu
    Copy the full SHA
    fa50d62 View commit details
    Browse the repository at this point in the history
  6. fix get_uxx functions

    etseidl committed Sep 12, 2023
    Configuration menu
    Copy the full SHA
    28d71c0 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    a474bec View commit details
    Browse the repository at this point in the history
  8. rework implementation of the column_orders field in file meta data

    as a demonstration of `ParquetFieldOptional`
    etseidl committed Sep 12, 2023
    Configuration menu
    Copy the full SHA
    9220f35 View commit details
    Browse the repository at this point in the history
  9. clean up

    etseidl committed Sep 12, 2023
    Configuration menu
    Copy the full SHA
    9f2e898 View commit details
    Browse the repository at this point in the history
  10. more cleanup

    etseidl committed Sep 12, 2023
    Configuration menu
    Copy the full SHA
    a91a196 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    fd9e3f8 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    4a677f6 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    e70b810 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    bf9e073 View commit details
    Browse the repository at this point in the history
  15. more consts

    etseidl committed Sep 12, 2023
    Configuration menu
    Copy the full SHA
    eb519f2 View commit details
    Browse the repository at this point in the history
  16. what can you apply apart from const...more const!

    and missed an 'if'
    etseidl committed Sep 12, 2023
    Configuration menu
    Copy the full SHA
    943be91 View commit details
    Browse the repository at this point in the history
  17. clean up header

    etseidl committed Sep 12, 2023
    Configuration menu
    Copy the full SHA
    1cfe326 View commit details
    Browse the repository at this point in the history

Commits on Sep 13, 2023

  1. Configuration menu
    Copy the full SHA
    9823f11 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a0a758f View commit details
    Browse the repository at this point in the history

Commits on Sep 14, 2023

  1. add FIXME

    etseidl committed Sep 14, 2023
    Configuration menu
    Copy the full SHA
    ac7b665 View commit details
    Browse the repository at this point in the history
  2. Merge branch 'refactor_parquet_thrift' of github.com:etseidl/cudf int…

    …o refactor_parquet_thrift
    etseidl committed Sep 14, 2023
    Configuration menu
    Copy the full SHA
    76f16dd View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    4f49ef1 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    e757616 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    71e8eab View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    13200ff View commit details
    Browse the repository at this point in the history

Commits on Sep 15, 2023

  1. Configuration menu
    Copy the full SHA
    16df9e5 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    075e11e View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    8a4820b View commit details
    Browse the repository at this point in the history

Commits on Sep 18, 2023

  1. Configuration menu
    Copy the full SHA
    5700b21 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    45f3249 View commit details
    Browse the repository at this point in the history
  3. use thrust::optional rather than std::optional as some fields may

    wind up on device.
    also rename the field functors to use snake_case
    etseidl committed Sep 18, 2023
    Configuration menu
    Copy the full SHA
    0ade852 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    aac2f33 View commit details
    Browse the repository at this point in the history
  5. missed a use of std::optional

    etseidl committed Sep 18, 2023
    Configuration menu
    Copy the full SHA
    0ae2fc4 View commit details
    Browse the repository at this point in the history

Commits on Sep 19, 2023

  1. Configuration menu
    Copy the full SHA
    f6dcb52 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    10df4a0 View commit details
    Browse the repository at this point in the history
  3. more snake case

    etseidl committed Sep 19, 2023
    Configuration menu
    Copy the full SHA
    15a2831 View commit details
    Browse the repository at this point in the history

Commits on Sep 20, 2023

  1. Configuration menu
    Copy the full SHA
    aad3908 View commit details
    Browse the repository at this point in the history
  2. finish merge

    etseidl committed Sep 20, 2023
    Configuration menu
    Copy the full SHA
    ec81897 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    9fafa22 View commit details
    Browse the repository at this point in the history

Commits on Sep 21, 2023

  1. Configuration menu
    Copy the full SHA
    ae164b9 View commit details
    Browse the repository at this point in the history

Commits on Sep 22, 2023

  1. Configuration menu
    Copy the full SHA
    05a7fa2 View commit details
    Browse the repository at this point in the history

Commits on Sep 27, 2023

  1. Configuration menu
    Copy the full SHA
    3cc69eb View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    5ac163b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    9680b3d View commit details
    Browse the repository at this point in the history

Commits on Sep 28, 2023

  1. Configuration menu
    Copy the full SHA
    6854683 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ec1f138 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    b2a9488 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    348bf11 View commit details
    Browse the repository at this point in the history

Commits on Sep 29, 2023

  1. Configuration menu
    Copy the full SHA
    57b64ba View commit details
    Browse the repository at this point in the history

Commits on Oct 3, 2023

  1. Configuration menu
    Copy the full SHA
    3b405c8 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6d53225 View commit details
    Browse the repository at this point in the history

Commits on Oct 4, 2023

  1. Configuration menu
    Copy the full SHA
    b53bca1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e96b225 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    069eb3d View commit details
    Browse the repository at this point in the history

Commits on Oct 6, 2023

  1. Configuration menu
    Copy the full SHA
    4c17cf0 View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2023

  1. Configuration menu
    Copy the full SHA
    fe6f1a8 View commit details
    Browse the repository at this point in the history

Commits on Oct 10, 2023

  1. Configuration menu
    Copy the full SHA
    5576e6d View commit details
    Browse the repository at this point in the history

Commits on Oct 11, 2023

  1. Configuration menu
    Copy the full SHA
    3334cec View commit details
    Browse the repository at this point in the history

Commits on Oct 16, 2023

  1. Configuration menu
    Copy the full SHA
    b111ff2 View commit details
    Browse the repository at this point in the history

Commits on Oct 18, 2023

  1. Configuration menu
    Copy the full SHA
    e4c9911 View commit details
    Browse the repository at this point in the history

Commits on Oct 20, 2023

  1. Configuration menu
    Copy the full SHA
    1b0b435 View commit details
    Browse the repository at this point in the history

Commits on Oct 25, 2023

  1. Configuration menu
    Copy the full SHA
    e311781 View commit details
    Browse the repository at this point in the history
  2. finish merge

    etseidl committed Oct 25, 2023
    Configuration menu
    Copy the full SHA
    07805bb View commit details
    Browse the repository at this point in the history

Commits on Oct 26, 2023

  1. fix alignment issue

    etseidl committed Oct 26, 2023
    Configuration menu
    Copy the full SHA
    df13cb7 View commit details
    Browse the repository at this point in the history

Commits on Oct 27, 2023

  1. Configuration menu
    Copy the full SHA
    2223be2 View commit details
    Browse the repository at this point in the history

Commits on Oct 28, 2023

  1. Configuration menu
    Copy the full SHA
    551554d View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a028df8 View commit details
    Browse the repository at this point in the history
  3. more clean up

    etseidl committed Oct 28, 2023
    Configuration menu
    Copy the full SHA
    244c369 View commit details
    Browse the repository at this point in the history

Commits on Oct 30, 2023

  1. Configuration menu
    Copy the full SHA
    03ea33a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    74f25d7 View commit details
    Browse the repository at this point in the history
  3. clean up comments

    etseidl committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    f8481c9 View commit details
    Browse the repository at this point in the history
  4. more cleanup

    etseidl committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    1bc77cd View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    1eeaee0 View commit details
    Browse the repository at this point in the history

Commits on Oct 31, 2023

  1. Configuration menu
    Copy the full SHA
    6e4b9b1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    536642c View commit details
    Browse the repository at this point in the history

Commits on Nov 1, 2023

  1. Configuration menu
    Copy the full SHA
    59982b2 View commit details
    Browse the repository at this point in the history
  2. add some documentation

    etseidl committed Nov 1, 2023
    Configuration menu
    Copy the full SHA
    1393232 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    ddc5463 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    7826c0c View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    b6067ac View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    cf973f4 View commit details
    Browse the repository at this point in the history
  7. fix docstring

    etseidl committed Nov 1, 2023
    Configuration menu
    Copy the full SHA
    8746540 View commit details
    Browse the repository at this point in the history
  8. rename function

    etseidl committed Nov 1, 2023
    Configuration menu
    Copy the full SHA
    43c980f View commit details
    Browse the repository at this point in the history

Commits on Nov 2, 2023

  1. Configuration menu
    Copy the full SHA
    72b9a57 View commit details
    Browse the repository at this point in the history
  2. add some more stats checks

    etseidl committed Nov 2, 2023
    Configuration menu
    Copy the full SHA
    ff7ffbc View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    69a081b View commit details
    Browse the repository at this point in the history
  4. clean up another comment

    etseidl committed Nov 2, 2023
    Configuration menu
    Copy the full SHA
    d4a1dba View commit details
    Browse the repository at this point in the history

Commits on Nov 3, 2023

  1. Configuration menu
    Copy the full SHA
    a6f772d View commit details
    Browse the repository at this point in the history
  2. add num_valid to EncPage

    etseidl committed Nov 3, 2023
    Configuration menu
    Copy the full SHA
    12ed759 View commit details
    Browse the repository at this point in the history
  3. remove unused aliases

    etseidl committed Nov 3, 2023
    Configuration menu
    Copy the full SHA
    717d53b View commit details
    Browse the repository at this point in the history

Commits on Nov 6, 2023

  1. Configuration menu
    Copy the full SHA
    551f728 View commit details
    Browse the repository at this point in the history

Commits on Nov 7, 2023

  1. Configuration menu
    Copy the full SHA
    1953714 View commit details
    Browse the repository at this point in the history

Commits on Nov 8, 2023

  1. Configuration menu
    Copy the full SHA
    cf793d6 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    90829f8 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    fa36c23 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    3ae2289 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    26fc762 View commit details
    Browse the repository at this point in the history

Commits on Nov 9, 2023

  1. Configuration menu
    Copy the full SHA
    18d4939 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7cb0524 View commit details
    Browse the repository at this point in the history

Commits on Nov 11, 2023

  1. Configuration menu
    Copy the full SHA
    9440cd0 View commit details
    Browse the repository at this point in the history

Commits on Nov 13, 2023

  1. Configuration menu
    Copy the full SHA
    cebc8ee View commit details
    Browse the repository at this point in the history

Commits on Nov 14, 2023

  1. Configuration menu
    Copy the full SHA
    e1b427d View commit details
    Browse the repository at this point in the history

Commits on Nov 16, 2023

  1. Configuration menu
    Copy the full SHA
    2155f18 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    206e741 View commit details
    Browse the repository at this point in the history

Commits on Nov 17, 2023

  1. Configuration menu
    Copy the full SHA
    220704a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    68dcc58 View commit details
    Browse the repository at this point in the history

Commits on Nov 20, 2023

  1. Configuration menu
    Copy the full SHA
    ed573ea View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    fe24d51 View commit details
    Browse the repository at this point in the history

Commits on Nov 21, 2023

  1. Configuration menu
    Copy the full SHA
    24c73f8 View commit details
    Browse the repository at this point in the history
  2. fix typo

    etseidl committed Nov 21, 2023
    Configuration menu
    Copy the full SHA
    6687b32 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    7d2075b View commit details
    Browse the repository at this point in the history

Commits on Nov 28, 2023

  1. Configuration menu
    Copy the full SHA
    335d0e7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    01fc44f View commit details
    Browse the repository at this point in the history
  3. a few fixes from review

    etseidl committed Nov 28, 2023
    Configuration menu
    Copy the full SHA
    b06d515 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    0dcb674 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    9d66218 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    223bfab View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    2001271 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    ef38d1e View commit details
    Browse the repository at this point in the history
  9. fix cut-and-paste error

    etseidl committed Nov 28, 2023
    Configuration menu
    Copy the full SHA
    7eaad88 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    2c4bebb View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    b458eb6 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    0665183 View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    c5d7e50 View commit details
    Browse the repository at this point in the history
  14. get rid of unnecessary casts

    etseidl committed Nov 28, 2023
    Configuration menu
    Copy the full SHA
    9f859be View commit details
    Browse the repository at this point in the history
  15. remove unused function

    etseidl committed Nov 28, 2023
    Configuration menu
    Copy the full SHA
    acef3d5 View commit details
    Browse the repository at this point in the history

Commits on Nov 29, 2023

  1. move num_data_pages/num_dict_pages into EncColumnChunk

    root out yet another use of num_data_pages()
    etseidl committed Nov 29, 2023
    Configuration menu
    Copy the full SHA
    6a8a02e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    49ea23f View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    0c6b6b5 View commit details
    Browse the repository at this point in the history

Commits on Dec 2, 2023

  1. Configuration menu
    Copy the full SHA
    33ab1e9 View commit details
    Browse the repository at this point in the history

Commits on Dec 5, 2023

  1. Configuration menu
    Copy the full SHA
    36785e2 View commit details
    Browse the repository at this point in the history
  2. clean up some comments

    etseidl committed Dec 5, 2023
    Configuration menu
    Copy the full SHA
    862752e View commit details
    Browse the repository at this point in the history

Commits on Dec 6, 2023

  1. Configuration menu
    Copy the full SHA
    3bacb21 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    78efd2e View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    97f2344 View commit details
    Browse the repository at this point in the history