Optimal paging queries #1679

coadan · 2021-12-30T11:15:51Z

coadan
Dec 30, 2021

Maybe I'm incorrect in my assumptions here, but here we go:

There are many cases where having graph queries is not necessary because your dataset is aggregated/denormalized and you don't need to join on anything, but you want to enable simple pagination over the sets.

I know that datalog queries provide offset and limit to enable paging over result tuples returned from utilizing the graph index, however the fact that it has to realize the whole query seems to be a limiting factor for efficiently enabling the display of larger sets of denormalized data. Also, I'm not sure it's also beneficial from the hardware side of things to spill to disk every time a query with a deep offset occurs because it would rip through the write ops of the underlying SSDs no?

Example use-case: We want to store synced emails in XTDB, but if we were to display these emails directly to the user with pagination, it's not possible to provide this in an optimal way, especially with potentially millions of them spilling to disk every time?

Answered by jarohen

Jan 4, 2022

Hey @coadan 👋

You're right in that :offset requires XT paging over all the results that you want to skip over (Postgres etc too, fwiw). In similar situations, I've tended to use attributes in the documents to filter on instead ('cursor-based pagination') - so, if you were paging through emails in the order they were received, and the last email on the page was received last Thursday, your client then requests '100 emails starting from last Thursday'. In this case, the query planner can then skip straight to the first item on the next page.

HTH!

James

View full answer

jarohen · 2022-01-04T16:20:01Z

jarohen
Jan 4, 2022
Maintainer

Hey @coadan 👋

You're right in that :offset requires XT paging over all the results that you want to skip over (Postgres etc too, fwiw). In similar situations, I've tended to use attributes in the documents to filter on instead ('cursor-based pagination') - so, if you were paging through emails in the order they were received, and the last email on the page was received last Thursday, your client then requests '100 emails starting from last Thursday'. In this case, the query planner can then skip straight to the first item on the next page.

HTH!

James

1 reply

coadan Jan 12, 2022
Author

Thanks, I will try to design around this!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

XTDB

Optimal paging queries #1679

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

XTDB

Optimal paging queries #1679

coadan Dec 30, 2021

Replies: 1 comment · 1 reply

jarohen Jan 4, 2022 Maintainer

coadan Jan 12, 2022 Author

coadan
Dec 30, 2021

Replies: 1 comment 1 reply

jarohen
Jan 4, 2022
Maintainer

coadan Jan 12, 2022
Author