Skip to content

kindywu/data-voyager

Repository files navigation

data voyager: A simple and practical data exploration tool based on dataFusion and reedline-repl-rs

image

image

image

data voyager

  • Arrow
  • DataFusion
  • Polars

duckdb clickhouse pgcli

  • duckdb

    • SELECT count(*) FROM 'data/user_stats.parquet' WHERE last_visited_at >= '2024-06-15';
  • clickhouse local

    • SELECT count(*) FROM file('data/user_stats.parquet', Parquet) WHERE last_visited_at >= toDateTime('2024-06-15', 'Asia/Shanghai'); // 0.065 sec
  • pgcli postgres://kindy:kindy@localhost:5432/stats

    • SELECT count(*) FROM user_stats WHERE last_visited_at >= '2024-06-15'; // 0.131 sec

other

chmod +x /usr/bin/duckdb

dump 5M rows to parquet file

  • clickhouse select * from postgresql('localhost:5432', 'stats', 'user_stats', 'kindy', 'kindy') into outfile 'data/user_stats.parquet'

select * from postgresql('localhost:5432', 'stats', 'user_stats', 'kindy', 'kindy') limit 100 into outfile 'assets/user_stats.ndjson'

About

data voyager for arrow, polars, dataFusion, etc

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages