data voyager: A simple and practical data exploration tool based on dataFusion and reedline-repl-rs
- Arrow
- DataFusion
- Polars
-
duckdb
- SELECT count(*) FROM 'data/user_stats.parquet' WHERE last_visited_at >= '2024-06-15';
-
clickhouse local
- SELECT count(*) FROM file('data/user_stats.parquet', Parquet) WHERE last_visited_at >= toDateTime('2024-06-15', 'Asia/Shanghai'); // 0.065 sec
-
pgcli postgres://kindy:kindy@localhost:5432/stats
- SELECT count(*) FROM user_stats WHERE last_visited_at >= '2024-06-15'; // 0.131 sec
chmod +x /usr/bin/duckdb
- clickhouse select * from postgresql('localhost:5432', 'stats', 'user_stats', 'kindy', 'kindy') into outfile 'data/user_stats.parquet'
select * from postgresql('localhost:5432', 'stats', 'user_stats', 'kindy', 'kindy') limit 100 into outfile 'assets/user_stats.ndjson'