Yordan Grigorov
Haralampos Gavriilidis
Sergey Redyuk
https://docs.google.com/presentation/d/1ddOAj5-91dMmr7gBm28uBOANgcL1zlLOv41waXOuHt4/edit?usp=sharing
https://drive.google.com/file/d/1EusaxeYp8BGZFN4AVRxKpls9tO-Y2D-B/view?usp=sharing
Coming soon!
grizzly
- Scripts that prepare workflows for execution with grizzly
- Experiments with grizzly
modin
- Scripts that prepare workflows for execution with modin
- Experiments with modin
p2d2
- The product of the thesis
- "Python PushDown to Database Management System"
papers
- PDFs of papers that are/were referenced by the thesis
- PDFs of papers that I find interesting and could be relevant for the thesis
wflows
- Data science workflows adapted from Kaggle wflows/qgenenv - contains the qgen binary from TPC-H and its output (result.sql, result2.sql, result3.sql)
data
- Data for the wflows and the scripts to import them. See comments in wflows to see what data is required. Or just import all.
tpch-kit
- submodule containing gregrahn's fork of the TPC-H implementation. I had some difficulties compiling the original TPC-H and also this fork promises better PostgreSQL compatibility.
latex
- The bachelor thesis
retired
- The "trash bin". Contains code I might not want to delete just yet.