Scope of this project is to apply Algorithm 1: CVOPT-SASG: Algorithm computing a random sample for a single aggregate, single group-by as refers in the follow source : https://arxiv.org/pdf/1909.02629.pdf input streams-data are coming from : https://openaq.org/
Loaded into a Kafka topic (Removed in last version) Read From Kafka topic (Removed in last version) // Comment code, Lose Tuples cause Time events.//
- Run FirstPAssMain
- Run SecondPassMain
- Run FirstPassMain
- Run KafkaMain
- Run SecondPassMain
Resolving Time Issues, New KafkaConsumer (Connection between first and second Flink jobs).