Applications Metrics Polling POC #4

atosatto · 2016-07-08T14:00:36Z

We should definitively put down the initial reference implementation of the Application Metrics Polling system.

The responsibility of this system will be to execute the checks defined by the container label kappa.metric every kappa.rate units of time.
Available checks will be defined in the kappa configuration file.
Each metric should give kappa the desired number of active containers for the given service.
The communication between Kappa's Engine and the Applications Metrics Polling components will be implemented using async communication through channels. Each metric will be lazily spawned into its own goroutine.
The first implementation of the Applications Metrics Polling will rely on external bash scripts to actually perform the HTTP requests of cGroup queries to extract the number of desired instances of each container but the implementation will ease the creation of different Applications Metrics Polling implementations.

The text was updated successfully, but these errors were encountered:

fntlnz · 2016-07-08T16:28:33Z

I created a little diagram to show what is going on there.

Download PDF

The tick part is based on the rate while the metric is the actual code to be executed in order to obtain back the number of containers to scale.

DOUBT: what are we supposed to do when a tick fails?

Also note that I called them ticks for a reason. I thought that would be better to threat them as single sequential units instead of allowing them to be overlapped like what happens in cronjobs.

atosatto · 2016-07-09T11:12:20Z

Uhm.
Regarding your doubt I think that we have two choices (plus logging):

exponentially bake-off on fail
keep polling the metric even if it is failing
I believe that the 2nd strategy would be the best one since I expect to users to test and monitor their metrics.

I thought that would be better to threat them as single sequential units instead of allowing them to be overlapped like what happens in cronjobs.

@fntlnz could you be more precise about this?! What do you mean by overlapping? Are you referring to the fact that if the metric is slow to compute the number of desired containers and the rate is too high we can easily queues too many metric polling routines?

I'll open a PR with some code addressing this issue and count on your (@fntlnz & @jnardiello) precious feedbacks.

fntlnz · 2016-07-12T21:41:09Z

@atosatto Yep exactly that.

Let's consider this:

t0 -> metric1 check is fired
t1 -> metric1 ceck finished
t2 -> metric2 check is started
t3 -> metric3 check is started
t4 -> metric3 check is finished
t5 -> metric2 check is finished

As you can imagine allowing metrics to overlap and start in an un-managed way could lead to more problem than it solves

fntlnz · 2016-07-12T21:43:24Z

For the failing metric thing I would choose the same option of your choice.

keep polling the metric even if it is failing

Mainly because if the check fails containers are untouched. I would expose an API or give a way to check programmatically if something is failing

atosatto · 2016-07-14T06:30:39Z

Very good. All what we need is the code! 😆
I've already written few code lines to address this issue.
I'll submit a PR as soon as the LOC will be more consistent in order to let you give me some feedbacks while I code this.

Thank you!

atosatto · 2016-07-27T08:13:35Z

Just submitted PR #5 to keep track of the changes to address this issue.

atosatto added the waiting-for-feedback label Jul 8, 2016

atosatto added WIP and removed waiting-for-feedback labels Jul 14, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Applications Metrics Polling POC #4

Applications Metrics Polling POC #4

atosatto commented Jul 8, 2016 •

edited

Loading

fntlnz commented Jul 8, 2016 •

edited

Loading

atosatto commented Jul 9, 2016 •

edited

Loading

fntlnz commented Jul 12, 2016

fntlnz commented Jul 12, 2016

atosatto commented Jul 14, 2016

atosatto commented Jul 27, 2016 •

edited

Loading

Applications Metrics Polling POC #4

Applications Metrics Polling POC #4

Comments

atosatto commented Jul 8, 2016 • edited Loading

fntlnz commented Jul 8, 2016 • edited Loading

atosatto commented Jul 9, 2016 • edited Loading

fntlnz commented Jul 12, 2016

fntlnz commented Jul 12, 2016

atosatto commented Jul 14, 2016

atosatto commented Jul 27, 2016 • edited Loading

atosatto commented Jul 8, 2016 •

edited

Loading

fntlnz commented Jul 8, 2016 •

edited

Loading

atosatto commented Jul 9, 2016 •

edited

Loading

atosatto commented Jul 27, 2016 •

edited

Loading