next: Changes queued for the v2016.07 release #956

lukego · 2016-06-29T06:10:13Z

Changes queued up for v2016.07.

Fixes needed:

Selftest in core.timer is printing tens of thousands of debug messages. Have to account for why this has started happening and likely remove some debug printouts (seems excessive to print a message on every timer invocation even when developer_debug is enabled... could be a use case for timeline instead soon).
NFV selftest is failing CI. Could be related to migration of davos from Ubuntu to NixOS that is ongoing. @eugeneia?
Performance regression detected by Hydra tests developed over on NFV appears to allocate #951. This benchmark is automatically executing as the issue-951 jobset. Results can be found by clicking an Evaluation (i.e. test run) and locating the benchmark-report job.

Set the shared memory path (shm.path) to a private namespace for each app with prefix "app/$name". This means that apps can create shm objects such as counters and by default these will appear in a local namespace for that app.

I rewrote the git-workflow explanation with the goal of briefly but clearly explaining both how to submit a change to Snabb Switch and also how to be a subsystem maintainer. This is intended to be a chapter in the manual.

Contains "XXX rewrite" where old sections have been removed but replacements not written yet.

This file now contains a diagram so it needs to have separate .src.md and .md versions. This can be cleaned up when these changes eventually merge with the on-demand markdown (#829).

also stopped capitalizing "Pull Request" because it seemed awkward.

- Use "apps/" instead of "app/" for uniformity - Set shm path to "apps/$name" when calling `app:stop' too - Unlink "apps/$name" after `app:stop' using `shm.unlink' - Add a test case to core.app selftest

# Conflicts: # src/core/app.lua

…eric representation.

Resolved conflicts with incoming changes from master branch: *.src.md replaced with simply *.md "Snabb Switch" being renamed to "Snabb"

Ditaa images are not kept in tree anymore...

option and support injecting a function to determine the current time.

This reverts commit 8bb3215.

MRG_RXBUF is enabled by default in the beginning, and QEMU will initially negotiate a feature set with Snabb NFV that includes MRG_RXBUF. This adds a field onto the virtio header, in legacy mode. However if we later negotiate to not have MRG_RXBUF, we need to re-set this value to not have the extra fields. (cherry picked from commit df8e83c)

lukego · 2016-07-01T21:42:49Z

I cherry-picked the virtio-net fix df8e83c from lwaftr repo for renegotiating virtio-net options with guests that switch drivers while running.

…dicate." Reason: its actually slower than the initial naive version. This reverts commit c186591. # Conflicts: # src/apps/vhost/vhost_user.lua

eugeneia · 2016-07-04T12:59:38Z

@lukego I have tracked down the regression to the statistics counters in vhost_user. I pushed e3fcbbf to #931 which reverts a premature optimization I did, and this removes half of the slowdown. That leaves us with a 2.5% regression. Now looking into what can be done about the remaining slowdown.

Edit: well “premature optimization” is the wrong term here really as I tried to outsmart the compiler and failed.

eugeneia · 2016-07-04T13:29:58Z

Profiling tells me that 10% of time is spent in counter.add, and does not highlight vhost_user in particular. Theory: more counters → more work. Possible solutions I can think of:

optimize counter.add
update vhost_user to invoke counter.add less, e.g. aggregate updates like intel10g does

eugeneia · 2016-07-04T13:45:16Z

Alternatively we could omit counting successfully processed rx/tx packets in vhost_user, and rely on the existing link statistics to reflect these. (rx/tx packes/bytes app-counters are mostly redundant with respect to link stats.)

# Conflicts: # src/apps/vhost/vhost_user.lua # src/lib/protocol/README.md

…-next

Fixed packet documentation merge conflict

lukego · 2016-07-05T05:59:53Z

@eugeneia Looking at a new Hydra run it looks like the same performance regression is there on both next and your statistics-superset branch. If you push new code to your branch it should automatically rerun. Hopefully you can find results by clicking around the snabb-new-tests project on Hydra. Currently we have two relevant CI jobs, regression-min-2016.07 running a quick 5-iteration benchmark and regression-2016.07 running a thorough 30-iteration benchmark.

@domenkozar I would like to generalize these CI jobs. Instead of specifically testing for a performance regression on the 2016.07 release they could be continuously showing the comparative performance of a collection of branches: master, next, lukego/optimize, eugeneia/optimize, wingo/optimize, kbara/optimize, etc. That way everybody can always interact with the CI simply by pushing to their optimization branch and waiting to see the result. Just a matter of renaming the jobsets and locking in the branch names?

Meta: I reckon that all "out of band" information we are discussing should be brought "in band" somehow. For example, if profiler information is required for interpreting results then the CI tests should include runs that are profiled and publish the reports. This way we are all looking at the same information and can see exactly how it was produced.

lukego · 2016-07-05T07:41:03Z

@domenkozar This is really cool testing to have up and running, btw! I am especially impressed that Hydra is smart enough to reuse all previous results that are still valid. So if one branch is updated then its benchmarks will be rerun but the results for the unmodified branches will be reused. This seems extremely efficient and means we can include a lot of branches in the benchmark campaigns.

domenkozar · 2016-07-05T09:39:59Z

@lukego I've renamed the CI jobs to next-regression-benchmarks and next-regression-benchmarks-small: https://hydra.snabb.co/project/snabb-new-tests

Once /optimize branches exist we can also change those :)

By "profiler information" do you mean we should write those tests and include them? I'm +1 on that, let's add more regressions tests.

The counter value for the interface speed should be in units of bps. The ifSpeed SNMP object must obey RFC3635 sec. 3.2.8.

domenkozar · 2016-07-05T15:26:19Z

Looks like SnabbBot errored out due to snabblab/snabblab-nixos#52

# Conflicts: # src/apps/vhost/vhost_user.lua

Allow multiple CPUs via --cpu argument

lukego and others added 30 commits February 22, 2016 13:35

engine: Set shm path to "app/$name"

b34c3ee

Set the shared memory path (shm.path) to a private namespace for each app with prefix "app/$name". This means that apps can create shm objects such as counters and by default these will appear in a local namespace for that app.

doc/git-workflow.md: Rewritten based on new experience

80614e1

I rewrote the git-workflow explanation with the goal of briefly but clearly explaining both how to submit a change to Snabb Switch and also how to be a subsystem maintainer. This is intended to be a chapter in the manual.

doc/git-workflow.md: Added section on upstreaming subsystems

12d0009

doc/git-workflow.md: Partly rewritten draft

af8ae46

Contains "XXX rewrite" where old sections have been removed but replacements not written yet.

doc/git-workflow.md: Created .src.md

eccb614

This file now contains a diagram so it needs to have separate .src.md and .md versions. This can be cleaned up when these changes eventually merge with the on-demand markdown (#829).

git-workflow.md: Rewrote section on becoming a maintainer

113b607

git-workflow.md: Wrote about upstreaming subsystem branches

4a37b98

also stopped capitalizing "Pull Request" because it seemed awkward.

Amendments to #766:

94ff234

- Use "apps/" instead of "app/" for uniformity - Set shm path to "apps/$name" when calling `app:stop' too - Unlink "apps/$name" after `app:stop' using `shm.unlink' - Add a test case to core.app selftest

Merge PR #766 (engine: Set shm path to "app/$name") into yang

aac0c8c

# Conflicts: # src/core/app.lua

core.counter: Qualify counter names using `shm.resolve'.

fad0f43

snabb top: add `--app' option to print app counters.

7ed4ed0

snabb top: unlink own shm tree to avoid clutter.

eb9005b

vhost_user: Add RFC 7223 app counters.

5fbe0d6

Intel_app: Add RFC 7223 app counters.

8bb3215

snabb top: Add --link parameter to list link counters.

7a55478

core.app: Put app counters under "counters/<app>", update snabb top.

dde5da2

Add a test demonstrating pci device ids, with capital A-F

7a6296f

lib.json: Import JSON4Lua 1.0.0, include encode functionality.

924ff4e

lib.macaddress: Support numeric initialization; add method to get num…

8e34093

…eric representation.

core.link: Create “discontinuity-time” counters.

5f9efd2

snabb top: add `--yang' option to print YANG model as JSON.

7b39148

remove superfluous syscall.sysctl

5d6baa4

Update CONTRIBUTING.md.

e457300

Add 'start' method to apps

c3c69d2

Call app.start instead of app_table[name].start

9dabdd2

Merge branch 'master' into doc-git-workflow-redux

6964969

Resolved conflicts with incoming changes from master branch: *.src.md replaced with simply *.md "Snabb Switch" being renamed to "Snabb"

doc: Remove stale .images/Branches.png file

6c46b5d

Ditaa images are not kept in tree anymore...

snabb top --yang: Represent uint64_t as decimal string.

8984741

[core.lib] Generalize `timer' to optionally accept 'repeating'

ee00d16

option and support injecting a function to determine the current time.

Revert "Intel_app: Add RFC 7223 app counters."

45490b8

This reverts commit 8bb3215.

Revert "lib.protocol.ethernet: Add n_mcast, branch-free Multicast pre…

e3fcbbf

…dicate." Reason: its actually slower than the initial naive version. This reverts commit c186591. # Conflicts: # src/apps/vhost/vhost_user.lua

eugeneia and others added 8 commits July 4, 2016 15:48

Merge branch 'statistics-superset' into io-stats-1

b3abaa0

# Conflicts: # src/apps/vhost/vhost_user.lua # src/lib/protocol/README.md

intel_app: fix wrong “speed” counter value.

b418f8f

apps.ipv6.nd_light: revise counters, south = rx / north = tx.

d52d89c

Merge PR #944 (test_env.sh: output kernel console to stdout) into max…

e99037a

…-next

Merge PR #948 (default to eugeneia/snabb-nfv-test-vanilla) into max-next

06bed19

Merge PR #949 (ESP tunnel support for SnabbNFV) into max-next

9738d15

Merge #961 branch 'eugeneia/max-next-v2016.07-2' into next

3549398

Merged #947 (RawSocket/TAP i/o stats counters)

b31acfd

Fixed packet documentation merge conflict

Fix interface speed

900ff8e

The counter value for the interface speed should be in units of bps. The ifSpeed SNMP object must obey RFC3635 sec. 3.2.8.

Katerina Barone-Adesi and others added 4 commits July 5, 2016 18:39

Merge remote-tracking branch 'upstream/next' into kbara-next

d9e52ab

Merge PR #953 (Misc NFV stats counters) into kbara-next

51c9d51

vhost/vhost_user: avoid callbacks.

905bf8d

Merge PR #963 (Fix interface speed) into statistics-superset

8317e61

eugeneia mentioned this pull request Jul 8, 2016

Fixes for next/2016.07 #964

Merged

lukego and others added 3 commits July 11, 2016 10:38

Merge #947 branch 'snabbco/kbara-next' into next

cf1810e

Merge branch 'next' into statistics-superset

f891072

# Conflicts: # src/apps/vhost/vhost_user.lua

Merge #964 branch 'eugeneia/statistics-superset' into next

3e1f4dc

eugeneia merged commit 3e1f4dc into master Jul 13, 2016

eugeneia added a commit that referenced this pull request Jul 13, 2016

Merge PR #956 (v2016.07 release) into master

6bef9cc

dpino pushed a commit to dpino/snabb that referenced this pull request Sep 25, 2017

Merge pull request snabbco#956 from Igalia/multiprocess-cpu

9a4b06e

Allow multiple CPUs via --cpu argument

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

next: Changes queued for the v2016.07 release #956

next: Changes queued for the v2016.07 release #956

lukego commented Jun 29, 2016 •

edited

Loading

lukego commented Jul 1, 2016

eugeneia commented Jul 4, 2016 •

edited

Loading

eugeneia commented Jul 4, 2016

eugeneia commented Jul 4, 2016 •

edited

Loading

lukego commented Jul 5, 2016

lukego commented Jul 5, 2016

domenkozar commented Jul 5, 2016 •

edited

Loading

domenkozar commented Jul 5, 2016 •

edited

Loading

next: Changes queued for the v2016.07 release #956

next: Changes queued for the v2016.07 release #956

Conversation

lukego commented Jun 29, 2016 • edited Loading

lukego commented Jul 1, 2016

eugeneia commented Jul 4, 2016 • edited Loading

eugeneia commented Jul 4, 2016

eugeneia commented Jul 4, 2016 • edited Loading

lukego commented Jul 5, 2016

lukego commented Jul 5, 2016

domenkozar commented Jul 5, 2016 • edited Loading

domenkozar commented Jul 5, 2016 • edited Loading

lukego commented Jun 29, 2016 •

edited

Loading

eugeneia commented Jul 4, 2016 •

edited

Loading

eugeneia commented Jul 4, 2016 •

edited

Loading

domenkozar commented Jul 5, 2016 •

edited

Loading

domenkozar commented Jul 5, 2016 •

edited

Loading