next: Collected changes for the v2016.03 release #778

lukego · 2016-02-24T08:45:57Z

I am declaring that next is now a release candidate for the v2016.03 release. I plan to focus on testing and stability and have it in good shape on March 1st for potential merge by @eugeneia onto master. I leave the [wip] tag for the next few days so that we have a chance to test.

* lib/luajit/src/lj_record.c (lj_record_tailcall): Tail calls don't contribute to loop unroll limit.

* src/Makefile (MDSRC): Rename from RMSRC, and search for any .md.src file. Update users. (MDOBJS): Rename from RMOBJS.

… functions * src/lib/ctable.lua (make_equal_fn): Add specialized implementations for 2-byte and 6-byte keys as well. (hash_32): Rename from hash_i32, as it works on any 32-bit number. (hashv_32, hashv_48, hashv_64): New functions. (selftest): Update to test the new equal functions.

* src/doc/genbook.sh: Add section for specialized data structures. * src/lib/README.ctable.md: New file.

Thanks to Kristian Larsson for catching it.

Convert all shell scripts from having path dependencies like: #!/bin/bash to #!/usr/bin/env bash The immediate utility of this change is to fix annoying breakage on NixOS. For example, now "make doc/snabbswitch.html" works. Exception is that I did not descend into subdirectories of dependencies e.g. ljsyscall and pflua. I made the changes with an Emacs macro in the hope of avoiding an unfortunate typo that will break something.

Streamline the core.memory API: - Rewrite the blurb. - Keep dma_alloc() and virtual_to_physical(). - Remove dma_min_addr and dma_max_addr (unsed). - Deprecate huge_page_size (used but should it be?) Backstory here is that the core.memory module has exported low-level information to other modules: the VFIO module (since removed) that needed to whitelist all DMA memory with the kernel, a /dev/vhost-net accelerator for tap devices (since removed) that also needed to whitelist all packet memory, and to a lesser extent the vhost-user implementation (that is still alive and well but may be overusing the memory API e.g. in dead code that should be removed). Adding the deprecation note on "huge_page_size" may or may not be a good idea. Have to look more closely at the usage and clarify with a later commit.

Remove the variable 'pagebits' and the function 'VirtioNetDevice:translate_physical_addr()'. These are not used and they depend on parts of the core.memory API that I want to remove.

This reverts commit 9666348.

Thanks to Max Rottenkolber for review.

Rework the parsing of the Snabb command line to allow us to ship a nicely named snabb-lwaftr binary, but have it map internally to the also nicely-named 'programs.lwaftr' module tree. Gets rid of the nasty "snabb" name duplication in the module tree.

This is intended to make Github match the ".md" suffix and treat the files as markdown format. This should enable better support in the web interface e.g. showing diffs with the rendered markdown. Link to relevant Github feature: https://help.github.com/articles/rendering-differences-in-prose-documents/

The syscall.util.if_nametoindex() only needs to be passed the name of the interface. The version of the function which accepts an open socket as second argument is internal to ljsyscall, and not even exported to client code. Also, this moves the call to syscall.util.if_nametoindex() to the top of RawSocket:new() to simplify error handling: this way one less explicit call to syscall.close() is needed in case of errors.

…ax-next

…nametoindex) into max-next

…) into max-next

…x-next

domenkozar · 2016-02-25T17:05:56Z

@eugeneia I'll take a look at this tomorrow

eugeneia · 2016-02-25T17:20:29Z

@domenkozar You would need to add a NixOS package for https://github.com/eugeneia/snabbswitch-docker (see $(VM_TARBALL) task in the Makefile) that adds the contents of the created tarball into /etc/skel/.test_env/.

lukego · 2016-02-25T20:40:40Z

Thus spake the wise man:

A little knowledge is a dangerous thing.

I have taken @sleinen's bait and started learning R for making sense of test results. Bear with me, I am surely doing it all wrong. Here is a first visualization of around 200 test results from lugano-1: comparing master and next by running the DPDK packet forwarding benchmark with a DPDK 1.7 guest and three different packet sizes (64, 128, 256):

This seems to be telling us an encouraging message: that 256-byte consistently hits line rate (~4.4 Mpps), that 128 and 64 byte both exceed this, that variance is low, and that master and next are much the same. If this is indeed the message then this seems like a valuable kind of test to run (and particularly on release candidates).

There is one outlier with next and 64-byte packets: it would be interesting to make that dot into a hyperlink to the relevant log file (which I know can be done in R using SVG output).

I also found the function aov (analysis of variance) and that seems to basically work:

> summary(aov(dat17.mpps ~ dat17.branch * dat17.packetsize, data=df17))
                               Df Sum Sq Mean Sq   F value Pr(>F)    
dat17.branch                    1   0.00    0.00     3.244 0.0732 .  
dat17.packetsize                2  75.22   37.61 33379.059 <2e-16 ***
dat17.branch:dat17.packetsize   2   0.00    0.00     1.469 0.2328    
Residuals                     195   0.22    0.00                     
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

If I am reading this correctly (massive "if") then it is saying that different packet sizes have a major impact on performance (as expected) but different branches do not (as desired i.e. same perf from head and next).

I like the fact that with R this is all scripted and so analysis like this could easily be integrated into the CI.

End braindump. Can be that we look back on this post and chuckle at how I managed to completely misinterpret every aspect of R on my first day trying to use it. But that is then and this is now so I am feeling quite proud for the moment :).

lukego · 2016-02-25T20:52:19Z

(The cool thing about the analysis of variance is that it can tell you both how much effect a variable has and how likely that is to really be caused by random background noise. I am really keen to explore looking for relationships between multiple factors in the context of #755 e.g. how sensitive are memcpy/checksum to alignment and how much does that depend on which level of the cache hierarchy they are hitting.)

lukego · 2016-02-25T21:05:59Z

cc @domenkozar could be that we could have Hydra run these kinds of analysis and make the results available for every PR.

lukego · 2016-02-26T08:21:21Z

Here is a picture of the same test above but with the DPDK 2.1 version of l2fwd:

On the one hand it is good for the v2016.03 release that master and next have consistent performance. On the other hand we see both lower performance overall (#665) and surprisingly lower results for 128-byte packets compared with 256-byte packets. Performance is sensitive to guest behavior in ways that we don't understand yet.

I am excited to press ahead with parallel efforts of measurement and optimization. These are both really valuable and mutually beneficial activities. There is a lot that can be done in both areas and for more applications. (I am focusing on the NFV application here only because it is the one that has upstream performance tests available: going forward I see this activity as encompassing all programs.)

Onward! One step at a time...

lukego · 2016-02-26T08:33:28Z

@mwiget mentioned seeing lower performance (~10%) when leaving an application idle for a while before starting to process traffic. This is likely some JIT behavior that we need to understand and correct (following @wingo's fine example with #707). The way forward that I see would be to define a test case where the waiting time before processing traffic can be controlled so that we can measure its impact and see which code changes successfully reduce variance.

This is all extremely interesting from my perspective :-)

wingo · 2016-02-26T14:03:34Z

I can believe that. I feel like sometimes if we start taking too many side traces, LuaJIT doesn't react appropriately, and if you do a jit.flush() things get back to good performance. Something to look at in the future.

lukego · 2016-02-26T15:21:19Z

Hey wow cool awesome!

I asked R to do an analysis of variance with the branch, the packet size, and the container (DPDK version) and here is what it says:

> summary(aov(dat.mpps ~ dat.branch * dat.packetsize * dat.container, data=df))
                                         Df Sum Sq Mean Sq   F value Pr(>F)    
dat.branch                                1   0.01    0.01     1.857  0.174    
dat.packetsize                            2  54.59   27.30  7321.203 <2e-16 ***
dat.container                             1 271.85  271.85 72912.023 <2e-16 ***
dat.branch:dat.packetsize                 2   0.01    0.00     0.734  0.481    
dat.branch:dat.container                  1   0.00    0.00     0.001  0.970    
dat.packetsize:dat.container              2  29.06   14.53  3897.513 <2e-16 ***
dat.branch:dat.packetsize:dat.container   2   0.00    0.00     0.037  0.964    
Residuals                               390   1.45    0.00                     
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

and if I am interpreting this correctly then it is really providing valuable information that is consistent with the graphs above:

Switching branches (master/next) does not make a difference.
Switching packet size does make a difference.
Switching containers also makes a difference.
The differences caused by the packet size and the container are dependent on each other: packet size impacts each container differently (128-byte is faster with DPDK 1.7 and slower with DPDK 2.1).

This starts to look like exactly the thing I was looking for in #755 as a means to search for interactions between data alignment (src/dst/len) and memory hierarchy for different functions. And we already have the data in a CSV file to try this out with!

lukego · 2016-02-27T10:01:09Z

Charging ahead with R and surely making a generous helping of mistakes...

Post hoc analysis with Tukey's test is another way to use the ANOVA results. This can be easier to interpret because it tells us how much the average changes between groups of results: the best estimate and the +/- range that can be predicted with 95% confidence.

Here is what Tukey's test says about the effect on Mpps of branch
(master vs next):

> TukeyHSD(aov(mpps ~ branch, data=df), ordered=TRUE)
  Tukey multiple comparisons of means
    95% family-wise confidence level
    factor levels have been ordered

Fit: aov(formula = mpps ~ branch, data = df)

$branch
                   diff        lwr       upr     p adj
master-next 0.008300208 -0.1769718 0.1935722 0.9298626

This says that the estimated difference is 0.0083 Mpps (i.e. very small) and we predict with 95% confidence that if we reran the test the effect would be within -0.177 and +0.194 of that value (i.e. not clear that there is a positive or negative difference overall).

Here is the same analysis but with instead comparing the DPDK/L2FWD version in the guest:

> TukeyHSD(aov(mpps ~ dpdk, data=df), ordered=TRUE)
  Tukey multiple comparisons of means
    95% family-wise confidence level
    factor levels have been ordered

Fit: aov(formula = mpps ~ dpdk, data = df)

$dpdk
                    diff      lwr     upr p adj
dpdk1.7-dpdk2.1 1.644667 1.554203 1.73513     0

This estimates that DPDK 1.7 shows +1.64 Mpps in this dataset and we can say with 95% confidence that the real difference is between +1.55 and +1.74 Mpps.

Cool information!

Just for another angle, maybe pointlessly fancy, we can also look at the effect of branch and DPDK version in combination:

> TukeyHSD(aov(mpps ~ branch * dpdk, data=df), ordered=TRUE)
  Tukey multiple comparisons of means
    95% family-wise confidence level
    factor levels have been ordered

Fit: aov(formula = mpps ~ branch * dpdk, data = df)

[... clipped redundant info ...]

$`branch:dpdk`
                                     diff        lwr       upr     p adj
master:dpdk2.1-next:dpdk2.1   0.008071301 -0.1602539 0.1763965 0.9993226
next:dpdk1.7-next:dpdk2.1     1.644434343  1.4748576 1.8140111 0.0000000
master:dpdk1.7-next:dpdk2.1   1.652963458  1.4846382 1.8212887 0.0000000
next:dpdk1.7-master:dpdk2.1   1.636363042  1.4680378 1.8046883 0.0000000
master:dpdk1.7-master:dpdk2.1 1.644892157  1.4778278 1.8119565 0.0000000
master:dpdk1.7-next:dpdk1.7   0.008529115 -0.1597961 0.1768543 0.9992013

which makes it look like the effects are independent: same results with same DPDK version, consistent 1.6Mpps boost with DPDK 1.7 vs 2.1.

lukego · 2016-02-28T14:36:22Z

(FYI: My mum tells me that this use and interpretation of Tukey's test is right.)

This file contains only the documentation for the ctable module and not the whole lib/ API so README.md was a misnomer.

Extracted the documentation from the pmu.lua source file into a markdown-formatted README.

Extracted the documentation for the checksum module from the source code and into a separate markdown-formatted README file.

lukego · 2016-03-01T16:20:35Z

@eugeneia ship'able from my perspective.

Require link and engine

Kacper Wysocki and others added 30 commits January 10, 2016 16:55

page pointers portable to 32bit platforms

080fba5

Don't count tail calls towards loop unroll limit.

9a1294c

* lib/luajit/src/lj_record.c (lj_record_tailcall): Tail calls don't contribute to loop unroll limit.

Add lib.ctable module.

bf2d58a

Adding detection of Intel X520 2-port 10GE card

61fa43f

Merge branch 'master' into x520

164bae4

Makefile can process any .md.src to an .md, not just README.md

9666348

* src/Makefile (MDSRC): Rename from RMSRC, and search for any .md.src file. Update users. (MDOBJS): Rename from RMOBJS.

Add documentation for lib.ctable.

fe70ccf

* src/doc/genbook.sh: Add section for specialized data structures. * src/lib/README.ctable.md: New file.

Add wingo-next to branches.md.

586f0b7

Fix typo.

27b82d5

Fix another typo.

c587728

Thanks to Kristian Larsson for catching it.

Merge #732 (add branch wingo-next) into next

fe27f7a

virtio/net_device.lua: Remove unused code

7bf8b3e

Remove the variable 'pagebits' and the function 'VirtioNetDevice:translate_physical_addr()'. These are not used and they depend on parts of the core.memory API that I want to remove.

Revert "Makefile can process any .md.src to an .md, not just README.md"

bc2fda2

This reverts commit 9666348.

Move ctable documentation to src/lib/README.md, fix nits

5b9400a

Thanks to Max Rottenkolber for review.

Merge PR #737 (fixing shebang lines for NixOS) to wingo-next.

1f3d139

Merge wingo-next into next

9c6d0ae

Merge PR #731 (Intel X520 2-port Adapter detection) into max-next

a5d2f48

Merge PR #707 (Tail calls do not contribute to unroll counter) into m…

5965197

…ax-next

Merge PR #700 (Use correct page pointer types) into max-next

eae7da6

Merge PR #722 (Add lib.ctable module) into max-next

323902a

Merge max-next into next

429a2bc

Merge PR #747 (Fix bug in RawSocket related to use of ljsyscall's if_…

ff03106

…nametoindex) into max-next

Merge branch PR #743 (Rename all README.md.src files to README.src.md…

666360f

…) into max-next

Merge PR #742 (Map `snabb-lwaftr' argv[0] to programs.lwaftr) into ma…

d44429f

…x-next

Merge PR #739 (Simplify core.memory API) into max-next

ab6f8f4

eugeneia mentioned this pull request Feb 25, 2016

Merge max-next into next #785

Closed

Merge max-next into next

65df461

lukego mentioned this pull request Feb 27, 2016

Implement freelist with FFI cdata object instead of Lua table #771

Merged

lukego added 5 commits February 29, 2016 04:54

lib/README.md: Renamed to lib/README.ctable.md

23347b3

This file contains only the documentation for the ctable module and not the whole lib/ API so README.md was a misnomer.

lib/README.pmu.md: New README file

41d24e8

Extracted the documentation from the pmu.lua source file into a markdown-formatted README.

lib/README.checksum.md: New file

7e63b78

Extracted the documentation for the checksum module from the source code and into a separate markdown-formatted README file.

doc/genbook.sh: Updated paths to lib/ README files

a490d68

Merge #790 (lib/README restructure) into next

0ae7382

eugeneia changed the title ~~[wip] next: Collected changes for the v2016.03 release~~ next: Collected changes for the v2016.03 release Mar 1, 2016

eugeneia self-assigned this Mar 1, 2016

eugeneia merged commit 0ae7382 into master Mar 1, 2016

eugeneia added a commit that referenced this pull request Mar 1, 2016

Merge PR #778 (v2016.03 release) onto master

ed346d5

eugeneia added the merged label Mar 1, 2016

lukego mentioned this pull request Mar 2, 2016

Experimental performance test: Virtio-net options (take one) #796

Open

lukego mentioned this pull request Mar 20, 2016

Running performance experiments with Hydra snabblab/snabblab-nixos#29

Closed

lukego mentioned this pull request May 5, 2016

Help wanted: LuaJIT pros to contribute to Snabb LuaJIT/LuaJIT#177

Closed

dpino pushed a commit to dpino/snabb that referenced this pull request Mar 22, 2017

Merge pull request snabbco#778 from Igalia/require_link_and_engine

f3cc411

Require link and engine

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

next: Collected changes for the v2016.03 release #778

next: Collected changes for the v2016.03 release #778

lukego commented Feb 24, 2016

domenkozar commented Feb 25, 2016

eugeneia commented Feb 25, 2016

lukego commented Feb 25, 2016

lukego commented Feb 25, 2016

lukego commented Feb 25, 2016

lukego commented Feb 26, 2016

lukego commented Feb 26, 2016

wingo commented Feb 26, 2016

lukego commented Feb 26, 2016

lukego commented Feb 27, 2016

lukego commented Feb 28, 2016

lukego commented Mar 1, 2016

next: Collected changes for the v2016.03 release #778

next: Collected changes for the v2016.03 release #778

Conversation

lukego commented Feb 24, 2016

domenkozar commented Feb 25, 2016

eugeneia commented Feb 25, 2016

lukego commented Feb 25, 2016

lukego commented Feb 25, 2016

lukego commented Feb 25, 2016

lukego commented Feb 26, 2016

lukego commented Feb 26, 2016

wingo commented Feb 26, 2016

lukego commented Feb 26, 2016

lukego commented Feb 27, 2016

lukego commented Feb 28, 2016

lukego commented Mar 1, 2016