From f88b557482e88dc7c40eae6e4d5986e21cd1bc37 Mon Sep 17 00:00:00 2001 From: moebiusband73 Date: Mon, 12 Oct 2020 07:53:07 +0200 Subject: [PATCH] Update README.md --- README.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 37b31bf..034bfe0 100644 --- a/README.md +++ b/README.md @@ -37,7 +37,7 @@ TAG ?= GCC ENABLE_OPENMP ?= false ENABLE_LIKWID ?= false -OPTIONS = -DSIZE=40000000ull +OPTIONS = -DSIZE=120000000ull OPTIONS += -DNTIMES=10 OPTIONS += -DARRAY_ALIGNMENT=64 #OPTIONS += -DVERBOSE_AFFINITY @@ -47,6 +47,8 @@ OPTIONS += -DARRAY_ALIGNMENT=64 The verbosity options enable detailed output about affinity settings, allocation sizes and timer resolution. +*Notice:* OpenMP involves significant overhead through barrier cost, especially on systems with many memory domains. The default problem size is set to almost 4GB to have enough work vs overhead. If you suspect that the result should be better you may try to further increase the problem size. To compare to original stream results on X86 systems you have to ensure that streaming store instructions are used. For the ICC toolchain this is now the default (Option `-qopt-streaming-stores=always`. + 2. Build with: ``` make