Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: studies on CMS DY with phase space optimizations #970

Draft
wants to merge 406 commits into
base: master
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
406 commits
Select commit Hold shift + click to select a range
ef8cff8
[prof] in gg_tt.mad, profile REWGT (as "test14" for the moment, wip)
valassi Aug 12, 2024
e073613
[prof] in gg_tt.mad, add a second "program initial_i/o" counter"
valassi Aug 12, 2024
fbd5322
[prof] in gg_tt.mad driver.f, clean up comments in counters_register …
valassi Aug 12, 2024
04e39de
[prof] in gg_tt.mad driver.f, rename timers for unwgt, rewgt, scale, …
valassi Aug 12, 2024
62d7c4e
[prof] in gg_tt.mad dsample.f, remove the timer for grouping function…
valassi Aug 12, 2024
d3165cb
[prof] in gg_tt.mad auto_dsig1.f, add profiling for matrix1 also in d…
valassi Aug 12, 2024
d474e21
[prof] in gg_tt.mad, revert the profiling for matrix1 in dsig1
valassi Aug 12, 2024
59dbf04
[prof] in gg_tt.mad, profile ranmar (in ranmar.f: but this causes dou…
valassi Aug 12, 2024
117bd1e
[prof] in gg_tt.mad, revert the profiling of ranmar
valassi Aug 12, 2024
c356280
[prof] in gg_tt.mad driver.f, profile bridge creation/deletion (as te…
valassi Aug 12, 2024
6f86051
[prof] in gg_tt.mad, cleanly define Cudacpp initialise (bridge creati…
valassi Aug 12, 2024
255c343
[prof] in gg_tt.mad, start cleaning up timers: remove the two PROGRAM…
valassi Aug 12, 2024
568e024
[prof] in gg_tt.mad, complete cleanup of timers, with better names an…
valassi Aug 12, 2024
e1e212e
[prof] in gg_tt.mad counters.cc, add "OVERALL MEs" and "OVERALL NON-M…
valassi Aug 12, 2024
c80aa78
[prof] in gg_tt.mad counters add again an env variable CUDACPP_RUNTIM…
valassi Aug 12, 2024
5b24462
[prof] in gg_tt.mad counters.cc, consider printing throughputs only f…
valassi Aug 12, 2024
c9a72f3
[prof] in gg_tt.mad counters.cc, revert the last change
valassi Aug 12, 2024
c330fb1
[prof] in gg_tt.mad counters.cc, fix clang format
valassi Aug 12, 2024
555d91f
[prof] regenerate CODEGEN patch from gg_tt.mad including additional c…
valassi Aug 12, 2024
56404b3
[prof] regenerate all processes
valassi Aug 12, 2024
b20a1b0
[cmsdy] check out CODEGEN logs from prof for easier merging
valassi Aug 12, 2024
521798a
Merge branch 'prof' (including detailed timing profiles from WIP PR #…
valassi Aug 12, 2024
7122b9c
[cmsdy] regenerate all processes (ops this is not what I was normally…
valassi Aug 12, 2024
0ab28f3
[cmsdy] regenerate pp_dy3j.mad (with the extra counters from WIP PR #…
valassi Aug 12, 2024
29cf1fe
[cmsdy] undo the generation of all processes, focus on pp_dy3j alone
valassi Aug 12, 2024
22193a9
[cmdsy] in tlau, recreate dy3j gridpack using fortran on itgold91, in…
valassi Aug 12, 2024
e80e952
[cmsdy] in tlau/fromgridpacks, adapt the parseGridpackLogs.sh script …
valassi Aug 12, 2024
efdccf7
[cmsdy] new tlau run of ppdy3j for ALL backends from gridpack on itgo…
valassi Aug 12, 2024
699733e
[cmsdy] in tlau, modify the parseGridpackLogs.sh to create a summary.…
valassi Aug 12, 2024
d29e75b
[cmsdy] fix the tlau parseGridpackLogs.sh script and the resulting pp…
valassi Aug 12, 2024
5a2f534
[prof] rerun 102 tput tests on itscrd90 - all ok
valassi Aug 13, 2024
f24186c
[cmsdy] improve formatting in tlau parseGridpackLogs.sh script
valassi Aug 14, 2024
dae52f0
[cmsdy] include aggregate eevent counts in tlau parseGridpackLogs.sh …
valassi Aug 14, 2024
2e59eca
[cmsdy] in tlau recompute pp_dy3j.mad/summary.txtfrom parseGridpackLo…
valassi Aug 14, 2024
82f87c2
[prof] rerun 30 tmad tests on itscrd90 WITH NEW COUNTERS - all as exp…
valassi Aug 13, 2024
93cf80e
[prof] in gg_tt.mad, profile gen_mom (13) and sample_get_discrete_x (…
valassi Aug 14, 2024
f77cd1f
[prof] in gg_tt.mad, profile also subsections of genmom... is there a…
valassi Aug 14, 2024
1927c68
[cmsdy] empty commit, profile one individual dy3j subprocess
valassi Aug 14, 2024
00f0a37
[cmsdy] in pp_dy3j.mad P0_gux_taptamggux, add profiling for sample_ge…
valassi Aug 14, 2024
6d24179
[cmsdy] in pp_dy3j.mad P0_gux_taptamggux, add profiling for ranmar nt…
valassi Aug 15, 2024
e234927
[cmsdy] in pp_dy3j.mad P0_gux_taptamggux, remove profiling for ranmar…
valassi Aug 15, 2024
8beb5a5
[genps] in gg_tt.mad add newgenps.f (this is an as-is copy of sample_…
valassi Aug 14, 2024
605d212
[cmsdyps] move newgenps.f from gg_tt.mad to pp_dy3j.mad P0_gux_taptam…
valassi Aug 15, 2024
8f8317a
[genps] in gg_tt.mad newgenps.f, remove all dead code from sample_get_x
valassi Aug 15, 2024
7894922
[genps] in gg_tt.mad newgenps.f, comment out (but keep) warning messages
valassi Aug 15, 2024
50d08bc
[genps] in gg_tt.mad newgenps.f, further remove dead code for ituple=…
valassi Aug 15, 2024
ca7a84e
[genps] in gg_tt.mad newgenps.f, further comment out (but keep) warni…
valassi Aug 15, 2024
f5de5b3
[genps] in gg_tt.mad, integrate newgenps.f into the full build and fu…
valassi Aug 15, 2024
707aeb5
[cmsdyps] move makefile and dsample.f changes from gg_tt.mad to pp_dy…
valassi Aug 15, 2024
ba083d3
[cmsdyps] in pp_dy3j.mad newgenps, move counters to the new get_sampl…
valassi Aug 15, 2024
2aebfba
[cmsdyps] in pp_dy3j.mad newgenps, (temporarely) add back the checks …
valassi Aug 15, 2024
cc08fd2
[cmsdyps] in pp_dy3j.mad newgenps, comment out again the checks for w…
valassi Aug 15, 2024
8ad0b11
[genps] in gg_tt.mad newgenps. replace ddum(j) by ddum as I see no re…
valassi Aug 15, 2024
1f4ce0b
[cmsdyps] in ppdy3j, remove Source/htuple.f #967
valassi Aug 15, 2024
2625730
[genps] in gg_tt.mad add Source/newranmar.f (as-is copy of ntuple fro…
valassi Aug 15, 2024
fb6cbd9
[cmsdyps] move newranmar.f (and Source makefile changes) from gg_tt.m…
valassi Aug 15, 2024
c69fb2e
[genps] in gg_tt.mad newranmar.f, simplify the code removing initiali…
valassi Aug 15, 2024
9337450
[genps] in gg_tt.mad newgenps.f, finally remove all warnings instead …
valassi Aug 15, 2024
d52decc
[genps/cmsdyps] in gg_tt.mad newgenps.f, call the simpler ntuple_new …
valassi Aug 15, 2024
08fade2
[genps] in gg_tt.mad newgenps.f, add an optional debug printout
valassi Aug 15, 2024
c811df8
[genps] in gg_tt.mad newgenps.f, further assume that nzoom<=0 (else c…
valassi Aug 15, 2024
23a1358
[genps] in gg_tt.mad newgenps.f, simplify the code for xbin_min and x…
valassi Aug 15, 2024
e29cfbf
[genps] in gg_tt.mad newgenps.f and newranmar.f, further strip down f…
valassi Aug 15, 2024
654ec2a
[genps/cmsdyps] in gg_tt.mad newgenps.f, cosmetics
valassi Aug 15, 2024
b13d376
[genps] in gg_tt.mad newgenps.f, remove unused variable y
valassi Aug 15, 2024
291bcf5
[genps/cmsdyps] in gg_tt.mad newgenps.f, save xbin_min and xbin_max a…
valassi Aug 15, 2024
079207d
[cmsdy] in pp_dy3j.mad P0_gux_taptamggux, remove profiling for sample…
valassi Aug 15, 2024
5ac5d0a
[cmsdyps] in pp_dy3j.mad counters.cc, try to use steady_clock instead…
valassi Aug 16, 2024
ab34245
[cmsdyps] in pp_dy3j.mad counters.cc, revert to high_resolution_clock
valassi Aug 16, 2024
b3c0467
[cmsdyps] in pp_dy3j.mad newgenps.f, fix the counter for sample_get_x…
valassi Aug 16, 2024
7ed248b
[cmsdyps] in pp_dy3j.mad, add a new timers.h header (presently includ…
valassi Aug 16, 2024
e0bd2b3
[cmsdyps] in pp_dy3j.mad counters.cc and timers.h, try to use the new…
valassi Aug 16, 2024
5eb2721
[cmsdyps] in pp_dy3j.mad counters.cc, allow the use of ChronoTimers i…
valassi Aug 16, 2024
159f93b
[cmsdyps] in pp_dy3j.mad newgenps.f, temporarely disable again the sa…
valassi Aug 16, 2024
3797210
[cmsdyps] in pp_dy3j.mad counters.h, add a debug printout about which…
valassi Aug 16, 2024
753b904
[cmsdyps] in pp_dy3j.mad counters.cc, increase to 30 counters and add…
valassi Aug 17, 2024
c863d69
[cmsdyps] in pp_dy3j.mad driver.f and newgenps.f, move the SampleGetX…
valassi Aug 17, 2024
e9e560a
[cmsdyps] in pp_dy3j.mad counters.cc, rename env as CUDACPP_RUNTIME_D…
valassi Aug 17, 2024
89d7965
[cmsdyps] in pp_dy3j.mad driver.f and newgenps.f, comment out the Sam…
valassi Aug 17, 2024
4572a55
[cmsdyps] in pp_dy3j.mad driver.f, rename Random2Momenta counter as P…
valassi Aug 17, 2024
f8088be
[cmsdyps] in pp_dy3j.mad, port timermap.h to the new timers.h, remove…
valassi Aug 17, 2024
918481b
[cmsdyps] in pp_dy3j.mad, for simplicity rename the new timers.h as t…
valassi Aug 17, 2024
2a2d768
[cmsdyps] in ppdy3j, add back Source/htuple.f (postpone #967 till we …
valassi Aug 17, 2024
ef888b6
[cmsdyps] in ppdy3j Source/dsample.f, improve the comment about ntupl…
valassi Aug 17, 2024
940929a
[cmsdyps/prof] in gg_tt.mad, backport changes from pp_dy3j.mad (P0_gu…
valassi Aug 17, 2024
5fd648b
[cmsdyps/prof] rerun one tput test for ggtt with the new timers, chec…
valassi Aug 17, 2024
f7c5d3b
[cmsdyps/prof] in gg_tt.mad timermap.h, move to using rdtsc timers by…
valassi Aug 17, 2024
411397e
[cmsdyps/prof] in tput/throughputX.sh, add a printout about chrono vs…
valassi Aug 17, 2024
6622d45
[cmsdyps/prof] rerun one tput test for ggtt with chrono timers, no ch…
valassi Aug 17, 2024
b6a2964
[cmsdyps/prof] rerun one tput test for ggtt with rdtsc timers, essent…
valassi Aug 17, 2024
848ab8b
[cmsdyps] in pp_dy3j.mad timer.h, further modify ChronoTimer to use i…
valassi Aug 18, 2024
bfbf229
[cmsdyps] in pp_dy3j.mad timer.h, add functions getCountsSinceStart (…
valassi Aug 18, 2024
c318aba
[cmsdyps] in pp_dy3j.mad (backport gg_tt.mad) timermap.h, move to usi…
valassi Aug 18, 2024
ded24bf
[cmsdyps] in pp_dy3j.mad timermap.h, cosmetics (change case of variab…
valassi Aug 18, 2024
6f36d44
[cmsdyps] in pp_dy3j.mad timer.h, add secondsPerCount() function - ch…
valassi Aug 18, 2024
c859c91
[cmsdyps] in pp_dy3j.mad, complete moving timermap.h to getCountsSinc…
valassi Aug 18, 2024
4e1fcdc
[cmsdyps] in pp_dy3j.mad timer.h and counters.h, improve variable/fun…
valassi Aug 19, 2024
64c6b3f
[cmsdyps/prof] in gg_tt.mad, backport latest changes in timers and co…
valassi Aug 19, 2024
a930ca1
[cmsdyps/prof] rerun one tput test for ggtt with new chrono timers - …
valassi Aug 19, 2024
c400ec6
[cmsdyps/prof] rerun one tput test for ggtt with new rdtsc timers - t…
valassi Aug 19, 2024
3dec51e
[cmsdyps/prof] in gg_tt.mad timermap.h and check_sa,cc, fix the calib…
valassi Aug 19, 2024
c0a2aee
[cmsdyps/prof] rerun one tput test for ggtt with new chrono timers - …
valassi Aug 19, 2024
a4edb68
[cmsdyps/prof] rerun one tput test for ggtt with new rdtsc timers - n…
valassi Aug 19, 2024
20178c7
[prof] in gg_tt.mad, revert the last two commits (remove test profili…
valassi Aug 19, 2024
17aeb61
[prof] go back to previous tput and tmad logs for easier merging of c…
valassi Aug 19, 2024
2af35cb
[cmsdyps/prof] in gg_tt.mad, backport changes from pp_dy3j.mad (P0_gu…
valassi Aug 17, 2024
0f65d33
[cmsdyps/prof] rerun one tput test for ggtt with the new timers, chec…
valassi Aug 17, 2024
83202ca
[cmsdyps/prof] in gg_tt.mad timermap.h, move to using rdtsc timers by…
valassi Aug 17, 2024
c077f83
[cmsdyps/prof] in tput/throughputX.sh, add a printout about chrono vs…
valassi Aug 17, 2024
88f6916
[cmsdyps/prof] rerun one tput test for ggtt with chrono timers, no ch…
valassi Aug 17, 2024
d10e7f4
[cmsdyps/prof] rerun one tput test for ggtt with rdtsc timers, essent…
valassi Aug 17, 2024
90c863b
[cmsdyps/prof] in gg_tt.mad, backport latest changes in timers and co…
valassi Aug 19, 2024
609b4e4
[cmsdyps/prof] rerun one tput test for ggtt with new chrono timers - …
valassi Aug 19, 2024
d06e6a4
[cmsdyps/prof] rerun one tput test for ggtt with new rdtsc timers - t…
valassi Aug 19, 2024
48c8c79
[cmsdyps/prof] in gg_tt.mad timermap.h and check_sa,cc, fix the calib…
valassi Aug 19, 2024
9bf5e6e
[cmsdyps/prof] rerun one tput test for ggtt with new chrono timers - …
valassi Aug 19, 2024
a1c9b7a
[cmsdyps/prof] rerun one tput test for ggtt with new rdtsc timers - n…
valassi Aug 19, 2024
5fe76e0
[prof] in CODEGEN, backport the latest changes to timermap.h, check_s…
valassi Aug 19, 2024
3435f56
[prof] in CODEGEN, fix clang format for timermap.h, check_sa.cpp, tim…
valassi Aug 19, 2024
6f7076a
[prof] regenerate CODEGEN patch from gg_tt.mad including htuple comme…
valassi Aug 19, 2024
0db0718
[prof] in gg_tt.mad, fix clang format for timermap.h, check_sa.cpp, t…
valassi Aug 19, 2024
e2b46f2
[prof] regenerate gg_tt.mad, all ok
valassi Aug 19, 2024
5d75bb4
[prof] regenerate all processes
valassi Aug 19, 2024
6eb36a6
[prof] rerun a simple tmad test for ggtt... times look ok but through…
valassi Aug 19, 2024
4e7e07c
[prof] in gg_tt.mad and CODEGEN, fix a silly bug in throughputs (was …
valassi Aug 19, 2024
2e43faf
[prof] revert tmad run of ggtt with throughput bug
valassi Aug 19, 2024
42cad8d
[prof] rerun again a simple tmad test for ggtt... now times and throu…
valassi Aug 19, 2024
607abfc
[prof] regenerate gg_tt.mad, all ok
valassi Aug 19, 2024
9a03440
[prof] manually fix counters.cc in all generated processes
valassi Aug 19, 2024
7af63a2
[prof] in CODEGEN, backport the latest changes to timermap.h, check_s…
valassi Aug 19, 2024
8335edd
[prof] in CODEGEN, fix clang format for timermap.h, check_sa.cpp, tim…
valassi Aug 19, 2024
a882deb
[prof] regenerate CODEGEN patch from gg_tt.mad including htuple comme…
valassi Aug 19, 2024
87751e5
[prof] in gg_tt.mad, fix clang format for timermap.h, check_sa.cpp, t…
valassi Aug 19, 2024
8dfb8ef
[prof] in gg_tt.mad and CODEGEN, fix a silly bug in throughputs (was …
valassi Aug 19, 2024
08668e3
[cmsdyps] in pp_dy3j.mad P0_gux_taptamggux, fix clang format for time…
valassi Aug 19, 2024
6df453d
[cmsdyps] in pp_dy3j.mad P0_gux_taptamggux, copy timermap.h, counters…
valassi Aug 19, 2024
fdfd04c
[cmsdyps/cmsdy] in CODEGEN/generateAndCompare.sh, add gux_taptamggux …
valassi Aug 19, 2024
4196897
[cmsdyps/cmsdy] in CODEGEN/generateAndCompare.sh gux_taptamggux, try …
valassi Aug 19, 2024
983cd79
[cmsdyps/cmsdy] in CODEGEN/generateAndCompare.sh, try to add pp_tapta…
valassi Aug 19, 2024
065310a
[cmsdyps/cmsdy] revert the last 3 changes in CODEGEN/generateAndCompa…
valassi Aug 19, 2024
c17393f
[cmsdy] in dsample.f of pp_dy3j.mad P0_gux_taptamggux (also copied to…
valassi Aug 15, 2024
6a32f46
[cmsdy] in dsample.f of pp_dy3j.mad P0_gux_taptamggux (also copied to…
valassi Aug 15, 2024
40a0cbd
[cmsdy] in dsample.f of pp_dy3j.mad P0_gux_taptamggux (also copied to…
valassi Aug 15, 2024
5e9000f
[cmsdy] in dsample.f of pp_dy3j.mad P0_gux_taptamggux (also copied to…
valassi Aug 19, 2024
f0a7a3a
[prof] rerun 102 tput tests (with new rdtcs timers) on itscrd90 - all ok
valassi Aug 20, 2024
db32587
[prof] ** COMPLETE PROF ** rerun 30 tmad tests on itscrd90 (with new …
valassi Aug 20, 2024
95329f3
[prof] move to upstream/master codegen logs to ease merging
valassi Aug 21, 2024
9b394e6
Merge remote-tracking branch 'upstream/master' (with hel #960, mac #9…
valassi Aug 21, 2024
56d73ff
[prof] regenerate all processes after merging upstream/master
valassi Aug 21, 2024
9ac0039
[prof] in gg_tt.mad and CODEGEN timers/counters, disable Rdtsc counte…
valassi Aug 21, 2024
5c8d579
[prof] regenerate all processes after disabling Rdtsc counters on pla…
valassi Aug 21, 2024
ebcf619
[grid] move to upstream/master codegen logs for easier merging
valassi Aug 22, 2024
6ad8641
Merge remote-tracking branch 'upstream/master' (with hel #960, mac #9…
valassi Aug 22, 2024
d75ac0b
[grid] regenerate gg_tt.mad and gq_ttq.mad, all ok no change
valassi Aug 22, 2024
c68a9a7
[grid] in CODEGEN launch_plugin.py, make cudacpp_bldall a boolean par…
valassi Aug 22, 2024
2060dc1
[grid] regenerate gg_tt.mad and gq_ttq.mad, only launch_plugin.py has…
valassi Aug 22, 2024
9f7aa25
[grid] rerun tlau gridpack creation of gqttq (simplest process with t…
valassi Aug 22, 2024
5345ec0
[grid] add logs from a new gridpack execution of tlau gqttq on ALL ba…
valassi Aug 22, 2024
bebfd72
[grid] in tlau/fromgridpacks add parseGridpackLogs.sh
valassi Aug 7, 2024
70f5c47
[grid] further improvements in tlau parseGridpackLogs.sh, iterate ove…
valassi Aug 7, 2024
bfac4f7
[grid] further refine the tlau parseGridpackLogs.sh script
valassi Aug 7, 2024
324c613
[grid] further refine the tlau parseGridpackLogs.sh script
valassi Aug 8, 2024
289a973
[grid] in tlau compute gq_ttq.mad/summary.txt for the first time from…
valassi Aug 22, 2024
0615825
[cmsdy] in dsample.f of pp_dy3j.mad P0_gux_taptamggux [!NB: forgot to…
valassi Aug 19, 2024
af9cc07
[prof] regenerate CODEGEN patch from gg_tt.mad including the #969 per…
valassi Aug 19, 2024
76829d2
[cmsdyps/prof] in tput/throughputX.sh, add a printout about chrono vs…
valassi Aug 17, 2024
e88e6e5
[prof] in CODEGEN, backport the latest changes to timermap.h, check_s…
valassi Aug 19, 2024
3bd1333
[prof] in CODEGEN, fix clang format for timermap.h, check_sa.cpp, tim…
valassi Aug 19, 2024
3632d88
[prof] regenerate CODEGEN patch from gg_tt.mad including htuple comme…
valassi Aug 19, 2024
0a54164
[prof] in gg_tt.mad and CODEGEN, fix a silly bug in throughputs (was …
valassi Aug 19, 2024
5f1e4df
[cmsdy] regenerate gg_tt.mad after cherry picking the latest CODEGEN …
valassi Aug 19, 2024
2ea9a1e
[prof] regenerate CODEGEN patch from gg_tt.mad, after regenerating gg…
valassi Aug 19, 2024
40399e1
[cmsdy] regenerate pp_dy3j.mad (with new timers/counters and with #96…
valassi Aug 19, 2024
e8baa12
[cmdsy] in tlau, recreate dy3j gridpack using fortran on itgold91, in…
valassi Aug 19, 2024
2719e1c
[cmsdy] new tlau run of ppdy3j for ALL backends from gridpack on itgo…
valassi Aug 19, 2024
b125b65
[cmsdy] modify tags (PhaseSpaceSampling instead of Random2Momenta) in…
valassi Aug 19, 2024
5328973
[cmsdy] in tlau recompute pp_dy3j.mad/summary.txt from parseGridpackL…
valassi Aug 19, 2024
02d7a90
[cmsdy] in tlau/lauX.sh print out whether CUDACPP_RUNTIME_SKIPXBINCHE…
valassi Aug 21, 2024
249b46d
[cmsdy] finally port from pp_dy3j.mad to gg_tt.mad the dsample.f [tha…
valassi Aug 22, 2024
e959f7f
[prof] regenerate CODEGEN patch from gg_tt.mad, finally including the…
valassi Aug 22, 2024
3fc3ac0
[cmsdy] regenerate pp_dy3j.mad on itgold91, now including CUDACPP_RUN…
valassi Aug 22, 2024
6397737
[cmdsy] in tlau, recreate dy3j gridpack using fortran on itgold91, in…
valassi Aug 22, 2024
278c5d5
[cmsdy] in tlau/lauX.sh print out whether CUDACPP_RUNTIME_SKIPXBINCHE…
valassi Aug 22, 2024
2043ce4
[cmsdy] new tlau run of ppdy3j for ALL backends from gridpack on itgo…
valassi Aug 22, 2024
ada7969
[cmsdy] in tlau recompute pp_dy3j.mad/summary.txt on itgold91 (includ…
valassi Aug 22, 2024
d06a9db
[cmsdy] new tlau run of ppdy3j for ALL backends from gridpack on itgo…
valassi Aug 22, 2024
348664c
[cmsdy] in tlau recompute pp_dy3j.mad/summary.txt on itgold91 (with C…
valassi Aug 22, 2024
212e236
[cmsdyps] in gg_tt.mad, copy newranmar.f and newgenps.f from pp_dy3j.…
valassi Aug 19, 2024
89fd588
[cmsdyps] in gg_tt.mad, move newgenps.f to SubProcesses and add it in…
valassi Aug 22, 2024
eb6078c
[cmsdyps] in pp_dy3j.mad, move newgenps.f to SubProcesses and add it …
valassi Aug 22, 2024
9e1dbff
[cmsdyps] in CODEGEN, add newgenps.f and newranmar.f to the list of f…
valassi Aug 22, 2024
6956473
[cmsdyps] regenerate CODEGEN patch from gg_tt.mad including newgenps.…
valassi Aug 22, 2024
b780372
[cmsdyps] regenerate gg_tt.mad, all ok
valassi Aug 22, 2024
e3c60d3
[cmsdyps] regenerate pp_dy3j.mad, with newgenps.f and newranmar.f eve…
valassi Aug 22, 2024
b7e11e2
[cmsdyps] move to codegen logs from branch cmsdy, to ease merging
valassi Aug 22, 2024
b747b6c
Merge branch 'cmsdy' (with #968 and #969 improvements - but not yet t…
valassi Aug 22, 2024
742d161
[cmsdyps] regenerate CODEGEN patch from (merged) gg_tt.mad after merg…
valassi Aug 22, 2024
ea87504
[cmsdy] [cmsdy] check out CODEGEN logs from branch grid for easier me…
valassi Aug 22, 2024
5223f79
Merge branch 'grid' (including also upstream/master merge with hel #9…
valassi Aug 22, 2024
7b2d8b2
Merge branch 'cmsdy' (including grid which include upstream/master me…
valassi Aug 22, 2024
90431ba
[grid] regenerate all processes (before re-merging into cmsdy)
valassi Aug 23, 2024
264316e
Merge branch 'grid' (after regenerating all processes) into cmsdy
valassi Aug 23, 2024
51781f7
[prof] in gg_tt.mad and CODEGEN timers/counters, disable Rdtsc counte…
valassi Aug 21, 2024
3dfdf55
[cmsdy] check out CODEGEN logs from branch prof for easier merging
valassi Aug 23, 2024
e165549
Merge branch 'prof' into cmsdy
valassi Aug 23, 2024
988fb93
[cmsdy] regenerate all processes (after merging prof into cmsdy)
valassi Aug 23, 2024
fd1b762
[cmsdyps] check out tput logs from branch cmsdy for easier merging
valassi Aug 23, 2024
ea7ea6d
[cmsdy] in CODEGEN/allGenerateAndCompare.sh add (temporarely?) pp_dy0…
valassi Aug 23, 2024
facebfe
[cmsdy] regenerate pp_dy012j.mad
valassi Aug 23, 2024
dfdc9de
[cmsdy] regenerate pp_dy3j.mad
valassi Aug 23, 2024
2350ef0
Merge branch 'cmsdy' (including the latest grid and prof branches wit…
valassi Aug 23, 2024
c60de03
[prof] in CODEGEN/generateAndCompare.sh, add gux_taptamggux (similar …
valassi Aug 23, 2024
3a94376
[prof] add gux_taptamggux.mad to CODEGEN/allGenerateAndCompare.sh
valassi Aug 23, 2024
af682f3
[prof] add gux_taptamggux.mad to the repo, for timer tests
valassi Aug 23, 2024
5f22187
[prof] in gux_taptamggux.mad, switch on SampleGetX profiling as a test
valassi Aug 23, 2024
5c0a2ed
[prof] in gux_taptamggux.mad timer.h, add the option to remove overhe…
valassi Aug 23, 2024
ad9b747
[prof] in gux_taptamggux.mad timer.h, add instead a getTotalOverheadS…
valassi Aug 23, 2024
464b9d7
[prof] in gux_taptamggux.mad counters.cc, add the option to remove ti…
valassi Aug 23, 2024
e33250a
[prof] in gux_taptamggux.mad counters.cc, improve handling of TEST CO…
valassi Aug 23, 2024
eba8039
[prof] in gux_taptamggux.mad counters.cc, add a mechanism for declari…
valassi Aug 23, 2024
51bbbaa
[prof] in gux_taptamggux.mad counters.cc, add a printout of the estim…
valassi Aug 23, 2024
fe44fa9
[prof] in gux_taptamggux.mad, declare SampleGetX as included in Phase…
valassi Aug 23, 2024
5d3da5a
[prof] in gux_taptamggux.mad timer.h, remove all handling of overhead…
valassi Aug 23, 2024
3577a55
[prof] in gux_taptamggux.mad counters.h, move here the handling of co…
valassi Aug 23, 2024
6dcab81
[prof] in gux_taptamggux.mad counters.h, improve the handling of coun…
valassi Aug 23, 2024
ef82161
[prof] move to CODEGEN logs from the latest upstream/master for easie…
valassi Sep 2, 2024
eb7e826
Merge remote-tracking branch 'upstream/master' (including new CI and …
valassi Sep 2, 2024
a7267f0
[grid] move to CODEGEN logs from the latest upstream/master for easie…
valassi Sep 2, 2024
271afaf
Merge remote-tracking branch 'upstream/master' (including new CI and …
valassi Sep 2, 2024
526c1af
[cmsdy] move to CODEGEN logs from the latest upstream/master for easi…
valassi Sep 2, 2024
312c171
Merge branch 'prof' (including upstream/master with Source/makefile #…
valassi Sep 2, 2024
b20a4f8
Merge branch 'grid' (also including the latest master - should be a n…
valassi Sep 2, 2024
058d066
Merge branch 'cmsdy' (including latest prof, grid and master with Sou…
valassi Sep 2, 2024
ffa3f06
[cmsdyps] in CODEGEN, move the dependency on newramar.o to output.py …
valassi Sep 2, 2024
11a2a31
[cmsdyps] regenerate gg_tt.mad - it is different but functionally equ…
valassi Sep 2, 2024
2525410
[prof] move to tput/tmad logs from the latest upstream/master for eas…
valassi Sep 16, 2024
34041b7
[prof] move to auto_dsig1.f from the latest upstream/master in all ge…
valassi Sep 16, 2024
4df3dfa
Merge remote-tracking branch 'upstream/master' (including june24, goo…
valassi Sep 16, 2024
4d91140
[prof] in gg_tt.mad auto_dsig1.f, add back all counters as in the pro…
valassi Sep 16, 2024
4526aac
[prof] regenerate CODEGEN patch from gg_tt.mad after merging an old u…
valassi Oct 4, 2024
5eacb46
[prof] regenerate all processes after merging an old 'upstream/master…
valassi Oct 4, 2024
95a9070
[prof] move to the latest upstream/master CODEGEN logs for easier mer…
valassi Oct 4, 2024
7c6ba3d
[prof] move to dsample.f from the latest upstream/master in all gener…
valassi Oct 4, 2024
416a52b
Merge remote-tracking branch 'upstream/master' (including v1.0.0 and …
valassi Oct 4, 2024
3ec0964
[prof] regenerate CODEGEN patch from gg_tt.mad after merging upstream…
valassi Oct 4, 2024
817dd25
[prof] regenerate all processes after merging upstream/master(v1.0.0 …
valassi Oct 4, 2024
1aa687e
Merge branch 'prof' (including latest prof, plus upstream/master with…
valassi Oct 5, 2024
6fdb131
[cmsdy] regenerate CODEGEN patch from gg_tt.mad after merging the lat…
valassi Oct 5, 2024
619777a
Merge branch 'cmsdy' (including latest prof, with master for v1.00.00…
valassi Oct 5, 2024
c63eb88
[cmsdyps] regenerate CODEGEN patch from gg_tt.mad after merging the l…
valassi Oct 5, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
The diff you're trying to view is too large. We only load the first 3000 changed files.
Original file line number Diff line number Diff line change
@@ -1,8 +1,141 @@
diff --git b/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/auto_dsig.f a/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/auto_dsig.f
index bc9bcfeb9..0c1962d3e 100644
--- b/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/auto_dsig.f
+++ a/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/auto_dsig.f
@@ -315,8 +315,10 @@ C ENDDO

C set the running scale
C and update the couplings accordingly
+ CALL COUNTERS_START_COUNTER( 5, VECSIZE_USED ) ! FortranUpdateScaleCouplings=5
CALL UPDATE_SCALE_COUPLING_VEC(ALL_P, ALL_WGT, ALL_Q2FACT,
$ VECSIZE_USED)
+ CALL COUNTERS_STOP_COUNTER( 5 ) ! FortranUpdateScaleCouplings=5

IF(GROUPED_MC_GRID_STATUS.EQ.0) THEN
C If we were in the initialization phase of the grid for MC over
diff --git b/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/auto_dsig1.f a/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/auto_dsig1.f
index db3c284ca..f1cd4e976 100644
--- b/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/auto_dsig1.f
+++ a/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/auto_dsig1.f
@@ -127,6 +127,7 @@ C Continue only if IMODE is 0, 4 or 5
IF(IMODE.NE.0.AND.IMODE.NE.4.AND.IMODE.NE.5) RETURN


+ CALL COUNTERS_START_COUNTER( 4, 1 ) ! FortranPDFs=4
IF (ABS(LPP(IB(1))).GE.1) THEN
C LP=SIGN(1,LPP(IB(1)))
IF (DSQRT(Q2FACT(IB(1))).EQ.0D0) THEN
@@ -148,6 +149,7 @@ C LP=SIGN(1,LPP(IB(2)))
ENDIF
G2=PDG2PDF(LPP(IB(2)),0, IB(2),XBK(IB(2)), QSCALE)
ENDIF
+ CALL COUNTERS_STOP_COUNTER( 4 ) ! FortranPDFs=2
PD(0) = 0D0
IPROC = 0
IPROC=IPROC+1 ! g g > t t~
@@ -186,7 +188,9 @@ C Select a flavor combination (need to do here for right sign)
R=R-DABS(PD(IPSEL))/PD(0)
ENDDO

+ CALL COUNTERS_START_COUNTER( 6, 1 ) ! FortranReweight=6
DSIGUU=DSIGUU*REWGT(PP,1)
+ CALL COUNTERS_STOP_COUNTER( 6 ) ! FortranReweight=6

C Apply the bias weight specified in the run card (default is 1.0)
DSIGUU=DSIGUU*CUSTOM_BIAS(PP,DSIGUU,1,1)
@@ -360,6 +364,7 @@ C Continue only if IMODE is 0, 4 or 5
STOP
ENDIF

+ CALL COUNTERS_START_COUNTER( 4, VECSIZE_USED ) ! FortranPDFs=2
DO CURR_WARP=1, NB_WARP_USED
IF(IMIRROR_VEC(CURR_WARP).EQ.1)THEN
IB(1) = 1
@@ -382,6 +387,7 @@ C LP=SIGN(1,LPP(IB(2)))
ENDIF
ENDDO ! IWARP LOOP
ENDDO ! CURRWARP LOOP
+ CALL COUNTERS_STOP_COUNTER( 4 ) ! FortranPDFs=2
ALL_PD(0,:) = 0D0
IPROC = 0
IPROC=IPROC+1 ! g g > t t~
@@ -426,7 +432,9 @@ C Select a flavor combination (need to do here for right sign)
CHANNEL = SUBDIAG(1)


+ CALL COUNTERS_START_COUNTER( 6, 1 ) ! FortranReweight=6
ALL_RWGT(IVEC) = REWGT(ALL_PP(0,1,IVEC), IVEC)
+ CALL COUNTERS_STOP_COUNTER( 6 ) ! FortranReweight=6

IF(FRAME_ID.NE.6)THEN
CALL BOOST_TO_FRAME(ALL_PP(0,1,IVEC), FRAME_ID, P_MULTI(0
@@ -482,11 +490,13 @@ C Set sign of dsig based on sign of PDF and matrix element
ALL_OUT(IVEC)=0D0
ENDIF
C Generate events only if IMODE is 0.
+ CALL COUNTERS_START_COUNTER( 7, 1 ) ! FortranUnweight=7
IF(IMODE.EQ.0.AND.DABS(ALL_OUT(IVEC)).GT.0D0)THEN
C Call UNWGT to unweight and store events
CALL UNWGT(ALL_PP(0,1,IVEC), ALL_OUT(IVEC)*ALL_WGT(IVEC),1,
$ SELECTED_HEL(IVEC), SELECTED_COL(IVEC), IVEC)
ENDIF
+ CALL COUNTERS_STOP_COUNTER( 7 ) ! FortranUnweight=7
ENDDO

END
@@ -555,7 +565,7 @@ C Call UNWGT to unweight and store events

IF( FBRIDGE_MODE .LE. 0 ) THEN ! (FortranOnly=0 or BothQuiet=-1 or BothDebug=-2)
#endif
- CALL COUNTERS_SMATRIX1MULTI_START( -1, VECSIZE_USED ) ! fortranMEs=-1
+ CALL COUNTERS_START_COUNTER( 9, VECSIZE_USED ) ! FortranMEs=9
DO IVEC=1, VECSIZE_USED
CALL SMATRIX1(P_MULTI(0,1,IVEC),
& hel_rand(IVEC),
@@ -571,7 +581,7 @@ C ======================================================
C *START* Included from CUDACPP template smatrix_multi.f
C (into function smatrix$i_multi in auto_dsig$i.f)
C ======================================================
- CALL COUNTERS_SMATRIX1MULTI_STOP( -1 ) ! fortranMEs=-1
+ CALL COUNTERS_STOP_COUNTER( 9 ) ! FortranMEs=9
#ifdef MG5AMC_MEEXPORTER_CUDACPP
ENDIF

@@ -581,7 +591,7 @@ C ======================================================
STOP
ENDIF
IF ( FIRST ) THEN ! exclude first pass (helicity filtering) from timers (#461)
- CALL COUNTERS_SMATRIX1MULTI_START( 1, VECSIZE_USED ) ! cudacppHEL=1
+ CALL COUNTERS_START_COUNTER( 11, 0 ) ! 11=CudaCpp-Initialise (was CudaCpp-HEL; counter set to 1 on bridge creation, do not increment it further)
CALL FBRIDGESEQUENCE_NOMULTICHANNEL( FBRIDGE_PBRIDGE, ! multi channel disabled for helicity filtering
& P_MULTI, ALL_G, HEL_RAND, COL_RAND, OUT2,
& SELECTED_HEL2, SELECTED_COL2, .TRUE.) ! quit after computing helicities
@@ -602,9 +612,9 @@ C ENDIF
ENDIF
WRITE (6,*) 'NGOODHEL =', NGOODHEL
WRITE (6,*) 'NCOMB =', NCOMB
- CALL COUNTERS_SMATRIX1MULTI_STOP( 1 ) ! cudacppHEL=1
+ CALL COUNTERS_STOP_COUNTER( 11 ) ! 11=CudaCpp-Initialise (was CudaCpp-HEL)
ENDIF
- CALL COUNTERS_SMATRIX1MULTI_START( 0, VECSIZE_USED ) ! cudacppMEs=0
+ CALL COUNTERS_START_COUNTER( 19, VECSIZE_USED ) ! CudaCppMEs=19
IF ( .NOT. MULTI_CHANNEL ) THEN
CALL FBRIDGESEQUENCE_NOMULTICHANNEL( FBRIDGE_PBRIDGE, ! multi channel disabled
& P_MULTI, ALL_G, HEL_RAND, COL_RAND, OUT2,
@@ -618,7 +628,7 @@ C ENDIF
& HEL_RAND, COL_RAND, CHANNELS, OUT2,
& SELECTED_HEL2, SELECTED_COL2, .FALSE.) ! do not quit after computing helicities
ENDIF
- CALL COUNTERS_SMATRIX1MULTI_STOP( 0 ) ! cudacppMEs=0
+ CALL COUNTERS_STOP_COUNTER( 19 ) ! CudaCppMEs=19
ENDIF

IF( FBRIDGE_MODE .LT. 0 ) THEN ! (BothQuiet=-1 or BothDebug=-2)
diff --git b/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/driver.f a/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/driver.f
index 1124a9164..27a6e4674 100644
index ecd11b239..4650934b2 100644
--- b/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/driver.f
+++ a/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/driver.f
@@ -74,13 +74,77 @@ c common/to_colstats/ncols,ncolflow,ncolalt,ic
@@ -76,16 +76,95 @@ c common/to_colstats/ncols,ncolflow,ncolalt,ic

include 'coupl.inc' ! needs VECSIZE_MEMMAX (defined in vector.inc)
INTEGER VECSIZE_USED
Expand All @@ -27,7 +160,19 @@ index 1124a9164..27a6e4674 100644
+ CALL OMPNUMTHREADS_NOT_SET_MEANS_ONE_THREAD()
+#endif
+ CALL COUNTERS_INITIALISE()
+
+c Use null-terminated C-string in COUNTERS_REGISTER_COUNTER calls (maybe it is not needed, but it does not harm)
+ CALL COUNTERS_REGISTER_COUNTER( 1, 'Fortran Initialise(I/O)'//char(0) )
+ CALL COUNTERS_REGISTER_COUNTER( 3, 'Fortran PhaseSpaceSampling'//char(0) ) ! uniform [0,1] + vegas to [0,1] + map to momenta
+ CALL COUNTERS_REGISTER_COUNTER( 4, 'Fortran PDFs'//char(0) )
+ CALL COUNTERS_REGISTER_COUNTER( 5, 'Fortran UpdateScaleCouplings'//char(0) )
+ CALL COUNTERS_REGISTER_COUNTER( 6, 'Fortran Reweight'//char(0) )
+ CALL COUNTERS_REGISTER_COUNTER( 7, 'Fortran Unweight(LHE-I/O)'//char(0) )
+ CALL COUNTERS_REGISTER_COUNTER( 8, 'Fortran SamplePutPoint'//char(0) )
+ CALL COUNTERS_REGISTER_COUNTER( 9, 'Fortran MEs'//char(0) )
+ CALL COUNTERS_REGISTER_COUNTER( 11, 'CudaCpp Initialise'//char(0) )
+ CALL COUNTERS_REGISTER_COUNTER( 12, 'CudaCpp Finalise'//char(0) )
+ CALL COUNTERS_REGISTER_COUNTER( 19, 'CudaCpp MEs'//char(0) )
+c CALL COUNTERS_REGISTER_COUNTER( 21, 'TEST SampleGetX'//char(0) )
+#ifdef MG5AMC_MEEXPORTER_CUDACPP
+ fbridge_mode = 1 ! CppOnly=1, default for CUDACPP
+#else
Expand Down Expand Up @@ -71,24 +216,39 @@ index 1124a9164..27a6e4674 100644
+ endif
+
+#ifdef MG5AMC_MEEXPORTER_CUDACPP
+ CALL COUNTERS_START_COUNTER( 11, 1 ) ! 11=CudaCpp-Initialise
+ CALL FBRIDGECREATE(FBRIDGE_PBRIDGE, VECSIZE_USED, NEXTERNAL, 4) ! this must be at the beginning as it initialises the CUDA device
+ FBRIDGE_NCBYF1 = 0
+ FBRIDGE_CBYF1SUM = 0
+ FBRIDGE_CBYF1SUM2 = 0
+ FBRIDGE_CBYF1MAX = -1D100
+ FBRIDGE_CBYF1MIN = 1D100
+ CALL COUNTERS_STOP_COUNTER( 11 ) ! 11=CudaCpp-Initialise
+#endif
c
c Read process number
c
@@ -208,8 +272,33 @@ c call sample_result(xsec,xerr)
+ CALL COUNTERS_START_COUNTER( 1, 1 ) ! FortranInitialise=1
call open_file(lun+1, 'dname.mg', fopened)
if (.not.fopened)then
goto 11
@@ -156,6 +235,7 @@ c If CKKW-type matching, read IS Sudakov grid
print *,'Running CKKW as lower mult sample'
endif
endif
+ CALL COUNTERS_STOP_COUNTER( 1 ) ! FortranInitialise=1

c
c Get user input
@@ -216,8 +296,35 @@ c call sample_result(xsec,xerr)
c write(*,*) 'Final xsec: ',xsec

rewind(lun)
-
close(lun)
+
+#ifdef MG5AMC_MEEXPORTER_CUDACPP
+ CALL COUNTERS_START_COUNTER( 12, 1 ) ! 12=CudaCpp-Finalise
+ CALL FBRIDGEDELETE(FBRIDGE_PBRIDGE) ! this must be at the end as it shuts down the CUDA device
+ IF( FBRIDGE_MODE .LE. -1 ) THEN ! (BothQuiet=-1 or BothDebug=-2)
+ WRITE(*,'(a,f10.8,a,e8.2)')
Expand All @@ -111,12 +271,13 @@ index 1124a9164..27a6e4674 100644
+ & FBRIDGE_CBYF1SUM / FBRIDGE_NCBYF1, ' +- ',
+ & SQRT( FBRIDGE_CBYF1SUM2 ) / FBRIDGE_NCBYF1 ! ~standard error
+ ENDIF
+ CALL COUNTERS_STOP_COUNTER( 12 ) ! 12=CudaCpp-Finalise
+#endif
+ CALL COUNTERS_FINALISE()
end

c $B$ get_user_params $B$ ! tag for MadWeight
@@ -387,7 +476,7 @@ c
@@ -400,7 +507,7 @@ c
fopened=.false.
tempname=filename
fine=index(tempname,' ')
Expand All @@ -126,7 +287,7 @@ index 1124a9164..27a6e4674 100644
open(unit=lun,file=tempname,status='old',ERR=20)
fopened=.true.
diff --git b/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/matrix1.f a/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/matrix1.f
index 1acba8200..069c74ef4 100644
index bf488e4b0..707ea4032 100644
--- b/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/matrix1.f
+++ a/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/matrix1.f
@@ -71,7 +71,10 @@ C
Expand All @@ -141,7 +302,7 @@ index 1acba8200..069c74ef4 100644
C
C This is just to temporarily store the reference grid for
C helicity of the DiscreteSampler so as to obtain its number of
@@ -211,6 +214,17 @@ C ----------
@@ -224,6 +227,17 @@ C update.
ENDIF
IF(NTRY(1).EQ.MAXTRIES)THEN
ISHEL=MIN(ISUM_HEL,NGOOD)
Expand Down
Loading
Loading