Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tau memory_instrumentation (v4.0.x) #7

Open
wants to merge 147 commits into
base: v4.0.x
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
147 commits
Select commit Hold shift + click to select a range
d7053a3
btl/openib: delay UCX warning to add_procs()
ggouaillardet Nov 29, 2018
c58c774
btl/openib: have add_proc() return immediately when the port is disab…
ggouaillardet Dec 11, 2018
8da4605
btl/openib: immediately release the device when no port is allowed
ggouaillardet Dec 12, 2018
c44821a
pml/ob1: fixed local handle sent during PUT control message
EmmanuelBRELLE Apr 1, 2019
48f8243
Fix the leak of fragments for persistent sends.
bosilca Apr 26, 2019
8f82c89
Coll/hcoll: don't init opal memhooks unless explicitely requested by …
May 20, 2019
11cb0f2
btl/uct: check for support before disabling UCX memory hooks
hjelmn May 15, 2019
c142605
SSHMEM/COLL: added sshmem/mpi implementation for shmem_collect call
May 21, 2019
f75d46f
ALLOC_WITH_HINT: added implace realloc
May 15, 2019
748a5f5
SHADOW ALLOCATOR: minor code optimization
May 16, 2019
456c5b9
OSHMEM: minor optimization of realloc in shadow allocator
May 24, 2019
69923e7
SPML/UCX: added synchronized flush on quiet
May 20, 2019
5f79dfa
shmat/shmdt additions for patcher
markalle Mar 27, 2019
cadf315
Fixed SPC/MPI_T initialization error.
bosilca May 17, 2019
3fd5c84
Merge pull request #6718 from hoopoepg/topic/pci-flush-on-quiet-v4.0
hppritcha May 31, 2019
dbf8940
Fix the SPC initialization.
bosilca May 17, 2019
a8d5da6
Fix the man pages for some of the MPI_T_* functions.
bosilca May 29, 2019
4083800
Use the correct counter name in the example.
bosilca May 29, 2019
76f01b9
Merge pull request #6696 from gpaulsen/topic/v4.0.x/btl_uct_from_6668
hppritcha Jun 3, 2019
6c74d40
Merge pull request #6720 from markalle/patcher_additions_v40x
hppritcha Jun 3, 2019
a04f5f0
Merge pull request #6692 from vspetrov/v4.0.x
gpaulsen Jun 3, 2019
18f1037
Merge pull request #6152 from ggouaillardet/topic/v4.0.x/ucx_warning
gpaulsen Jun 3, 2019
6c2cd10
Fix tree spawn at scale
rhc54 Jun 4, 2019
e07f127
Ignore generated file
rhc54 Jun 4, 2019
e6e09c6
shmem/c: Fix shmem type for calls to shmem_test and shmem_wait_until …
May 30, 2019
bd602cc
Merge pull request #6701 from hoopoepg/topic/sshmem-mpi-coll-collect-…
gpaulsen Jun 5, 2019
5acaf00
regx/base: fix an integer overflow
Jun 5, 2019
a42977f
Merge pull request #6707 from hoopoepg/topic/alloc-with-hint-realloc-…
hppritcha Jun 6, 2019
5dd8830
mpiext/pcollreq: Add `_f08` to procedure names
yanagibashi May 31, 2019
900f0fa
OSC rdma: make sure accumulating in shared memory is safe
devreal May 10, 2019
07b97bf
Merge pull request #6745 from yanagibashi/pr/v4.0.x/add-f08-procedure…
gpaulsen Jun 7, 2019
105bfed
Merge pull request #6741 from open-mpi/smiller_shmem_wait_types_v4.0.x
gpaulsen Jun 7, 2019
630af10
Merge pull request #6739 from ggouaillardet/topic/regx_atoi
gpaulsen Jun 7, 2019
0cd5a5a
Merge pull request #6714 from rhc54/cmr40/routed
gpaulsen Jun 7, 2019
b5428aa
btl/uct: add support for UCX 1.6.x
hjelmn May 16, 2019
adba7f5
COLL/BASE: Fix linear sync all2all
brminich Jun 6, 2019
cb8dd56
Merge pull request #6747 from devreal/rdma-fetchop-local-v4.0.x
hppritcha Jun 14, 2019
e01005a
Merge pull request #6748 from gpaulsen/topic/v4.0.x/fix_ucx_1.6_issue…
gpaulsen Jun 17, 2019
73c4aac
Merge pull request #6750 from brminich/topic/all2all_linear_sync_fix_…
hppritcha Jun 17, 2019
05fa584
Fix finalize of flux component
rhc54 Jun 19, 2019
c5cf343
OSC rdma win allocate: synchronize error codes across shared memory g…
devreal Jun 5, 2019
28d3009
Merge pull request #6725 from bosilca/cherrypick/6683
hppritcha Jun 24, 2019
3da9236
Merge pull request #6765 from rhc54/cmr4/flux
hppritcha Jun 24, 2019
16e1d74
Suggestion to fix division by zero in file view.
haraldkl Jun 13, 2019
b353639
Update to PMIx v3.1.3rc4
rhc54 Jun 25, 2019
9d0adbc
Update to track 32-bit support commit
rhc54 Jun 26, 2019
ba2368b
Merge pull request #6774 from hppritcha/topic/pr_6759_for_v4.0.x
hppritcha Jun 26, 2019
6424857
Merge pull request #6634 from jsquyres/pr/v4.0.x/ob1-fixes
hppritcha Jun 26, 2019
7f26c6d
Merge pull request #6776 from rhc54/cmr40/pmix
gpaulsen Jun 26, 2019
514e273
Merge pull request #6770 from devreal/osc_winalloc_err_v4.0.x
gpaulsen Jun 28, 2019
1d0e055
v4.0.x: Update PMIx to official v3.1.3 release
rhc54 Jul 2, 2019
b6da090
pml/ucx: Fix the max tag and context id values
nysal Jul 3, 2019
a7608a0
Merge pull request #6787 from rhc54/cmr40/pmix
gpaulsen Jul 8, 2019
2df46ac
Merge pull request #6792 from hoopoepg/topic/ucx_maxtag_fix-v4.0
gpaulsen Jul 8, 2019
667fe3f
Fix oob_tcp tcp_component_close segfault with active listeners
orivej Jul 4, 2019
d3a7360
Merge pull request #6800 from jsquyres/pr/v4.0.x/tcp-oob-active-liste…
hppritcha Jul 9, 2019
9499dcf
Manage errors in NBC collective ops
abouteiller Jan 26, 2018
c9e4240
mpi: retain operation and datatype in non blocking collectives
ggouaillardet Apr 9, 2019
71f240f
btl/openib: fix issue 6785
hppritcha Jul 12, 2019
63605fc
v4.0.x OSC: Reset external request to NULL to avoid double request
Jul 12, 2019
507fcc9
Merge pull request #6806 from ggouaillardet/topic/v4.0.x/nbc_retain
gpaulsen Jul 16, 2019
368da00
Merge pull request #6804 from hppritcha/topic/swat_issue_6785
gpaulsen Jul 16, 2019
667aba9
Merge pull request #6810 from janjust/v4.0.x
hppritcha Jul 23, 2019
aae73d9
fortran/mpif-h: fix C to Fortran error code conversion
ggouaillardet Jul 13, 2019
fbf7d31
fortran/mpif-h: fix MPI_[I]Alltoallw() binding
ggouaillardet Jul 13, 2019
5ab26e4
fortran/mpif-h: fix [i]alltoallw bindings
ggouaillardet Jul 17, 2019
13ba2b0
pcollreq/mpif-h: fix MPIX_Alltoallw_init() binding
ggouaillardet Jul 17, 2019
1ffb9b1
pcollreq/mpif-h: fix MPIX_Alltoallw_init() binding
kawashima-fj Jul 22, 2019
0422b23
Try to prevent the compiler from optimizing out MPIR_Breakpoint().
awlauria Jul 18, 2019
86794e5
Merge pull request #6843 from awlauria/mpir_opt_v4.0.x
hppritcha Jul 26, 2019
3d5b7b4
COLL/TUNED: Update alltoall selection rule for mlx
brminich Jul 2, 2019
b9998a1
COLL/TUNED: Minor var names/comments fixes
brminich Jul 24, 2019
359cdf2
osc/ucx: Fix data corruption with non-contiguous accumulates
nysal Jul 24, 2019
673ddae
Merge pull request #6845 from brminich/tuned_all2all_select-v4.0.x
hppritcha Jul 30, 2019
31aa52f
Merge pull request #6846 from nysal/topic/v4.0.x/ucx_accumulate_fix
hppritcha Aug 2, 2019
e547a2b
Merge pull request #6838 from ggouaillardet/topic/v4.0.x/misc_fortran…
hppritcha Aug 2, 2019
f68b06e
Fix incorrect behavior with length == 0
bosilca May 9, 2019
4f754d0
Optimized datatype description.
bosilca May 16, 2019
8b79423
Update the datatype dump to match the actual types.
bosilca May 8, 2019
4cdc215
Optimize the raw representation.
bosilca May 20, 2019
0a00b02
Small improvements on the test.
bosilca May 18, 2019
012a004
Clean and sync the pack and unpack functions.
bosilca May 20, 2019
78cc0ff
Disable checksum.
bosilca May 28, 2019
d5cdfe7
Optimize the position placement.
bosilca Jun 21, 2019
fad707d
Rework the datatype commit.
bosilca Jun 26, 2019
87299e0
Get rid of the division in the critical path.
bosilca Jul 10, 2019
f78d3d5
Optimize the pack/unpack.
bosilca Jul 10, 2019
9898332
Allow individual jobs to set their map/rank/bind policies
rhc54 Aug 6, 2019
f0f25b6
Fix typos
rhc54 Aug 7, 2019
6d62fb0
Merge pull request #6874 from rhc54/cmr40/lsf
gpaulsen Aug 8, 2019
2fa112c
UCX: added PPN hint for UCX context
Aug 5, 2019
cb3ed47
Merge pull request #6882 from hoopoepg/topic/ucx-ppn-hint-v4.0
gpaulsen Aug 9, 2019
167ca31
Update PMIx to official v3.1.4 release
rhc54 Aug 9, 2019
b37c85d
coll/libnbc: fixes ompi ompi_coll_libnbc_request_t parent
ggouaillardet Aug 8, 2019
ae26957
coll/base: cleanup ompi_coll_base_nbc_request_t elements
ggouaillardet Aug 8, 2019
39ec580
coll/base: only retain datatypes/op if the request has not yet completed
ggouaillardet Aug 9, 2019
e9a0343
osc/ucx: Fix possible win creation/destruction race condition
Jun 28, 2019
8087147
Merge pull request #6885 from rhc54/cmr40/pmix314
gpaulsen Aug 12, 2019
14f3fbb
Provide locality for all procs on node
rhc54 Jul 22, 2019
e17203b
Silence Coverity warning
rhc54 Aug 12, 2019
d72d39b
io_ompio_file_open: fix offset calculation with SEEK_END
edgargabriel Aug 5, 2019
83d40c1
Fix the stack displacement.
bosilca Aug 14, 2019
8e6e826
Fix the variable names used for the datatype dump.
bosilca Aug 14, 2019
c9f48e2
Whitespace cleanup
bosilca Aug 14, 2019
fd13b27
Merge pull request #6889 from ggouaillardet/topic/v4.0.x/nbc_fixes
hppritcha Aug 19, 2019
7b09c15
Merge pull request #6892 from janjust/v4.0.x-osc_fix
hppritcha Aug 19, 2019
f96994b
Merge pull request #6865 from rhc54/cmr40/locality
hppritcha Aug 19, 2019
f49c22a
Run-as-root env vars in orterun.c
simonbyrne Aug 12, 2019
549abea
orterun: remove duplicate code
jsquyres Aug 19, 2019
d3587f5
Merge pull request #6911 from jsquyres/pr/v4.0/mpirun-as-root-as-cont…
hppritcha Aug 20, 2019
390e0bc
Merge pull request #6863 from bosilca/topic/backport_6695
gpaulsen Aug 21, 2019
66e1856
SPML/UCX: fixed hang in SHMEM_FINALIZE
Aug 21, 2019
1f9fce8
SPML/UCX: fixed comment
Aug 21, 2019
e4adbee
Merge pull request #6905 from edgargabriel/pr/file-seek-end-fix-v4.0.x
hppritcha Aug 23, 2019
b07d58a
Updating VERSION for v4.0.2rc1
gpaulsen Aug 23, 2019
be67734
Merge pull request #6922 from hoopoepg/topic/fixed-hand-on-shmem-fina…
gpaulsen Aug 26, 2019
1b0cfdf
v4.0.x: regx/naive: add regx/naive component
Aug 20, 2019
197607c
Updating NEWS for v4.0.2
gpaulsen Aug 24, 2019
5744811
Merge pull request #6925 from gpaulsen/v4.0.x_VERSION_rc1
gpaulsen Aug 26, 2019
83f6c57
Merge pull request #6926 from gpaulsen/v4.0.2_NEWS
gpaulsen Aug 26, 2019
8efc6e1
Remove unnecessary error log
rhc54 Aug 27, 2019
83a2518
Coll/hcoll: fixes hcoll non-blocking colls support
Aug 27, 2019
8b3fd56
Update OPAL DDT variable names
jsquyres Aug 27, 2019
061574f
Merge pull request #6935 from vspetrov/v4.0.x_coll_hcoll_nbc_request_…
hppritcha Aug 27, 2019
8d130e1
UCX osc: properly release exclusive lock to avoid lockup
devreal Aug 27, 2019
78b8b01
Merge pull request #6938 from jsquyres/pr/v4.0.x/fix-ddt-variable-nam…
gpaulsen Aug 28, 2019
2d515f7
Merge pull request #6934 from devreal/osc-ucx-excl-lock-v4.0.x
gpaulsen Aug 29, 2019
4c1160e
Fix tree spawn routed component issue
jjhursey Aug 29, 2019
fbbacc1
Fix mmap infinite recurse in memory patcher
hkuno Jul 30, 2019
989461f
Merge pull request #6915 from sam6258/smiller_regx_none
hppritcha Aug 30, 2019
c6fe859
Merge pull request #6946 from hkuno/intercept_mmap_fix
hppritcha Aug 30, 2019
893ea3f
Merge pull request #6929 from rhc54/cmr40/pmix314
gpaulsen Aug 30, 2019
9bf1873
Fix osc sm posts when only 32 bit atomics support
guserav Jul 9, 2019
6912e09
Merge pull request #6942 from guserav/v4-fix-osc-sm-post-32-bit-atomics
hppritcha Sep 3, 2019
e2b1543
Small optimization on the datatype commit.
bosilca Aug 29, 2019
8f16780
Add a test for datatypes composed by multiple predefined
bosilca Aug 30, 2019
95cc53e
Be a little less restrictive on interface requirements
rhc54 Sep 6, 2019
287ee15
Merge pull request #6967 from rhc54/cmr40x/oob
gpaulsen Sep 9, 2019
ce228d2
Merge pull request #6952 from jsquyres/pr/v4.0.x/ddt-opt-and-fix
gpaulsen Sep 9, 2019
a482edc
Merge pull request #6944 from jjhursey/v4/fix-tree-launch
gpaulsen Sep 9, 2019
49a2558
Reving VERSION to v4.0.2rc2
gpaulsen Sep 9, 2019
84e4af5
Merge pull request #6969 from gpaulsen/topic/v4.0.x_VERSION_rc2
gpaulsen Sep 10, 2019
93dbc16
WIP: carry Tau memory_instrumentation bits to master
naughtont3 Oct 2, 2018
6d0adca
update to new opal_infosubscriber_t structure
naughtont3 Jan 4, 2019
3325308
move profile funcs/stubs out to memprof file
naughtont3 Jan 15, 2019
a6eeecf
use macros in code and add comments + fix include
naughtont3 Jan 16, 2019
2ac51fe
fix bad merge to avoid old header include
naughtont3 Mar 12, 2019
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
20 changes: 20 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -199,6 +199,8 @@ ompi/mca/rte/orte/mpirun.1

ompi/mca/sharedfp/addproc/mca_sharedfp_addproc_control

ompi/mca/topo/treematch/config.h

ompi/mpi/c/profile/p*.c

ompi/mpi/fortran/configure-fortran-output.h
Expand Down Expand Up @@ -399,6 +401,24 @@ opal/mca/pmix/ext3x/ext3x_client.c
opal/mca/pmix/ext3x/ext3x_component.c
opal/mca/pmix/ext3x/ext3x_server_north.c
opal/mca/pmix/ext3x/ext3x_server_south.c
opal/mca/pmix/pmix3x/pmix/config/mca_library_paths.txt
opal/mca/pmix/pmix3x/pmix/config/test-driver
opal/mca/pmix/pmix3x/pmix/test/run_tests00.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests01.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests02.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests03.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests04.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests05.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests06.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests07.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests08.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests09.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests10.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests11.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests12.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests13.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests14.pl
opal/mca/pmix/pmix3x/pmix/test/run_tests15.pl

opal/tools/opal-checkpoint/opal-checkpoint
opal/tools/opal-checkpoint/opal-checkpoint.1
Expand Down
47 changes: 46 additions & 1 deletion NEWS
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ Copyright (c) 2006 Sun Microsystems, Inc. All rights reserved.
Use is subject to license terms.
Copyright (c) 2006-2018 Los Alamos National Security, LLC. All rights
reserved.
Copyright (c) 2010-2018 IBM Corporation. All rights reserved.
Copyright (c) 2010-2019 IBM Corporation. All rights reserved.
Copyright (c) 2012 Oak Ridge National Labs. All rights reserved.
Copyright (c) 2012 Sandia National Laboratories. All rights reserved.
Copyright (c) 2012 University of Houston. All rights reserved.
Expand Down Expand Up @@ -57,6 +57,51 @@ included in the vX.Y.Z section and be denoted as:
(** also appeared: A.B.C) -- indicating that this item was previously
included in release version vA.B.C.

4.0.2 -- September, 2019
------------------------
- Update embedded PMIx to 3.1.4
- Fix an issue with env processing when running as root.
Thanks to Simon Byrne for reporting and providing a fix.
- Fix Fortran MPI_FILE_GET_POSITION return code bug.
Thanks to Wei-Keng Liao for reporting.
- Fix user defined datatypes/ops leak in nonblocking base collective
component. Thanks to Andrey Maslennikov for verifying fix.
- Fixed shared memory not working with spawned processes.
Thanks to @rodarima for reporting.
- Fix data corruption of overlapping datatypes on sends.
Thanks to DKRZ for reporting.
- Fix segfault in oob_tcp component on close with active listeners.
Thanks to Orivej Desh for reporting and providing a fix.
- Fix divide by zero segfault in ompio.
Thanks to @haraldkl for reporting and providing a fix.
- Fix finalize of flux compnents.
Thanks to Stephen Herbein and Jim Garlick for providing a fix.
- Fix osc_rdma_acc_single_intrinsic regression.
Thanks to Joseph Schuchart for reporting and providing a fix.
- Fix hostnames with large integers.
Thanks to @perrynzhou for reporting and providing a fix.
- Fix Deadlock in MPI_Fetch_and_op when using UCX
Thanks to Joseph Schuchart for reporting.
- Fix the SLURM plm for mpirun-based launching.
Thanks to Jordon Hayes for reporting and providing a fix.
- Prevent grep failure in rpmbuild from aborting.
Thanks to Daniel Letai for reporting.
- Fix btl/vader finalize sequence.
Thanks to Daniel Vollmer for reporting.
- Fix pml/ob1 local handle sent during PUT control message.
Thanks to @EmmanuelBRELLE for reporting and providing a fix.
- Fix Memory leak with persistent MPI sends and the ob1 "get" protocol.
Thanks to @s-kuberski for reporting.
- v4.0.x: mpi: mark MPI_COMBINER_{HVECTOR,HINDEXED,STRUCT}_INTEGER
removed unless configured with --enable-mpi1-compatibility
- Fix make-authors.pl when run in a git submodule.
Thanks to Michael Heinz for reporting and providing a fix.
- Fix deadlock with mpi_assert_allow_overtaking in MPI_Issend.
Thanks to Joseph Schuchart and George Bosilca for reporting.
- Add compilation flag to allow unwinding through files that are
present in the stack when attaching with MPIR.
Thanks to James A Clark for reporting and providing a fix.

4.0.1 -- March, 2019
--------------------

Expand Down
24 changes: 13 additions & 11 deletions VERSION
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,9 @@
# All rights reserved.
# Copyright (c) 2017 Los Alamos National Security, LLC. All rights
# reserved.
# Copyright (c) 2018 IBM Corporation. All rights reserved.
# Copyright (c) 2019 Triad National Security, LLC. All rights
# reserved.
# Copyright (c) 2018-2019 IBM Corporation. All rights reserved.
# Copyright (c) 2018 Research Organization for Information Science
# and Technology (RIST). All rights reserved.

Expand All @@ -28,7 +30,7 @@ release=2
# requirement is that it must be entirely printable ASCII characters
# and have no white space.

greek=a1
greek=rc2

# If repo_rev is empty, then the repository version number will be
# obtained during "make dist" via the "git describe --tags --always"
Expand Down Expand Up @@ -86,16 +88,16 @@ date="Unreleased developer copy"
# Version numbers are described in the Libtool current:revision:age
# format.

libmpi_so_version=60:1:20
libmpi_cxx_so_version=60:0:20
libmpi_mpifh_so_version=60:1:20
libmpi_so_version=60:2:20
libmpi_cxx_so_version=60:1:20
libmpi_mpifh_so_version=60:2:20
libmpi_usempi_tkr_so_version=60:0:20
libmpi_usempi_ignore_tkr_so_version=60:0:20
libmpi_usempif08_so_version=60:0:20
libopen_rte_so_version=60:1:20
libopen_pal_so_version=60:1:20
libmpi_usempif08_so_version=60:1:20
libopen_rte_so_version=60:2:20
libopen_pal_so_version=60:2:20
libmpi_java_so_version=60:0:20
liboshmem_so_version=61:0:21
liboshmem_so_version=62:0:22
libompitrace_so_version=60:0:20

# "Common" components install standalone libraries that are run-time
Expand All @@ -104,7 +106,7 @@ libompitrace_so_version=60:0:20
# components-don't-affect-the-build-system abstraction.

# OMPI layer
libmca_ompi_common_ompio_so_version=60:1:19
libmca_ompi_common_ompio_so_version=60:2:19
libmca_ompi_common_monitoring_so_version=60:0:10

# ORTE layer
Expand All @@ -113,6 +115,6 @@ libmca_orte_common_alps_so_version=60:0:20
# OPAL layer
libmca_opal_common_cuda_so_version=60:0:20
libmca_opal_common_sm_so_version=60:0:20
libmca_opal_common_ucx_so_version=60:0:20
libmca_opal_common_ucx_so_version=60:1:20
libmca_opal_common_ugni_so_version=60:0:20
libmca_opal_common_verbs_so_version=60:0:20
5 changes: 4 additions & 1 deletion config/ompi_check_ucx.m4
Original file line number Diff line number Diff line change
Expand Up @@ -120,7 +120,8 @@ AC_DEFUN([OMPI_CHECK_UCX],[
UCP_ATOMIC_POST_OP_XOR,
UCP_ATOMIC_FETCH_OP_FAND,
UCP_ATOMIC_FETCH_OP_FOR,
UCP_ATOMIC_FETCH_OP_FXOR],
UCP_ATOMIC_FETCH_OP_FXOR,
UCP_PARAM_FIELD_ESTIMATED_NUM_PPN],
[], [],
[#include <ucp/api/ucp.h>])
AC_CHECK_DECLS([UCP_WORKER_ATTR_FIELD_ADDRESS_FLAGS],
Expand All @@ -135,9 +136,11 @@ AC_DEFUN([OMPI_CHECK_UCX],[
[$1_CPPFLAGS="[$]$1_CPPFLAGS $ompi_check_ucx_CPPFLAGS"
$1_LDFLAGS="[$]$1_LDFLAGS $ompi_check_ucx_LDFLAGS"
$1_LIBS="[$]$1_LIBS $ompi_check_ucx_LIBS"
AC_DEFINE([HAVE_UCX], [1], [have ucx])
$2],
[AS_IF([test ! -z "$with_ucx" && test "$with_ucx" != "no"],
[AC_MSG_ERROR([UCX support requested but not found. Aborting])])
AC_DEFINE([HAVE_UCX], [0], [have ucx])
$3])

OPAL_VAR_SCOPE_POP
Expand Down
1 change: 1 addition & 0 deletions config/opal_configure_options.m4
Original file line number Diff line number Diff line change
Expand Up @@ -120,6 +120,7 @@ else
fi
AC_DEFINE_UNQUOTED(OPAL_ENABLE_MEM_PROFILE, $WANT_MEM_PROFILE,
[Whether we want the memory profiling or not])
AM_CONDITIONAL([OPAL_ENABLE_MEM_PROFILE], [test "$WANT_MEM_PROFILE" = "1"])

#
# Developer picky compiler options
Expand Down
5 changes: 5 additions & 0 deletions contrib/platform/mellanox/optimized.conf
Original file line number Diff line number Diff line change
Expand Up @@ -78,3 +78,8 @@ opal_event_include=epoll

bml_r2_show_unreach_errors = 0

# alltoall algorithm selection settings for tuned coll mca
coll_tuned_alltoall_large_msg = 250000
coll_tuned_alltoall_min_procs = 2048
coll_tuned_alltoall_algorithm_max_requests = 8

3 changes: 3 additions & 0 deletions ompi/communicator/comm_init.c
Original file line number Diff line number Diff line change
Expand Up @@ -35,6 +35,7 @@

#include "opal/util/bit_ops.h"
#include "opal/util/info_subscriber.h"
#include "opal/util/memprof.h"
#include "opal/mca/pmix/pmix.h"
#include "ompi/constants.h"
#include "ompi/mca/pml/pml.h"
Expand Down Expand Up @@ -242,6 +243,7 @@ ompi_communicator_t *ompi_comm_allocate ( int local_size, int remote_size )

/* create new communicator element */
new_comm = OBJ_NEW(ompi_communicator_t);
OPAL_MEMPROF_START_ALLOC(new_comm->super.s_base.obj_class->cls_name, 0, 0);
new_comm->super.s_info = NULL;
new_comm->c_local_group = ompi_group_allocate ( local_size );
if ( 0 < remote_size ) {
Expand All @@ -258,6 +260,7 @@ ompi_communicator_t *ompi_comm_allocate ( int local_size, int remote_size )

/* fill in the inscribing hyper-cube dimensions */
new_comm->c_cube_dim = opal_cube_dim(local_size);
OPAL_MEMPROF_STOP_ALLOC(new_comm->super.s_base.obj_class->cls_name, 0);

return new_comm;
}
Expand Down
2 changes: 1 addition & 1 deletion ompi/datatype/ompi_datatype.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
/* -*- Mode: C; c-basic-offset:4 ; indent-tabs-mode:nil -*- */
/*
* Copyright (c) 2009-2013 The University of Tennessee and The University
* Copyright (c) 2009-2019 The University of Tennessee and The University
* of Tennessee Research Foundation. All rights
* reserved.
* Copyright (c) 2009 Oak Ridge National Labs. All rights reserved.
Expand Down
13 changes: 6 additions & 7 deletions ompi/datatype/ompi_datatype_create_contiguous.c
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
* Copyright (c) 2004-2006 The Trustees of Indiana University and Indiana
* University Research and Technology
* Corporation. All rights reserved.
* Copyright (c) 2004-2013 The University of Tennessee and The University
* Copyright (c) 2004-2019 The University of Tennessee and The University
* of Tennessee Research Foundation. All rights
* reserved.
* Copyright (c) 2004-2006 High Performance Computing Center Stuttgart,
Expand All @@ -29,13 +29,12 @@ int32_t ompi_datatype_create_contiguous( int count, const ompi_datatype_t* oldTy
{
ompi_datatype_t* pdt;

if( 0 == count ) {
pdt = ompi_datatype_create( 0 );
ompi_datatype_add( pdt, &ompi_mpi_datatype_null.dt, 0, 0, 0 );
} else {
pdt = ompi_datatype_create( oldType->super.desc.used + 2 );
opal_datatype_add( &(pdt->super), &(oldType->super), count, 0, (oldType->super.ub - oldType->super.lb) );
if( (0 == count) || (0 == oldType->super.size) ) {
return ompi_datatype_duplicate( &ompi_mpi_datatype_null.dt, newType);
}

pdt = ompi_datatype_create( oldType->super.desc.used + 2 );
opal_datatype_add( &(pdt->super), &(oldType->super), count, 0, (oldType->super.ub - oldType->super.lb) );
*newType = pdt;
return OMPI_SUCCESS;
}
4 changes: 1 addition & 3 deletions ompi/datatype/ompi_datatype_create_darray.c
Original file line number Diff line number Diff line change
Expand Up @@ -192,9 +192,7 @@ int32_t ompi_datatype_create_darray(int size,
if (ndims < 1) {
/* Don't just return MPI_DATATYPE_NULL as that can't be
MPI_TYPE_FREE()ed, and that seems bad */
*newtype = ompi_datatype_create(0);
ompi_datatype_add(*newtype, &ompi_mpi_datatype_null.dt, 0, 0, 0);
return MPI_SUCCESS;
return ompi_datatype_duplicate( &ompi_mpi_datatype_null.dt, newtype);
}

rc = ompi_datatype_type_extent(oldtype, &orig_extent);
Expand Down
Loading