-
Notifications
You must be signed in to change notification settings - Fork 1
/
papers.html
152 lines (130 loc) · 16.1 KB
/
papers.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
<!doctype html>
<html>
<head>
<meta charset="utf-8">
<meta http-equiv="X-UA-Compatible" content="chrome=1">
<title>OP DSL - Papers</title>
<link rel="stylesheet" href="stylesheets/styles.css">
<link rel="stylesheet" href="stylesheets/pygment_trac.css">
<meta name="viewport" content="width=device-width, initial-scale=1, user-scalable=no">
<!--[if lt IE 9]>
<script src="//html5shiv.googlecode.com/svn/trunk/html5.js"></script>
<![endif]-->
</head>
<body>
<div class="wrapper">
<header>
<h1>
<a href="https://op-dsl.github.io/">
<img src="https://op-dsl.github.io/images/op-dsl_logo_small.png" width="300px">
</a>
</h1>
</h1>
<hr/>
<h1><a href="index.html">< Home</a></h1>
<hr />
<h1><a href="codes.html">Users & Codes</a></h1>
<hr />
<h1><a href="projects.html">Projects</a></h1>
<hr />
<h1><a href="papers.html">Papers</a></h1>
<hr />
<h1><a href="about.html">About</a></h1>
<hr />
<p class="view"><a href="https://github.com/OP-DSL">View My GitHub Profile</a></p>
</header>
<section>
<h1>OP DSL - Papers</h1><hr />
<ul>
<li>I.Z. Reguly, A.M.B. Owenson, A. Powell, S.A. Jarvis, and G.R. Mudalige, <em>Under the Hood of SYCL – An Initial Performance Analysis With an Unstructured-mesh CFD Application</em>, International Supercomputing Conference (ISC 2021), June 2021. (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/cameraready_13april2021-pap153s3-file1.pdf">PDF</a>)</li>
<li>I.Z. Reguly and G.R. Mudalige, Productivity, Performance, and Portability for Computational Fluid Dynamics Applications, Computers and Fluids, 2020,104425, ISSN 0045-7930. doi: <a href="https://doi.org/10.1016/j.compfluid.2020.104425">10.1016/j.compfluid.2020.104425</a>. (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/regulymudalige_computersandfluids2020.pdf">PDF</a>)</li>
<li>A.A. Sulyok, G.D. Balogh, I.Z. Reguly, G.R. Mudalige, (2019) <em>Locality Optimized Unstructured Mesh Algorithms on GPUs</em>. Journal of Parallel and Distributed Computing. Volume 134, pp 50-64. doi: <a href="https://doi.org/10.1016/j.jpdc.2019.07.011">0.1016/j.jpdc.2019.07.011</a> (<a href=" https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/jpdc2019_acceptedpreprint.pdf">PDF</a>)</li>
<li>G.R. Mudalige, I.Z. Reguly, S.P. Jammy, C.T. Jacobs, M.B. Giles and N.D. Sandham, (2019) <em>Large-scale performance of a DSL-based multi-block structured-mesh application for direct numerical simulation</em>. Journal of Parallel and Distributed Computing. Volume 131, pp 130-146. doi:<a href="http://dx.doi.org/10.1016/j.jpdc.2019.04.019">10.1016/j.jpdc.2019.04.019</a> (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/ops-opensbli.pdf">PDF</a>)</li>
<li>I.Z. Reguly, G.R. Mudalige, M.B. Giles and S. Maheswaran (2019) <em>Improving resilience of scientific software through a domain-specific approach</em>. Journal of Parallel and Distributed Computing, Volume 128, pp. 99-114. doi:
<a href="http://dx.doi.org/10.1016/j.jpdc.2019.01.015">10.1016/j.jpdc.2019.01.015</a> (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/paper.pdf">PDF</a>)</li>
<li>I.Z. Reguly, B. Moore, T. Schmielau, J. du Toit, G.R. Mudalige. (2019) <em>Batch Solution of Small PDEs with the OPS DSL</em>. In: M. Weiland, G. Juckeland, S. Alam, H. Jagode. (eds) High Performance Computing. ISC High Performance 2019. Lecture Notes in Computer Science, vol 11887. Springer, Cham. (<a href="http://wrap.warwick.ac.uk/136839/8/WRAP-batch-solution-small-PDEs-with-OPS-DSL-Mudalige-2020.pdf">PDF</a>)</li>
<li>G. D. Balogh, G. R. Mudalige, I. Z. Reguly, S. F. Antao and C. Bertolli, <em>OP2-Clang: A Source-to-Source Translator Using Clang/LLVM LibTooling</em>, 2018 IEEE/ACM 5th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC), Dallas, TX, USA, 2018, pp. 59-70. doi: <a href="https://doi.org/10.1109/LLVM-HPC.2018.8639205">10.1109/LLVM-HPC.2018.8639205</a> (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/balogh_op2clang_cameraready.pdf">PDF</a>)</li>
<li>B. Siklosi, I.Z. Reguly and G.R. Mudalige. <em>Heterogeneous CPU-GPU Execution of Stencil Applications.</em> In proceedings of the International Workshop on Performance, Portability and Productivity in HPC (P3HPC) held as part of SC18, The International Conference for High Performance Computing, Networking, Storage and Analysis (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/heterogeneous-cpu-gpu_cameraready.pdf">PDF</a>)</li>
<li>Istvan Z Reguly, G.R Mudalige, Mike B Giles. <em>Loop Tiling in Large-Scale Stencil Codes at Run-time with OPS</em>. (2017) IEEE Transactions on Parallel and Distributed Systems, doi:10.1109/TPDS.2017.2778161 (<a href="https://arxiv.org/abs/1704.00693">arXiv preprint</a> / accepted <a href="http://wrap.warwick.ac.uk/95559/9/WRAP-loop-tiling-large-scale-stencil-codes-run-time-OPS-Mudalige-2017.pdf">PDF</a> and <a href="http://wrap.warwick.ac.uk/95559/2/WRAP-supplementary-material.pdf">supplementary material</a> )</li>
<li>G.D. Balogh, I.Z. Reguly, G.R. Mudalige. <em>Comparison of Parallelisation Approaches, Languages, and Compilers for Unstructured Mesh Algorithms on GPUs</em>. In proceedings of the 8th International Workshop on Performance Modeling, Benchmarking, and Simulation (PMBS 2017), Held in conjunction with IEEE/ACM Supercomputing 2017(SC'17) New Orleans, LA, USA, November 13, 2017 (<a href="https://arxiv.org/abs/1711.01845">arXiv preprint</a> / <a href="https://arxiv.org/pdf/1711.01845">PDF</a>)</li>
<li>I.Z. Reguly, G.R. Mudalige and M.B. Giles. (2017). <em>Beyond 16GB: Out-of-Core Stencil Computations</em>. In Proceedings of the Workshop on Memory Centric Programming for HPC (MCHPC'17). Held in conjunction with IEEE/ACM Supercomputing 2017(SC'17) Denver, CO, USA, November 12, 2017. ACM, New York, NY, USA, 20-29. DOI: <a href="https://doi.org/10.1145/3145617.3145619">10.1145/3145617.3145619</a> (<a href="https://arxiv.org/pdf/1709.02125">preprint PDF</a>) <br />
</li>
<li>R.O. Kirk, G.R. Mudalige, I.Z. Reguly, S.A. Wright, M.J. Martineau and S.A. Jarvis, <em>Achieving Performance Portability for a Heat Conduction Solver Mini-Application on Modern Multi-core Systems</em>, 2017 IEEE International Conference on Cluster Computing (CLUSTER), Honolulu, HI, 2017, pp. 834-841. doi: <a href="https://doi.org/10.1109/CLUSTER.2017.122">10.1109/CLUSTER.2017.122</a> (<a href="http://wrap.warwick.ac.uk/91037/">preprint PDF</a>) </li>
<li>S.P. Jammy, G.R. Mudalige, I.Z. Reguly, N.D. Sandham, M.B. Giles. <em>Block-structured compressible Navier–Stokes solution using the OPS high-level abstraction</em>.(2016) International Journal of Computational Fluid Dynamics 30 (6), 450-454, doi:10.1080/10618562.2016.1243663 (<a href="http://wrap.warwick.ac.uk/86959/6/WRAP_Block-structured-compressible.pdf">PDF</a>)
</li>
<li>
<div>G.R. Mudalige, I.Z. Reguly and M.B. Giles. <em> Auto-Vectorizing a Large-scale Production Unstructured-mesh CFD Application. </em> In proceedings of the 3rd Workshop on Programming Models for SIMD/Vector Processing (WPMVP 2016) held as part of ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP16), Barcelona, Spain, March 2016 (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/grm-wpmvp.pdf" style="background-color: #ffffff;">PDF</a>)</div>
</li>
<li>
<div><span class="author"> I. Z. Reguly </span> , E. <span class="author"> László </span> , G.R. <span class="author"> Mudalige, </span> and M.B. <span class="author"> Giles. </span> (<span class="pubYear">2015</span>) <span class="articleTitle"> Vectorizing unstructured mesh computations for many-core architectures </span> . <span class="journalTitle"> Concurrency Computat.: Pract. Exper.</span>, doi: <a class="accessionId" title="Link to external resource: 10.1002/cpe.3621" shape="rect" href="http://dx.doi.org/10.1002/cpe.3621"> 10.1002/cpe.3621 </a> .</div>
</li>
<li>
<div>I. Z. Reguly, G. R. Mudalige, C. Bertolli, M. B. Giles, A. Betts, P. H. J. Kelly, and D. Radford. <em> Acceleration of a Full-scale Industrial CFD Application with OP2 </em> . (2015) IEEE Transactions on Parallel and Distributed Systems, doi:10.1109/TPDS.2015.2453972 (<a href="http://arxiv.org/abs/1403.7209">arXiv</a> / <a href="http://ora.ox.ac.uk/objects/uuid:1e2274c5-1301-4f5b-8060-30c91d9609ca">data</a> / pre-print <a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/op2-hydra.pdf"> PDF</a>)</div>
</li>
<li>I.Z. Reguly, G.R. Mudalige and M.B. Giles, <em>Design and Development of Domain Specific Active Libraries with Proxy Applications</em>, In proceeding of the Workshop on Representative Applications (WRAp) , 2015 IEEE International Conference on Cluster Computing, Chicago, IL, 2015, pp. 738-745. doi: 10.1109/CLUSTER.2015.128 (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/representative-v1.pdf">PDF</a>)</li>
<li>
<div>S.P. Jammy, N.D. Sandham, G.R. Mudalige, I.Z. Reguly, M.B. Giles. <em> Block Structured Compressible Navier Stokes Solution Using the OPs High-Level abstraction </em> , 27th International Conference on Parallel CFD, May 2015, Montreal, Canada (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/parcfd_2015.pdf">PDF</a>)</div>
</li>
<li>
<div> G.R. Mudalige, I. Reguly, M. Giles, A. Mallinson, W. Gaudin, and J. Herdman, In proceedings of the 5th International Workshop on Performance Modeling, Benchmarking, and Simulation (PMBS 2014), Held in conjunction with IEEE/ACM Supercomputing 2014(SC'14) New Orleans, LA, USA, November 16, 2014. Revised Selected Papers. Springer, 2015, vol. 8966, no. 1, ch. <em> Performance Analysis of a High-level Abstractions-based Hydrocode on Future Computing Systems </em> , pp. 85–104. ISBN 978-3-319-17248-4 (pre-print <a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/ops-pmbs2014.pdf" style="line-height: 15.6000003814697px;"> PDF</a>)</div>
</li>
</li>
<li>
<div>I.Z. Reguly, G.R. Mudalige, M.B. Giles, D. Curran and S. McIntosh-Smith, <em> The OPS Domain Specific Abstraction for Multi-Block Structured Grid Computations </em> . In proceedings of the 4th international workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing(WOLFHPC '14).Held in conjunction with IEEE/ACM Supercomputing 2014(SC'14). (<a href="docs/OPS/wolfhpc_ops.pdf"> PDF</a>)</div>
</li>
<li>
<div>I.Z Reguly, E. Laszlo, G.R. Mudalige, M.B Giles, <em> Vectorizing Unstructured Mesh Computations for Many-core Architectures </em> , PMAM’14, Proceedings of Programming Models and Applications on Multicores and Manycores, February 2014 ( <a href="http://dl.acm.org/citation.cfm?id=2560686"> online </a> ) (pre-print <a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/p39-reguly.pdf"> PDF</a>)</div>
</li>
<li>
<div> G.R. Mudalige, M.B. Giles, J. Thiyagalingam, I. Reguly, C. Bertolli, P.H.J. Kelly and A.E. Trefethen, <em> Design and Initial Performance of a High-level Unstructured Mesh Framework on Heterogeneous Parallel Systems. </em> Parallel Comput.(2013) <em> . </em> http://dx.doi.org/10.1016/j.parco.2013.09.004 (<a class="external-link" href="http://authors.elsevier.com/sd/article/S0167819113001166">online</a>) (<a title="PARCO-OP2-Preprint" href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/parco-final-op2.pdf">PDF</a>)</div>
</li>
<li>
<div>C. Bertolli, A. Betts, N. Loriant, G.R. Mudalige, D. Radford, D.A. Ham, M.B. Giles, and P.H.J. Kelly. <em> Compiler Optimizations for Industrial Unstructured Mesh CFD Applications on GPUs </em> , Languages and Compilers for Parallel Computing, pp.112-126, Springer, 2013. (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/op2-lcpc.pdf">PDF</a>)</div>
</li>
<li>
<div>M.B. Giles, G.R. Mudalige, B. Spencer, C. Bertolli, I. Reguly, <em> Designing OP2 for GPU architectures </em> , Journal of Parallel and Distributed Computing, Volume 73, Issue 11, November 2013, Pages 1451-1460, ISSN 0743-7315. ( <a class="external-link" href="http://dx.doi.org/10.1016/j.jpdc.2012.07.008"> PDF</a>) (pre-print <a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/jpdc-op2.pdf"> PDF</a>)</div>
</li>
<li>
<div>M.B. Giles, G.R. Mudalige, C. Bertolli, P.H.J. Kelly, E. Laszlo, and I. Reguly, <em> An Analytical Study of Loop Tiling for a Large-Scale Unstructured Mesh Application </em> . SC Companion 2012: 477-482. (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/gmbklr-pmbs2012.pdf">PDF</a>)</div>
</li>
<li>
<div> G.R. Mudalige, I. Reguly, M.B. Giles, C. Bertolli and P.H.J. Kelly. <em> OP2: An Active Library Framework for Solving Unstructured Mesh-based Applications on Multi-Core and Many-Core Architectures. </em> In Proceedings of Innovative Parallel Computing (InPar), 2012, pp.1-12, 13-14 May 2012. (<a href="docs/OPS/InPar2012.pdf"><span class="internal-link">PDF</span></a>)</div></li>
<li>
<div> G.R. Mudalige, M.B. Giles, C. Bertolli, and P.H.J. Kelly. <em> Predictive Modeling and Analysis of OP2 on Distributed Memory GPU Clusters </em> . SIGMETRICS Perform. Eval. Rev. 40, 2 :61-67 (2012). (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/per2012.pdf"><span class="internal-link">PDF</span></a>)</div>
</li>
<li>
<div>M.B. Giles, G.R. Mudalige, Z. Sharif, G. Markall, P.H.J. Kelly. <em> Performance Analysis and Optimization of the OP2 Framework on Many-core Architectures </em> (2012) Computer Journal, 55 (2), pp. 168-180. ISSN 0010-4620.(<a class="external-link" href="http://comjnl.oxfordjournals.org/cgi/reprint/bxr062?%20ijkey=m3uIelhqOe6ggoq&keytype=ref" style="line-height: 1.2;">PDF</a>) (pre-print <a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/op2-cj.pdf"> PDF</a>)</div>
</li>
<li>
<div>C. Bertolli, A. Betts, P.H.J. Kelly, G.R. Mudalige, and M.B. Giles. <em> Mesh Independent Loop Fusion for Unstructured Mesh Applications </em> in Proceedings of the 9th Conference on Computing Frontiers (CF ’12). pp.43-52 ACM, 2012. (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/cff84-bertolli.pdf">PDF</a>)</div>
</li>
<li>
<div>C. Bertolli, A. Betts, G.R. Mudalige, M.B. Giles, P.H.J. Kelly. <em> Design and Performance of the OP2 Library for Unstructured Mesh Applications </em> , Euro-Par 2011 Parallel Processing Workshops, Lecture Notes in Computer Science, Springer, 2012. (<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/op2-cgws.pdf">PDF</a>)</div>
</li>
<li>
<div> G.R. Mudalige, M.B. Giles, C. Bertolli, and P.H.J. Kelly. 2011. <em> Predictive Modeling and analysis of OP2 on distributed memory GPU clusters </em> . In Proceedings of the second international workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS '11). ACM, New York, NY, USA, 3-4. Held in conjunction with IEEE/ACM Supercomputing 2011 (SC'11), Seattle, WA, USA
(<a href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/pmbs2011_predictivemodellingop2.pdf">PDF</a>)
</div>
</li>
<li>
<div>M.B. Giles, G.R. Mudalige, Z. Sharif, G. Markall, P.H.J Kelly. <em> Performance Analysis of the OP2 Framework on Many-core Architectures </em> . (2011). ACM SIGMETRICS Perform. Eval. Rev. 38 (4) ISSN 0163-5999. (<a title="PER-March-2011" href="https://warwick.ac.uk/fac/sci/dcs/people/gihan_mudalige/per2011-cameraready.pdf" style="line-height: 1.2;">PDF</a>)</div>
</li>
</ul>
<hr />
<p><small>Hosted on GitHub Pages — Theme by <a href="https://github.com/orderedlist">orderedlist</a></small></p>
</section>
</div>
<script src="javascripts/scale.fix.js"></script>
<!-- <script type="text/javascript">
var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www.");
document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E"));
</script>
<script type="text/javascript">
try {
var pageTracker = _gat._getTracker("UA-36190086-1");
pageTracker._trackPageview();
} catch(err) {}
</script>
-->
</body>
</html>