From 19c425d9fedd61c3a008b77dd6dbbad6b839d893 Mon Sep 17 00:00:00 2001 From: xwuupb Date: Sun, 5 Apr 2020 21:40:51 +0200 Subject: [PATCH] finish 05_saxpy --- README.md | 23 +++-------------------- 1 file changed, 3 insertions(+), 20 deletions(-) diff --git a/README.md b/README.md index ad3a857..76ee2e2 100644 --- a/README.md +++ b/README.md @@ -49,27 +49,10 @@ Paderborn University. The sub-directories are generally organized as: `scalarAddition` adds two integers on host and accelerator, and also compares the performance. -* 05_saxpy_v1 +* 05_saxpy - `saxpy` performs the `axpy` operation on host as well as accelerator and then - compares the FLOPS performance. - - The `axpy` operation is defined as: - - $$ y := a * x + y $$ - - where: - - - `a` is a scalar. - - `x` and `y` are vectors each with n elements. - - The initial value of `a` and elements of `x[]` and `y[]` are specially - designed, so that the floating-point calculations on host and accelerator - can be compared _exactly_. - - Please note that only _one GPU thread_ is used for the `axpy` calculation on - accelerator in this version. This can be verified by uncomment the `CFLAGS` - line in `configure.ac`. + `saxpy` performs the `saxpy` operation on host as well as accelerator. + The performance (in MB/s) for different implementations is also compared. * 08_distThreads