Sindhu/bfloat16 support #399

sindhu-nervana · 2019-12-20T20:06:50Z

Add support in bridge for TF Graph with Ops that take in bfloat data type inputs.

The bfloatOps are by default assigned XLA CPU device by TF. We register dummy kernels for CPU device, for bfloat data types. which makes TF assign device CPU to bfloatOps.

- Enabled --var build to use parallel executor integrating weights-on-device and data pipelining - moved ngraph_var files outside the var build

dbonner · 2020-01-19T00:36:08Z

Will your work help with this issue #447 ?
The PLAIDML backend won't build because BLFOAT16 enum case is not dealt with in switch (dt).

…port

sayantan-nervana · 2020-01-29T18:39:18Z

ngraph_bridge/enable_variable_ops/ngraph_variable_modifiers.cc

+REGISTER_NGRAPH_STUB_KERNEL("NGraphApplyMomentum");
+REGISTER_NGRAPH_STUB_KERNEL(
+    "NGraphAssignAdd");  //*input[0] = *input[0] + input[1]
+REGISTER_NGRAPH_STUB_KERNEL(


REGISTER_NGRAPH_STUB_KERNEL registers only for bfloat type? since its defn is:

#define REGISTER_NGRAPH_STUB_KERNEL(optype) \ REGISTER_KERNEL_BUILDER( \ Name(optype).Device(DEVICE_CPU).TypeConstraint<bfloat16>("T"), \ NGStubOp);

in that case is this replacement ok?

changed the macros. the names were not matching the definition.

sayantan-nervana · 2020-01-29T19:07:07Z

ngraph_bridge/ngraph_utils.cc

@@ -223,6 +223,9 @@ Status TensorToStream(std::ostream& ostream, const Tensor& tensor) {
    case DT_BOOL:
      TensorDataToStream<bool>(ostream, n_elements, data);
      break;
+    case DT_BFLOAT16:
+      TensorDataToStream<bool>(ostream, n_elements, data);


It says <bool> in the template. copy-paste error perhaps.

Good catch. Not sure what the corresponding data type for bfloat is.

We can throw an error or return a bad status for now I guess

…port

kanvi-nervana

Minor comments

kanvi-nervana · 2020-01-29T21:20:25Z

ngraph_bridge/ngraph_register_stub_kernels.cc

+// Since nGraph-bridge OPs work on TF DEVICE_CPU we are registering stub
+// bfloat16
+// kernels here. The expectation is when we register the stub kernels for
+// bfloat16
+// TF is going to assign DEVICE_CPU to the respective Ops and we will
+// encapsulate them


// Since nGraph-bridge OPs work on TF DEVICE_CPU we are registering stub
// bfloat16 kernels here. The expectation is when we register the stub kernels
// for bfloat16 TF is going to assign DEVICE_CPU to the respective Ops and
// we will encapsulate them

kanvi-nervana · 2020-01-29T21:21:41Z

test/python/test_bfloat16.py

@@ -0,0 +1,131 @@
+# ==============================================================================
+#  Copyright 2019 Intel Corporation


shresthamalik

LGTM

sindhu-nervana and others added 4 commits December 11, 2019 14:30

initial commit

694e63c

add bfloat16 test

2ab87c7

Shrestha/var in compute (#388)

b267ba1

- Enabled --var build to use parallel executor integrating weights-on-device and data pipelining - moved ngraph_var files outside the var build

disable the test

3ffb02e

sindhu-nervana requested a review from shresthamalik December 20, 2019 20:06

kanvi-nervana and others added 2 commits December 20, 2019 13:27

Kanvi/Add asserts in some python tests (#398)

367d3db

Merge branch 'master' into sindhu/bfloat16_support

453a304

sayantan-nervana force-pushed the master branch from 367d3db to d74ac48 Compare December 20, 2019 22:28

Merge branch 'master' into sindhu/bfloat16_support

22e7755

shresthamalik requested a review from kanvi-nervana December 27, 2019 18:51

Merge branch 'master' into sindhu/bfloat16_support

c6220b7

kanvi-nervana approved these changes Dec 31, 2019

View reviewed changes

Merge branch 'master' into sindhu/bfloat16_support

bea7c4d

Shrestha Malik added 15 commits January 21, 2020 14:02

added test

4cfb27f

changes

266b24a

added another test

062a3c3

added another bfloat test. encapsulate always assigned device CPU

f00e298

Merge remote-tracking branch 'origin/master' into sindhu/bfloat16_sup…

5644eb6

…port

removed couts, rearranged the tests

0a4ffdd

device checks

80c46f8

fix by registering dummy bfloat kernel

eb145c7

Merge remote-tracking branch 'origin/master' into sindhu/bfloat16_sup…

4d91711

…port

hanging include

5f08083

changes

e50323a

minor

e35892d

Register Stub Kernels

a95c92f

fix bazel build

5d313e3

update comment

f636278

shresthamalik requested a review from kanvi-nervana January 29, 2020 18:14

shresthamalik requested a review from sayantan-nervana January 29, 2020 18:14

added comments to the test

d2a161f

sayantan-nervana reviewed Jan 29, 2020

View reviewed changes

Shrestha Malik added 3 commits January 29, 2020 11:16

corrected the macros

1e4923c

fix template

0bb58e0

Merge remote-tracking branch 'origin/master' into sindhu/bfloat16_sup…

957bf01

…port

kanvi-nervana approved these changes Jan 29, 2020

View reviewed changes

incorporate review comments

9fce56c

shresthamalik requested a review from sayantan-nervana January 30, 2020 01:03

sayantan-nervana approved these changes Jan 30, 2020

View reviewed changes

sayantan-nervana added the ready to merge This PR is the next in the queue. label Jan 30, 2020

shresthamalik removed their request for review January 30, 2020 20:08

shresthamalik approved these changes Jan 30, 2020

View reviewed changes

shresthamalik merged commit 310ca25 into master Jan 30, 2020

shresthamalik deleted the sindhu/bfloat16_support branch January 30, 2020 20:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sindhu/bfloat16 support #399

Sindhu/bfloat16 support #399

sindhu-nervana commented Dec 20, 2019 •

edited by shresthamalik

Loading

dbonner commented Jan 19, 2020

sayantan-nervana Jan 29, 2020

shresthamalik Jan 29, 2020 •

edited

Loading

sayantan-nervana Jan 29, 2020 •

edited

Loading

shresthamalik Jan 29, 2020

sayantan-nervana Jan 29, 2020

shresthamalik Jan 29, 2020

kanvi-nervana left a comment

kanvi-nervana Jan 29, 2020

kanvi-nervana Jan 29, 2020

shresthamalik left a comment

		@@ -0,0 +1,131 @@
		# ==============================================================================
		# Copyright 2019 Intel Corporation

Sindhu/bfloat16 support #399

Sindhu/bfloat16 support #399

Conversation

sindhu-nervana commented Dec 20, 2019 • edited by shresthamalik Loading

dbonner commented Jan 19, 2020

sayantan-nervana Jan 29, 2020

Choose a reason for hiding this comment

shresthamalik Jan 29, 2020 • edited Loading

Choose a reason for hiding this comment

sayantan-nervana Jan 29, 2020 • edited Loading

Choose a reason for hiding this comment

shresthamalik Jan 29, 2020

Choose a reason for hiding this comment

sayantan-nervana Jan 29, 2020

Choose a reason for hiding this comment

shresthamalik Jan 29, 2020

Choose a reason for hiding this comment

kanvi-nervana left a comment

Choose a reason for hiding this comment

kanvi-nervana Jan 29, 2020

Choose a reason for hiding this comment

kanvi-nervana Jan 29, 2020

Choose a reason for hiding this comment

shresthamalik left a comment

Choose a reason for hiding this comment

sindhu-nervana commented Dec 20, 2019 •

edited by shresthamalik

Loading

shresthamalik Jan 29, 2020 •

edited

Loading

sayantan-nervana Jan 29, 2020 •

edited

Loading