GDCN implementation #716

an-tran528 · 2024-05-05T02:17:58Z

I'm trying to search around for the implementation GDCN, an updated version for DCN but seems like it's not yet supported.

I'm trying to tweak the Cross layer implementation by adding gate layers with sigmoid activation:

      self._gate_u = tf.keras.layers.Dense(
          self._projection_dim,
          kernel_initializer=_clone_initializer(self._kernel_initializer),
          kernel_regularizer=self._kernel_regularizer,
          use_bias=False,
          dtype=self.dtype,
      )
      self._gate_v = tf.keras.layers.Dense(
          last_dim,
          kernel_initializer=_clone_initializer(self._kernel_initializer),
          bias_initializer=self._bias_initializer,
          kernel_regularizer=self._kernel_regularizer,
          bias_regularizer=self._bias_regularizer,
          use_bias=self._use_bias,
          dtype=self.dtype,
          activation="sigmoid",
      )
    ....
def call:
    return x0 * prod_output + self._gate_v(self._gate_u(x)) + x

But loss doesn't converge for my use case. Is the implementation correct?

The text was updated successfully, but these errors were encountered:

zhangfan555 · 2024-08-26T20:45:57Z

From the paper, it should be "x0 * prod_output * self._gate_v(self._gate_u(x)) + x" ?

rlcauvin · 2024-08-27T20:45:49Z

I'd be interested in seeing the full code for the GDCN once you get it working. Hopefully, the correction from @zhangfan555 will make the loss converge.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GDCN implementation #716

GDCN implementation #716

an-tran528 commented May 5, 2024

zhangfan555 commented Aug 26, 2024

rlcauvin commented Aug 27, 2024

GDCN implementation #716

GDCN implementation #716

Comments

an-tran528 commented May 5, 2024

zhangfan555 commented Aug 26, 2024

rlcauvin commented Aug 27, 2024