apply_gradients AttributeError: 'ResourceVariable' object has no attribute 'overwrite_with_gradient' #20517

andrewl36 · 2024-11-19T20:18:15Z

When I have a mix of tf.Variable and KerasVariables I get the following error:

--> 632     if v.overwrite_with_gradient:
    633         if self.gradient_accumulation_steps:
    634             # Utilize a stateless manner for JAX compatibility
    635             steps = self.gradient_accumulation_steps

AttributeError: 'ResourceVariable' object has no attribute 'overwrite_with_gradient'

I suspect this is because my list of variables is [KerasVariables] + [tf.Variables]
and the following line only checks the first in the list as to whether overwrite_with_gradient can be used?

keras/keras/src/optimizers/base_optimizer.py

Line 675 in 660da94

if not hasattr(vars[0], "overwrite_with_gradient"):

The text was updated successfully, but these errors were encountered:

sachinprasadhs · 2024-11-19T21:28:42Z

Could you please provide some sample reproducible script to replicate the reported behavior. Thanks!

andrewl36 · 2024-11-21T18:35:07Z

Could you please provide some sample reproducible script to replicate the reported behavior. Thanks!

@sachinprasadhs Sure, please try this cut down simple example showing the problem:

import tensorflow as tf
from tensorflow import keras 

class MyModel(tf.keras.Model):
    def __init__(self):
        super().__init__()
        
        # Keras model Layers
        self.hidden_layers = [tf.keras.layers.Dense(32, activation='tanh') for _ in range(2)]
        self.output_layer = tf.keras.layers.Dense(1)
        
        # Custom variable
        self.my_var = tf.Variable(0.1, trainable=True, dtype=tf.float32, name="my_var")

    def call(self, inputs):
        x = inputs
        for layer in self.hidden_layers:
            x = layer(x)
        return self.output_layer(x)
    
data = np.array([
    [0.0,    10.4],
    [900.0,  21.1],
    [3900.0, 64.2],
]
)

model   = MyModel()
inputs  = data[:, 0:1]
outputs = data[:, 1:]

epochs = 1000
learning_rate = 0.005
optimizer = tf.keras.optimizers.Adam(learning_rate=learning_rate)

for epoch in range(epochs):
    with tf.GradientTape() as tp:
        y_pred = model(inputs)
        loss   = tf.reduce_mean(tf.square((outputs - y_pred)))
    
    params = model.trainable_variables + [model.my_var]
        
    gradients = tp.gradient(loss, params)
    optimizer.apply_gradients(zip(gradients, params))
    del tp

james77777778 · 2024-11-22T04:38:47Z

@andrewl36
Try this one:

import numpy as np
import tensorflow as tf

import keras


class MyModel(keras.Model):
    def __init__(self):
        super().__init__()

        # Keras model Layers
        self.hidden_layers = [
            keras.layers.Dense(32, activation="tanh") for _ in range(2)
        ]
        self.output_layer = keras.layers.Dense(1)

        # Custom variable
        self.my_var = self.add_weight(shape=(), dtype="float32", name="my_var")
        self.my_var.assign(0.1)

    def call(self, inputs):
        x = inputs
        for layer in self.hidden_layers:
            x = layer(x)
        return self.output_layer(x)


data = np.array(
    [
        [0.0, 10.4],
        [900.0, 21.1],
        [3900.0, 64.2],
    ]
)

model = MyModel()
inputs = data[:, 0:1]
outputs = data[:, 1:]
epochs = 1000
learning_rate = 0.005
optimizer = keras.optimizers.Adam(learning_rate=learning_rate)

pbar = keras.utils.Progbar(epochs)
for epoch in range(epochs):
    with tf.GradientTape() as tp:
        y_pred = model(inputs)
        loss = tf.reduce_mean(tf.square((outputs - y_pred)))
    gradients = tp.gradient(loss, model.trainable_variables)
    optimizer.apply_gradients(zip(gradients, model.trainable_variables))
    values = {"loss": loss.numpy()}
    pbar.add(1, values.items())

The key is to use self.add_weight for creating custom variables.

EDITED:
I have submitted a PR to fix this.

andrewl36 · 2024-11-22T16:15:04Z

@james77777778 thank you, yes that does work now, cheers

google-ml-butler · 2024-11-22T16:24:55Z

Are you satisfied with the resolution of your issue?
Yes
No

github-actions bot assigned sachinprasadhs Nov 19, 2024

andrewl36 mentioned this issue Nov 19, 2024

AttributeError: 'ResourceVariable' object has no attribute 'overwrite_with_gradient' tensorflow/addons#2871

Open

sachinprasadhs added the type:Bug label Nov 19, 2024

sachinprasadhs added the stat:awaiting response from contributor label Nov 19, 2024

google-ml-butler bot removed the stat:awaiting response from contributor label Nov 21, 2024

james77777778 mentioned this issue Nov 22, 2024

Fix BaseOptimizer to handle a mix of tf.Variable and KerasVariable #20534

Merged

fchollet closed this as completed in #20534 Nov 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apply_gradients AttributeError: 'ResourceVariable' object has no attribute 'overwrite_with_gradient' #20517

apply_gradients AttributeError: 'ResourceVariable' object has no attribute 'overwrite_with_gradient' #20517

andrewl36 commented Nov 19, 2024

sachinprasadhs commented Nov 19, 2024

andrewl36 commented Nov 21, 2024

james77777778 commented Nov 22, 2024 •

edited

Loading

andrewl36 commented Nov 22, 2024

google-ml-butler bot commented Nov 22, 2024

apply_gradients AttributeError: 'ResourceVariable' object has no attribute 'overwrite_with_gradient' #20517

apply_gradients AttributeError: 'ResourceVariable' object has no attribute 'overwrite_with_gradient' #20517

Comments

andrewl36 commented Nov 19, 2024

sachinprasadhs commented Nov 19, 2024

andrewl36 commented Nov 21, 2024

james77777778 commented Nov 22, 2024 • edited Loading

andrewl36 commented Nov 22, 2024

google-ml-butler bot commented Nov 22, 2024

james77777778 commented Nov 22, 2024 •

edited

Loading