Leaky relu #468

wu-haoze · 2021-07-05T18:54:30Z

Adding leaky relu support

Support leaky relu in deeppoly
Support solving leaky relu constraint in MILP encoding

----update----
3. Also support solving in DeepSoI
4. Added regression tests on leaky relu networks.

guykatzz · 2021-07-18T08:08:11Z

src/common/GurobiWrapper.cpp

+                                 xPts,
+                                 yPts );
+    }
+    catch (GRBException e )


space after (.
Please refrain from abbreviations: pts --> points

guykatzz · 2021-07-18T08:08:20Z

src/common/GurobiWrapper.cpp

+    }
+}
+
+


remove empty line

guykatzz · 2021-07-18T08:09:19Z

src/common/GurobiWrapper.h

@@ -71,6 +71,12 @@ class GurobiWrapper
    // Add a new EQ constraint, e.g. 3x + 4y = -5
    void addEqConstraint( const List<Term> &terms, double scalar );

+    // Add a piece-wise linear constraint


Is this a new Gurobi interface we're starting to use? Perhaps all existing encodings should use it, too?

I agree it's worth studying whether we should switch to gurobi PLW constraints altogether

guykatzz · 2021-07-18T08:10:05Z

src/configuration/GlobalConfiguration.h

@@ -23,6 +23,8 @@ class GlobalConfiguration
 public:
    static void print();

+    static const double LEAKY_RELU_SLOPE;


Is it a single slope for all leaky relus in the query? Because the python interface suggested each could have a separate slope.

I agree that we should allow each leaky relu to have their own slope.

guykatzz · 2021-07-18T08:11:05Z

src/engine/InputQuery.cpp

+
+    List<NeuronInformation> newNeurons;
+
+    // Look for ReLUs where all b variables have already been handled


leaky relus?

guykatzz · 2021-07-18T08:34:10Z

src/engine/MILPEncoder.cpp

@@ -91,7 +94,7 @@ void MILPEncoder::encodeEquation( GurobiWrapper &gurobi, const Equation &equatio
    }
 }

-void MILPEncoder::encodeReLUConstraint( GurobiWrapper &gurobi, ReluConstraint *relu)
+void MILPEncoder::encodeReLUConstraint( GurobiWrapper &gurobi, ReluConstraint *relu )
 {



remove empty line

guykatzz · 2021-07-18T08:35:37Z

src/engine/MarabouError.h

@@ -49,6 +49,7 @@ class MarabouError : public Error
        NETWORK_LEVEL_REASONER_ACTIVATION_NOT_SUPPORTED = 24,
        NETWORK_LEVEL_REASONER_NOT_AVAILABLE = 24,
        REQUESTED_NONEXISTENT_CASE_SPLIT= 25,
+        INVALID_LEAKY_RELU_SLOPE= 26,


space before =. For prevoius line also

guykatzz · 2021-07-18T08:36:28Z

src/nlr/DeepPolyLeakyReLUElement.cpp

+    _layer = layer;
+    _size = layer->getSize();
+    _layerIndex = layer->getLayerIndex();
+    _alpha = layer->getAlpha();


again, I think slope is better than alpha.
Are we sure all leakyRelus in the layer have the same slope?

rename slope to _alpha in the DeepPoly LeakyReLUElement.

guykatzz · 2021-07-18T08:41:12Z

src/nlr/Layer.cpp

+            NeuronIndex sourceIndex = *_neuronToActivationSources[i].begin();
+            double inputValue = _layerOwner->getLayer( sourceIndex._layer )->getAssignment( sourceIndex._neuron );
+
+            _assignment[i] = FloatUtils::max( inputValue, slope * inputValue );


unclear. is slope always <=1?
even if so, lets make this more comprehensible

Yes, currently we need slope to be <= 1, otherwise the symbolic bounds would be incorrect. I added an assertion and also throw an error when this requirement is not met when constructing a LeakyReLU

guykatzz · 2021-07-18T08:42:03Z

src/nlr/Layer.cpp

+            NeuronIndex sourceIndex = *_neuronToActivationSources[i].begin();
+            const Vector<double> &simulations = ( *( _layerOwner->getLayer( sourceIndex._layer )->getSimulations() ) ).get( sourceIndex._neuron );
+            for ( unsigned j = 0; j < simulationSize; ++j )
+                _simulations[i][j] = FloatUtils::max( simulations.get( j ), slope * simulations.get( j ) );


same comment, slope > 1?

added an assertion.

MatthewDaggitt · 2024-01-04T03:13:19Z

Before merging, this should get a CHANGELOG entry added.

guykatzz · 2024-02-05T10:36:47Z

maraboupy/MarabouNetwork.py

@@ -472,6 +489,8 @@ def evaluateWithMarabou(self, inputValues, filename="evaluateWithMarabou.log", o
            ipq.setLowerBound(k, inputDict[k])
            ipq.setUpperBound(k, inputDict[k])

+        MarabouCore.saveQuery(ipq, "test.ipq")


Is this intended? Or a debugging thing?

guykatzz · 2024-02-05T10:43:15Z