address feedback

Signed-off-by: Pranav Gaikwad <[email protected]>
konveyor · Feb 13, 2024 · b1d9970 · b1d9970
1 parent 9177885
commit b1d9970
Showing 1 changed file with 63 additions and 40 deletions.
diff --git a/enhancements/analyzer-rules-testing/README.md b/enhancements/analyzer-rules-testing/README.md
@@ -21,15 +21,15 @@ status: implementable
 
 As of today, testing an analyzer rule involves creating a provider config file, running analysis using the rule against a sample application, finally verifying the results by inspecting the output file manually. 
 
-This enhancement proposes a new test runner for YAML rules. It will allow write tests for rules declaratively, run analysis using the rules against sample data, verify that the expected output in the test matches the actual output and finally, produce a result detailing passed / failed tests.
+This enhancement proposes a new test runner for YAML rules. It will enable rule authors to write tests for rules declaratively, run analysis using the rules against sample data, verify that the expected output in the test matches the actual output and finally, produce a result detailing passed / failed tests.
 
 ## Motivation
 
 The biggest drawback with testing rules right now is that the output must be verified by the rule author manually. They have to go through all incidents / violations and verify if they see a desired result. Once they do verify the output, the only way to share this expected output is to share the entire _output.yaml_ file generated post analysis. Comparing the expected output with the actual results is done by computing _diff_ between the expected and actual output files. While this works great in end-to-end CI tests that run on a single rules file, there are limitations. First, it's very hard to understand what exactly is failing by only looking at diff output. And second, you have to generate that very first "expected output" file by running the analyzer. You cannot expect someone to write that file by hand. 
 
 Assume that you're a rule author creating rules for a certain technology. You write your first rule, you run it against sample application, manually verify that your rule matched in the `output.yaml` file. You count the number of incidents, check if produced messages are correct. You check for any tags created if applicable. All good! You move onto writing your second rule, you run the rules file again and get an output file. This time, you ensure that the first rule still produces the same output as before in addition to verifying your new rule. In every iteration, you manually verify every rule in the ruleset. Ideally, we want the rule author to write a test first and then iteratively work towards perfecting their rule until it produces the desired output. It's necessary that they can write the test by hand and on top of that, they should be able to run a command to verify the results.
 
-In addition to the UX concern above, there is another important concern. As analyzer-lsp adds more and more providers, contribution to rulesets will increase as a result. For the long-term health of the project, it is essential that any new rules added to rulesets have associated tests and sample data. Manual testing process will only discourage rule authors from submitting tests.
+In addition to the UX concern above, there is another important concern. As analyzer-lsp adds more and more providers, contribution to rulesets will increase. For the long-term health of the project, it is essential that any new rules added to rulesets have associated tests and sample data. Manual testing process will only discourage rule authors from submitting tests.
 
 Lastly, the project has been mainly relying on rules from the Windup project which all have associated test cases. At some point, we will discontinue the Windup shim and move onto the new YAML format for good. What happens to the existing test cases in that case? We need to make sure we can convert the existing tests into this new format and continue to have coverage for those old rules.
 
@@ -65,9 +65,15 @@ tests:
       mode: source-only
       depLabelSelector: "!konveyor.io/source=open-source"
     hasIncidents:
-      count: 10
-      messageContains: "produced message should contain this text in all 10 incidents"
-      codeSnipContains: "removedFunctionCall()"
+      and:
+      - fileURI: "<file/uri/where/expected/incidents/should/be/present>"
+        exactly: 10
+        atLeast: 5
+      - fileURI: "<file/uri/of/expected/incident/with/custom/variables>"
+        locations:
+        - lineNumber: 8
+          messageContains: "incident on line number 8 should contain this"
+          codeSnipContains: "code snippet should contain this text"
     hasTags:
     - Category=tag1
 ```
@@ -81,10 +87,17 @@ tests:
     * analysisParams: Additional _optional_ parameters to configure analysis when running this test. By default, these won't be set.
       * mode: Analysis mode, defaulted by analyzer-lsp to provider default.
       * depLabelSelector: Dependency label selector, defaulted to nil.
-    * hasIncidents: The actual content of the incident which will be verified.
-      * count: Desired count of incidents, test will FAIL when total incidents don't match this number in the output.
-      * messageContains: String to search in the message, test will FAIL when any one or more incidents don't have this string in their message.
-      * codeSnipContains: String to search in the code snip, test will FAIL when one or more incidents don't have this string in their code snippets.
+    * hasIncidents: This defines how incidents should be verified. There could be multiple ways verification can be done for a test case. So this is a nested structure just like how we define _when_ block of a rule. It has exactly one _IncidentCondition_.
+      * _IncidentCondition_ is defined for a _fileURI_. It will apply to a specific file. In a given file, the incidents can be verified in one of several ways:
+        * _Simple Verification_: This simply checks if the given file contains exactly given number of incidents. Exactly one of the following parameters can be defined for this type of verification:
+          * count: there should be exactly this many incidents.
+          * atLeast: there should at least this many incidents, more are OK.
+          * atMost: there should at most this many incidents, less are OK.
+        * _Per incident verification_: This is defined on a per incident basis and takes a list of code locations as input. Each code location will take following parameters:
+          * lineNumber: incident should be found on this line
+          * messageContains: message should contain this string
+          * codeSnipContains: code snippet should contain this string
+      * _and_ and _or_ are two special type of _IncidentCondition_ that can be used to form a complex test.
     * hasTags: A list of tags to verify. The test will FAIL when any one or more tags in the list are not found in the output.
 
 Notice that there could be more than one _testCases_ in a test with different _analysisParams_. This is useful when you want to test the same rule but under different analysis parameters. 
@@ -114,6 +127,8 @@ testConfig:
     location: /path/to/java/
 ```
 
+> When a specific provider config is defined in a test file, that config will take precedance over the ruleset level config
+
 #### Running the tests
 
 We are proposing a new CLI that takes the tests as input. 
@@ -147,25 +162,22 @@ The test runner will recursively find and run all tests in `./rulesets/`. In thi
 The output for a test should look something like:
 
 ```sh
---- rules_test.yaml    1/2 PASSED, 1/2 FAILED
+--- rules_test.yaml    1/2 PASSED
   - rule-1000          2/2 PASSED
-    - testCase[0]      PASS
-    - testCase[1]      PASS
-  - rule-1001          1/2 PASSED, 1/2 FAILED 
-    - testCase[0]      PASS
+  - rule-1001          1/2 PASSED 
     - testCase[1]      FAIL
       - Unexpected message string in 8/10 incidents
         - Log created in ./tmp/analyzer-test-099912/analysis.log
         - Output created in ./tmp/analyzer-test-099912/output.yaml
 
-Total: 2, 1/2 PASSED, 1/2 FAILED, Pass Rate: 50%
+Total: 2, 1/2 PASSED, Pass Rate: 50%
 ```
 
 The output is produced per test file aggregating total number of failed / passed rules at the top.
 
 Within a test file, it's further broken down into results of individual rules, with results of each test case for a rule.
 
-For any failed test case, the log and the output location is printed so that users can debug.
+For any failed test case, the log and the output location is printed so that users can debug. _We will also grab the provider logs and place them in the output directory so that users can debug the providers themselves._
 
 Finally, a summary line will be printed showing overall statistics.
 
@@ -192,38 +204,49 @@ This output will be later used by the CI. When a new PR is opened in _konveyor/r
 The Go structs are defined below:
 
 ```go
-type TestConfig struct {
-  Providers []ProviderConfig `yaml:"providers"`
-  Tests     []RuleTest       `yaml:"tests"`
+type Tests struct {
+    Providers []Provider `yaml:"providers,omitempty" json:"providers,omitempty"`
+    Tests     []Test     `yaml:"tests,omitempty" json:"tests,omitempty"`
 }
 
-type ProviderConfig struct {
-  Name     string `yaml:"name,omitempty"`
-  Location string `yaml:"location,omitempty"`
+type Provider struct {
+    Name     string `yaml:"name" json:"name"`
+    DataPath string `yaml:"dataPath" json:"dataPath"`
 }
 
-type RuleTest struct {
-	RuleID    string     `yaml:"ruleID"`
-	Data      string     `yaml:"data,omitempty"`
-	TestCases []TestCase `yaml:"testCases"`
+type Test struct {
+    RuleID    string     `yaml:"ruleID" json:"ruleID"`
+    TestCases []TestCase `yaml:"testCases" json:"testCases"`
 }
 
 type TestCase struct {
-	Description    string `yaml:"description"`
-	AnalysisParams `yaml:",inline"`
-	HasTags        []string      `yaml:"hasTags,omitempty"`
-	HasIncidents   *IncidentsTest `yaml:"hasIncidents,omitempty"`
+    Description    string `yaml:"description,omitempty" json:"description,omitempty"`
+    AnalysisParams `yaml:"analysisParams,omitempty" json:"analysisParams,omitempty"`
+    HasIncidents   IncidentCondition `yaml:"hasIncidents,omitempty" json:"hasIncidents,omitempty"`
+    HasTags        []string          `yaml:"hasTags,omitempty" json:"hasTags,omitempty"`
+}
+
+type IncidentCondition interface {
+    Schema() openapi3.SchemaRef
+    Verify([]konveyor.Incident) bool
+}
+
+type SimpleVerification struct {
+    FileURI string `yaml:"fileURI" json:"fileURI"`
+    Exactly *int   `yaml:"exactly,omitempty" json:"exactly,omitempty"`
+    AtLeast *int   `yaml:"atLeast,omitempty" json:"atLeast,omitempty"`
+    AtMost  *int   `yaml:"atMost,omitempty" json:"atMost,omitempty"`
 }
 
-type AnalysisParams struct {
-	Mode             *provider.AnalysisMode `yaml:"mode,omitemtpy"`
-	DepLabelSelector *string                `yaml:"depLabelSelector"`
+type PerIncidentVerification struct {
+    FileURI   string                 `yaml:"fileURI" json:"fileURI"`
+    Locations []LocationVerification `yaml:"locations" json:"locations"`
 }
 
-type IncidentsTest struct {
-	Count            int     `yaml:"count"`
-	MessageContains  *string `yaml:"messageContains"`
-  CodeSnipContains *string `yaml:"codeSnipContains"`
+type LocationVerification struct {
+    LineNumber      string `yaml:"lineNumber" json:"lineNumber"`
+    MessageContains string `yaml:"messageContains" json:"messageContains"`
+    CodeSnipContains string `yaml:"codeSnipContains" json:"codeSnipContains"`
 }
 ```
 
@@ -236,9 +259,9 @@ Also notice that _MessageContains_ and _CodeSnipContains_ checks in the testcase
 Here are options for the  test runner CLI:
 
  * --provider-settings: Required. Points to provider configuration.
- * --output: Optional. Generate output in a YAML file in addition to stdout. Defaults to stdout only.
- * --tmp-dir: Optional. Temporary directory for storing logs & output. Defaults to system's temp dir.
- * --quiet: Optional. Suppress output from passed tests.
+ * --output-stats: Optional. Print stats to the given YAML file in addition to stdout. Defaults to stdout only.
+ * --log-dir: Optional. Temporary directory for storing logs & output. Defaults to system's temp dir.
+ * --verbose: Optional. Increase verbosity of output.
 
 #### Running Tests