graphdatascience testing

This document describes how to test the implementation of graphdatascience as well as how to check that its code style is maintained.

Tests can be found in graphdatascience/tests. In each of the folders there, unit and integration, the tests of that respective type reside.

Please see the section Specifically for this project of our contribution guidelines for how to set up an environment for testing and style checking.

NOTE: This document does not cover documentation testing. Please see the documentation README for that.

Unit testing

The unit tests are run without a connection a database. Typically the CollectingQueryRunner is used to mock running queries.

To run the unit tests (with default options), simply call:

pytest graphdatascience/tests/unit

Integration testing

In order to run the integration tests one must have a Neo4j DBMS with the Neo4j Graph Data Science library installed running.

Configuring

The tests will through the Neo4j Python driver connect to a Neo4j database based on the environment variables:

NEO4J_URI (defaulting to "bolt://localhost:7687" if unset),
NEO4J_USER,
NEO4J_PASSWORD (defaulting to "neo4j" if unset),
NEO4J_DB (defaulting to "neo4j" if unset).

However, if NEO4J_USER is not set the tests will try to connect without authentication.

Once the driver connects successfully to the Neo4j DBMS the tests will go on to execute against the NEO4J_DB database.

Running

To run the integration tests (with default options), simply call:

pytest graphdatascience/tests/integration

To include tests that require the Enterprise Edition of the Neo4j Graph Data Science library, you must specify the option --include-enterprise. Naturally, this requires access to a valid Neo4j GDS license key, which can be acquired via the Neo4j GDS product website.

To include tests that exercise storing and loading models, you must specify the option --include-model-store-location. Note however that this also requires you to have specified a valid path for the gds.model.store_location configuration key of your database.

If the database you are targeting is an AuraDS instance, you should use the option --target-aura which makes sure that tests of operations not supported on AuraDS are skipped.

Running tests that require encrypted connections

In order to run integration tests that test encryption features, you must setup the Neo4j server accordingly:

# Enable the Arrow Flight Server (necessary if you run integration tests that require Arrow)
gds.arrow.enabled=true

# Allow bolt connections either encrypted or unencrypted
dbms.connector.bolt.tls_level=OPTIONAL
dbms.ssl.policy.bolt.enabled=true
dbms.ssl.policy.bolt.base_directory=<absolute-path-to-graph-datascience-client>/graphdatascience/tests/integration/resources
dbms.ssl.policy.bolt.public_certificate=arrow-flight-gds-test.crt
dbms.ssl.policy.bolt.private_key=arrow-flight-gds-test.key
dbms.ssl.policy.bolt.client_auth=NONE

To run only integration tests that are marked as encrypted_only, call:

pytest graphdatascience/tests/integration --encrypted-only

GDS library versions

There are integration tests that are only compatible with certain versions of the GDS library. For example, a procedure (which does not follow the standard algorithm procedure pattern) introduced in version 2.1.0 of the library will not exist in version 2.0.3, and so any client side integration tests that call this procedure should not run when testing against server library version 2.0.3. For this reason only tests compatible with the GDS library server version you are running against will run.

Style guide

The code and examples use ruff to format and lint. You can check all code using all the below mentioned code checking tools by running the scripts/checkstyle bash script. There's also a scripts/makestyle to do formatting. Use SKIP_NOTEBOOKS=true to only format the code.

See pyproject.toml for the configuration.

Static typing

The code is annotated with type hints in order to provide documentation and allow for static type analysis with mypy. Please note that the typing library is used for annotation types in order to stay compatible with Python versions < 3.9. To run static analysis on the entire repository with mypy, just run:

mypy .

from the root. See mypy.ini for our custom mypy settings.

Notebook examples

The notebooks under /examples can be run using scripts/run_notebooks.

Cell Tags

Verify version If you only want to let CI run the notebook given a certain condition, tag a given cell in the notebook with verify-version. As the name suggests, the tag was introduced to only run for given GDS server versions.

Teardown

To make sure certain cells are always run even in case of failure, tag the cell with teardown.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TESTING.md

TESTING.md

graphdatascience testing

Unit testing

Integration testing

Configuring

Running

Running tests that require encrypted connections

GDS library versions

Style guide

Static typing

Notebook examples

Cell Tags

Files

TESTING.md

Latest commit

History

TESTING.md

File metadata and controls

graphdatascience testing

Unit testing

Integration testing

Configuring

Running

Running tests that require encrypted connections

GDS library versions

Style guide

Static typing

Notebook examples

Cell Tags