Test vectors #37

jurraca · 2024-11-18T13:55:40Z

Stacked on #32

Two commits:

first moves all util-style modules to their own folder/namespace
second adds test vectors/fixtures to test network merging from. Also adds an init.py to the tests folder so pytest can resolve modules.

Let me know if this is not ok, I struggled with Python's imports logic a bit.

fjahr · 2024-11-24T16:06:24Z

kartograf/util/generate_data.py

+
+MAX_ASN = 33521664
+
+def generate_ip(ip_type="v4", subnet_size="16"):


How about having these test specific util functions under a util folder in tests, similar to bitcoin core: https://github.com/bitcoin/bitcoin/tree/master/test? If we don't need these in the normal code, I would just put them in the test space and let them be shared there.

@jurraca When moving the file from the normal util to tests/util you could also get rid of the additional util folder and avoid the util/util namespace. I think we only need a util folder when the split up the util file which would then create namespaces like util/foo, util/bar etc. If you want to split it up now that would be also fine with me but I don't think it's needed yet.

thanks, I misunderstood.

jurraca · 2024-12-30T13:00:54Z

Cleaned this up a bit.
I reverted the change to format_pfx from 97f0fbe , since it just removed the leading zero of private networks and resulted in invalid IP networks ( 0.128.0.0/16 -> .128.0.0/15). The follow ups to this will be changing how we check that the prefix we ingest is invalid vs valid but poorly formatted in the parsing logic.

edit: in fact, this 04953b8 commit is a bug fix -- tests are broken. I can split it out on its own if that's better. lmk.

fjahr

Test vectors part looks good at first pass but it seems to me there is something missing in the pfx part (or I am just not getting it ;)

fjahr · 2025-01-06T23:00:08Z

kartograf/util.py

            formatted_pfx = str(ipaddress.ip_network(pfx))
            return f"{formatted_pfx}"
        return str(ipaddress.ip_address(pfx))
    except ValueError:
        return pfx


+def is_valid_pfx(pfx):


If this function is only used in tests it should go into tests/util.py or directly into the test file if it's just used in one file. But I'm not sure if this is what was intended, the commit description makes it sound a bit like the is_valid_pfx should be used in the actual code?

If this is run before format_pfx then the try block could could be removed there I think.

yea, thanks that makes sense. Is there a reason we're checking ipaddress.ip_address(pfx) actually? Shouldn't we reject things that are not networks?

Is there a reason we're checking ipaddress.ip_address(pfx) actually?

Hm, not sure I understand the question correctly but the idea in my old code was to use str -> ipaddress -> str to standardize the formatting. Stuff like leading zeros should be handled consistently through this for example. At least that is the rationale I remember from taking a quick look.

The idea here is of course that I expected ipaddress to handle this better than I ever could by writing a custom parser. I.e. parsing to ipaddress should be able to read as many weird formats as possible and then stringifying ipaddress should always be consistent. But honestly I can't say that I have put in a lot of effort to verify that ipaddress is super solid.

I just meant checking ipaddress.ip_address(pfx) vs ipaddress.ip_network(pfx) -- the first checks an address, the other a network. Don't we expect only networks?

Ah ok, I misunderstood the question. If I remember correctly single IPs are valid entries in RPKI repos and IRR. So there is nothing wrong with them per se although i don't remember if I looked into if there are specific use cases behind it. But this looks like something where I encountered an error and then looked at the spec and then went with what the spec allowed.

kartograf/util.py

fjahr · 2025-01-12T21:52:31Z

edit: in fact, this 04953b8 commit is a bug fix -- tests are broken. I can split it out on its own if that's better. lmk.

Would have been easier for me to keep it separate. It's not an issue if you open many small PRs if you are explicit which ones you want me to look at first. As I mentioned in the other one you now, you can make the others drafts or say "this depends on X, review that one first".

jurraca · 2025-01-14T14:32:51Z

Would have been easier for me to keep it separate.

yea, sorry about this. Separated the bug fix out to #47 and rebased this branch on that.

fjahr · 2025-01-19T18:32:46Z

Looks good to me but needs rebase first

create tests/util folder and resolve local import

fjahr

ACK, leaving some suggestion for a follow-up but merging this for now since it's a good addition either way.

fjahr · 2025-01-21T00:00:45Z

tests/merge_test.py

+        l = f.readlines()
+        final_network_count = len(l)
+        expected_count = (len(base_nets) + len(extra_nets)) - (base_subnet_count + extra_subnet_count)
+        assert final_network_count == expected_count


It's fine to check the count but it leaves the possibility of uncaught bugs open if the count is offset to the correct number again. Imagine we add two networks, one should go into final and the other not (simple one valid case + one invalid case). If we don't check here which one goes in the behavior could be the opposite of what we expect (invalid in, valid out) and the test would still pass.

So as a follow-up I think it would be great if this is switched from checking the count to checking the actual content matches. That shouldn't be too bigh of a change, instead of counting in read_test_vectors, build a list of included ones and then compare the list to a readout of the final_result. I hope it works as simple as I imagine it :)

fjahr · 2025-01-21T14:09:00Z

tests/merge_test.py

+def test_merge_from_fixtures(tmp_path):
+    '''
+    Assert that general_merge merges subnets correctly.
+    '''


formatting-nit: A bit inconsistent to use ''' here and """ above

fjahr reviewed Nov 24, 2024

View reviewed changes

jurraca force-pushed the test-vectors branch 2 times, most recently from 16e0d8a to cd535cf Compare December 3, 2024 12:39

jurraca force-pushed the test-vectors branch 2 times, most recently from a542492 to 04953b8 Compare December 30, 2024 12:56

fjahr reviewed Jan 6, 2025

View reviewed changes

jurraca force-pushed the test-vectors branch 3 times, most recently from 0c1c703 to e21877d Compare January 10, 2025 18:45

jurraca mentioned this pull request Jan 11, 2025

Bogon.py updates #45

Merged

jurraca force-pushed the test-vectors branch from e21877d to 6640e41 Compare January 14, 2025 14:32

add test merge from fixtures

37b87a6

create tests/util folder and resolve local import

jurraca force-pushed the test-vectors branch from 6640e41 to 37b87a6 Compare January 20, 2025 21:06

fjahr reviewed Jan 21, 2025

View reviewed changes

fjahr merged commit d6f675c into asmap:master Jan 21, 2025
1 check passed

jurraca mentioned this pull request Jan 22, 2025

merge_test: use networks sets instead of counts, updated comments #51

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test vectors #37

Test vectors #37

jurraca commented Nov 18, 2024

fjahr Nov 24, 2024

fjahr Nov 25, 2024

jurraca Nov 27, 2024

jurraca commented Dec 30, 2024 •

edited

Loading

fjahr left a comment

fjahr Jan 6, 2025

fjahr Jan 6, 2025

jurraca Jan 8, 2025

fjahr Jan 12, 2025

fjahr Jan 12, 2025

jurraca Jan 14, 2025 •

edited

Loading

fjahr Jan 19, 2025

fjahr commented Jan 12, 2025

jurraca commented Jan 14, 2025

fjahr commented Jan 19, 2025

fjahr left a comment

fjahr Jan 21, 2025

fjahr Jan 21, 2025


		MAX_ASN = 33521664

		def generate_ip(ip_type="v4", subnet_size="16"):

Test vectors #37

Test vectors #37

Conversation

jurraca commented Nov 18, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jurraca commented Dec 30, 2024 • edited Loading

fjahr left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jurraca Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fjahr commented Jan 12, 2025

jurraca commented Jan 14, 2025

fjahr commented Jan 19, 2025

fjahr left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jurraca commented Dec 30, 2024 •

edited

Loading

jurraca Jan 14, 2025 •

edited

Loading