Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

refactor: introduce new internal representation for Opossum files #192

Open
wants to merge 11 commits into
base: main
Choose a base branch
from

Conversation

abraemer
Copy link
Contributor

@abraemer abraemer commented Jan 20, 2025

Summary of changes

  • introduced new internal opossum model
  • switch ScanCode pipeline to the new model

This PR does not contain tests for the new model. These are done separately see #189

Context and reason for change

See: #190

Closes: #195
Part1 of: #196 (tests come in a follow-up because they require #194 )

@abraemer abraemer force-pushed the refactor-consistent-model-pipeline branch 2 times, most recently from 1e1d053 to 057727c Compare January 21, 2025 11:23
@abraemer abraemer marked this pull request as ready for review January 21, 2025 11:24
@abraemer abraemer force-pushed the refactor-consistent-model-pipeline branch 2 times, most recently from 60b1e47 to 59f5dc7 Compare January 21, 2025 11:44
@Hellgartner Hellgartner self-requested a review January 21, 2025 11:54
src/opossum_lib/opossum_model.py Outdated Show resolved Hide resolved
src/opossum_lib/opossum_model.py Outdated Show resolved Hide resolved
src/opossum_lib/opossum_model.py Outdated Show resolved Hide resolved
src/opossum_lib/opossum_model.py Outdated Show resolved Hide resolved
src/opossum_lib/opossum_model.py Outdated Show resolved Hide resolved
src/opossum_lib/opossum_model.py Show resolved Hide resolved
src/opossum_lib/opossum_model.py Outdated Show resolved Hide resolved
src/opossum_lib/opossum_model.py Outdated Show resolved Hide resolved
src/opossum_lib/opossum_model.py Outdated Show resolved Hide resolved
src/opossum_lib/opossum_model.py Outdated Show resolved Hide resolved
@abraemer abraemer force-pushed the refactor-consistent-model-pipeline branch from 54f5c61 to bd3735f Compare January 22, 2025 11:59
@abraemer
Copy link
Contributor Author

Note: rebased onto current main

)
output_file: OpossumOutputFile | None = None

def to_opossum_file_format(self) -> opossum_file_content.OpossumFileContent:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think it would be nicer to read if at the top of the file you don't use namespace imports but import exactly the classes you need.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is some duplication of names between opossum_file and opossum_model. That's why I wanted to explicit where which name comes from.
This duplication comes from the fact that opossum_model is quite similar to opossum_file generally but we chose to separate the implementations which meant copying some models verbatim.
The specific instance you marked could be removed though as there are only clashes between opossum_file and opossum_model.

src/opossum_lib/scancode/convert_scancode_to_opossum.py Outdated Show resolved Hide resolved
src/opossum_lib/scancode/resource_tree.py Outdated Show resolved Hide resolved
src/opossum_lib/scancode/resource_tree.py Outdated Show resolved Hide resolved
* this model encapsulates also the semantic relationships of resources, resourcesToAttribuions
 and externalAttributions. These are not enforced by the file structure alone.
* This will be used as a target for the other file format frontends and simplify their logic.
* It also allows for easier testing since it allows to check for semantic/structural
 equivalence among opossum files (e.g. the IDs of the attribution carry no semantic semantic
 information themselves i.e. are arbitrary labels)
* Opossum class should be able to carry all information that could be present in an
 .opossum file
* resources can now by added to the resource structure without knowledge about internals
* for this reason:
 - resources can be created with just a path (i.e. without type)
 - resources can now be merged together if the types are compatible.
   types are compatible if at least one is not set or types are identical
 - when converting to opossum file format, unset type defaults to folder.
   Maybe this should raise an error instead, but being more permissible probably just
   makes things more ergonomic without hurting correctness.
* default to treating a Resource as file if the type is undefined and no children present
* provide slightly more information in an error message
@abraemer abraemer force-pushed the refactor-consistent-model-pipeline branch from bd3735f to 60f4f48 Compare January 23, 2025 06:37
@abraemer abraemer requested a review from Hellgartner January 23, 2025 12:06
@abraemer abraemer requested a review from mstykow January 23, 2025 12:15
@mstykow mstykow removed their request for review January 23, 2025 14:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Introduce base model
3 participants