-
Notifications
You must be signed in to change notification settings - Fork 3
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #19 from eriknovak/feature/pattern-extractor
Adds a new Pattern Extractor and updated the package documentation
- Loading branch information
Showing
67 changed files
with
3,746 additions
and
4,386 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -139,6 +139,6 @@ data/**/ | |
!data/README.md | ||
|
||
notebooks | ||
!docs/documentation/notebooks | ||
!docs/how-to-guides/notebooks | ||
|
||
scripts |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,29 +1,23 @@ | ||
""" | ||
anonymize | ||
"""Module containing the anonymization modules and utility. | ||
The module provides a set of anonymization utilities. | ||
The `anonymize` module provides a set of anonymization modules and utility, | ||
including `extractors`, `generators`, and `strategies`. In addition, it provides | ||
methods for anonymizing text based on a list of replacements. | ||
Submodules | ||
---------- | ||
extractors : | ||
The module containing the extractor classes | ||
generators : | ||
The module containing the generator classes | ||
strategies : | ||
The module containing the strategy classes | ||
regex : | ||
The module containing the regex patterns | ||
Modules: | ||
extractors: The module containing the extractor classes. | ||
generators: The module containing the generator classes. | ||
strategies: The module containing the strategy classes. | ||
Methods | ||
------- | ||
anonymize() | ||
Methods: | ||
anonymize(text, replacements): | ||
Anonymize the text based on the replacements. | ||
""" | ||
|
||
from . import extractors | ||
from . import generators | ||
from . import strategies | ||
from . import regex | ||
from .helpers import anonymize | ||
|
||
__all__ = ["extractors", "generators", "strategies", "regex", "anonymize"] | ||
__all__ = ["extractors", "generators", "strategies", "anonymize"] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,18 +1,19 @@ | ||
""" | ||
extractors | ||
"""Module containing the `extractors`. | ||
The module provides a set of extractors used in the library. | ||
The `extractors` module provides a set of extractors used to identify relevant | ||
information within a document. | ||
Classes | ||
------- | ||
ExtractorInterface : | ||
The class representing the extractor interface | ||
EntityExtractor : | ||
The class representing the entity extractor | ||
Classes: | ||
NERExtractor: The class representing the named entity recognition (NER) extractor. | ||
PatternExtractor: The class representing the pattern extractor. | ||
MultiExtractor: The class representing the multi extractor. | ||
""" | ||
|
||
from .interface import ExtractorInterface | ||
from .entity_extractor import EntityExtractor | ||
from .multi_extractor import MultiExtractor | ||
from .ner_extractor import NERExtractor | ||
from .pattern_extractor import PatternExtractor | ||
|
||
|
||
__all__ = ["ExtractorInterface", "EntityExtractor"] | ||
__all__ = ["ExtractorInterface", "MultiExtractor", "NERExtractor", "PatternExtractor"] |
Oops, something went wrong.