Skip to content

Commit

Permalink
Update src/corppa/utils/filter.py
Browse files Browse the repository at this point in the history
Co-authored-by: Laure Thompson <[email protected]>
  • Loading branch information
rlskoeser and laurejt authored Mar 28, 2024
1 parent ec53b3f commit 29d8c18
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions src/corppa/utils/filter.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,8 @@
"""
Utility for filtering PPA full-text corpus to work with a subset of
pages. Currently supports filtering by a list of PPA source ids.
Currently, there is no way to filter to a specific excerpt when
there are multiple.
Can be run via command-line or python code. Takes jsonl file (compressed or
not) as input, a filename for output, and a file with a list of
Expand Down

0 comments on commit 29d8c18

Please sign in to comment.