- docs: Updates README (
d7a77fd
)
- fix: Adds fast directory check (
db2e33c
)
- fix: Adds gdraw (
f871521
)
- fix: Adds gslides and gform (
f449511
)
- fix: Adds quotes to delete scripts (
061700d
)
- style: Fixes style issues (
1e31875
)
- feat: Adds ignore hash to CLI (
ff4d279
)
- ci: Adds dedup ignore to release (
9d0388c
)
-
fix: Fixes newlines for cross-platform (
db3aa35
) -
fix: Updates dedup ignore for google drive (
303c8aa
)
- fix: Adds logging to compute_sha256 (
ebb80be
)
- Merge branch 'master' of github.com:UCSD-E4E/e4e-deduplication (
8a41847
)
- fix: Adds hash error trap (
6d09e49
)
- fix: Adds progress bar to drop tree (
a6056d8
)
- fix: Fixes analyze cli linkage (
a1f2fc3
)
- fix: Allows multiple directory for analyze (
dd42401
)
- fix: Adds cache execution logic (
5625628
)
- chore: Merges 31 bug file path may contain commas (#32) (
ca3ed0d
)
- ci: Adds test for #31 (
ada7fcb
)
- feat: Switches to json based cache, splits cache upgrade scripts (
14f6f32
)
-
style: Fixes duplicate code (
c3082e8
) -
style: Removes unused imports (
c8275f7
) -
style: Adds docstrings (
ac087e4
)
- style: Fixes reopen warning (
df84d7e
)
- style: Fixes resource allocation (
fa0b2fb
)
- test: Fixes upgrade cache test (
2fb50d5
)
- feat: Adds list jobs (
cbe46b8
)
-
feat!: Switches cli to subcommands (
3b8ccaa
) -
feat!: WIP -adding subcommands (
02340be
) -
feat!: Enables hostname aware comparison (
55ea161
)
- ci: Adds windows and macos for testing (
568ce5a
)
- feat: allow for cross machine (#23) (
3f6fa7b
)
-
fix: Fixes *nix permissions (
39b6c4d
) -
fix: Adds logging, fixes cache rebuild after cache sort (
d200e15
)
- test: Adds tests for os delete scripts (
0745005
)
-
ci: Fixes env (
43a3fa0
) -
ci: Fixes poetry install (
43c45bd
) -
ci: Fixes pytest execution (
8ae5a2f
) -
ci: Adds nas_unzip (
6537c11
) -
ci: adding support for nas unzip (
2cc2509
)
-
fix: Faster job cache (#20) (
a113912
) -
fix: Fixes synology_api version (
67e5d9d
) -
fix: switching to internal offset dict (
00b02af
)
- style: Fixes style (
8e975e0
)
- fix: Fixes job cache handle initialization (
ca3246d
)
-
fix: Fixes cache behavior (
a9c4b3f
) -
fix: Removes extra prints (
cdf08c4
) -
fix: Implements file sorter (
1b2132b
) -
fix: progress bar (
fba9a28
)
- style: Removes unused import (
67fde55
)
- wip: Switching to filesort based duplication (
8820a1f
)
-
feat: Uses python parallel hasher (#13) (
c0351a5
) -
feat: Switching to python hasher (
11b14ca
) -
feat: Uses python parallel hasher (
40a9eb1
)
- fix: Removes pybind11 dependency (
7217838
)
- style: Refactors hash (
5cd4ce9
)
-
feat: sqitches to sqlite3 and thread pool (
1100593
) -
feat: Switches to a parallel thread hasher (
ee04599
) -
feat: Switching to threadpool and optimizing dataflow (
998fb60
) -
feat: Implements sqlite3 backend (
f87d9b8
)
- Merge remote-tracking branch 'refs/remotes/origin/sqlite3_testing'
Conflicts:
e4e_deduplication/analyzer.py (8c22f01
)
- Merge branch 'master' into 8-possible-memory-leak (
15427e4
)
- feat: Adds progress bars
Merge pull request #11 from UCSD-E4E/4-progress-bar-for-analysis-result-aggregation (6abf954
)
- feat: Adds progress bars (
c45e590
)
- Merge pull request #10 from UCSD-E4E/7-add-user-confirmation-for-clear_cache
feat: Adds clear cache check (8572e57
)
- feat: Adds clear cache check (
74d310b
)
- docs: Fixes readme (
495789d
)
- fix: Fixes naive job file
Merge pull request #3 from UCSD-E4E/naive_job_file (a9e57fb
)
- fix: Allows for empty dict in job cache (
98dfd56
)
- style: Fixes docstring (
e0f8192
)
- feat: Updates dedup for synology
Merge pull request #2 from UCSD-E4E/file_filter (404641f
)
- feat: Updates dedup for synology (
b14f926
)
- fix: Reduces console output to WARNING instead of DEBUG (
fcfbe79
)
- Merge branch 'master' of github.com:UCSD-E4E/e4e-deduplication (
f9cd7a5
)
- fix: Fixes release publish (
81bc878
)
-
ci: Fixes branch name (
22aae2c
) -
ci: Logging timing errors instead (
82e967a
) -
ci: Reduces mutiproc to 128MB (
f3f995c
) -
ci: Adds pytest for import (
666c7a9
) -
ci: Switching to pip install (
474808f
) -
ci: uses poetry to execute (
903b93a
) -
ci: Adds coverage (
13174d8
) -
ci: Disables venv (
535708c
) -
ci: Fixes install (
c2d9f3c
) -
ci: Fixes checkout (
4511bc1
) -
ci: Updates lock file (
3bccb81
) -
ci: Adds -v to poetry install for debug (
bdee9cf
) -
ci: Fixes pylint (
b6a1b85
)
-
feat: Merges pyFileHash for wheel (
5325524
) -
feat: Adds progress bar for file discovery (
0e3c773
) -
feat: Adds report (
d1b74a2
) -
feat: Updates behavior (
e3cec29
)
- fix: Fixes tests (
a1e0b12
)
- Merge pull request #1 from UCSD-E4E/gh_actions
Github Actions (69bfb16
)
-
add missing parameter to report.generate (
3cda02b
) -
handle root path (
5ada477
) -
fix bug with logging parameters (
a22e206
) -
support providing original paths (
4ecfd10
) -
remove printing that checksums are being calculated (
774e82d
) -
unpack mtime as a tuple from sqlite to address bug causing checksums to be recalculated for all files (
3cf892b
) -
print mtime of both files (
0a27b16
) -
print when we are running checksum (
a1050bd
) -
always update seen, even if mtime is the same (
e018423
) -
always print file, even if in cache (
8698499
) -
better handle excluded paths (
beee5bf
) -
reintroduce the in memory cache (
c5379a2
) -
fix no such column seen error (
74c593e
) -
support rechecking files already in cache (
381c3d9
) -
simplify dirty check for cache (
ee6f535
) -
handle only updating the db if cache is dirty (
4d09bee
) -
support in memory caching to speed up processing (
d367850
) -
correctly check to see if RootDir is in the db (
bc3bfcc
) -
cast cursor to list (
1a352b9
) -
handle pulling the metadata correctly (
cb292d8
) -
don't print path if already in cache (
c69d95e
) -
comment the code (
292aa68
) -
correct pylint errors (
098f68f
) -
use home directory from pathlib (
3c36dd8
) -
support older version of python (
7479acd
) -
only copy to nas when complete (
3d7eced
) -
commit every 10 items or every 10 minutes. whichever comes first. (
0fea6ec
) -
instead of committing every 10 items, commit every 10 minutes. (
2c118ce
) -
update cache to handle renames (
aa757ea
) -
only delete cache file if exists (
29ccaa8
) -
revert back to sha256 instead of file_digest to support older versions of python. (
1558466
) -
revert back to sha256 instead of file_digest to support older versions of python. (
d898021
) -
fixed bug in cache object with-statement (
d3495a5
) -
excluded files in Directory object (
ce60432
) -
support excluded files (
6b090e3
) -
commit to cache every 10 items (
ed66f63
) -
support cache with-statement. Store cache in local folder (
8df8853
) -
reverse previous commit. add some missing type hints (
8b061f5
) -
always commit after adding file (
20355eb
) -
print out current file name (
6028958
) -
move black and pylint to dev dependencies (
f6061e4
) -
move pylint and black to dev dependencies (
0fb7d20
) -
fix pylint errors (
d2474b1
) -
update cache to no longer use hard coded '/' and use file_digest to compute hash (
fdb7f29
) -
address Nathan's comments (
bdf88ca
) -
initial checkin of the e4e deduplication tool. (
e1d9c35
)