dupeguru

mirror of https://github.com/arsenetar/dupeguru.git synced 2026-07-02 19:17:52 +00:00

Author	SHA1	Message	Date
Andrew Senetar	58c04ff9ad	Switch from hsaudiotag to mutagen, close #440 - This opens up the ability to support more tags and audio information - Also makes progress on #333	2021-08-19 00:14:26 -05:00
Andrew Senetar	be10b462fc	Add portable mode If settings.ini is present next to the executable, will run in portable mode. This results in settings, data, and cache all being in same folder as dupeGuru.	2021-08-17 21:12:32 -05:00
Andrew Senetar	ffe6b7047c	Format all files with black correcting line length	2021-08-15 04:10:18 -05:00
Andrew Senetar	9446f37fad	Remove flake8 E731 Errors Note: black formatting is now applying correctly as well.	2021-08-15 03:53:43 -05:00
Andrew Senetar	e11f996dfc	Merge pull request #908 from glubsy/hash_sample_optimization Hash sample optimization	2021-08-13 23:41:17 -05:00
glubsy	e95306e58f	Fix flake 8	2021-08-14 02:52:00 +02:00
glubsy	891a875990	Cache constant expression Perhaps the python byte code is already optimized, but just in case it is not, keep pre-compute the constant expression.	2021-08-13 21:33:21 +02:00
glubsy	545a5a75fb	Fix for older python versions The "walrus" operator is only available in python 3.8 and later. Fall back to more traditional notation.	2021-08-13 20:56:33 +02:00
glubsy	7b764f183e	Avoid partially hashing small files Computing 3 hash samples for files less than 3MiB (3 * CHUNK_SIZE) is not efficient since spans of later samples would overlap a previous one. Therefore we can simply return the hash of the entire small file instead.	2021-08-13 20:47:01 +02:00
glubsy	3dccb686e2	Fix Directories regex test The entire path to the file would match unless another path separator is added.	2021-08-06 17:18:23 +02:00
Andrew Senetar	0db66baace	Merge pull request #907 from glubsy/missing_renamed_regex Missing renamed regex	2021-08-03 22:26:08 -05:00
glubsy	23c59787e5	Fix infinite recursion Force the Results to update its internal __dupes list whenever at least one group has re-prioritized and changed its dupes/ref.	2021-06-23 05:36:10 +02:00
glubsy	a51f263632	Fix refs appearing in dupes-only view * Some refs appeared in the dupes-only view after a re-prioritization was done a second time. * It seems the core.Results.__dupes list was not properly updated whenever core.app.Dupeguru.reprioritize_groups() -> core.Results.sort_dupes() was called. When a re-prioritization is done, some refs became dupe, and some dupes became ref in their place. So we need to update the new state of the internal list of dupes kept by the Results object, instead of relying on the outdated cached one. * Fix #757.	2021-06-22 22:57:57 +02:00
glubsy	718ca5b313	Remove unused import	2021-06-22 02:41:33 +02:00
glubsy	277bc3fbb8	Add unit tests for hash sample optimization * Instead of keeping md5 samples separate, merge them as one hash computed from the various selected chunks we picked. * We don't need to keep a boolean to see whether or not the user chose to optimize; we can simply compare the value of the threshold, since 0 means no optimization currently active.	2021-06-21 22:44:05 +02:00
glubsy	e07dfd5955	Add partial hashes optimization for big files * Big files above the user selected threshold can be partially hashed in 3 places. * If the user is willing to take the risk, we consider files with identical md5samples as being identical.	2021-06-21 19:03:21 +02:00
glubsy	a6f83ad3d7	Fix missing regexp after rename * Doing a full match should be safer to avoid partial results which would result in overly aggressive filtering. * Add new tests to test suite to cover this issue. * Fixes #903.	2021-06-19 02:00:25 +02:00
glubsy	ab8750eedb	Fix partial regex match yielding false positive	2021-06-17 03:49:59 +02:00
glubsy	22033211d6	Fix exception when deleting while in delta view	2021-05-31 23:49:21 +02:00
glubsy	f1ae478433	Fix including character at the border	2021-04-29 05:29:35 +02:00
glubsy	c4dcfd3d4b	Fix stripping (japanese) unicode characters * Accents are getting removed from Unicode characters to generate similar "words". * Non-latin characters which cannot be processed that way (eg. japanese, greek, russian, etc.) should not be filtered out at all otherwise files are erroneously skipped or detected as dupes if only some characters make it passed the filter. * Starting from an arbitrary unicode codepoint (converted to decimal), above which we know it is pointless to try any sort of processing, we leave the characters as is. * Fix #878.	2021-04-29 05:15:34 +02:00
Andrew Senetar	4a40b346a4	Update to 4.1.1	2021-03-21 22:50:33 -05:00
glubsy	528dedd813	Fix problematic string for translations Some languages have very different phrase syntaxes depending on which word is used. Better used two separate strings than a dynamically created one.	2021-02-09 01:40:00 +01:00
Sergey Zhuravlevich	32dcd90b50	Prioritize dialog: allow removing multiple prioritizations at once Removing prioritizations one-by-one can be tedious. This commit enables extended selection in the prioritizations list. Multiple items can be selected with conventional methods, such as holding down Ctrl or Shift key and clicking the items or holding down the left mouse button and hovering the cursor over the list. All items also can be selected with Ctrl+A. Multiple items drag-n-drop is also possible. To avoid confusion, the selection in the prioritizations list is cleared after the items are removed or drag-n-dropped. Signed-off-by: Sergey Zhuravlevich <sergey@zhur.xyz>	2021-01-07 17:42:30 +01:00
Sergey Zhuravlevich	c2fef8d624	Prioritize dialog: allow adding multiple criteria at once Adding criteria to the prioritizations list one-by-one can be tedious. This commit enables extended selection in the criteria list and implements adding multiple items. Multiple criteria can be selected with conventional methods, such as holding down Ctrl or Shift keys and clicking the items or holding down the left mouse button and hovering the cursor over the list. All items also can be selected with Ctrl+A. Signed-off-by: Sergey Zhuravlevich <sergey@zhur.xyz>	2021-01-07 17:42:07 +01:00
glubsy	b138dfad33	Fix exception when testing invalid regex * If a regex in the table is invalid and failed to compile, its "compiled" property is None. * Only test against the regex if its compilation worked.	2020-12-30 22:50:42 +01:00
glubsy	c1d94d6771	Merge branch 'master' into dev	2020-12-29 20:10:42 +01:00
glubsy	f0d3dec517	Fix exclude tests	2020-12-29 16:07:55 +01:00
glubsy	e533a396fb	Remove redundant check	2020-12-29 05:39:26 +01:00
glubsy	4b4cc04e87	Fix directories tests on Windows Regexes did not match properly because the separator for Windows is '\\'	2020-12-29 05:35:30 +01:00
glubsy	6bc619055e	Change version to 4.1.0	2020-12-06 20:13:03 +01:00
glubsy	680cb581c1	Merge branch 'master' into exclude_list	2020-10-28 03:58:05 +01:00
glubsy	32d66cd19b	Move up to 4.0.5 * Initial push to 4.0.5 milestone * Update changelog	2020-10-27 19:38:51 +01:00
glubsy	2875448c71	Merge branch 'save_directories' into dev	2020-10-27 16:23:49 +01:00
glubsy	424d34a7ed	Add desktop.ini to filter list	2020-09-04 19:07:07 +02:00
glubsy	2a032d24bc	Save/Load directories in Directories * Add the ability to save / load directories as XML, just like the last_directories.xml which get loaded on program start.	2020-09-04 18:56:25 +02:00
glubsy	ea11a566af	Highlight rows when testing regex string * Add testing feature to Exclusion dialog to allow users to test regexes against an arbitrary string. * Fixed test suites. * Improve comments and help dialog box.	2020-09-01 23:02:58 +02:00
glubsy	4a1641e39d	Add test suite, fix bugs	2020-08-31 20:35:56 +02:00
glubsy	9f223f3964	Concatenate regexes prio to compilation * Concatenating regexes into one Pattern might yield better performance under (un)certain conditions. * Filenames are tested against regexes with no os.sep in them. This may or may not be what we want to do. And alternative would be to test against the whole (absolute) path of each file, which would filter more agressively.	2020-08-20 02:46:06 +02:00
glubsy	2eaf7e7893	Implement exclude list dialog on the Qt side	2020-08-17 05:54:59 +02:00
glubsy	a26de27c47	Implement dialog and base classes for model/view	2020-08-14 20:19:47 +02:00
glubsy	470307aa3c	Ignore path and filename based on regex * Added initial draft for test suit * Fixed small logging bug	2020-08-03 16:19:27 +02:00
glubsy	5f5f9232c1	Properly wait for multiprocesses to exit * Fix for #693	2020-07-28 16:44:06 +02:00
glubsy	63b2f95cfa	Work around frozen progress dialog * It seems that matchblock.getmatches() returns too early and the (multi-)processes become zombies * This is a workaround which seems to work by sleeping for one second and avoid zombie processes	2020-07-25 23:37:41 +02:00
Andrew Senetar	5cc439d846	Clean up rest of DeprecationWarnings	2020-06-30 00:51:06 -05:00
Andrew Senetar	ee2671a5f3	More Test and Flake8 Cleanup - Allow flake8 to check more files as well.	2020-06-27 01:08:12 -05:00
Andrew Senetar	e05c72ad8c	Upgrade to latest pytest - Currently some incompatibility in the hscommon tests, commented out the ones with issues temporarily - Also updated some deprecation warnings, still more to do	2020-06-25 23:26:48 -05:00
glubsy	bcb26507fe	Remove superfluous argument	2020-06-25 01:23:03 +02:00
glubsy	ed64428c80	Add missing file class for folder type. * results.py doesn't set the proper type for dupes at the line "file = get_file(path)" so we add it on top * Perhap it could have been added to _get_fileclasses() in core.app.py too but I have not tested it	2020-06-24 23:32:04 +02:00
glubsy	e89156e55c	Add temporary workaround for bug #676 * In standard mode, for folder comparison, dupe type is wrongly set as core.fs.Folder while it should be core.se.fs.Folder. * Catching the NotImplementedError exception redirects to the appropriate handler * This is only a temporary workaround until a better fix is implemented	2020-06-24 22:01:30 +02:00

1 2 3 4 5 ...

327 Commits