dupeguru/core
glubsy c4dcfd3d4b Fix stripping (japanese) unicode characters
* Accents are getting removed from Unicode characters to generate similar "words".
* Non-latin characters which cannot be processed that way (eg. japanese, greek, russian, etc.) should not be filtered out at all otherwise files are erroneously skipped or detected as dupes if only some characters make it passed the filter.
* Starting from an arbitrary unicode codepoint (converted to decimal), above which we know it is pointless to try any sort of processing, we leave the characters as is.
* Fix #878.
2021-04-29 05:15:34 +02:00
..
gui Prioritize dialog: allow removing multiple prioritizations at once 2021-01-07 17:42:30 +01:00
me Format files with black 2019-12-31 20:16:27 -06:00
pe Properly wait for multiprocesses to exit 2020-07-28 16:44:06 +02:00
se Format files with black 2019-12-31 20:16:27 -06:00
tests Fix stripping (japanese) unicode characters 2021-04-29 05:15:34 +02:00
__init__.py Update to 4.1.1 2021-03-21 22:50:33 -05:00
app.py Fix problematic string for translations 2021-02-09 01:40:00 +01:00
directories.py Highlight rows when testing regex string 2020-09-01 23:02:58 +02:00
engine.py Fix stripping (japanese) unicode characters 2021-04-29 05:15:34 +02:00
exclude.py Remove redundant check 2020-12-29 05:39:26 +01:00
export.py Format files with black 2019-12-31 20:16:27 -06:00
fs.py Add test suite, fix bugs 2020-08-31 20:35:56 +02:00
ignore.py Format files with black 2019-12-31 20:16:27 -06:00
markable.py Format files with black 2019-12-31 20:16:27 -06:00
prioritize.py Format files with black 2019-12-31 20:16:27 -06:00
results.py Clean up rest of DeprecationWarnings 2020-06-30 00:51:06 -05:00
scanner.py Format files with black 2019-12-31 20:16:27 -06:00
util.py Format files with black 2019-12-31 20:16:27 -06:00