pruning does too much work
Currently pruning the database will delete records of files that do not exist and also rehash every image that still does. The second step is both costly and unnecessary in a simple workflow:
- generate phashes for all images in a large library
- get list of dupes
- delete some dupes
- remove deleted dupes from database