search for *similar* pictures (not duplicates...)

(10 posts)
  • hi there,

    my suggest is a possibility to search for similar -not necessarily duplicate- images.

    t+ can do that pretty fine, several shareware does that, too.

    i can't imagine acdsee 2 pro (!) can't arrange that.

    regards-antoine

    casio exilim - p 700

    centrino duo 1,6 mhz
    2 gb ram
    ati mobility x1300
    windows xp mce
    acdsee pro 2.019

    Posted On October 29, 2007 - 09:43 AM (2 years ago) (Permalink to this post)
  • Marc Sabatella
    Moderator

    Sounds great to me, although it also sounds like magic :-) What constitutes "similar"? I'm not saying this because I don't believe there are possible answers, but because I'm curious about *your* answer. To one person, that might mean two files that started out as the exact same image but have been saved in different formats or with different editing applied. In that case, there is probably info hidden in the EXIF info that would help ACDSee figure out they were related. To another person, it might mean images shot within a few seconds of each other that were basically of the same scene, in which, ACDSee might do some AI to try to guess which images were of the same scene. A solution that satisfied one of these two people wouldn't help the other at all. And you might have something else in mind entirely...

    Posted On October 30, 2007 - 05:43 PM (2 years ago) (Permalink to this post)
  • Sam Dring
    Moderator

    Sure I had answered this but must have forgotten to press Add Reply!
    There is a number of apps (some like Image Comparer are standalone) which use image content for comparison and you can set the level of similarity.
    I do not remember many people asking for this particular facility and would be interested to know what your main use would be please.

    Posted On October 31, 2007 - 03:56 AM (2 years ago) (Permalink to this post)
  • samdring said:

    Sure I had answered this but must have forgotten to press Add Reply!
    There is a number of apps (some like Image Comparer are standalone) which use image content for comparison and you can set the level of similarity.
    I do not remember many people asking for this particular facility and would be interested to know what your main use would be please.

    my problem is: after a "pc-technican" has formated my data partition instead of system partition, i have 13.000 pictures on my hd from recovery named "fil12365.jpg" and so on. of course, the recovery tool could not recover filenames but recovered more than 95 % of my files.

    pics have been saved before pc-crash as 1. raw dumps from memory card; 2. highres jpegs from memorycard; 3. edited tif (lzw) and 4. small jpgs for webpurposes.

    additionally, i played with lighing and exposure data.

    means, i have up to 20 versions of one motif - how can i sort them?

    t+ has a nice function called "find by query" --> "search similar images".
    you can adjust several data; tolerance, color similarity, shape similarity.

    regards-antoine

    casio exilim - p 700

    centrino duo 1,6 mhz, 2 gb ram, ati mobility x1300

    windows xp mce, acdsee pro 2.019

    Posted On October 31, 2007 - 09:42 AM (2 years ago) (Permalink to this post)
  • Sam Dring
    Moderator

    Antoine
    Wow - that is some problem! I am almost sorry for having asked.
    Best of luck in sorting them out - have you now got a backup facility?

    Posted On October 31, 2007 - 10:27 AM (2 years ago) (Permalink to this post)
  • Have you tried sorting by EXIF date/time original? (And/or use Batch Rename to give them names starting with the EXIF date/time in the order Year Month Day Time so that sorting by filename is meaningful.)

    This should bring all versions of the same image together (or at least those that still have embedded EXIF information). It might also be helpful to display the full EXIF date/time under the thumbnails, as you should then be able to see where each group of image files begins/ends (if you have images that are similar but not from the same original picture).

    Not an enviable task!

    Posted On November 1, 2007 - 08:55 AM (2 years ago) (Permalink to this post)
  • Marc Sabatella
    Moderator

    John_R said:

    Have you tried sorting by EXIF date/time original? (And/or use Batch Rename to give them names starting with the EXIF date/time in the order Year Month Day Time so that sorting by filename is meaningful.)

    Assuming the EXIF data was there in the first place is has not been stripped out, I think this is likely to be the ticket. In any case, it definitely seems that we are talking about different versions of the same image, so hopefully *something* in the metadata will allow you to connect them without needing any application to guess which pictures seem similar.

    Posted On November 1, 2007 - 09:44 AM (2 years ago) (Permalink to this post)
  • Marc Sabatella said:

    Assuming the EXIF data was there in the first place is has not been stripped out, I think this is likely to be the ticket. In any case, it definitely seems that we are talking about different versions of the same image, so hopefully *something* in the metadata will allow you to connect them without needing any application to guess which pictures seem similar.

    thankyou for all your suggests, and your interest!

    the exif-data is dead and gone, original filenames too, and sorting filesize doesn't succeed, too:
    the dumps from exilim-memory card have all the same size 'coz they have not been altered nor compressed. they are "raw", but named tif. strange policy of casio...

    but thumbs+ did a nice job; and i really wondered how fast > 10.000 images have been thumbd. no, i didn't keep any td4-data from last years installation. all directories have been new "thumbd".

    btw: i don't think searching for similar not duplicate pics is an exotic feature - several shareware does that. some people seem to need it ;-)

    regards-antoine

    casio exilim - p 700

    centrino duo 1,6 mhz, 2 gb ram, ati mobility x1300

    windows xp mce, acdsee pro 2.019

    Posted On November 1, 2007 - 05:28 PM (2 years ago) (Permalink to this post)
  • @ Marc Sabatella

    marc, to give you a clue what i'm doing:

    i'm shooting all the movieposters i obtained the last 30 years for an online-collection. i'm not able to do a "repro" but try to let interested people see how that posters did look like

    regards, www.filmplakat-oldies.de

    Posted On November 1, 2007 - 05:40 PM (2 years ago) (Permalink to this post)
  • Marc Sabatella
    Moderator

    antoinebrisebard said:

    the exif-data is dead and gone

    Hopefully you will not make the mistake of stripping that away in the future!

    btw: i don't think searching for similar not duplicate pics is an exotic feature - several shareware does that. some people seem to need it ;-)

    I don't think anyone called it "exotic". I was primarily observing that different people might have entirely different notions of what constitutes "similar", so it's important to define that. Since not many people have requested this feature that I'm aware of, the folks at ACD probably don't already have an idea of what the msot common use for this would be. And of course, there is no way of knowing whether your particular use is common. Someone at ACD reading your original request might have thought to themselves, "I know - we'll use EXIF data to define what is similar", and that wouldn't have helped you a bit.

    So now it's clearer we are talking about looking at the image data itself. This might not be an "exotic" thing, but it's also basically a form of artificial intelligence (which is why I used the term "magic" and bound to be something one can not ascertain with any real precision. Sort of like spell checking, and the way a spell checker will often flag perfectly good words as errors because it doesn't know them, or miss spelling mistakes because you happened to spell another word, or make sometimes amusing suggestions as to what word it thinks you were trying to spell. In the case of algorithm to identify similar pictures, I'd expect potential issues to include things such as rotation & cropping, changes in the lighting in photos taken of the same scene, pictures using the same pose but different models, etc. Obviously, in a position like yours, you'll take whatever you can get. But again, that's why it is important define what exactly would constitute "similar" for you, and to get an idea of how that compares with what others might expect, so anyone considering implementing this type of feature would b sure to try to address the types of issues that would be of the most benefit.

    Posted On November 3, 2007 - 07:45 PM (2 years ago) (Permalink to this post)

Subscribe to this topic via RSS

Reply

You must log in to post.