Seeking advice on a complex image cataloging project
Posted: 2014-02-21T10:01:36-07:00
I'm a linux user (Arch) who has done a bit of tinkering with image files using imagemagick over the years. So, though I'm not an IM power user by any means, I'm not a complete novice, either.
I've now been called on to assist an elderly artist who is in the early stages of Alzheimers to catalog his work. For the past 10-15 years this artist has worked almost exclusively on a computer and has made extensive use of Photoshop in producing his visual art. Probably 75% of the corpus has been written to CD's, and it looks like he was taking a bit more of an organized approach toward the beginning of that period. But during the last 5-7 years organizational attempts have tapered off, and it is unclear what material is on which CD. I am currently tackling this most recent set of CD's, trying to determine what on them is artwork worthy of cataloging.
My first task has been to copy the content of those more poorly organized CD's to an external hard drive so they'll be more readily accessible. I'm about halfway through the copying process, so I am experimenting with ways to process the data that's now been transferred and to make it more "perusable" so that we can better distinguish which of the data belongs in the catalog. There are a number of digital formats involved, mainly tif, jpg, psd, with maybe a few gifs and bmps.
I've experimented a bit with creating contact sheets, thinking this might be a good way to generate a visual record of images contained on those CD's and thus allowing us to discern the artwork from other types of images (from, say, personal photos or photos that might have been intended as elements in some artwork, but which were never used). Having now looked over more carefully the montage documentation at http://www.imagemagick.org/Usage/montage/#html I'm beginning to think an alternate approach, i.e., visual index images with clickable links (such as you get from running convert 'vid:../img_photos/*_orig.*' vid_index.html) might be a good initial approach to take in helping us discern which images are artwork worthy of cataloging. So my initial questions will be relating to that solution.
A test I ran using that convert 'vid:../img_photos/*_orig.*' vid_index.html command produced what seem to me satisfactory results, with a few caveats. First, some of the file names, which can be fairly long, tend to get a little jumbled as their width exceeds the width of the thumbnails produced and as they obtrude a bit into the adjacent image's caption: is there some way to cause these file names better to break across lines when they are long like this? We may be able to contend with this jumbling since this is really just a preliminary phase in the cataloging process, but if there is some simple way to rectify it, that would help this phase go a bit more smoothly.
Next, in relation to the visual index images approach, the following. It seems to me the varying range of image formats with which we're dealing should be no problem with respect to generating the montage--does that sound correct? However, when it comes to the fuller-resolution images to which the clickable links point, those will likely need to be converted to something typical web browsers can display, correct? So, though I might be able to produce a montage from tif's, jpg's, and even psd's, the browser will be unable to display some of those images (e.g., the tif's and psd's), once a relevant segment of the montage has been clicked, correct?
If I'm understanding that aspect of how visual index images works, then I'll be needing to convert all images to something like png or jpg in order to use that solution. That introduces a new layer of complexity to this task, since I'll need to determine a way of trawling all sub-directories on the external hard drive to which I'm copying, for relevant image files, then converting, very likely reducing them in size, then copying them into a distinct sub-directory, before generating the montage. If you'll bear with me, I'll pose inquiries later about that process.
With this note, I'd just like to get feedback as to whether it seems I'm taking a sound approach at the outset of this project, whether visual index images seems a good solution to implement at this initial phase and, finally, whether I've correctly understood how visual index images works. Your input will be appreciated.
I've now been called on to assist an elderly artist who is in the early stages of Alzheimers to catalog his work. For the past 10-15 years this artist has worked almost exclusively on a computer and has made extensive use of Photoshop in producing his visual art. Probably 75% of the corpus has been written to CD's, and it looks like he was taking a bit more of an organized approach toward the beginning of that period. But during the last 5-7 years organizational attempts have tapered off, and it is unclear what material is on which CD. I am currently tackling this most recent set of CD's, trying to determine what on them is artwork worthy of cataloging.
My first task has been to copy the content of those more poorly organized CD's to an external hard drive so they'll be more readily accessible. I'm about halfway through the copying process, so I am experimenting with ways to process the data that's now been transferred and to make it more "perusable" so that we can better distinguish which of the data belongs in the catalog. There are a number of digital formats involved, mainly tif, jpg, psd, with maybe a few gifs and bmps.
I've experimented a bit with creating contact sheets, thinking this might be a good way to generate a visual record of images contained on those CD's and thus allowing us to discern the artwork from other types of images (from, say, personal photos or photos that might have been intended as elements in some artwork, but which were never used). Having now looked over more carefully the montage documentation at http://www.imagemagick.org/Usage/montage/#html I'm beginning to think an alternate approach, i.e., visual index images with clickable links (such as you get from running convert 'vid:../img_photos/*_orig.*' vid_index.html) might be a good initial approach to take in helping us discern which images are artwork worthy of cataloging. So my initial questions will be relating to that solution.
A test I ran using that convert 'vid:../img_photos/*_orig.*' vid_index.html command produced what seem to me satisfactory results, with a few caveats. First, some of the file names, which can be fairly long, tend to get a little jumbled as their width exceeds the width of the thumbnails produced and as they obtrude a bit into the adjacent image's caption: is there some way to cause these file names better to break across lines when they are long like this? We may be able to contend with this jumbling since this is really just a preliminary phase in the cataloging process, but if there is some simple way to rectify it, that would help this phase go a bit more smoothly.
Next, in relation to the visual index images approach, the following. It seems to me the varying range of image formats with which we're dealing should be no problem with respect to generating the montage--does that sound correct? However, when it comes to the fuller-resolution images to which the clickable links point, those will likely need to be converted to something typical web browsers can display, correct? So, though I might be able to produce a montage from tif's, jpg's, and even psd's, the browser will be unable to display some of those images (e.g., the tif's and psd's), once a relevant segment of the montage has been clicked, correct?
If I'm understanding that aspect of how visual index images works, then I'll be needing to convert all images to something like png or jpg in order to use that solution. That introduces a new layer of complexity to this task, since I'll need to determine a way of trawling all sub-directories on the external hard drive to which I'm copying, for relevant image files, then converting, very likely reducing them in size, then copying them into a distinct sub-directory, before generating the montage. If you'll bear with me, I'll pose inquiries later about that process.
With this note, I'd just like to get feedback as to whether it seems I'm taking a sound approach at the outset of this project, whether visual index images seems a good solution to implement at this initial phase and, finally, whether I've correctly understood how visual index images works. Your input will be appreciated.