I refer to the -s option, providing the size of a text square area.
I have not grasped the placement algorithm as yet but I thought it tried to find an area with the largest area of background colour and place the text there - so that it covered least amount of image information.
If the `uncluttered' background was T shaped with the top and the height of the T about the same
as the -s dimension, then the algorithm might position the text square more or less straddling the entire T. However, if the text were rectangular, like a three digit number, it might fit entirely in the top of the T.
There is an example of a numbered grid called IndexedGrid.gif and an example
of an indexed map called Map-4x3-indexed.gif at
www.discway.com.au/cmad. There is a `T'
in the indexed grid, but it could have been made a little more `challenging'. Note that each
cell of the grid before indexing was cut out as a separate image and numbered using autocaption.
The individual numbered images were identical with the numbered grid, as one would hope.
For an uncluttered image (at least in the examples under consideration) it would look better
if the text was near the center of the grid. However the captioning algorithm default
position appears to be the top left corner of the image.
If I fully understood Fred's algorithm, I would adjust the text colour transparency number in accordance with a
measure of the image `clutter'.
Regards