Normally I am using this kind of matlab code to do a projective transformation with a single set of reference points:
fixedpoints = [0 0; 50 0; 50 100; 0 100];
movingpoints= [752 361; 888 361; 885 609; 736 609];
transformationtype='projective';
tform = fitgeotrans(movingpoints,fixedpoints,transformationtype);
Imagepr = imwarp(Image,tform);
Now I have a photo that shows some (for example 10) jumbled paper sheets, each with known size, situated on a plain large table. The distances between these sheets are unknown.
So I would have 10 4-reference-point sets to obtain the projective transformation matrix and later the projected image of the 2d table surface.
How can I produce an optimized transformation matrix (and later a projected image), which would include all 4-points sets as a kind of best fit?
Thanks in advance.
I feel the need to reformulate the problem as follows: you have a table with rectangular sheets of the same size* in arbitrary positions and an image taken by a camera in arbitrary position. You want to compute the homographic transform that maps the plane of the table to the image (or conversely). The coordinates of the four corners of every sheet are known in the image, but not on the table, and point correspondences are not available. The reference coordinate system of the table is arbitrary.
As hinted by the OP, the problem can be solved for every sheet independently by assigning "table" coordinates to the corners. The equations are linear. (Anyway two solutions are possible by swapping the long and short sides of the sheets.)
It appears to be impossible to combine the various per-sheet solutions, as the relative placement of the sheets is unknown, and the coefficients are unrelated.
Starting from an initial solution, one can think of using the least-squares approach to refine the solution, minimizing the total discrepancy between the known side lengths and those estimated by the transform. (We can use the known sides and diagonals, even though one of these six elements can be derived from the other five. The orientation ambiguity is solved by taking the orientation that gives the minimum discrepancy.)
Unfortunately, the equations aren't nice, calling for Levenberg-Marquardt or constrained minimization: sledge hammers.
The OP made an excellent suggestion: solve the projection problem for all sheets and compute the discrepancy metric for each and keep the solution with the lowest discrepancy. This reminds of RANSAC, where instead of random attempts, an exhaustive search is made**.
If one wants to further improve this solution (presumably an already pretty good one), one can think of derivative-less methods like the simplex (Nelder–Mead), Hooke-Jeeves patterns or sequential descent, trying to improve the discrepancy metric.
Taking the few best solutions has the benefit of giving an order of magnitude for the increments to apply to the coefficients. An extra difficulty appears, due to the fact that the "table" coordinate system isn't fixed. To avoid a drift of the solutions, one should add constrains, such as distances to the corners of the reference sheet.
*If the sheets are of different sizes, we can use the same approach if the sheet correspondence is known (but exact corner correspondence is unnecessary).
**If necessary, for increased robustenss, it should be possible to integrate an outlier rejection process in the minimization.
The projection of a plane onto the image is described by an homographic relation
There are eight unknown parameters, and it takes four point correspondences to determine them.
These equations are easily linearized as