getting a transformation matrix from a specific point of view of a 3d plot, matplotlib vs pytorch3d

101 Views Asked by At

I've been working on a problem where numerous 2d projections of a 3d plot are created based on spherical coordinates around the 3d plot. I have been using matplotlib to successfully generate these 2d projections. Now I would like to switch from matplotlib to pytorch3d to generate the exact same 2d projections, as with the pytorch3d library it is possible to use the functions for backpropagation.

I have the following matplotlib code with a minimal example:

    
import matplotlib.pyplot as plt
import numpy as np
    
# random coords
coords_3d = np.random.rand(10, 3)
# viewpoint where viewpoint[0] is elevation, viewpoint[1] is azimuth and viewpoint[2] is distance
viewpoints_spherical = np.array([ -0., -90.,   1.])

print(coords_3d)
print(viewpoints_spherical)

fig = plt.figure()
ax = fig.add_subplot(111, projection='3d')

ax.scatter(coords_3d[:, 0], coords_3d[:, 1], coords_3d[:, 2], s = 10)

ax.set_xlabel('x')
ax.set_ylabel('y')
ax.set_zlabel('z')

# ensure equal aspect ratio:
max = 1
ax.set_xlim(-max, max)
ax.set_ylim(-max, max)
ax.set_zlim(-max, max)

# rotate the axes and update
ax.set_box_aspect((4, 4, 4), zoom=1.2)

# set the elevation and azimuth
ax.view_init(viewpoints_spherical[0], viewpoints_spherical[1])
# get the transformation matrix
TM = np.linalg.inv(ax.get_proj())
print(TM)

The transformation matrix I receive here has the following values:

[[ 1.57792186e+00  0.00000000e+00 -1.05110711e-16 -1.05110711e-16]
 [ 1.05110711e-16  0.00000000e+00  1.57792186e+00  1.57792186e+00]
 [ 0.00000000e+00  1.57792186e+00  0.00000000e+00  0.00000000e+00]
 [-0.00000000e+00 -0.00000000e+00 -1.00000000e-01 -0.00000000e+00]]

Now, I would like to acquire the exact same transformation matrix using the pytorch3d library.

from pytorch3d.renderer import (look_at_view_transform, get_world_to_view_transform,)
distance = 1
elevation = -0
azimuth = -90
device = 'cpu'

R, T = look_at_view_transform(distance, elevation, azimuth, device=device)
res = get_world_to_view_transform(R, T).get_matrix()[0]
print(res)

The transformation matrix I get with pytorch3d has the following values:

tensor([[ 4.3711e-08,  0.0000e+00,  1.0000e+00,  0.0000e+00],
        [ 0.0000e+00,  1.0000e+00,  0.0000e+00,  0.0000e+00],
        [-1.0000e+00,  0.0000e+00,  4.3711e-08,  0.0000e+00],
        [ 0.0000e+00,  0.0000e+00,  1.0000e+00,  1.0000e+00]])

There is a difference between these projection matrices which will lead to different 2d projections from the original 3d coordinates.

I have been inspecting the get_proj() function from matplotlib (https://github.com/matplotlib/matplotlib/blob/v3.8.0/lib/mpl_toolkits/mplot3d/axes3d.py#L896-L958) and I have replicated each step within the function. It seems that here the projection matrix computation takes into account the box aspect. I suspect that this may lead to some differences between the get_proj() function from matplotlib and the functions from pytorch3d. The problem here is that I don't want to change the code written using matplotlib, as this code has been used to generate the projection matrices already. I am simply looking for a way to replicate the transformation matrix using pytorch3d.

I could also be mistaken and that I have been misusing/misunderstanding the pytorch3d functions look_at_view_transform() and get_world_to_view_transform().

Any help is appreciated.

0

There are 0 best solutions below