I'm trying to create a nested record array, but I am having trouble with the dimensions. I tried following the example at how to set dtype for nested numpy ndarray?, but I am misunderstanding something. Below is an MRE. The arrays are generated in a script, not imported from CSV.
arr1 = np.array([4, 5, 4, 5])
arr2 = np.array([0, 0, -1, -1])
arr3 = np.array([0.51, 0.89, 0.59, 0.94])
arr4 = np.array(
[[0.52, 0.80, 0.62, 1.1], [0.41, 0.71, 0.46, 0.77], [0.68, 1.12, 0.78, 1.19]]
).T
arr5 = np.repeat(np.array([0.6, 0.2, 0.2]), 4).reshape(3, 4).T
arrs = (arr1, arr2, arr3, arr4, arr5)
for i in arrs:
print(i.shape, i)
For which the print statement returns:
(4,) [4 5 4 5]
(4,) [ 0 0 -1 -1]
(4,) [0.51 0.89 0.59 0.94]
(4, 3) [[0.52 0.41 0.68]
[0.8 0.71 1.12]
[0.62 0.46 0.78]
[1.1 0.77 1.19]]
(4, 3) [[0.6 0.2 0.2]
[0.6 0.2 0.2]
[0.6 0.2 0.2]
[0.6 0.2 0.2]]
However, the ans
line throws an error:
dtypes = [
("state", "f8"),
("variability", "f8"),
("target", "f8"),
("measured", [("mean", "f8"), ("low", "f8"), ("hi", "f8")], (4,)),
("var", [("mid", "f8"), ("low", "f8"), ("hi", "f8")], (4,)),
]
ans = np.column_stack(arrs).view(dtype=dtypes)
ValueError: When changing to a larger dtype, its size must be a divisor of the total size in bytes of the last axis of the array.
Problem 1: How do I get the desired array output?
print(np.column_stack(arrs))
returns
[[ 4. 0. 0.51 0.52 0.41 0.68 0.6 0.2 0.2 ]
[ 5. 0. 0.89 0.8 0.71 1.12 0.6 0.2 0.2 ]
[ 4. -1. 0.59 0.62 0.46 0.78 0.6 0.2 0.2 ]
[ 5. -1. 0.94 1.1 0.77 1.19 0.6 0.2 0.2 ]]
But the desired output looks like this:
[[4 0 0.51 (0.52, 0.41, 0.68) (0.6, 0.2, 0.2)]
[5 -1 0.89 (0.8, 0.71, 1.12) (0.6, 0.2, 0.2)]
[4 0 0.59 (0.62, 0.46, 0.78) (0.6, 0.2, 0.2)]
[5 -1 0.94 (1.1, 0.77, 1.19) (0.6, 0.2, 0.2)]]
Problem 2: How do I include the dtype.names?
print(rec_array.dtype.names)
should return:
('state', 'variability', 'target', 'measured', 'var')
and print(rec_array['measured'].dtype.names)
should return:
('mean', 'low', 'high')
and similarly for the names of the other nested array.
With your dtype:
A 2 element zeros array looks like:
Using
recfunctions
I can map that to a unstructured array:That says that your dtypes has 27 fields, not the 9 that seem to think (from your column stack).
Making a new (2,27) array, I can create a structured array:
view
still has problems with this. In some simple casesview
does work, though it can require some dimensions adjustment. But I have not explored its limitations:edit
removing the (4,) from dtypes:
and via a csv and genfromtxt
view
still does not work.with your data