Is there a way to resize a chunked dataset in HDF5 using Julia's HDF5.jl? I didn't see anything in the documentation. Looking through the source, all I found was set_dims!()
, but that cannot extend a dataset (only shrink it). Does HDF5.jl have the ability to enlarge an existing (chunked) dataset? This is a very important feature for me, and I would rather not have to call into another language.
Resize HDF5 dataset in Julia
504 Views Asked by Brendan At
2
There are 2 best solutions below
0

I believe I've got it figured out. The issue is that I forgot to give the dataspace a large enough max_dims. Doing that required digging into the lower-level API. The solution I found was:
dspace = HDF5.dataspace((6,20)::Dims, max_dims=(6,typemax(Int64)))
dtype = HDF5.datatype(Float64)
dset = HDF5.d_create(prt, "trajectory", dtype, dspace, "chunk", (6,10))
Once I created a dataset that can be resized appropriately, the set_dims!
function resizes the dataset correctly.
I think I located a few minor issues with the API, which I had to work around or change in my local version. I will get in touch with the HDF5.jl owner regarding those. For those interested:
- The constant
H5S_UNLIMITED
is of typeUint64
, but thedataspace
function will only accept tuples ofInt64
, hence why I usedtypemax(Int64)
for mymax_dims
to imitate howH5S_UNLIMITED
is derived. - The form of
d_create
which I used callsh5d_create
incorrectly; it passesparent
instead ofcheckvalid(parent).id
(can be seen by comparison with other forms ofd_create
).
The docs have a brief mention of extendible dimensions in
hdf5.md
excerpted below.You can use extendible dimensions,
where dims is a tuple of integers. For example