Over the past year or so I've noticed a few C++-related answers on StackOverflow refer to mdspan
's - but I've never actually seen these in C++ code. I tried looking for them in my C++ compiler's standard library directory and in the C++ coding guidelines - but couldn't find them. I did find std::span
's; I'm guessing they're related - but how? And what does this addition of "md" stand for?
Please explain what this mysterious entity is about, and when I might want to use it.
TL;DR:
mdspan
is an extension ofstd::span
for multiple dimensions - with a lot of (unavoidable) flexibile configurability w.r.t. memory layout and modes of access.Before you read this answer, you should make sure you're clear on what a
span
is and what it's used for. Now that that's out of the way: Sincemdspan
's can be rather complex beasts (typically ~7x or more source code as anstd::span
implementation), we'll start with a simplified description, and keep the advanced capabilities for further below."What is it?" (simple version)
An
mdspan<T>
is:T
elements).std::span<T>
, from a uni-dimensional/linear sequence of elements to multiple dimensions.T
in memory, interpreted as a multi-dimensional array.struct { T * ptr; size_type extents[d]; }
with some convenience methods (ford
dimensions determined at run-time).Illustration of
mdspan
-interpreted layoutIf we have:
we can view the data of
v
as a 1D array of 12 elements, similar to its original definition:or a 2D array of extents 2 x 6:
or a 3D array 2 x 3 x 2:
and we could also consider it as a 3 x 2 x 2 or 2 x 2 x 3 array, or 3 x 4 and so on.
"When should I use it?"
(C++23 and later) When you want to use the multi-dimensional
operator[]
on some buffer you get from somewhere. Thus in the example above,ms3[1, 2, 0]
is11
andms3[0, 1, 1]
is4
.When you want to pass multi-dimensional data without separating the raw data pointer and the dimensions. You've gotten a bunch of elements in memory, and want to refer to them using more than one dimension. Thus instead of:
you could write:
As the right type for passing multidimensional C arrays around:
C supports multidimensional arrays perfectly... as long as their dimensions are given at compile time, and you don't try passing them to functions. Doing that is a bit tricky because the outermost dimension experiences decay, so you would actually be passing a pointer. But with mdspans, you can write this:
Standardization status
While
std::span
was standardized in C++20,std::mdspan
was not. However, it is part of C++23, which is almost-finalized (awaiting final ballot).You can already use a reference implementation. It is part of the US' Sandia National Laboratory's "Kokkos performance portability ecosystem".
"What are those 'extra capabilities' which
mdspan
offers?"An
mdspan
actually has 4 template parameters, not just the element type and the extents:This answer is already rather long, so we won't give the full details, but:
Some of the extents can be "static" rather than "dynamic", specified in compile-time, and thus not stored in instance data members. Only the "dynamic" instances are stored. For example, this:
... is an extents objects corresponding to
dextents<size_t>{ 2, 3, 4 }
, but which only stores the values2
and4
in the class instance; with the compiler knowing it needs to plug in3
whenever the second dimension is used.You can have the dimensions go from-minor-to-major, in Fortran style instead of from-major-to-minor like in C. Thus, if you set
LayoutPolicy = layout_left
, thenmds[x,y]
is atmds.data[mds.extent(0) * y + x]
instead of the usualmds.data[mds.extent(1) * x + y]
.You can "reshape" your
mdspan
into anothermdspan
with different dimensions but the same overall size.You can define a layout policy with "strides": Have consecutive elements in the mdspan be at a fixed distance in memory; have extra offsets and the beginning and/or the end of each line or dimensional slice; etc.
You can "cut up" your
mdspan
with offsets in every dimension (e.g. take a submatrix of a matrix) - and the result is still anmdspan
! ... that's because you can have anmdspan
with aLayoutPolicy
which incorporates these offsets. This functionality is not available in C++23 IIANM.Using the
AccessorPolicy
, you can makemdspan
's which actually do own the data they refer to, individually or collectively.Further reading
std::mdspan
proposal, accepted into C++23.std::mdspan
page on cppreference.commdspan
's, by Asher Macinelli.(some examples were adapted from these sources.)