xref: /aosp_15_r20/external/eigen/doc/UnalignedArrayAssert.dox (revision bf2c37156dfe67e5dfebd6d394bad8b2ab5804d4)
1*bf2c3715SXin Linamespace Eigen {
2*bf2c3715SXin Li
3*bf2c3715SXin Li/** \eigenManualPage TopicUnalignedArrayAssert Explanation of the assertion on unaligned arrays
4*bf2c3715SXin Li
5*bf2c3715SXin LiHello! You are seeing this webpage because your program terminated on an assertion failure like this one:
6*bf2c3715SXin Li<pre>
7*bf2c3715SXin Limy_program: path/to/eigen/Eigen/src/Core/DenseStorage.h:44:
8*bf2c3715SXin LiEigen::internal::matrix_array<T, Size, MatrixOptions, Align>::internal::matrix_array()
9*bf2c3715SXin Li[with T = double, int Size = 2, int MatrixOptions = 2, bool Align = true]:
10*bf2c3715SXin LiAssertion `(reinterpret_cast<size_t>(array) & (sizemask)) == 0 && "this assertion
11*bf2c3715SXin Liis explained here: http://eigen.tuxfamily.org/dox-devel/group__TopicUnalignedArrayAssert.html
12*bf2c3715SXin Li**** READ THIS WEB PAGE !!! ****"' failed.
13*bf2c3715SXin Li</pre>
14*bf2c3715SXin Li
15*bf2c3715SXin LiThere are 4 known causes for this issue.
16*bf2c3715SXin LiIf you can target \cpp17 only with a recent compiler (e.g., GCC>=7, clang>=5, MSVC>=19.12), then you're lucky: enabling c++17 should be enough (if not, please <a href="http://eigen.tuxfamily.org/bz/">report</a> to us).
17*bf2c3715SXin LiOtherwise, please read on to understand those issues and learn how to fix them.
18*bf2c3715SXin Li
19*bf2c3715SXin Li\eigenAutoToc
20*bf2c3715SXin Li
21*bf2c3715SXin Li\section where Where in my own code is the cause of the problem?
22*bf2c3715SXin Li
23*bf2c3715SXin LiFirst of all, you need to find out where in your own code this assertion was triggered from. At first glance, the error message doesn't look helpful, as it refers to a file inside Eigen! However, since your program crashed, if you can reproduce the crash, you can get a backtrace using any debugger. For example, if you're using GCC, you can use the GDB debugger as follows:
24*bf2c3715SXin Li\code
25*bf2c3715SXin Li$ gdb ./my_program          # Start GDB on your program
26*bf2c3715SXin Li> run                       # Start running your program
27*bf2c3715SXin Li...                         # Now reproduce the crash!
28*bf2c3715SXin Li> bt                        # Obtain the backtrace
29*bf2c3715SXin Li\endcode
30*bf2c3715SXin LiNow that you know precisely where in your own code the problem is happening, read on to understand what you need to change.
31*bf2c3715SXin Li
32*bf2c3715SXin Li\section c1 Cause 1: Structures having Eigen objects as members
33*bf2c3715SXin Li
34*bf2c3715SXin LiIf you have code like this,
35*bf2c3715SXin Li
36*bf2c3715SXin Li\code
37*bf2c3715SXin Liclass Foo
38*bf2c3715SXin Li{
39*bf2c3715SXin Li  //...
40*bf2c3715SXin Li  Eigen::Vector4d v;
41*bf2c3715SXin Li  //...
42*bf2c3715SXin Li};
43*bf2c3715SXin Li//...
44*bf2c3715SXin LiFoo *foo = new Foo;
45*bf2c3715SXin Li\endcode
46*bf2c3715SXin Li
47*bf2c3715SXin Lithen you need to read this separate page: \ref TopicStructHavingEigenMembers "Structures Having Eigen Members".
48*bf2c3715SXin Li
49*bf2c3715SXin LiNote that here, Eigen::Vector4d is only used as an example, more generally the issue arises for all \ref TopicFixedSizeVectorizable "fixed-size vectorizable Eigen types".
50*bf2c3715SXin Li
51*bf2c3715SXin Li\section c2 Cause 2: STL Containers or manual memory allocation
52*bf2c3715SXin Li
53*bf2c3715SXin LiIf you use STL Containers such as std::vector, std::map, ..., with %Eigen objects, or with classes containing %Eigen objects, like this,
54*bf2c3715SXin Li
55*bf2c3715SXin Li\code
56*bf2c3715SXin Listd::vector<Eigen::Matrix2d> my_vector;
57*bf2c3715SXin Listruct my_class { ... Eigen::Matrix2d m; ... };
58*bf2c3715SXin Listd::map<int, my_class> my_map;
59*bf2c3715SXin Li\endcode
60*bf2c3715SXin Li
61*bf2c3715SXin Lithen you need to read this separate page: \ref TopicStlContainers "Using STL Containers with Eigen".
62*bf2c3715SXin Li
63*bf2c3715SXin LiNote that here, Eigen::Matrix2d is only used as an example, more generally the issue arises for all \ref TopicFixedSizeVectorizable "fixed-size vectorizable Eigen types" and \ref TopicStructHavingEigenMembers "structures having such Eigen objects as member".
64*bf2c3715SXin Li
65*bf2c3715SXin LiThe same issue will be exhibited by any classes/functions by-passing operator new to allocate memory, that is, by performing custom memory allocation followed by calls to the placement new operator. This is for instance typically the case of \c `std::make_shared` or `std::allocate_shared` for which is the solution is to use an \ref aligned_allocator "aligned allocator" as detailed in the \ref TopicStlContainers "solution for STL containers".
66*bf2c3715SXin Li
67*bf2c3715SXin Li\section c3 Cause 3: Passing Eigen objects by value
68*bf2c3715SXin Li
69*bf2c3715SXin LiIf some function in your code is getting an %Eigen object passed by value, like this,
70*bf2c3715SXin Li
71*bf2c3715SXin Li\code
72*bf2c3715SXin Livoid func(Eigen::Vector4d v);
73*bf2c3715SXin Li\endcode
74*bf2c3715SXin Li
75*bf2c3715SXin Lithen you need to read this separate page: \ref TopicPassingByValue "Passing Eigen objects by value to functions".
76*bf2c3715SXin Li
77*bf2c3715SXin LiNote that here, Eigen::Vector4d is only used as an example, more generally the issue arises for all \ref TopicFixedSizeVectorizable "fixed-size vectorizable Eigen types".
78*bf2c3715SXin Li
79*bf2c3715SXin Li\section c4 Cause 4: Compiler making a wrong assumption on stack alignment (for instance GCC on Windows)
80*bf2c3715SXin Li
81*bf2c3715SXin LiThis is a must-read for people using GCC on Windows (like MinGW or TDM-GCC). If you have this assertion failure in an innocent function declaring a local variable like this:
82*bf2c3715SXin Li
83*bf2c3715SXin Li\code
84*bf2c3715SXin Livoid foo()
85*bf2c3715SXin Li{
86*bf2c3715SXin Li  Eigen::Quaternionf q;
87*bf2c3715SXin Li  //...
88*bf2c3715SXin Li}
89*bf2c3715SXin Li\endcode
90*bf2c3715SXin Li
91*bf2c3715SXin Lithen you need to read this separate page: \ref TopicWrongStackAlignment "Compiler making a wrong assumption on stack alignment".
92*bf2c3715SXin Li
93*bf2c3715SXin LiNote that here, Eigen::Quaternionf is only used as an example, more generally the issue arises for all \ref TopicFixedSizeVectorizable "fixed-size vectorizable Eigen types".
94*bf2c3715SXin Li
95*bf2c3715SXin Li
96*bf2c3715SXin Li\section explanation General explanation of this assertion
97*bf2c3715SXin Li
98*bf2c3715SXin Li\ref TopicFixedSizeVectorizable "Fixed-size vectorizable Eigen objects" must absolutely be created at properly aligned locations, otherwise SIMD instructions addressing them will crash.
99*bf2c3715SXin LiFor instance, SSE/NEON/MSA/Altivec/VSX targets will require 16-byte-alignment, whereas AVX and AVX512 targets may require up to 32 and 64 byte alignment respectively.
100*bf2c3715SXin Li
101*bf2c3715SXin Li%Eigen normally takes care of these alignment issues for you, by setting an alignment attribute on them and by overloading their `operator new`.
102*bf2c3715SXin Li
103*bf2c3715SXin LiHowever there are a few corner cases where these alignment settings get overridden: they are the possible causes for this assertion.
104*bf2c3715SXin Li
105*bf2c3715SXin Li\section getrid I don't care about optimal vectorization, how do I get rid of that stuff?
106*bf2c3715SXin Li
107*bf2c3715SXin LiThree possibilities:
108*bf2c3715SXin Li<ul>
109*bf2c3715SXin Li  <li>Use the \c DontAlign option to Matrix, Array, Quaternion, etc. objects that gives you trouble. This way %Eigen won't try to over-align them, and thus won"t assume any special alignment. On the down side, you will pay the cost of unaligned loads/stores for them, but on modern CPUs, the overhead is either null or marginal. See \link StructHavingEigenMembers_othersolutions here \endlink for an example.</li>
110*bf2c3715SXin Li  <li>Define \link TopicPreprocessorDirectivesPerformance EIGEN_MAX_STATIC_ALIGN_BYTES \endlink to 0. That disables all 16-byte (and above) static alignment code, while keeping 16-byte (or above) heap alignment. This has the effect of
111*bf2c3715SXin Li      vectorizing fixed-size objects (like Matrix4d) through unaligned stores (as controlled by \link TopicPreprocessorDirectivesPerformance EIGEN_UNALIGNED_VECTORIZE \endlink), while keeping unchanged the vectorization of dynamic-size objects
112*bf2c3715SXin Li      (like MatrixXd). On 64 bytes systems, you might also define it 16 to disable only 32 and 64 bytes of over-alignment. But do note that this breaks ABI compatibility with the default behavior of static alignment.</li>
113*bf2c3715SXin Li  <li>Or define both \link TopicPreprocessorDirectivesPerformance  EIGEN_DONT_VECTORIZE \endlink and `EIGEN_DISABLE_UNALIGNED_ARRAY_ASSERT`. This keeps the
114*bf2c3715SXin Li      16-byte (or above) alignment code and thus preserves ABI compatibility, but completely disables vectorization.</li>
115*bf2c3715SXin Li</ul>
116*bf2c3715SXin Li
117*bf2c3715SXin LiIf you want to know why defining `EIGEN_DONT_VECTORIZE` does not by itself disable 16-byte (or above) alignment and the assertion, here's the explanation:
118*bf2c3715SXin Li
119*bf2c3715SXin LiIt doesn't disable the assertion, because otherwise code that runs fine without vectorization would suddenly crash when enabling vectorization.
120*bf2c3715SXin LiIt doesn't disable 16-byte (or above) alignment, because that would mean that vectorized and non-vectorized code are not mutually ABI-compatible. This ABI compatibility is very important, even for people who develop only an in-house application, as for instance one may want to have in the same application a vectorized path and a non-vectorized path.
121*bf2c3715SXin Li
122*bf2c3715SXin Li\section checkmycode How can I check my code is safe regarding alignment issues?
123*bf2c3715SXin Li
124*bf2c3715SXin LiUnfortunately, there is no possibility in c++ to detect any of the aforementioned shortcoming at compile time (though static analyzers are becoming more and more powerful and could detect some of them).
125*bf2c3715SXin LiEven at runtime, all we can do is to catch invalid unaligned allocation and trigger the explicit assertion mentioned at the beginning of this page.
126*bf2c3715SXin LiTherefore, if your program runs fine on a given system with some given compilation flags, then this does not guarantee that your code is safe. For instance, on most 64 bits systems buffer are aligned on 16 bytes boundary and so, if you do not enable AVX instruction set, then your code will run fine. On the other hand, the same code may assert if moving to a more exotic platform, or enabling AVX instructions that required 32 bytes alignment by default.
127*bf2c3715SXin Li
128*bf2c3715SXin LiThe situation is not hopeless though. Assuming your code is well covered by unit test, then you can check its alignment safety by linking it to a custom malloc library returning 8 bytes aligned buffers only. This way all alignment shortcomings should pop-up. To this end, you must also compile your program with \link TopicPreprocessorDirectivesPerformance EIGEN_MALLOC_ALREADY_ALIGNED=0 \endlink.
129*bf2c3715SXin Li
130*bf2c3715SXin Li
131*bf2c3715SXin Li*/
132*bf2c3715SXin Li
133*bf2c3715SXin Li}
134