You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
hi guys
In our previous project, we were using MKL's fft, and during the migration process, we encountered two issues with FFTW.
The first issue is that when using fft's c2r multiple times, the input data is defaulted to be destroyed, and we need to use the FFTW_DESTROY_INPUT flag to make it work correctly. This behavior is not present in MKL, but we had to add this flag during the migration. When I read the documentation, I found that using the PRESERVE_FLAG may lead to NaN results in the plan. We believe that having the input destroyed by default is not safe, as we want to preserve the input data as well as ensure the plan returns valid results. What would be a more reasonable way to handle this situation?
The second issue arises when using the inplace mode; the results do not match our expectations, while they are correct with MKL. Here is a relevant portion of the code:
fftwf_complex *a = (fftwf_complex *)malloc(4 * 4 * (4 / 2 + 1) * 2 * sizeof(float));
for(int i = 0; i < 64; i++)
{
((float *)a)[i] = i;
}
fftwf_plan plan1 = fftwf_plan_dft_r2c_3d(4, 4, 4, (float *)a, a, FFTW_ESTIMATE);
for(int i = 0; i < 96; i++)
{
printf("%d : %f | ",i, ((float *)a)[i]);
}
printf("\n===========\n");
fftwf_plan plan2 = fftwf_plan_dft_c2r_3d(4, 4, 4, a, (float *)a, FFTW_ESTIMATE);
fftwf_execute(plan1);
fftwf_execute(plan2);
for(int i = 0; i < 96; i++)
{
printf("%d : %f | ", i, ((float *)a)[i]);
}
printf("\n===========\n");
// and this get worse
float *b = (float *)malloc(4 * 4 * 4 * sizeof(float));
for(int i = 0; i < 64; i++)
{
b[i] = i;
}
fftwf_execute_dft_r2c(plan1, b, a);
fftwf_execute_dft_c2r(plan2, a, b);
for(int i = 0; i < 64; i++)
{
printf("%d : %f | ",i, b[i]);
}
printf("\n===========\n");
return;
In consule, we can find results at positions 4 and 5 is not what we expect. In this case, the data processed by the FFT is not from indices 0 to 63, but rather fromy 0 ~ 3, 6 ~ 9, 12 ~ 15, ect. It appears to treat the array as a 4 * 4 * 6 cube and processes 4 elements per line. However, we need it to be treated as a linear array rather than a cube. And using another input will get worse.
Although we can handle this by using out-of-place transforms, we are curious if there is a more reasonable way to achieve the desired behavior with inplace transforms, perhaps by changing a flag or build configuration. Additionally, considering that the default setting leads to unexpected results, should the default behavior prioritize safety over efficiency?
The text was updated successfully, but these errors were encountered:
hi guys
In our previous project, we were using MKL's fft, and during the migration process, we encountered two issues with FFTW.
The first issue is that when using fft's c2r multiple times, the input data is defaulted to be destroyed, and we need to use the FFTW_DESTROY_INPUT flag to make it work correctly. This behavior is not present in MKL, but we had to add this flag during the migration. When I read the documentation, I found that using the PRESERVE_FLAG may lead to NaN results in the plan. We believe that having the input destroyed by default is not safe, as we want to preserve the input data as well as ensure the plan returns valid results. What would be a more reasonable way to handle this situation?
The second issue arises when using the inplace mode; the results do not match our expectations, while they are correct with MKL. Here is a relevant portion of the code:
In consule, we can find results at positions 4 and 5 is not what we expect. In this case, the data processed by the FFT is not from indices 0 to 63, but rather fromy 0 ~ 3, 6 ~ 9, 12 ~ 15, ect. It appears to treat the array as a 4 * 4 * 6 cube and processes 4 elements per line. However, we need it to be treated as a linear array rather than a cube. And using another input will get worse.
Although we can handle this by using out-of-place transforms, we are curious if there is a more reasonable way to achieve the desired behavior with inplace transforms, perhaps by changing a flag or build configuration. Additionally, considering that the default setting leads to unexpected results, should the default behavior prioritize safety over efficiency?
The text was updated successfully, but these errors were encountered: