Update manual about DFTI vs FFTW3 with MKL
Using DFTI is better than FFTW3 interface with MKL. The performance difference may not be caused by MKL but some inefficiency in the OpenMP threaded part used by FFTW3 code path. Even when I use ESSL on BG/Q, the current code works better avoiding FFTW3.