Better CUDA complex division.
The original produced NaNs when dividing 0/b for subnormal b. The complex_divide_stable was changed to use the more common Smith's algorithm.
The original produced NaNs when dividing 0/b for subnormal b. The complex_divide_stable was changed to use the more common Smith's algorithm.