SYCL: Add PackedFloat3 for AMD CDNA2 devices

added SYCL label

Tested this too with ROCm 5.3,5.4,5.5,5.6, and 5.7, nothing much has changed (wrt 84dc9ef9 which is the last version I tested). I suggest to remove the draft tag.

added 1 commit

fd660059 - Add asserts and inlining attributes

Compare with previous version

added 1 commit

04adcba1 - Add asserts and inlining attributes

Compare with previous version

marked this merge request as ready

resolved all threads

approved this merge request

A couple of sentences in the commit message for the git history would be useful.

changed the description

added 1 commit

3155dea1 - Add a macro to toggle use of packed float3

Compare with previous version

added 1 commit

ddb78ba2 - Move the struct to sycl_kernel_utils

Compare with previous version

approved this merge request

added 13 commits

ddb78ba2...336eea4e - 5 commits from branch main
5da5e884 - Add FastFloat3 class, don't use it yet
40ff2ccb - Use FastFloat3 for fCiBuf
26f2e241 - Fix build
d0b8b629 - Rename the class, add docs
6f3f634c - Silence compiler warnings
7c397454 - Add asserts and inlining attributes
99f08040 - Add a macro to toggle use of packed float3
3914b5dc - Move the struct to sycl_kernel_utils

Compare with previous version

resolved all threads

SYCL: Add PackedFloat3 for AMD CDNA2 devices

Merge request reports

Activity