• Ondrej Mosnáček's avatar
    [CUDA,OpenCL] Use RAM buffer for in/out blocks · 58141506
    Ondrej Mosnáček authored
    This allows to do the CPU pre-/post-processing of a password batch to
    be done in parallel with the GPU computation. This means we can now
    assume the BLAKE2 computation cost to be hidden behind the GPU
    computation time (for real).
    
    This only adds the overhead of copying the data from/to the RAM buffer
    to the GPU computation time, but this is fast thanks to the rectangular
    copy operations that are used. This should significantly affect only
    hashes with low cost parameters. For these the benchmark tool was
    reporting too optimistic times before this commit.
    58141506
Name
Last commit
Last update
data/kernels Loading commit data...
ext Loading commit data...
include Loading commit data...
lib Loading commit data...
scripts Loading commit data...
src Loading commit data...
.gitattributes Loading commit data...
.gitignore Loading commit data...
.gitlab-ci.yml Loading commit data...
.gitmodules Loading commit data...
CMakeLists.txt Loading commit data...
LICENSE Loading commit data...
README.md Loading commit data...