Tags

Tags give the ability to mark specific points in history as being important

master-37e257c

37e257c4 · make : clean *.so files (#1857) · Jun 15, 2023
master-64cc19b

64cc19b4 · Fix the validation of main device (#1872) · Jun 15, 2023
master-4bfcc85

4bfcc855 · metal : parallel command buffer encoding (#1860) · Jun 15, 2023
master-6b8312e

6b8312e7 · Better error when using both LoRA + GPU layers (#1861) · Jun 15, 2023
master-254a7a7

254a7a7a · CUDA full GPU acceleration, KV cache in VRAM (#1827) · Jun 14, 2023
master-9254920

92549202 · baby-llama : fix operator!= (#1821) · Jun 13, 2023
master-e32089b

e32089b2 · train : improved training-from-scratch example (#1652) · Jun 13, 2023
master-2347e45

2347e45e · llama : do a warm-up eval at start for better timings (#1824) · Jun 13, 2023
master-74d4cfa

74d4cfa3 · Allow "quantizing" to f16 and f32 (#1787) · Jun 13, 2023
master-74a6d92

74a6d922 · Metal implementation for all k_quants (#1807) · Jun 12, 2023
master-e4caa8d

e4caa8da · ci : run when changing only the CUDA sources (#1800) · Jun 12, 2023
master-58970a4

58970a4c · Leverage mmap for offloading tensors to GPU (#1597) · Jun 12, 2023
master-fa84c4b

fa84c4b3 · Fix issue where interactive mode crashes when input exceeds ctx size (#1789) · Jun 11, 2023
master-4de0334

4de0334f · cmake : fix Metal build (close #1791) · Jun 10, 2023
master-3f12231

3f122315 · k-quants : GCC12 compilation fix (#1792) · Jun 10, 2023
master-303f580

303f5809 · metal : fix issue with ggml-metal.metal path. Closes #1769 (#1782) · Jun 10, 2023
master-17c10ac

17c10acf · ggml : force no_alloc == false when creating opt tensors (close #1699) · Jun 10, 2023
master-4f0154b

4f0154b0 · llama : support requantizing models instead of only allowing quantization from 16/32bit (#1691) · Jun 10, 2023
master-ef3171d

ef3171d1 · ggml : workaround for missing _mm256_setr_m128i in GCC < 8 (#1638) · Jun 10, 2023
master-555275a

555275a6 · make : add SSSE3 compilation use case (#1659) · Jun 10, 2023

1
…
15
16
17
18
19
20
21
22
23
…
38