Tags

Tags give the ability to mark specific points in history as being important

master-81844fb

81844fbc · tests : Fix compilation warnings (Linux/GCC) (#2451) · Aug 02, 2023
master-86aeb27

86aeb277 · server : Support dark mode (#2414) · Aug 01, 2023
master-49e7cb5

49e7cb5b · CUDA: fixed LLAMA_FAST compilation option (#2473) · Jul 31, 2023
master-b772bba

b772bba4 · CUDA: fixed cmake F16 option (#2471) · Jul 31, 2023
master-0728c5a

0728c5a8 · CUDA: mmq CLI option, fixed mmq build issues (#2453) · Jul 31, 2023
master-1215ed7

1215ed7d · CUDA: Implemented row flattening for non-glm RoPE (#2468) · Jul 31, 2023
master-2dbf518

2dbf5189 · CUDA: fewer memory bank conflicts for mul_mat_q (#2458) · Jul 31, 2023
master-9d2382b

9d2382b3 · Fix Metal backend broken from the allocator changes (#2455) · Jul 31, 2023
master-a113689

a1136895 · ggml : add graph tensor allocator (#2411) · Jul 30, 2023
master-11f3ca0

11f3ca06 · CUDA: Quantized matrix matrix multiplication (#2160) · Jul 29, 2023
master-9baf9ef

9baf9ef3 · CUDA: faster multi GPU synchronization (#2448) · Jul 29, 2023
master-8a88e58

8a88e585 · perplexity : add Hellaswag calculation (#2389) · Jul 28, 2023
master-a9559bf

a9559bf7 · ggml : workaround for missing _mm256_setr_m128i in GCC < 8 in k_quants.c (#2405) · Jul 28, 2023
master-ee1b497

ee1b497c · llama : support more diverse tokenizers? (#2420) · Jul 28, 2023
master-1a94186

1a941869 · metal : disable graph concurrency optimization due to bug (#2413) · Jul 27, 2023
master-b5472ea

b5472ea0 · ggml : fix assert in ggml_set_unary_op (#2410) · Jul 26, 2023
master-6df1f59

6df1f594 · make : build with -Wmissing-prototypes (#2394) · Jul 26, 2023
master-5488fb7

5488fb78 · ggml : allocate graphs in a context (#2392) · Jul 26, 2023
master-eb542d3

eb542d39 · Add LLAMA_DEFAULT_RMS_EPS so we can change the default (#2384) · Jul 25, 2023
master-07aaa0f

07aaa0f6 · ggml : fix ggml_flash_attn to use op_params (#2387) · Jul 25, 2023

1
…
6
7
8
9
10
11
12
13
14
…
38