Georgi Gerganov
e0dbec0bc6
llama : refactor llama_context, llama_kv_cache, llm_build_context (#12181)
* llama : refactor llama_context, llama_kv_cache, llm_build_context
ggml-ci
* graph : don't mutate the KV cache during defrag
ggml-ci
* context : reduce virtuals + remove test function
ggml-ci
* context : move interface implementation to source file + factory
ggml-ci
* graph : move KV cache build functions to llama_context impl
ggml-ci
* graph : remove model reference from build_pooling
ggml-ci
* graph : remove llama_model reference
ggml-ci
* kv_cache : provide rope factors
ggml-ci
* graph : rework inputs to use only unique_ptr, remove attn input abstraction
ggml-ci
* context : remove llama_context_i abstraction
ggml-ci
* context : clean-up
ggml-ci
* graph : clean-up
ggml-ci
* llama : remove redundant keywords (struct, enum)
ggml-ci
* model : adapt gemma3
ggml-ci
* graph : restore same attention ops as on master
ggml-ci
* llama : remove TODO + fix indent
ggml-ci
2025-03-13 12:35:44 +02:00
..
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-12 09:30:24 +01:00
2025-03-12 09:30:24 +01:00
2025-01-03 10:18:53 +02:00
2025-03-13 12:35:44 +02:00
2025-03-04 18:53:26 +02:00
2025-02-02 09:48:46 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-01-03 10:18:53 +02:00
2025-01-03 10:18:53 +02:00
2025-03-05 13:05:13 +00:00
2025-03-05 13:05:13 +00:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-01-10 09:58:08 +08:00
2025-01-12 11:32:42 +02:00
2025-01-07 18:01:58 +01:00
2025-02-12 10:06:53 -04:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-03-05 06:30:31 +01:00
2025-02-10 20:58:18 +02:00
2025-01-27 14:42:09 +01:00
2025-01-16 13:54:08 +01:00
2025-03-13 12:35:44 +02:00
2025-03-13 12:35:44 +02:00
2025-01-16 13:54:08 +01:00
2025-01-03 10:18:53 +02:00
2025-03-05 13:05:13 +00:00
2025-01-12 11:32:42 +02:00
2025-03-04 18:53:26 +02:00
2025-01-12 12:15:53 +02:00
2025-03-13 12:35:44 +02:00
2024-10-08 13:27:04 +02:00
2024-10-02 15:49:55 +02:00
2025-02-15 16:40:57 +02:00
2024-12-16 12:31:45 +02:00