Files
llama.cpp/src
Piotr Wilkin (ilintar) 746f9ee889 Override SSM_A op for Qwen3 Next to reduce splits (#17587)
* Override SSM_A op for Qwen3 Next to reduce splits

* New tensor mapping SSM_A_NOSCAN for SSM_A used outside of OP_SSM_SCAN context.

* Update src/llama-model.cpp

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

* Update src/llama-model.cpp

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>
2025-12-02 00:43:13 +01:00
..
2025-12-01 12:26:52 +01:00
2025-09-05 17:32:39 -06:00
2025-09-05 17:32:39 -06:00
2025-11-28 12:02:56 +01:00
2025-11-28 12:02:56 +01:00