llama.cpp

mirror of https://github.com/ggml-org/llama.cpp.git synced 2026-05-13 12:34:05 +00:00

Files

Trivikram Reddy 856c3adac1 hexagon: eliminate scalar VTCM loads via HVX splat helpers (#22993 )

* hexagon: add hvx_vec_repl helpers and use those for splat-from-vtcm usecase

* hmx-mm: optimize per-group scale handling

* hmx-fa: optimize slope load from vtcm

* hmx-fa: use aligned access where possible in hmx-utils

* hexagon: add hvx_vec_repl_2x_f16 helper and consolidate repl helpers

---------

Co-authored-by: Max Krasnyansky <maxk@qti.qualcomm.com>

2026-05-12 17:28:02 -07:00

adb

hexagon: eliminate scalar VTCM loads via HVX splat helpers (#22993 )

2026-05-12 17:28:02 -07:00

qdc

Enable testing on Snapdragon devices (#21051 )

2026-04-23 13:08:10 -07:00

windows

hexagon: add support for basic and extended Op profiling (#22269 )

2026-04-23 14:17:21 -07:00

ggml-hexagon-profile.py

hexagon: add support for basic and extended Op profiling (#22269 )

2026-04-23 14:17:21 -07:00