Skip to content

feat(llama-cpp-localai-paged): paged KV cache llama.cpp backend + cross-request prefix sharing + GB10 decode optimization [WIP]#10462

Open
localai-bot wants to merge 321 commits into
masterfrom
worktree-feat+paged-attention
Open

feat(llama-cpp-localai-paged): paged KV cache llama.cpp backend + cross-request prefix sharing + GB10 decode optimization [WIP]#10462
localai-bot wants to merge 321 commits into
masterfrom
worktree-feat+paged-attention

Commits

This pull request is big! We're only showing the most recent 250 commits

Commits on Jun 23, 2026

Commits on Jun 24, 2026

Commits on Jun 25, 2026

Commits on Jun 26, 2026

Commits on Jun 27, 2026

Commits on Jun 28, 2026

Commits on Jun 30, 2026

Commits on Jul 1, 2026

Commits on Jul 2, 2026