Skip to content(if available)orjump to list(if available)

Kimi Linear: An Expressive, Efficient Attention Architecture

Ethan312

Kimi Linear looks solid. Efficient attention with strong results could make large models faster without much accuracy loss.

nekofneko

Let's GO!