Technology · 1 articles · Asked 0×

State Space Models and Linear Attention

Alternative architectures with fixed memory layers offering better scaling profiles for long context windows

Articles in this topic

Showing 1–1 of 1