FFN vs KV-cache Bottleneck Analysis
Context-length sweep for FFN weight reads, KV-cache reads, measured component shares, and bottleneck movement.
research.altifigence.com > notebooks
Runnable research notebooks with Colab launch links, downloadable notebook files, and CSV-backed benchmark charts.
Context-length sweep for FFN weight reads, KV-cache reads, measured component shares, and bottleneck movement.
Benchmark methodology notebook with decode latency, component timing, thread scaling, and CSV export paths.