research.altifigence.com
Altifigence™ Research
Research records for on-device Small Language Models, Transformer bottlenecks, and hardware/software optimization work.
Current focus
On-device model limits and optimization routes.
The first publication and notebooks center on decode-stage memory pressure, FFN versus KV-cache attention behavior, and repeatable latency-benchmark methodology.