Llm-Inference
The C64 Runs a Transformer and You're Still Paying $25/Million Tokens
You Could Run a Language Model on a Bucket of Water (And That Should Bother You)