llm 8
- Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
- 2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP
- What 2x GH200 delivers: memory paths for LLM inference
- LLM Neuroanatomy III: Why RYS Works — The Language-Agnostic Middle
- LLM Neuroanatomy II: Modern LLM Hacking and hints of a Universal Language?
- LLM Neuroanatomy: How I Topped the LLM Leaderboard Without Changing a Single Weight
- Optimising a 2× GH200 system for Claude Code
- Building a High-End AI Desktop