Post
185
134,614 tok/sec input prefil max
1031 tokens/sec out gen max
At these local AI speeds, there is no User Interface for humans. My human UI is the Radicle distributed Git issues queue
On my GPU workstation:
- Z8 Fury G5 4x A6000
- MiniMax-M2.5
- Claude Code to localhost:8000
1031 tokens/sec out gen max
At these local AI speeds, there is no User Interface for humans. My human UI is the Radicle distributed Git issues queue
On my GPU workstation:
- Z8 Fury G5 4x A6000
- MiniMax-M2.5
- Claude Code to localhost:8000