Post
44
Got to 1199.8 tokens/sec with Devstral Small -2 on my desktop GPU workstation. vLLM nightly.
Works out of the box with Mistral Vibe. Next is time to test the big one.
Works out of the box with Mistral Vibe. Next is time to test the big one.