so, how did it do?

by robbiemu - opened Aug 31, 2025

Aug 31, 2025

Now that you published the full fat version, I have to ask -- did you measure perplexity, or have any measure or recommendation based on feel for which quantization is "still SOTA" but smaller?

csabakecskemeti

DevQuasar org Aug 31, 2025

No I haven’t. It’s very costly.
In the past I’ve tested my fine tune models with:
https://github.com/EleutherAI/lm-evaluation-harness
It has the same test set that the open LLM leaderboard uses so it gives you a good comparison.
So far I haven’t tested the quantized models

robbiemu

Aug 31, 2025

Yes that makes sense. I really wanted to use this model, i do a fair bit with language learning in ai, dating back to my masters capstone in 2019, but it turns out i would be stuck in the 2bit regime here :(

Its get if you too provide these :) thanks for answering my question

robbiemu changed discussion status to closed Aug 31, 2025

csabakecskemeti

DevQuasar org Sep 2, 2025

@robbiemu

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment