view article Article Activation Steering With Mean Response Probes : A Case Study In Suppressing Sycophancy In Language Models During TTC 12 days ago • 1
view article Article Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement Nov 7 • 4