AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Selective Steering: Norm-Preserving Control Through Discriminative Layer Selection
RedBench: A Universal Dataset for Comprehensive Red Teaming of Large Language Models
datasets 9
knoveleng/redbench
Viewer
• Updated
• 29.4k • 692
knoveleng/sing
Viewer
• Updated
• 103k • 268
knoveleng/ms
Viewer
• Updated
• 20k • 58
knoveleng/open-s1
Viewer
• Updated
• 18.6k • 27 • 4
knoveleng/open-rs
Viewer
• Updated
• 7k • 188 • 11
knoveleng/open-deepscaler
Viewer
• Updated
• 21k • 34 • 4
knoveleng/AMC-23
Viewer
• Updated
• 40 • 6.24k • 1
knoveleng/OlympiadBench
Viewer
• Updated
• 675 • 1.36k • 1
knoveleng/Minerva-Math
Viewer
• Updated
• 272 • 1.43k • 1