Rom's picture

Rom

wrom

·

wr0om

AI & ML interests

LLM Security

Recent Activity

upvoted a paper 27 days ago

Extracting Recurring Vulnerabilities from Black-Box LLM-Generated Software

authored a paper 29 days ago

Step-Wise Refusal Dynamics in Autoregressive and Diffusion Language Models

upvoted a paper 29 days ago

Step-Wise Refusal Dynamics in Autoregressive and Diffusion Language Models

View all activity

Organizations

wrom 's models

None public yet