Diwank Tomer's picture

Diwank Tomer PRO

diwank

·

https://diwank.name

AI & ML interests

None yet

Recent Activity

updated a collection 1 day ago

liked a model 1 day ago

fastino/gliner2-multi-v1

liked a model 1 day ago

mistralai/Ministral-3-14B-Instruct-2512

View all activity

Organizations

upvoted an article 9 days ago

Article

Norm-Preserving Biprojected Abliteration

Nov 6

•

50

upvoted 2 papers 17 days ago

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published 19 days ago • 51

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published 27 days ago • 104

upvoted a collection 20 days ago

The Bestiary

Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated 22 days ago • 68

upvoted a collection 28 days ago

Nemotron RAG

12 items • Updated 4 days ago • 47

upvoted a paper 29 days ago

Drax: Speech Recognition with Discrete Flow Matching

Paper • 2510.04162 • Published Oct 5 • 27

upvoted 3 papers about 1 month ago

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

Paper • 2510.11062 • Published Oct 13 • 28

Reinforcing Diffusion Models by Direct Group Preference Optimization

Paper • 2510.08425 • Published Oct 9 • 11

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9 • 44

upvoted 2 articles about 1 month ago

Article

What makes good reasoning data

Oct 30

•

34

Article

Projected Abliteration

Oct 25

•

30

upvoted a paper about 2 months ago

The Markovian Thinker

Paper • 2510.06557 • Published Oct 8 • 30

upvoted 5 papers 2 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28 • 173

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30 • 535

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30 • 55

Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training

Paper • 2509.25758 • Published Sep 30 • 22

DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively

Paper • 2509.26603 • Published Sep 30 • 16

upvoted 2 collections 2 months ago

Qwen3-Omni

6 items • Updated Oct 9 • 168

SDLM

Sequential Diffusion Language Models • 9 items • Updated Oct 3 • 8

upvoted a paper 2 months ago

Thinking Augmented Pre-training

Paper • 2509.20186 • Published Sep 24 • 23