1 18 1

Sukmin Cho

zomss

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed

upvoted a paper about 1 month ago

Budget-Aware Tool-Use Enables Effective Agent Scaling

upvoted a paper about 1 month ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

View all activity

Organizations

None yet

upvoted a paper 11 days ago

Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed

Paper • 2512.14067 • Published 12 days ago • 12

upvoted 2 papers about 1 month ago

Budget-Aware Tool-Use Enables Effective Agent Scaling

Paper • 2511.17006 • Published Nov 21 • 29

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11 • 105

upvoted 2 papers about 2 months ago

Adaptive Multi-Agent Response Refinement in Conversational Systems

Paper • 2511.08319 • Published Nov 11 • 41

LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs

Paper • 2511.06174 • Published Nov 9 • 6

upvoted a paper 2 months ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10 • 83

liked a model 2 months ago

KORMo-Team/KORMo-10B-sft

Text Generation • 11B • Updated 13 days ago • 2.11k • 122

upvoted 3 papers 3 months ago

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Paper • 2510.07499 • Published Oct 8 • 48

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3 • 97

EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

Paper • 2509.17396 • Published Sep 22 • 19

upvoted a paper 4 months ago

DINOv3

Paper • 2508.10104 • Published Aug 13 • 291

upvoted 2 papers 8 months ago

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published May 14 • 71

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 120

upvoted a paper 10 months ago

Autellix: An Efficient Serving Engine for LLM Agents as General Programs

Paper • 2502.13965 • Published Feb 19 • 19

authored a paper 11 months ago

Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding

Paper • 2502.05609 • Published Feb 8 • 18

upvoted a paper 11 months ago

Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding

Paper • 2502.05609 • Published Feb 8 • 18

commented a paper 11 months ago

Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding

Paper • 2502.05609 • Published Feb 8 • 18 •

upvoted a paper 11 months ago

Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations

Paper • 2404.13948 • Published Apr 22, 2024 • 2

authored 2 papers 11 months ago

Test-Time Self-Adaptive Small Language Models for Question Answering

Paper • 2310.13307 • Published Oct 20, 2023

Discrete Prompt Optimization via Constrained Generation for Zero-shot Re-ranker

Paper • 2305.13729 • Published May 23, 2023 • 1

Sukmin Cho

AI & ML interests

Recent Activity

Organizations

zomss's activity