-
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Paper • 2411.10442 • Published • 87 -
VisionZip: Longer is Better but Not Necessary in Vision Language Models
Paper • 2412.04467 • Published • 117 -
NVILA: Efficient Frontier Visual Language Models
Paper • 2412.04468 • Published • 59 -
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper • 2412.03555 • Published • 133
junha um
JUNE0123
·
AI & ML interests
None yet
Organizations
None yet
paperbox
-
Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization
Paper • 2411.10442 • Published • 87 -
VisionZip: Longer is Better but Not Necessary in Vision Language Models
Paper • 2412.04467 • Published • 117 -
NVILA: Efficient Frontier Visual Language Models
Paper • 2412.04468 • Published • 59 -
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper • 2412.03555 • Published • 133
models
0
None public yet
datasets
0
None public yet