Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
DONGFANG ZIHAO's picture
4 3

DONGFANG ZIHAO

UUUserna
·
  • UUUserna

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
upvoted a paper 3 months ago
Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks
upvoted a paper 3 months ago
Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods
View all activity

Organizations

None yet

upvoted a paper 14 days ago

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Paper • 2512.22905 • Published 20 days ago • 18
upvoted 2 papers 3 months ago

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

Paper • 2510.25760 • Published Oct 29, 2025 • 16

Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods

Paper • 2510.07143 • Published Oct 8, 2025 • 12
upvoted a paper 4 months ago

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

Paper • 2509.12989 • Published Sep 16, 2025 • 28
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs