8 24 324

Gaurang Bharti PRO

gbharti

https://gaurangbharti.netlify.app/

AI & ML interests

GPTs, Computer Vision, NLP

Recent Activity

new activity 16 days ago

gbharti/finance-alpaca:Add LICENSE file

liked a Space 26 days ago

depth-anything/depth-anything-3

liked a dataset about 1 month ago

nvidia/PhysicalAI-Robotics-GR00T-X-Embodiment-Sim

View all activity

Organizations

New activity in gbharti/finance-alpaca 16 days ago

Add LICENSE file

🤝 1

#6 opened 16 days ago by

jewittje

liked a Space 26 days ago

Depth Anything 3

🏢

307

Generate depth maps from images using GPU acceleration

liked a dataset about 1 month ago

nvidia/PhysicalAI-Robotics-GR00T-X-Embodiment-Sim

Updated about 15 hours ago • 505k • 175

upvoted a paper about 2 months ago

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published Oct 12 • 46

New activity in gbharti/finance-alpaca 2 months ago

good

#5 opened 2 months ago by

Jackrong

liked a Space 4 months ago

OmniAvatar

🐨

264

Generate podcast and tiktok style video avatars

liked a dataset 5 months ago

Vchitect/ShotBench

Viewer • Updated Jul 1 • 3.57k • 141 • 10

liked a model 5 months ago

Vchitect/ShotVL-7B

Image-Text-to-Text • 8B • Updated Sep 19 • 1.46k • 15

upvoted a paper 5 months ago

VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20, 2024 • 38

liked a model 5 months ago

google/videoprism-base-f16r288

Video Classification • Updated Jul 29 • 115k • 89

upvoted a paper 5 months ago

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published Jun 23 • 33

liked a model 6 months ago

ByteDance/LatentSync-1.6

Updated Jun 12 • 64.6k • 50

liked a dataset 6 months ago

opencompass/MMBench-Video

Preview • Updated Oct 9, 2024 • 404 • 9

liked a Space 7 months ago

Keysync Demo

📈

Generate synchronized video from audio and video inputs

liked a model 7 months ago

chancharikm/qwen2.5-vl-7b-cam-motion

Video-Text-to-Text • 8B • Updated Sep 19 • 418 • 16

upvoted 4 papers 7 months ago

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21 • 157

NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks

Paper • 2504.19854 • Published Apr 28 • 7

TesserAct: Learning 4D Embodied World Models

Paper • 2504.20995 • Published Apr 29 • 22

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 72

liked a Space 8 months ago

Skyreels A1 Talking Head

😻

197

Audio to Talking Face