Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Hengyi Wang's picture
2 5

Hengyi Wang

aaronwhy
https://www.linkedin.com/in/hengyi-wang-86605b175/
  • AaronWhy

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 2 months ago

R-WoM: Retrieval-augmented World Model For Computer-use Agents

Paper • 2510.11892 • Published Oct 13 • 21
upvoted a paper 10 months ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6 • 96
upvoted a paper 12 months ago

Unifying Specialized Visual Encoders for Video Language Models

Paper • 2501.01426 • Published Jan 2 • 20
upvoted 2 papers over 1 year ago

Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models

Paper • 2406.12649 • Published Jun 18, 2024 • 16

Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models

Paper • 2406.11230 • Published Jun 17, 2024 • 33
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs