4 16 11

Chaoyou Fu

BradyFU

https://bradyfu.github.io/

BradyFU

AI & ML interests

Multimodal LLMs

Recent Activity

liked a model about 9 hours ago

MiG-NJU/OmniVideo-7B_VITA-1.5

liked a dataset 1 day ago

MiG-NJU/OmniVideo-Test

liked a dataset 1 day ago

MiG-NJU/OmniVideo-100K

View all activity

Organizations

liked a model about 9 hours ago

MiG-NJU/OmniVideo-7B_VITA-1.5

8B • Updated about 11 hours ago • 13 • 2

liked 2 datasets 1 day ago

MiG-NJU/OmniVideo-Test

Viewer • Updated about 14 hours ago • 505 • 887 • 4

MiG-NJU/OmniVideo-100K

Preview • Updated about 14 hours ago • 750 • 7

upvoted a paper 3 days ago

OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains

Paper • 2606.14702 • Published 6 days ago • 27

upvoted a paper 7 days ago

Agentic Environment Engineering for Large Language Models: A Survey of Environment Modeling, Synthesis, Evaluation, and Application

Paper • 2606.12191 • Published 8 days ago • 63

upvoted a paper about 1 month ago

UniPrefill: Universal Long-Context Prefill Acceleration via Block-wise Dynamic Sparsification

Paper • 2605.06221 • Published May 7 • 22

liked 3 datasets about 2 months ago

upvoted a paper about 2 months ago

PersonaVLM: Long-Term Personalized Multimodal LLMs

Paper • 2604.13074 • Published Mar 20 • 46

commented a paper about 2 months ago

PersonaVLM: Long-Term Personalized Multimodal LLMs

Paper • 2604.13074 • Published Mar 20 • 46 •

submitted a paper to Daily Papers about 2 months ago

PersonaVLM: Long-Term Personalized Multimodal LLMs

Paper • 2604.13074 • Published Mar 20 • 46

upvoted 2 papers 2 months ago

Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models

Paper • 2604.08545 • Published Apr 9 • 41

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published Apr 6 • 236

authored 6 papers 2 months ago

MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models

Paper • 2306.13394 • Published Jun 23, 2023

A Survey on Multimodal Large Language Models

Paper • 2306.13549 • Published Jun 23, 2023 • 1

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Paper • 2408.05211 • Published Aug 9, 2024 • 50

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

Paper • 2411.15296 • Published Nov 22, 2024 • 21

Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Paper • 2411.00774 • Published Nov 1, 2024

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published Jan 3, 2025 • 48

Chaoyou Fu

AI & ML interests

Recent Activity

Organizations

BradyFU's activity