8 11 13

Yichen You

youyc22

youyc22

AI & ML interests

None yet

Recent Activity

updated a dataset 5 days ago

youyc22/am-team-4B

published a dataset 5 days ago

youyc22/am-team-4B

authored a paper 20 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

View all activity

Organizations

upvoted a paper 20 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published 21 days ago • 30

upvoted a paper about 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 109

upvoted a paper 3 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 59

upvoted an article 6 months ago

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 402

upvoted a collection 7 months ago

TaH

Collection

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models • 9 items • Updated Apr 12 • 2

upvoted 2 papers 7 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 110

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 135

upvoted a paper 8 months ago

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3, 2025 • 99

upvoted a paper 9 months ago

Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification

Paper • 2509.15591 • Published Sep 19, 2025 • 45

upvoted a paper 12 months ago

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

Paper • 2506.16054 • Published Jun 19, 2025 • 60

upvoted a paper about 1 year ago

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Paper • 2505.21600 • Published May 27, 2025 • 71

Yichen You

AI & ML interests

Recent Activity

Organizations

youyc22's activity

Continuous batching from first principles