8 11 13

Yichen You

youyc22

youyc22

AI & ML interests

None yet

Recent Activity

updated a dataset 7 days ago

youyc22/am-team-4B

published a dataset 7 days ago

youyc22/am-team-4B

authored a paper 22 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

View all activity

Organizations

updated a dataset 7 days ago

youyc22/am-team-4B

Viewer • Updated 7 days ago • 348k • 33

published a dataset 7 days ago

youyc22/am-team-4B

Viewer • Updated 7 days ago • 348k • 33

authored a paper 22 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published 23 days ago • 30

upvoted a paper 22 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published 23 days ago • 30

upvoted a paper about 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 110

updated a dataset about 2 months ago

youyc22/amteam-8b-121k-top16

Viewer • Updated Apr 12 • 83.9k • 14

published a dataset about 2 months ago

youyc22/amteam-8b-121k-top16

Viewer • Updated Apr 12 • 83.9k • 14

updated a collection about 2 months ago

TaH

Collection

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models • 9 items • Updated Apr 12 • 2

published a dataset about 2 months ago

youyc22/amteam-121k-8k

Updated Apr 11 • 52

updated a dataset about 2 months ago

youyc22/amteam-121k-8k

Updated Apr 11 • 52

updated a collection 2 months ago

TaH

Collection

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models • 9 items • Updated Apr 12 • 2

updated a collection 3 months ago

TaH

Collection

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models • 9 items • Updated Apr 12 • 2

upvoted a paper 3 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 60

liked a dataset 5 months ago

Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b

Viewer • Updated Jan 31 • 306k • 1.57k • 348

published a model 5 months ago

nics-efc/Standard-1.7B

Text Generation • 2B • Updated Jan 12 • 4

updated a model 5 months ago

nics-efc/Standard-1.7B

Text Generation • 2B • Updated Jan 12 • 4

liked a model 6 months ago

Nanbeige/Nanbeige4-3B-Thinking-2511

Text Generation • 4B • Updated Dec 17, 2025 • 796 • 208

Yichen You

AI & ML interests

Recent Activity

Organizations

youyc22's activity