Tech Buzz China · Researcher profileandChina Research CollectiveAI Proem China Research CollectiveAs of Jul 14, 2026

WU Yuxin (吴育昕)

Co-founder & research lead, Moonshot AI

FounderReturneeTsinghua

Known for

Moonshot AI co-founder · Group Normalization (GN) co-author with Kaiming He

Current org

Kimi (月之暗面)

Schools

Tsinghua (清华大学) ·CMU

Articles

Videos

Links

Career path

Studied

Tsinghua (清华大学)CMU

→

Previously

Meta FAIRGoogle Brain

→

Now

Kimi (月之暗面)

Tsinghua BS, CMU MS 2016

Ex-Meta FAIR (Detectron2 author, Group Norm w/ Kaiming He, ECCV 2018 best paper nom, ICCV 2021 Mark Everingham Prize)

Also ex-Google Brain

In their words

Reinforcement learning infrastructure is indeed a huge challenge. We strive to achieve high efficiency while maintaining good flexibility. In terms of efficiency, we try to fully consider the actual application scenarios of reinforcement learning when developing training and inference systems, so that we can reuse all the heavy computation work and achieve scaling. The deployment logic of the agent swarm is particularly complex, but our system is highly flexible, allowing us to integrate different frameworks and sub-agent setups into the training process.

强化学习基础设施的确是一项巨大的挑战，我们力求在保持良好灵活性的同时实现高效率。在效率方面，我们尝试在开发训练和推理系统时充分考虑强化学习的实际应用场景，以便复用所有繁重的计算工作，从而实现规模化扩展。智能体蜂群的部署逻辑尤其复杂，但我们的系统具有极高的灵活性，允许我们将不同的框架和子智能体设置集成到训练过程中。

月之暗面三位联创深夜回应一切！3小时答全球网友23问· 2026-01-29

Managing hallucinations remains a huge challenge for all large models. We have improved this by increasing data quality (more verified knowledge, fewer low-quality claims) and reward mechanisms (e.g., penalizing the model when it hallucinates), but we believe there are still many ways to further improve.

对于所有大模型来说，管理幻觉仍然是一个巨大的挑战。我们已经通过提高数据质量（更多经过验证的知识，更少低质量的说法）和奖励机制（例如，当模型出现幻觉时进行惩罚）来改善这种情况，但我们认为仍然有很多方法可以进一步改进。

月之暗面三位联创深夜回应一切！3小时答全球网友23问· 2026-01-29

I'm not sure if 1:1 optimality still holds, but in that sense, we do 'waste' some training compute. Otherwise the model would be larger and would 'waste' a lot of inference compute compared to our current model.

我不确定1:1最优性是否仍然成立，但从这个意义上讲，我们确实会'浪费'一些训练计算资源。否则模型会更大，并且与我们现在的模型相比，会'浪费'大量的推理计算资源。

月之暗面三位联创深夜回应一切！3小时答全球网友23问· 2026-01-29

Before Sam builds his trillion-dollar data center.

在 Sam 价值万亿美元的数据中心建成之前。

月之暗面杨植麟、周昕宇、吴育昕罕见回应一切：打假 460 万美元、调侃 OpenAI - IT之家· 2025-11-12

Profile

WU Yuxin (吴育昕) is a co-founder and research lead at Moonshot AI (月之暗面). He graduated from both Tsinghua University and Carnegie Mellon University (like Yang Zhilin), with a research focus on computer vision detection and recognition.

After graduation, he worked at Meta's Fundamental AI Research (FAIR) lab, where he co-proposed Group Normalization (GN) with Kaiming He and created Detectron2, one of Meta's most popular AI projects. His papers have received over 19,000 Google Scholar citations, and he was nominated for Best Paper at ECCV 2018.

In 2018, his team IYSWIM was the only one to crack a facial recognition algorithm at the GeekPwn international security competition.

Known for

Moonshot AI co-founderGroup Normalization (GN) co-author with Kaiming HeDetectron2 creator19,000+ Google Scholar citations

Education

Carnegie Mellon University

Graduate

Tsinghua University

B.S.

Articles / interviews

InterviewChinese

月之暗面三位联创深夜回应一切！3小时答全球网友23问

Moonshot AI's Three Co-Founders Respond to Everything Late at Night, Answering 23 Questions from Global Users in Three Hours