title: "2026-03-16 · AI 每日简报" date: 2026-03-16T17:40:48.752446+08:00 draft: false summary: "RobotDaily 2026-03-16:共 7 篇,含 具身智能 2 篇,表征学习 3 篇,强化学习 2 篇。"
Hugo 归档版,来源于 RobotDaily 当日 Markdown 简报。
RobotDaily 2026-03-16:共 7 篇,含 具身智能 2 篇,表征学习 3 篇,强化学习 2 篇。
偏应用导向精选,按方向整理成短卡片式 Markdown 归档。
提出PhysMoDPO,采用偏好优化解决人类动作生成,实现真实部署
- 作者:Yangsong Zhang, Anujith Muraleedharan, Rikhat Akizhanov, Abdul Ahad Butt 等另外4人
- 标签:
扩散模型偏好优化动作生成零样本- 中文摘要:【LLM 暂不可用,先保留英文摘要要点】Recent progress in text-conditioned human motion generation has been largely driven by diffusion models trained on large-scale human motion data. Building on this progress, recent methods attempt to transfer such models…
- 链接:DOI | arXiv | PDF
提出HandelBot,采用强化学习解决音乐演奏,实现真实部署
- 作者:Amber Xie, Haozhi Qi, Dorsa Sadigh
- 标签:
强化学习自适应机器人操作灵巧操作- 中文摘要:【LLM 暂不可用,先保留英文摘要要点】Mastering dexterous manipulation with multi-fingered hands has been a grand challenge in robotics for decades. Despite its potential, the difficulty of collecting high-quality data remains a primary bottleneck for high-…
- 链接:DOI | arXiv | PDF
提出自监督学习框架,采用自监督学习解决物理系统模拟,实现性能优化
- 作者:Helen Qu, Rudy Morel, Michael McCabe, Alberto Bietti 等另外3人
- 标签:
自监督学习表征学习潜在空间世界模型- 中文摘要:【LLM 暂不可用,先保留英文摘要要点】Machine learning approaches to spatiotemporal physical systems have primarily focused on next-frame prediction, with the goal of learning an accurate emulator for the system's evolution in time. However, these emulators…
- 链接:DOI | arXiv | PDF
提出强化学习框架,采用强化学习解决导航控制,实现性能优化
- 作者:Reza T. Batley, Apurba Sarker, Rajib Mostakim, Andrew Klichine 等另外1人
- 标签:
强化学习导航建图表征学习- 中文摘要:【LLM 暂不可用,先保留英文摘要要点】Intelligent systems across physics, language and perception often exhibit factorisable structure, yet are typically modelled by monolithic neural architectures that do not explicitly exploit this structure. The separabl…
- 链接:DOI | arXiv | PDF
提出VIRD,采用多种技术解决自动驾驶,实现首次提出
- 作者:Juhye Park, Wooju Lee, Dasol Hong, Changki Sung 等另外3人
- 标签:
自动驾驶定位表征学习潜在空间- 中文摘要:【LLM 暂不可用,先保留英文摘要要点】Accurate global localization is crucial for autonomous driving and robotics, but GNSS-based approaches often degrade due to occlusion and multipath effects. As an emerging alternative, cross-view pose estimation predict…
- 链接:DOI | arXiv | PDF
提出强化学习框架,采用残差策略优化解决自动驾驶,实现真实部署
- 作者:Raphael Trumpp, Denis Hoornaert, Mirco Theile, Marco Caccamo
- 标签:
强化学习残差策略优化自动驾驶零样本- 中文摘要:【LLM 暂不可用,先保留英文摘要要点】Residual policy learning (RPL), in which a learned policy refines a static base policy using deep reinforcement learning (DRL), has shown strong performance across various robotic applications. Its effectiveness is part…
- 链接:DOI | arXiv | PDF
提出Beyond Imitation,采用扩散模型解决机器人导航,实现零样本泛化
- 作者:Junhe Sheng, Ruofei Bai, Kuan Xu, Ruimeng Liu 等另外4人
- 标签:
扩散模型强化学习模仿学习自适应- 中文摘要:【LLM 暂不可用,先保留英文摘要要点】Diffusion-based robot navigation policies trained on large-scale imitation learning datasets, can generate multi-modal trajectories directly from the robot's visual observations, bypassing the traditional localization-m…
- 链接:DOI | arXiv | PDF