---
title: "2026-03-16 · AI 每日简报"
date: 2026-03-16T16:31:30.888993+08:00
draft: false
summary: "RobotDaily 2026-03-16：共 7 篇，含 具身智能 2 篇，表征学习 3 篇，强化学习 2 篇。"
tags: ["robotdaily", "ai-daily", "具身智能", "表征学习", "强化学习"]
---

> Hugo 归档版，来源于 RobotDaily 当日 Markdown 简报。
>
> RobotDaily 2026-03-16：共 7 篇，含 具身智能 2 篇，表征学习 3 篇，强化学习 2 篇。

偏应用导向精选，按方向整理成短卡片式 Markdown 归档。

## 具身智能（2 篇）

### 1. PhysMoDPO: Physically-Plausible Humanoid Motion with Preference Optimization
> PhysMoDPO，采用扩散模型解决机器人操作，实现真实部署
- 作者：Yangsong Zhang, Anujith Muraleedharan, Rikhat Akizhanov, Abdul Ahad Butt 等另外4人
- 标签：`具身智能` `机器人` `真实部署` `操控`
- 中文摘要：【LLM 暂不可用，先保留英文摘要要点】Recent progress in text-conditioned human motion generation has been largely driven by diffusion models trained on large-scale human motion data. Building on this progress, recent methods attempt to transfer such models…
- 链接：[DOI](https://doi.org/10.48550/arXiv.2603.13228) | [arXiv](https://arxiv.org/abs/2603.13228v1) | [PDF](https://arxiv.org/pdf/2603.13228v1)

### 2. HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies
> HandelBot，采用强化学习解决机器人操作，实现真实部署
- 作者：Amber Xie, Haozhi Qi, Dorsa Sadigh
- 标签：`具身智能` `机器人` `真实部署` `操控`
- 中文摘要：【LLM 暂不可用，先保留英文摘要要点】Mastering dexterous manipulation with multi-fingered hands has been a grand challenge in robotics for decades. Despite its potential, the difficulty of collecting high-quality data remains a primary bottleneck for high-…
- 链接：[DOI](https://doi.org/10.48550/arXiv.2603.12243) | [arXiv](https://arxiv.org/abs/2603.12243v1) | [PDF](https://arxiv.org/pdf/2603.12243v1)

## 表征学习（3 篇）

### 1. Representation Learning for Spatiotemporal Physical Systems
> Representation Learning for Spatiotemporal Physical Systems，采用自监督学习解决相关任务，实现性能优化
- 作者：Helen Qu, Rudy Morel, Michael McCabe, Alberto Bietti 等另外3人
- 标签：`表征学习` `潜在空间` `世界模型` `预训练`
- 中文摘要：【LLM 暂不可用，先保留英文摘要要点】Machine learning approaches to spatiotemporal physical systems have primarily focused on next-frame prediction, with the goal of learning an accurate emulator for the system's evolution in time. However, these emulators…
- 链接：[DOI](https://doi.org/10.48550/arXiv.2603.13227) | [arXiv](https://arxiv.org/abs/2603.13227v1) | [PDF](https://arxiv.org/pdf/2603.13227v1)

### 2. Separable neural architectures as a primitive for unified predictive and generative intelligence
> Separable neural architectures as a primitive for unified predictive and generative i…
- 作者：Reza T. Batley, Apurba Sarker, Rajib Mostakim, Andrew Klichine 等另外1人
- 标签：`表征学习` `潜在空间` `世界模型` `预训练`
- 中文摘要：【LLM 暂不可用，先保留英文摘要要点】Intelligent systems across physics, language and perception often exhibit factorisable structure, yet are typically modelled by monolithic neural architectures that do not explicitly exploit this structure. The separabl…
- 链接：[DOI](https://doi.org/10.48550/arXiv.2603.12244) | [arXiv](https://arxiv.org/abs/2603.12244v1) | [PDF](https://arxiv.org/pdf/2603.12244v1)

### 3. VIRD: View-Invariant Representation through Dual-Axis Transformation for Cross-View Pose Estimation
> VIRD，采用表征学习解决机器人操作，实现首次提出
- 作者：Juhye Park, Wooju Lee, Dasol Hong, Changki Sung 等另外3人
- 标签：`表征学习` `潜在空间` `世界模型` `预训练`
- 中文摘要：【LLM 暂不可用，先保留英文摘要要点】Accurate global localization is crucial for autonomous driving and robotics, but GNSS-based approaches often degrade due to occlusion and multipath effects. As an emerging alternative, cross-view pose estimation predict…
- 链接：[DOI](https://doi.org/10.48550/arXiv.2603.12918) | [arXiv](https://arxiv.org/abs/2603.12918v1) | [PDF](https://arxiv.org/pdf/2603.12918v1)

## 强化学习（2 篇）

### 1. Efficient Real-World Autonomous Racing via Attenuated Residual Policy Optimization
> Efficient Real-World Autonomous Racing via Attenuated Residual Policy Optimization，采用…
- 作者：Raphael Trumpp, Denis Hoornaert, Mirco Theile, Marco Caccamo
- 标签：`强化学习` `策略优化` `奖励设计` `离线 RL`
- 中文摘要：【LLM 暂不可用，先保留英文摘要要点】Residual policy learning (RPL), in which a learned policy refines a static base policy using deep reinforcement learning (DRL), has shown strong performance across various robotic applications. Its effectiveness is part…
- 链接：[DOI](https://doi.org/10.48550/arXiv.2603.12960) | [arXiv](https://arxiv.org/abs/2603.12960v1) | [PDF](https://arxiv.org/pdf/2603.12960v1)

### 2. Beyond Imitation: Reinforcement Learning Fine-Tuning for Adaptive Diffusion Navigation Policies
> Beyond Imitation，采用扩散模型解决机器人操作，实现零样本泛化
- 作者：Junhe Sheng, Ruofei Bai, Kuan Xu, Ruimeng Liu 等另外4人
- 标签：`强化学习` `策略优化` `奖励设计` `离线 RL`
- 中文摘要：【LLM 暂不可用，先保留英文摘要要点】Diffusion-based robot navigation policies trained on large-scale imitation learning datasets, can generate multi-modal trajectories directly from the robot's visual observations, bypassing the traditional localization-m…
- 链接：[DOI](https://doi.org/10.48550/arXiv.2603.12868) | [arXiv](https://arxiv.org/abs/2603.12868v1) | [PDF](https://arxiv.org/pdf/2603.12868v1)