来了！ECCV 2024自动驾驶论文汇总~

点击下方卡片，关注“自动驾驶之心”公众号戳我->领取自动驾驶近15个方向学习路线ECCV 2024放榜有一段时日了！自动驾驶之心一直在做汇总，今天就为大家分享自动驾驶领域相关的优秀工作！>>点击进入→自动驾驶之心『ECCV2024』技术交流群编辑 | 自动驾驶之心汇总链接：https://github.com/autodriving-heart/ECCV-2024-Papers.

自动驾驶之心

1874人浏览 · 2024-07-21 00:01:30

自动驾驶之心 · 2024-07-21 00:01:30 发布

点击下方卡片，关注“自动驾驶之心”公众号

戳我-> 领取自动驾驶近15个方向学习路线

ECCV 2024放榜有一段时日了！自动驾驶之心一直在做汇总，今天就为大家分享自动驾驶领域相关的优秀工作！

>>点击进入→自动驾驶之心『ECCV2024』技术交流群

编辑 | 自动驾驶之心

汇总链接：https://github.com/autodriving-heart/ECCV-2024-Papers-Autonomous-Driving

We will promptly include more related works in this repository. Please stay tuned!!!

We also kindly invite you to our platform, Auto Driving Heart, for paper interpretation and sharing. If you would like to promote your work, please feel free to contact me.

1) End to End | 端到端自动驾驶

GenAD: Generative End-to-End Autonomous Driving

paper: https://arxiv.org/pdf/2402.11502
code: https://github.com/wzzheng/GenAD

2）LLM Agent | 大语言模型智能体

DriveLM: Driving with Graph Visual Question Answering

paper: https://arxiv.org/pdf/2312.14150
code: https://github.com/OpenDriveLab/DriveLM

ELM: Embodied Understanding of Driving Scenarios

paper: https://arxiv.org/pdf/2403.04593
code: https://github.com/OpenDriveLab/ELM

Controllable Navigation Instruction Generation with Chain of Thought Prompting

paper: coming soon
code: https://github.com/refkxh/C-Instructor

Asynchronous Large Language Model Enhanced Planner for Autonomous Driving

paper: coming soon
code: coming soon

TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes

paper: https://arxiv.org/pdf/2403.19589
code: https://github.com/jxbbb/TOD3Cap

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection

paper: coming soon
code: https://github.com/GradiusTwinbee/GLIS

3）SSC: Semantic Scene Completion | 语义场景补全

Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion

paper: https://arxiv.org/pdf/2407.02077
code: https://github.com/Arlo0o/HTCL

4）OCC: Occupancy Prediction | 占用感知

Fully Sparse 3D Occupancy Prediction

paper: https://arxiv.org/pdf/2312.17118
code: https://github.com/MCG-NJU/SparseOcc

GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction

paper: https://arxiv.org/pdf/2405.17429
code: https://github.com/huang-yh/GaussianFormer

Occupancy as Set of Points

paper: https://arxiv.org/pdf/2407.04049
code: https://github.com/hustvl/osp

5) World Model | 世界模型

OccWorld: 3D World Model for Autonomous Driving

paper: https://arxiv.org/pdf/2311.16038
code: https://github.com/wzzheng/OccWorld

Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model

paper: coming soon
code: coming soon

DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

paper: https://arxiv.org/pdf/2309.09777
code: https://github.com/JeffWang987/DriveDreamer

6）HD-Mapping

MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping

paper: https://arxiv.org/pdf/2403.15951
code: https://github.com/woodfrog/maptracker

ADMap: Anti-disturbance framework for reconstructing online vectorized HD map

paper: coming soon
code: https://github.com/hht1996ok/ADMap

Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention

paper: coming soon
code: https://github.com/alfredgu001324/MapBEVPrediction

Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction

paper: https://arxiv.org/pdf/2402.17430
code: https://github.com/HXMap/MapQR

7）Foundation Model

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

paper: coming soon
code: coming soon

8）Robust Perception

Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather

paper: https://arxiv.org/pdf/2407.02286
code: https://github.com/engineerJPark/LiDAR-DataAug4Weather

R2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations

paper: coming soon
code: https://github.com/lxa9867/r2bench

9）3D Object Detection | 三维目标检测

Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance

paper: https://arxiv.org/pdf/2312.07530
code: https://github.com/KuanchihHuang/VG-W3D

GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection

paper: https://arxiv.org/pdf/2403.11848
code: https://github.com/adept-thu/GraphBEV

RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection

paper: coming soon
code: https://github.com/lucifer443/RecurrentBEV

Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection

paper: https://arxiv.org/pdf/2402.03634
code: https://github.com/LiewFeng/RayDN

MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection

paper: coming soon
code: https://github.com/VisualAIKHU/MonoWAD

DualBEV: CNN is All You Need in View Transformation

paper: https://arxiv.org/pdf/2403.05402
code: https://github.com/PeidongLi/DualBEV

OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

paper: coming soon
code: https://github.com/AlmoonYsl/OPEN

Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression

paper: coming soon
code: coming soon

SEED: A Simple and Effective 3D DETR in Point Clouds

paper: coming soon
code: coming soon

Towards Stable 3D Object Detection

paper: https://arxiv.org/pdf/2407.04305
code: https://github.com/jbwang1997/StabilityIndex

10）Domain Adaptation & Test-Time Adaptation

Enhancing Source-Free Domain Adaptive Object Detection with Low-Confidence Pseudo-Label Distillation

paper: coming soon
code: https://github.com/junia3/LPLD

Fully Test-Time Adaptation for Monocular 3D Object Detection

paper: coming soon
code: https://github.com/Hongbin98/MonoTTA

Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds

paper: https://arxiv.org/pdf/2303.01276
code: https://github.com/xiaoyao3302/PCFEA

11）Cooperative Perception | 协同感知

Plug and Play: A Representation Enhanced Domain Adapter for Collaborative Perception

paper: coming soon
code: https://github.com/luotianyou349/PnPDA

12）SLAM

13）Scene Flow Estimation | 场景流估计

4D Contrastive Superflows are Dense 3D Representation Learners

paper: coming soon
code: https://github.com/Xiangxu-0103/SuperFlow

14）Point Cloud | 点云

T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy

paper: coming soon
code: https://github.com/df-boy/T-CorresNet

15) Efficient Network

16) Segmentation

17）Radar | 毫米波雷达

18）Nerf Gaussian Splatting

Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting

paper: https://arxiv.org/pdf/2401.01339
code: https://github.com/zju3dv/street_gaussians

MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images

paper: https://arxiv.org/pdf/2403.14627
code: https://github.com/donydchen/mvsplat

GScream: Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal

paper: https://arxiv.org/pdf/2404.13679
code: https://github.com/W-Ted/GScream

BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream

paper: coming soon
code: https://github.com/WU-CVGL/BeNeRF

PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors

paper: https://arxiv.org/pdf/2403.09079
code: https://github.com/yuantianyuan01/PreSight

GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting

paper: https://arxiv.org/pdf/2403.08551
code: https://github.com/Xinjie-Q/GaussianImage

SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization

paper: coming soon
code: https://github.com/Iris-cyy/SG-NeRF

Disentangled Generation and Aggregation for Robust Radiance Fields

paper: coming soon
code: https://github.com/GaoHchen/Robust-Triplane

19）MOT: Muti-object Tracking | 多物体跟踪

Beyond MOT: Semantic Multi-Object Tracking

paper: coming soon
code: https://github.com/HengLan/SMOT

20）Multi-label Atomic Activity Recognition

21) Motion Prediction | 运动预测

22) Trajectory Prediction | 轨迹预测

23) Depth Estimation | 深度估计

Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation

paper: coming soon
code: https://github.com/zhyever/PatchRefiner

24) Event Camera | 事件相机

25) Odometry

DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure Alignment

paper: coming soon
code: https://github.com/IRMVLab/DVLO

Postscript

This list of papers is primarily curated by Rujia Wang.

If you have any questions about the paper list, please do not hesitate to email me and [Auto Driving Heart Team] or open an issue on GitHub.

投稿作者为『自动驾驶之心知识星球』特邀嘉宾，欢迎加入交流！重磅，自动驾驶之心科研论文辅导来啦，申博、CCF系列、SCI、EI、毕业论文、比赛辅导等多个方向，欢迎联系我们！

① 全网独家视频课程

BEV感知、BEV模型部署、BEV目标跟踪、毫米波雷达视觉融合、多传感器标定、多传感器融合、多模态3D目标检测、车道线检测、轨迹预测、在线高精地图、世界模型、点云3D目标检测、目标跟踪、Occupancy、cuda与TensorRT模型部署、大模型与自动驾驶、Nerf、语义分割、自动驾驶仿真、传感器部署、决策规划、轨迹预测等多个方向学习视频（扫码即可学习）

网页端官网：www.zdjszx.com

② 国内首个自动驾驶学习社区

国内最大最专业，近3000人的交流社区，已得到大多数自动驾驶公司的认可！涉及30+自动驾驶技术栈学习路线，从0到一带你入门自动驾驶感知（2D/3D检测、语义分割、车道线、BEV感知、Occupancy、多传感器融合、多传感器标定、目标跟踪）、自动驾驶定位建图（SLAM、高精地图、局部在线地图）、自动驾驶规划控制/轨迹预测等领域技术方案、大模型、端到端等，更有行业动态和岗位发布！欢迎扫描下方二维码，加入自动驾驶之心知识星球，这是一个真正有干货的地方，与领域大佬交流入门、学习、工作、跳槽上的各类难题，日常分享论文+代码+视频