Visual Representation Definition

Interactive Semantic Map Representation for Skill-Based Visual Object Navigation

Abstract: Visual object navigation is one of the key tasks in mobile robotics. One of the most important components of this task is the accurate semantic representation of the scene, which is needed ...

2025 in visual storytelling

Explore some favorite visual stories of designers, developers and art directors from The Washington Post’s Design, Graphics ...

IEEE

Semantically Consistent Visual Representation for Adversarial Robustness

Abstract: Deep neural networks have been widely used in various domains owing to the success of deep learning. However, recent studies have shown that these models are vulnerable to adversarial ...

GitHub

Don’t Blind Your VLA: Aligning Visual Representations for OOD Generalization

To address the degradation of visual-language (VL) representations during VLA supervised fine-tuning (SFT), we introduce Visual Representation Alignment. During SFT, we pull a VLA’s visual tokens ...

Microsoft

LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation

CLIP is one of the most important multimodal foundational models today. What powers CLIP’s capabilities? The rich supervision signals provided by natural language, the carrier of human knowledge, ...

Microsoft

LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation - Microsoft Research

CLIP is one of the most important multimodal foundational models today, aligning visual and textual signals into a shared feature space using a simple contrastive learning loss on large-scale ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results