Computer Vision

NRGS: Neural regularization for robust 3D semantic Gaussian splatting

TBA.

Jul 9, 2026

High-fidelity multi-view normal integration with scale-encoded neural surface representation

We propose a scale-encoded neural surface representation that incorporates the pixel coverage area into the neural representation.

Jul 1, 2026

BioVITA: Biological dataset, model, and benchmark for visual-textual-acoustic alignment
BioVITA: Biological dataset, model, and benchmark for visual-textual-acoustic alignment

We propose BioVITA, visual-textual-acoustic alignment framework for animal understanding.

Jun 3, 2026

PlantPose: Universal plant skeleton estimation via tree-constrained graph generation
PlantPose: Universal plant skeleton estimation via tree-constrained graph generation

We propose PlantPose, a universal plant skeleton estimator via tree-constrained graph generation.

Jun 1, 2026

Unsupervised 3D human pose estimation via conditional multi-view ancestral sampling

We propose conditional multi-view ancestral sampling (cMAS) for single view 3D human pose estimation without 3D supervision.

May 25, 2026

DP-SfM: Dual-pixel structure-from-motion without scale ambiguity
DP-SfM: Dual-pixel structure-from-motion without scale ambiguity

We integrate structure-from-motion (SfM) and dual-pixel (DP) imaging to resolving scale ambiguity.

May 5, 2026

Gaussian mesh renderer for lightweight differentiable rendering

We propose Gaussian Mesh Renderer (GMR), lightweight differentiable mesh renderer using 3DGS rasterizer.

May 4, 2026

AnimalCLAP: Taxonomy-aware language-audio pretraining for species recognition and trait inference

We introduce AnimalCLAP, a taxonomy-aware language-audio framework that incorporate hierarchical biological information.

May 4, 2026

Instance-wise distribution control of text-to-image diffusion models
Instance-wise distribution control of text-to-image diffusion models

We propose an instance-wise control the attribute distributions in the generated images of diffusion models.

Apr 1, 2026

Talking with Actionbits---A part-enhanced VLM for action and interaction recognition in animals

Mar 21, 2026