2025-01-14 |
Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers |
Efstathios Karypidis et.al. |
2501.08303 |
null |
2025-01-14 |
A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation |
Steven Landgraf et.al. |
2501.08188 |
null |
2025-01-14 |
Threshold Attention Network for Semantic Segmentation of Remote Sensing Images |
Wei Long et.al. |
2501.07984 |
null |
2025-01-14 |
Balance Divergence for Knowledge Distillation |
Yafei Qi et.al. |
2501.07804 |
null |
2025-01-13 |
Kolmogorov-Arnold Network for Remote Sensing Image Semantic Segmentation |
Xianping Ma et.al. |
2501.07390 |
null |
2025-01-13 |
Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion |
Li Liang et.al. |
2501.07260 |
link |
2025-01-12 |
LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation via Category-wise Attentive Classifier |
Haojun Yu et.al. |
2501.06862 |
link |
2025-01-12 |
SAM-DA: Decoder Adapter for Efficient Medical Domain Adaptation |
Javier Gamazo Tejero et.al. |
2501.06836 |
null |
2025-01-11 |
Parking Space Detection in the City of Granada |
Crespo-Orti Luis et.al. |
2501.06651 |
link |
2025-01-06 |
The 2nd Place Solution from the 3D Semantic Segmentation Track in the 2024 Waymo Open Dataset Challenge |
Qing Wu et.al. |
2501.05472 |
null |
2025-01-09 |
Domain-Incremental Semantic Segmentation for Autonomous Driving under Adverse Driving Conditions |
Shishir Muralidhara et.al. |
2501.05246 |
null |
2025-01-09 |
Advancing ALS Applications with Large-Scale Pre-training: Dataset Development and Downstream Assessment |
Haoyi Xiu et.al. |
2501.05095 |
null |
2025-01-08 |
Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation |
Ulindu De Silva et.al. |
2501.04696 |
link |
2025-01-07 |
Superpixel Boundary Correction for Weakly-Supervised Semantic Segmentation on Histopathology Images |
Hongyi Wu et.al. |
2501.03891 |
null |
2025-01-07 |
Image Segmentation: Inducing graph-based learning |
Aryan Singh et.al. |
2501.03765 |
link |
2025-01-06 |
4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation |
Jiexi Zhong et.al. |
2501.02937 |
null |
2025-01-08 |
GLoG-CSUnet: Enhancing Vision Transformers with Adaptable Radiomic Features for Medical Image Segmentation |
Niloufar Eghbali et.al. |
2501.02788 |
link |
2025-01-04 |
Unsupervised Class Generation to Expand Semantic Segmentation Datasets |
Javier Montalvo et.al. |
2501.02264 |
null |
2025-01-03 |
Semantic Segmentation for Sequential Historical Maps by Learning from Only One Map |
Yunshuang Yuan et.al. |
2501.01845 |
null |
2025-01-03 |
IAM: Enhancing RGB-D Instance Segmentation with New Benchmarks |
Aecheon Jung et.al. |
2501.01685 |
link |
2025-01-03 |
Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation |
Rini Smita Thakur et.al. |
2501.01640 |
null |
2025-01-02 |
A Multi-task Supervised Compression Model for Split Computing |
Yoshitomo Matsubara et.al. |
2501.01420 |
link |
2025-01-03 |
FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation |
Bingyu Li et.al. |
2501.00877 |
link |
2024-12-31 |
PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAM |
Runnan Chen et.al. |
2501.00352 |
null |
2024-12-31 |
OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies |
Runnan Chen et.al. |
2501.00326 |
null |
2024-12-30 |
HisynSeg: Weakly-Supervised Histopathological Image Segmentation via Image-Mixing Synthesis and Consistency Regularization |
Zijie Fang et.al. |
2412.20924 |
link |
2024-12-30 |
LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training |
Fardin Ayar et.al. |
2412.20881 |
null |
2024-12-29 |
Image Augmentation Agent for Weakly Supervised Semantic Segmentation |
Wangyu Wu et.al. |
2412.20439 |
null |
2024-12-27 |
Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP |
Zhongxing Xu et.al. |
2412.19650 |
null |
2024-12-27 |
An Actionable Hierarchical Scene Representation Enhancing Autonomous Inspection Missions in Unknown Environments |
Vignesh Kottayam Viswanathan et.al. |
2412.19582 |
null |
2024-12-27 |
Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation |
Chengyang Ye et.al. |
2412.19492 |
link |
2024-12-26 |
Impact of color and mixing proportion of synthetic point clouds on semantic segmentation |
Shaojie Zhou et.al. |
2412.19145 |
null |
2024-12-24 |
AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction |
Pufan Zou et.al. |
2412.18255 |
null |
2024-12-25 |
VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis |
Shicheng Yin et.al. |
2412.18178 |
link |
2024-12-24 |
UniPLV: Towards Label-Efficient Open-World 3D Scene Understanding by Regional Visual Language Supervision |
Yuru Wang et.al. |
2412.18131 |
null |
2024-12-24 |
LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding |
Hao Li et.al. |
2412.17635 |
null |
2024-12-25 |
AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation |
Jiaqi Ma et.al. |
2412.17601 |
link |
2024-12-24 |
Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation |
Jianjian Yin et.al. |
2412.17331 |
link |
2024-12-22 |
Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation |
Samuel Marschall et.al. |
2412.16990 |
null |
2024-12-22 |
Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection |
Yuhang Gan et.al. |
2412.16918 |
null |
2024-12-22 |
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection |
Xu Zheng et.al. |
2412.16876 |
null |
2024-12-22 |
Adversarial Diffusion Model for Unsupervised Domain-Adaptive Semantic Segmentation |
Jongmin Yu et.al. |
2412.16859 |
null |
2024-12-21 |
A Novel Approach to Tomato Harvesting Using a Hybrid Gripper with Semantic Segmentation and Keypoint Detection |
Shahid Ansari et.al. |
2412.16755 |
null |
2024-12-21 |
IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks |
Yaming Zhang et.al. |
2412.16654 |
link |
2024-12-21 |
V"Mean"ba: Visual State Space Models only need 1 hidden dimension |
Tien-Yu Chi et.al. |
2412.16602 |
null |
2024-12-20 |
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment |
Cijo Jose et.al. |
2412.16334 |
null |
2024-12-20 |
SegCol Challenge: Semantic Segmentation for Tools and Fold Edges in Colonoscopy data |
Xinwei Ju et.al. |
2412.16078 |
link |
2024-12-20 |
Enhancing Generalized Few-Shot Semantic Segmentation via Effective Knowledge Transfer |
Xinyue Chen et.al. |
2412.15835 |
link |
2024-12-19 |
GIRAFE: Glottal Imaging Dataset for Advanced Segmentation, Analysis, and Facilitative Playbacks Evaluation |
G. Andrade-Miranda et.al. |
2412.15054 |
link |
2024-12-19 |
Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation |
Zhenxin Lei et.al. |
2412.14587 |
null |
2024-12-18 |
Split Learning in Computer Vision for Semantic Segmentation Delay Minimization |
Nikos G. Evgenidis et.al. |
2412.14272 |
null |
2024-12-18 |
Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation |
Jianyu Zhang et.al. |
2412.14145 |
null |
2024-12-18 |
Prompt Categories Cluster for Weakly Supervised Semantic Segmentation |
Wangyu Wu et.al. |
2412.13823 |
null |
2024-12-18 |
Federated Source-free Domain Adaptation for Classification: Weighted Cluster Aggregation for Unlabeled Data |
Junki Mori et.al. |
2412.13757 |
null |
2024-12-18 |
Optical aberrations in autonomous driving: Physics-informed parameterized temperature scaling for neural network uncertainty calibration |
Dominik Werner Wolf et.al. |
2412.13695 |
null |
2024-12-18 |
GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting |
Yuning Peng et.al. |
2412.13654 |
null |
2024-12-17 |
Efficient Event-based Semantic Segmentation with Spike-driven Lightweight Transformer-based Networks |
Xiaxin Zhu et.al. |
2412.12843 |
null |
2024-12-17 |
Open-World Panoptic Segmentation |
Matteo Sodano et.al. |
2412.12740 |
null |
2024-12-17 |
SemStereo: Semantic-Constrained Stereo Matching Network for Remote Sensing |
Chen Chen et.al. |
2412.12685 |
null |
2024-12-17 |
Structural Pruning via Spatial-aware Information Redundancy for Semantic Segmentation |
Dongyue Wu et.al. |
2412.12672 |
link |
2024-12-17 |
Adaptive Prototype Replay for Class Incremental Semantic Segmentation |
Guilin Zhu et.al. |
2412.12669 |
null |
2024-12-17 |
SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation |
Shuangping Huang et.al. |
2412.12660 |
null |
2024-12-16 |
Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation |
Hongwei Niu et.al. |
2412.12050 |
link |
2024-12-16 |
SAMIC: Segment Anything with In-Context Spatial Prompt Engineering |
Savinay Nagendra et.al. |
2412.11998 |
null |
2024-12-16 |
SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation |
Yunxiang Fu et.al. |
2412.11890 |
link |
2024-12-16 |
Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation |
Svetlana Pavlitska et.al. |
2412.11608 |
null |
2024-12-15 |
MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation |
Zhiwei Yang et.al. |
2412.11076 |
link |
2024-12-14 |
RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone |
Mustafa Munir et.al. |
2412.10995 |
link |
2024-12-14 |
DCSEG: Decoupled 3D Open-Set Segmentation using Gaussian Splatting |
Luis Wiedmann et.al. |
2412.10972 |
link |
2024-12-14 |
SegACIL: Solving the Stability-Plasticity Dilemma in Class-Incremental Semantic Segmentation |
Jiaxu Li et.al. |
2412.10834 |
link |
2024-12-14 |
Neural Network Meta Classifier: Improving the Reliability of Anomaly Segmentation |
Jurica Runtas et.al. |
2412.10765 |
link |
2024-12-14 |
OmniHD-Scenes: A Next-Generation Multimodal Dataset for Autonomous Driving |
Lianqing Zheng et.al. |
2412.10734 |
null |
2024-12-13 |
A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation |
Wangkai Li et.al. |
2412.10339 |
null |
2024-12-13 |
SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians |
Siyun Liang et.al. |
2412.10231 |
null |
2024-12-13 |
Object-Focused Data Selection for Dense Prediction Tasks |
Niclas Popp et.al. |
2412.10032 |
null |
2024-12-12 |
Towards Open-Vocabulary Video Semantic Segmentation |
Xinhao Li et.al. |
2412.09329 |
link |
2024-12-16 |
FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation |
Yuntian Bo et.al. |
2412.09319 |
link |
2024-12-12 |
VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation |
Roberto Alcover-Couso et.al. |
2412.09240 |
null |
2024-12-11 |
A Deep Semantic Segmentation Network with Semantic and Contextual Refinements |
Zhiyan Wang et.al. |
2412.08671 |
null |
2024-12-11 |
A feature refinement module for light-weight semantic segmentation network |
Zhiyan Wang et.al. |
2412.08670 |
null |
2024-12-11 |
SegFace: Face Segmentation of Long-Tail Classes |
Kartik Narayan et.al. |
2412.08647 |
link |
2024-12-11 |
EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation |
Hongwei Niu et.al. |
2412.08628 |
link |
2024-12-12 |
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning |
Fan Lu et.al. |
2412.08614 |
link |
2024-12-11 |
Hierarchical Context Alignment with Disentangled Geometric and Temporal Modeling for Semantic Occupancy Prediction |
Bohan Li et.al. |
2412.08243 |
null |
2024-12-11 |
THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots |
Zeshun Li et.al. |
2412.08096 |
null |
2024-12-11 |
Static-Dynamic Class-level Perception Consistency in Video Semantic Segmentation |
Zhigang Cen et.al. |
2412.08034 |
null |
2024-12-09 |
SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception |
Yaniv Benny et.al. |
2412.06968 |
null |
2024-12-10 |
ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet |
Andrei-Robert Alexandrescu et.al. |
2412.06742 |
null |
2024-12-09 |
Active Learning with Context Sampling and One-vs-Rest Entropy for Semantic Segmentation |
Fei Wu et.al. |
2412.06470 |
null |
2024-12-09 |
GCUNet: A GNN-Based Contextual Learning Network for Tertiary Lymphoid Structure Semantic Segmentation in Whole Slide Image |
Lei Su et.al. |
2412.06129 |
null |
2024-12-08 |
Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation |
Zipeng Qi et.al. |
2412.05969 |
null |
2024-12-08 |
CSG: A Context-Semantic Guided Diffusion Approach in De Novo Musculoskeletal Ultrasound Image Generation |
Elay Dahan et.al. |
2412.05833 |
null |
2024-12-10 |
RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts |
Xu Liu et.al. |
2412.05679 |
link |
2024-12-06 |
FogROS2-FT: Fault Tolerant Cloud Robotics |
Kaiyuan Chen et.al. |
2412.05408 |
null |
2024-12-06 |
Generative Model-Based Fusion for Improved Few-Shot Semantic Segmentation of Infrared Images |
Junno Yun et.al. |
2412.05341 |
null |
2024-12-05 |
Assessing and Learning Alignment of Unimodal Vision and Language Models |
Le Zhang et.al. |
2412.04616 |
null |
2024-12-05 |
A Hitchhiker's Guide to Understanding Performances of Two-Class Classifiers |
Anaïs Halin et.al. |
2412.04377 |
null |
2024-12-05 |
Customize Segment Anything Model for Multi-Modal Semantic Segmentation with Mixture of LoRA Experts |
Chenyang Zhu et.al. |
2412.04220 |
null |
2024-12-05 |
Text Change Detection in Multilingual Documents Using Image Comparison |
Doyoung Park et.al. |
2412.04137 |
null |
2024-12-05 |
SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning |
Seokju Yun et.al. |
2412.04077 |
link |
2024-12-05 |
Quality Control in Open-Ended Crowdsourcing: A Survey |
Lei Chai et.al. |
2412.03991 |
null |
2024-12-05 |
LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model |
Yuan Xue et.al. |
2412.03841 |
null |
2024-12-04 |
Designing DNNs for a trade-off between robustness and processing performance in embedded devices |
Jon Gutiérrez-Zaballa et.al. |
2412.03682 |
null |
2024-12-04 |
Evaluating Single Event Upsets in Deep Neural Networks for Semantic Segmentation: an embedded system perspective |
Jon Gutiérrez-Zaballa et.al. |
2412.03630 |
link |
2024-12-04 |
FLAIR: VLM with Fine-grained Language-informed Image Representations |
Rui Xiao et.al. |
2412.03561 |
link |
2024-12-04 |
Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy |
Ronald L. P. D. de Jong et.al. |
2412.03401 |
null |
2024-12-04 |
Task-driven Image Fusion with Learnable Fusion Loss |
Haowen Bai et.al. |
2412.03240 |
null |
2024-12-04 |
Biologically-inspired Semi-supervised Semantic Segmentation for Biomedical Imaging |
Luca Ciampi et.al. |
2412.03192 |
null |
2024-12-04 |
Is Foreground Prototype Sufficient? Few-Shot Medical Image Segmentation with Background-Fused Prototype |
Song Tang et.al. |
2412.02983 |
null |
2024-12-04 |
Progressive Vision-Language Prompt for Multi-Organ Multi-Class Cell Semantic Segmentation with Single Branch |
Qing Zhang et.al. |
2412.02978 |
null |
2024-12-04 |
Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution |
Jiahua Xiao et.al. |
2412.02960 |
null |
2024-12-03 |
SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection |
Joongwon Chae et.al. |
2412.02565 |
link |
2024-12-03 |
AH-OCDA: Amplitude-based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation |
Jaehyun Choi et.al. |
2412.02280 |
null |
2024-12-03 |
Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance |
Jing Zeng et.al. |
2412.02249 |
null |
2024-12-02 |
INSIGHT: Explainable Weakly-Supervised Medical Image Analysis |
Wenbo Zhang et.al. |
2412.02012 |
null |
2024-12-02 |
Global Average Feature Augmentation for Robust Semantic Segmentation with Transformers |
Alberto Gonzalo Rodriguez Salgado et.al. |
2412.01941 |
null |
2024-12-02 |
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training |
Sanghwan Kim et.al. |
2412.01814 |
link |
2024-12-02 |
Robust and Transferable Backdoor Attacks Against Deep Image Compression With Selective Frequency Prior |
Yi Yu et.al. |
2412.01646 |
null |
2024-12-02 |
Epipolar Attention Field Transformers for Bird's Eye View Semantic Segmentation |
Christian Witte et.al. |
2412.01595 |
null |
2024-12-01 |
Token Cropr: Faster ViTs for Quite a Few Tasks |
Benjamin Bergner et.al. |
2412.00965 |
link |
2024-12-01 |
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification |
Jingwei Zhang et.al. |
2412.00678 |
link |
2024-11-30 |
Density-aware Global-Local Attention Network for Point Cloud Segmentation |
Chade Li et.al. |
2412.00489 |
null |
2024-11-30 |
TAROT: Targeted Data Selection via Optimal Transport |
Lan Feng et.al. |
2412.00420 |
link |
2024-11-30 |
GradiSeg: Gradient-Guided Gaussian Segmentation with Enhanced 3D Boundary Precision |
Zehao Li et.al. |
2412.00392 |
null |
2024-11-30 |
LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation |
Huadong Tang et.al. |
2412.00364 |
null |
2024-11-29 |
LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention |
Zewen Du et.al. |
2411.19585 |
link |
2024-11-29 |
Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding |
Wenbo Zhang et.al. |
2411.19551 |
null |
2024-11-29 |
Retrieval-guided Cross-view Image Synthesis |
Hongji Yang et.al. |
2411.19510 |
null |
2024-11-28 |
MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers |
Jongseong Bae et.al. |
2411.18995 |
null |
2024-11-28 |
Textured As-Is BIM via GIS-informed Point Cloud Segmentation |
Mohamed S. H. Alabassy et.al. |
2411.18898 |
null |
2024-11-27 |
The Last Mile to Supervised Performance: Semi-Supervised Domain Adaptation for Semantic Segmentation |
Daniel Morales-Brotons et.al. |
2411.18728 |
null |
2024-11-27 |
HoliSDiP: Image Super-Resolution via Holistic Semantics and Diffusion Prior |
Li-Yuan Tsao et.al. |
2411.18662 |
link |
2024-11-26 |
Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation |
Sudarshan Rajagopalan et.al. |
2411.17814 |
null |
2024-12-02 |
Efficient Multi-modal Large Language Models via Visual Token Grouping |
Minbin Huang et.al. |
2411.17773 |
null |
2024-11-26 |
Rapid Deployment of Domain-specific Hyperspectral Image Processors with Application to Autonomous Driving |
Jon Gutiérrez-Zaballa et.al. |
2411.17543 |
null |
2024-11-26 |
Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning |
Hoàng-Ân Lê et.al. |
2411.17536 |
link |
2024-11-26 |
TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba |
Xiaowen Ma et.al. |
2411.17473 |
link |
2024-11-26 |
MRIFE: A Mask-Recovering and Interactive-Feature-Enhancing Semantic Segmentation Network For Relic Landslide Detection |
Juefei He et.al. |
2411.17167 |
null |
2024-11-26 |
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation |
Chanyoung Kim et.al. |
2411.17150 |
null |
2024-11-26 |
ΩSFormer: Dual-Modal Ω-like Super-Resolution Transformer Network for Cross-scale and High-accuracy Terraced Field Vectorization Extraction |
Chang Li et.al. |
2411.17088 |
null |
2024-11-26 |
SCASeg: Strip Cross-Attention for Efficient Semantic Segmentation |
Guoan Xu et.al. |
2411.17061 |
null |
2024-11-25 |
Deformable Mamba for Wide Field of View Segmentation |
Jie Hu et.al. |
2411.16481 |
link |
2024-11-25 |
A Study on Unsupervised Domain Adaptation for Semantic Segmentation in the Era of Vision-Language Models |
Manuel Schwonberg et.al. |
2411.16407 |
null |
2024-11-25 |
A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads |
Rafael S. Toledo et.al. |
2411.16295 |
link |
2024-11-25 |
Learn from Foundation Model: Fruit Detection Model without Manual Annotation |
Yanan Wang et.al. |
2411.16196 |
link |
2024-11-25 |
Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training |
Man Yao et.al. |
2411.16061 |
link |
2024-11-24 |
Deep Learning for automated multi-scale functional field boundaries extraction using multi-date Sentinel-2 and PlanetScope imagery: Case Study of Netherlands and Pakistan |
Saba Zahid et.al. |
2411.15923 |
null |
2024-11-24 |
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation |
Sule Bai et.al. |
2411.15869 |
link |
2024-11-24 |
ResCLIP: Residual Attention for Training-free Dense Vision-language Inference |
Yuhang Yang et.al. |
2411.15851 |
link |
2024-11-24 |
Integrating Deep Metric Learning with Coreset for Active Learning in 3D Segmentation |
Arvind Murari Vepa et.al. |
2411.15763 |
link |
2024-11-22 |
Effective SAM Combination for Open-Vocabulary Semantic Segmentation |
Minhyeok Lee et.al. |
2411.14723 |
null |
2024-11-21 |
Revisiting the Integration of Convolution and Attention for Vision Backbone |
Lei Zhu et.al. |
2411.14429 |
link |
2024-11-21 |
CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation |
Lin Sun et.al. |
2411.13836 |
link |
2024-11-21 |
Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals |
Hussni Mohd Zakir et.al. |
2411.13774 |
null |
2024-11-20 |
FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting |
Ola Shorinwa et.al. |
2411.13753 |
null |
2024-11-20 |
BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation |
Umamaheswaran Raman Kumar et.al. |
2411.13251 |
null |
2024-11-20 |
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation |
Ziyi Wang et.al. |
2411.13243 |
link |
2024-11-20 |
Automating Sonologists USG Commands with AI and Voice Interface |
Emad Mohamed et.al. |
2411.13006 |
null |
2024-11-19 |
A Multimodal Approach Combining Structural and Cross-domain Textual Guidance for Weakly Supervised OCT Segmentation |
Jiaqi Yang et.al. |
2411.12615 |
link |
2024-11-19 |
SAM Carries the Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation |
Ron Keuth et.al. |
2411.12602 |
link |
2024-11-15 |
ULTra: Unveiling Latent Token Interpretability in Transformer Based Understanding |
Hesam Hosseini et.al. |
2411.12589 |
null |
2024-11-19 |
ADV2E: Bridging the Gap Between Analogue Circuit and Discrete Frames in the Video-to-Events Simulator |
Xiao Jiang et.al. |
2411.12250 |
null |
2024-11-18 |
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements |
M. Arda Aydın et.al. |
2411.12044 |
link |
2024-11-18 |
Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation |
Hanieh Shojaei Miandashti et.al. |
2411.11935 |
null |
2024-11-18 |
MAIRA-Seg: Enhancing Radiology Report Generation with Segmentation-Aware Multimodal Large Language Models |
Harshita Sharma et.al. |
2411.11362 |
null |
2024-11-18 |
Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications |
Scarlett Raine et.al. |
2411.11287 |
null |
2024-11-16 |
Attention-based U-Net Method for Autonomous Lane Detection |
Mohammadhamed Tangestanizadeh et.al. |
2411.10902 |
null |
2024-11-16 |
Automatic Discovery and Assessment of Interpretable Systematic Errors in Semantic Segmentation |
Jaisidh Singh et.al. |
2411.10845 |
null |
2024-11-15 |
Y-MAP-Net: Real-time depth, normals, segmentation, multi-label captioning and 2D human pose in RGB images |
Ammar Qammaz et.al. |
2411.10334 |
null |
2024-11-15 |
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation |
Dengke Zhang et.al. |
2411.10086 |
null |
2024-11-14 |
OneNet: A Channel-Wise 1D Convolutional U-Net |
Sanghyun Byun et.al. |
2411.09838 |
link |
2024-11-14 |
Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks |
Zengyi Yang et.al. |
2411.09387 |
null |
2024-11-14 |
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation |
Yuheng Shi et.al. |
2411.09219 |
link |
2024-11-14 |
Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery |
Ashim Dahal et.al. |
2411.09101 |
link |
2024-11-13 |
CoMiX: Cross-Modal Fusion with Deformable Convolutions for HSI-X Semantic Segmentation |
Xuming Zhang et.al. |
2411.09023 |
null |
2024-11-14 |
Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation |
Yangyang Li et.al. |
2411.08756 |
null |
2024-11-13 |
Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model |
Jun Xie et.al. |
2411.08592 |
null |
2024-11-12 |
Isometric Transformations for Image Augmentation in Mueller Matrix Polarimetry |
Christopher Hahne et.al. |
2411.07918 |
link |
2024-11-11 |
SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation |
Jiale Chen et.al. |
2411.06991 |
null |
2024-11-14 |
Can KAN Work? Exploring the Potential of Kolmogorov-Arnold Networks in Computer Vision |
Yueyang Cang et.al. |
2411.06727 |
null |
2024-11-10 |
Few-shot Semantic Learning for Robust Multi-Biome 3D Semantic Mapping in Off-Road Environments |
Deegan Atha et.al. |
2411.06632 |
null |
2024-11-09 |
Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing |
Kaixuan Lu et.al. |
2411.06091 |
null |
2024-11-08 |
Joint-Optimized Unsupervised Adversarial Domain Adaptation in Remote Sensing Segmentation with Prompted Foundation Model |
Shuchang Lyu et.al. |
2411.05878 |
link |
2024-11-08 |
Revisiting Network Perturbation for Semi-Supervised Semantic Segmentation |
Sien Li et.al. |
2411.05307 |
link |
2024-11-07 |
In the Era of Prompt Learning with Vision-Language Models |
Ankit Jha et.al. |
2411.04892 |
null |
2024-11-11 |
ZAHA: Introducing the Level of Facade Generalization and the Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset |
Olaf Wysocki et.al. |
2411.04865 |
link |
2024-11-06 |
Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model |
Yansong Qu et.al. |
2411.03672 |
null |
2024-11-05 |
Enhancing Weakly Supervised Semantic Segmentation for Fibrosis via Controllable Image Generation |
Zhiling Yue et.al. |
2411.03551 |
null |
2024-11-05 |
SynthSet: Generative Diffusion Model for Semantic Segmentation in Precision Agriculture |
Andrew Heschl et.al. |
2411.03505 |
link |
2024-11-05 |
Rethinking Decoders for Transformer-based Semantic Segmentation: Compression is All You Need |
Qishuai Wen et.al. |
2411.03033 |
link |
2024-11-05 |
Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation |
Xavier Timoneda et.al. |
2411.02969 |
null |
2024-11-05 |
Mapping Africa Settlements: High Resolution Urban and Rural Map by Deep Learning and Satellite Imagery |
Mohammad Kakooei et.al. |
2411.02935 |
link |
2024-11-05 |
CIT: Rethinking Class-incremental Semantic Segmentation with a Class Independent Transformation |
Jinchao Ge et.al. |
2411.02715 |
link |
2024-11-04 |
Deep Learning on 3D Semantic Segmentation: A Detailed Review |
Thodoris Betsas et.al. |
2411.02104 |
null |
2024-11-04 |
Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models |
Sharat Agarwal et.al. |
2411.01925 |
null |
2024-11-04 |
DiffuMask-Editor: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing to Improve Segmentation Ability |
Bo Gao et.al. |
2411.01819 |
null |
2024-11-04 |
Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations |
Thanh Nguyen Canh et.al. |
2411.01816 |
null |
2024-11-03 |
PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation |
Xinyu Xu et.al. |
2411.01624 |
null |
2024-11-01 |
Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions |
Lixiao Yang et.al. |
2411.01039 |
null |
2024-11-01 |
Event-guided Low-light Video Semantic Segmentation |
Zhen Yao et.al. |
2411.00639 |
null |
2024-11-01 |
Cross-modal semantic segmentation for indoor environmental perception using single-chip millimeter-wave radar raw data |
Hairuo Hu et.al. |
2411.00499 |
null |
2024-11-01 |
Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications via Diffusion-Based Image Editing |
Naufal Suryanto et.al. |
2411.00425 |
link |
2024-10-31 |
A Recipe for Geometry-Aware 3D Mesh Transformers |
Mohammad Farazi et.al. |
2411.00164 |
null |
2024-10-31 |
Federated Black-Box Adaptation for Semantic Segmentation |
Jay N. Paranjape et.al. |
2410.24181 |
link |
2024-10-31 |
COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes |
Muhammad Ali et.al. |
2410.24139 |
link |
2024-10-31 |
Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model |
Hao Zhang et.al. |
2410.23905 |
link |
2024-11-04 |
S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving |
Maciej K. Wozniak et.al. |
2410.23085 |
null |
2024-10-31 |
CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation |
Ziyang Gong et.al. |
2410.22629 |
link |
2024-10-29 |
Hyperspectral Imaging-Based Perception in Autonomous Driving Scenarios: Benchmarking Baseline Semantic Segmentation Models |
Imad Ali Shah et.al. |
2410.22101 |
link |
2024-10-29 |
Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation |
Ruihao Xia et.al. |
2410.21708 |
link |
2024-10-28 |
Domain Adaptation with a Single Vision-Language Embedding |
Mohammad Fahes et.al. |
2410.21361 |
null |
2024-10-28 |
IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks |
Manjunath D et.al. |
2410.20953 |
link |
2024-11-01 |
A Framework for Real-Time Volcano-Seismic Event Recognition Based on Multi-Station Seismograms and Semantic Segmentation Models |
Camilo Espinosa-Curilem et.al. |
2410.20595 |
link |
2024-10-27 |
Unlocking Comics: The AI4VA Dataset for Visual Understanding |
Peter Grönquist et.al. |
2410.20459 |
link |
2024-10-27 |
Historical Test-time Prompt Tuning for Vision Foundation Models |
Jingyi Zhang et.al. |
2410.20346 |
null |
2024-10-25 |
OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery |
Philipe Dias et.al. |
2410.19965 |
null |
2024-10-25 |
IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation |
Kaixian Qu et.al. |
2410.19697 |
null |
2024-10-25 |
Fusion-then-Distillation: Toward Cross-modal Positive Distillation for Domain Adaptive 3D Semantic Segmentation |
Yao Wu et.al. |
2410.19446 |
link |
2024-10-25 |
Context-Based Visual-Language Place Recognition |
Soojin Woo et.al. |
2410.19341 |
link |
2024-10-24 |
Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks |
Alexander Jaus et.al. |
2410.18684 |
null |
2024-10-24 |
Unsupervised semantic segmentation of urban high-density multispectral point clouds |
Oona Oinonen et.al. |
2410.18520 |
null |
2024-10-26 |
CARLA2Real: a tool for reducing the sim2real gap in CARLA simulator |
Stefanos Pasios et.al. |
2410.18238 |
link |
2024-10-23 |
Towards Safer Planetary Exploration: A Hybrid Architecture for Terrain Traversability Analysis in Mars Rovers |
Achille Chiuchiarelli et.al. |
2410.17738 |
null |
2024-10-22 |
EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding |
Zhiyi Pan et.al. |
2410.17207 |
null |
2024-10-22 |
SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments |
Jumman Hossain et.al. |
2410.16686 |
null |
2024-10-21 |
TIPS: Text-Image Pretraining with Spatial Awareness |
Kevis-Kokitsi Maninis et.al. |
2410.16512 |
null |
2024-10-21 |
GenGMM: Generalized Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation |
Nazanin Moradinasab et.al. |
2410.16485 |
null |
2024-10-21 |
LiOn-XA: Unsupervised Domain Adaptation via LiDAR-Only Cross-Modal Adversarial Training |
Thomas Kreutz et.al. |
2410.15833 |
link |
2024-10-21 |
TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight |
Hyun-Kurl Jang et.al. |
2410.15674 |
link |
2024-10-21 |
Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications |
Jintao Ren et.al. |
2410.15584 |
null |
2024-10-22 |
Multi-Layer Feature Fusion with Cross-Channel Attention-Based U-Net for Kidney Tumor Segmentation |
Fnu Neha et.al. |
2410.15472 |
null |
2024-10-18 |
On the Influence of Shape, Texture and Color for Learning Semantic Segmentation |
Annika Mütze et.al. |
2410.14878 |
null |
2024-10-18 |
Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+ |
Arpan Mahara et.al. |
2410.14836 |
null |
2024-10-17 |
ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding |
Guangda Ji et.al. |
2410.13924 |
null |
2024-10-22 |
EP-SAM: Weakly Supervised Histopathology Segmentation via Enhanced Prompt with Segment Anything |
Joonhyeon Song et.al. |
2410.13621 |
link |
2024-10-17 |
Day-Night Adaptation: An Innovative Source-free Adaptation Framework for Medical Image Segmentation |
Ziyang Chen et.al. |
2410.13472 |
null |
2024-10-17 |
SiamSeg: Self-Training with Contrastive Learning for Unsupervised Domain Adaptation in Remote Sensing |
Bin Wang et.al. |
2410.13471 |
link |
2024-10-17 |
Railway LiDAR semantic segmentation based on intelligent semi-automated data annotation |
Florian Wulff et.al. |
2410.13383 |
null |
2024-10-17 |
Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation |
Houze Liu et.al. |
2410.13099 |
null |
2024-10-16 |
Task Consistent Prototype Learning for Incremental Few-shot Semantic Segmentation |
Wenbo Xu et.al. |
2410.13094 |
null |
2024-10-16 |
Risk Assessment for Autonomous Landing in Urban Environments using Semantic Segmentation |
Jesús Alejandro Loera-Ponce et.al. |
2410.12988 |
null |
2024-10-16 |
VividMed: Vision Language Model with Versatile Visual Grounding for Medicine |
Lingxiao Luo et.al. |
2410.12694 |
link |
2024-10-16 |
Cascade learning in multi-task encoder-decoder networks for concurrent bone segmentation and glenohumeral joint assessment in shoulder CT scans |
Luca Marsilio et.al. |
2410.12641 |
null |
2024-10-17 |
SAM-Guided Masked Token Prediction for 3D Scene Understanding |
Zhimin Chen et.al. |
2410.12158 |
null |
2024-10-15 |
Development and Testing of a Wood Panels Bark Removal Equipment Based on Deep Learning |
Rijun Wang et.al. |
2410.11913 |
null |
2024-10-15 |
RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation |
Anton Antonov et.al. |
2410.11722 |
link |
2024-10-15 |
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation |
Jiayi Lin et.al. |
2410.11473 |
null |
2024-10-15 |
MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation |
Xianping Ma et.al. |
2410.11160 |
link |
2024-10-14 |
Locality Alignment Improves Vision-Language Models |
Ian Covert et.al. |
2410.11087 |
null |
2024-10-14 |
Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes |
Tim Broedermann et.al. |
2410.10791 |
null |
2024-10-14 |
UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation |
Lihe Yang et.al. |
2410.10777 |
link |
2024-10-14 |
Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation |
Daniel Fusaro et.al. |
2410.10510 |
link |
2024-10-14 |
LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections |
Xuezhi Xiang et.al. |
2410.10433 |
null |
2024-10-14 |
V2M: Visual 2-Dimensional Mamba for Image Representation Learning |
Chengkun Wang et.al. |
2410.10382 |
link |
2024-10-14 |
GlobalMamba: Global Image Serialization for Vision Mamba |
Chengkun Wang et.al. |
2410.10316 |
link |
2024-10-13 |
AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model |
Yuchen Li et.al. |
2410.09714 |
null |
2024-10-12 |
An Expeditious Spatial Mean Radiant Temperature Mapping Framework using Visual SLAM and Semantic Segmentation |
Wei Liang et.al. |
2410.09443 |
null |
2024-10-11 |
Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation |
Varduhi Yeghiazaryan et.al. |
2410.08946 |
null |
2024-10-11 |
Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation |
Hanieh Shojaei et.al. |
2410.08687 |
null |
2024-10-11 |
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention |
Nguyen Huu Bao Long et.al. |
2410.08582 |
link |
2024-10-10 |
Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? |
Samir Abou Haidar et.al. |
2410.08365 |
null |
2024-10-10 |
Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation |
Zhiyi Pan et.al. |
2410.08091 |
null |
2024-10-10 |
3D Vision-Language Gaussian Splatting |
Qucheng Peng et.al. |
2410.07577 |
null |
2024-10-11 |
Bridge the Points: Graph-based Few-shot Segment Anything Semantically |
Anqi Zhang et.al. |
2410.06964 |
link |
2024-10-09 |
Rethinking the Evaluation of Visible and Infrared Image Fusion |
Dayan Guan et.al. |
2410.06811 |
link |
2024-10-10 |
QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model |
Fei Xie et.al. |
2410.06806 |
link |
2024-10-09 |
Transesophageal Echocardiography Generation using Anatomical Models |
Emmanuel Oladokun et.al. |
2410.06781 |
null |
2024-10-09 |
Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy |
Qinfeng Zhu et.al. |
2410.06725 |
null |
2024-10-09 |
Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments |
Meng Yu et.al. |
2410.06626 |
null |
2024-10-09 |
Towards Natural Image Matting in the Wild via Real-Scenario Prior |
Ruihao Xia et.al. |
2410.06593 |
link |
2024-10-08 |
Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions |
Mateus Karvat et.al. |
2410.06380 |
null |
2024-10-08 |
Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading |
Fang Gao et.al. |
2410.05762 |
null |
2024-10-08 |
Advancements in Road Lane Mapping: Comparative Fine-Tuning Analysis of Deep Learning-based Semantic Segmentation Methods Using Aerial Imagery |
Xuanchen et.al. |
2410.05717 |
null |
2024-10-08 |
Remote Sensing Image Segmentation Using Vision Mamba and Multi-Scale Multi-Frequency Feature Fusion |
Yice Cao et.al. |
2410.05624 |
null |
2024-10-07 |
Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation |
Vince Zhu et.al. |
2410.04689 |
null |
2024-10-04 |
SpecSAR-Former: A Lightweight Transformer-based Network for Global LULC Mapping Using Integrated Sentinel-1 and Sentinel-2 |
Hao Yu et.al. |
2410.03962 |
null |
2024-10-04 |
Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features |
Benyuan Meng et.al. |
2410.03558 |
link |
2024-10-04 |
HRVMamba: High-Resolution Visual State Space Model for Dense Prediction |
Hao Zhang et.al. |
2410.03174 |
null |
2024-10-03 |
HiFiSeg: High-Frequency Information Enhanced Polyp Segmentation with Global-Local Vision Transformer |
Jingjing Ren et.al. |
2410.02528 |
null |
2024-10-04 |
Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation |
Muzhi Zhu et.al. |
2410.02369 |
link |
2024-10-03 |
RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds |
Remco Royen et.al. |
2410.02323 |
link |
2024-10-03 |
Efficient Semantic Segmentation via Lightweight Multiple-Information Interaction Network |
Yangyang Qiu et.al. |
2410.02224 |
null |
2024-10-03 |
Adapting Segment Anything Model to Melanoma Segmentation in Microscopy Slide Images |
Qingyuan Liu et.al. |
2410.02207 |
null |
2024-10-02 |
SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images |
Kaiyu Li et.al. |
2410.01768 |
link |
2024-10-02 |
One-Shot Robust Imitation Learning for Long-Horizon Visuomotor Tasks from Unsegmented Demonstrations |
Shaokang Wu et.al. |
2410.01630 |
null |
2024-10-02 |
Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation |
Zhaofeng Shi et.al. |
2410.01341 |
null |
2024-10-02 |
VectorGraphNET: Graph Attention Networks for Accurate Segmentation of Complex Technical Drawings |
Andrea Carrara et.al. |
2410.01336 |
null |
2024-10-01 |
RobustEMD: Domain Robust Matching for Cross-domain Few-shot Medical Image Segmentation |
Yazhou Zhu et.al. |
2410.01110 |
link |
2024-10-01 |
Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images using SegFormer |
Vlatko Spasev et.al. |
2410.01092 |
null |
2024-10-01 |
Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time |
Chiao-An Yang et.al. |
2410.01083 |
link |
2024-10-01 |
DeepAerialMapper: Deep Learning-based Semi-automatic HD Map Creation for Highly Automated Vehicles |
Robert Krajewski et.al. |
2410.00769 |
link |
2024-10-01 |
Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection |
Pengxi Zeng et.al. |
2410.00582 |
null |
2024-10-01 |
Precise Workcell Sketching from Point Clouds Using an AR Toolbox |
Krzysztof Zieliński et.al. |
2410.00479 |
null |
2024-10-01 |
Deep Multimodal Fusion for Semantic Segmentation of Remote Sensing Earth Observation Data |
Ivica Dimitrovski et.al. |
2410.00469 |
null |
2024-10-01 |
AARK: An Open Toolkit for Autonomous Racing Research |
James Bockman et.al. |
2410.00358 |
null |
2024-09-30 |
Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation |
Aleyna Kütük et.al. |
2410.00266 |
null |
2024-09-30 |
AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation |
Boyu Han et.al. |
2409.20398 |
link |
2024-09-30 |
Leveraging CAM Algorithms for Explaining Medical Semantic Segmentation |
Tillmann Rheude et.al. |
2409.20287 |
link |
2024-09-30 |
Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model |
Fulong Ma et.al. |
2409.20164 |
null |
2024-09-30 |
Segmenting Wood Rot using Computer Vision Models |
Roland Kammerbauer et.al. |
2409.20137 |
null |
2024-09-30 |
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels |
Heeseong Shin et.al. |
2409.19846 |
null |
2024-09-27 |
Excavating in the Wild: The GOOSE-Ex Dataset for Semantic Segmentation |
Raphael Hagmanns et.al. |
2409.18788 |
null |
2024-09-27 |
Learning from Pattern Completion: Self-supervised Controllable Generation |
Zhiqiang Chen et.al. |
2409.18694 |
link |
2024-10-01 |
Get It For Free: Radar Segmentation without Expert Labels and Its Application in Odometry and Localization |
Siru Li et.al. |
2409.18434 |
null |
2024-09-26 |
Hierarchical End-to-End Autonomous Driving: Integrating BEV Perception with Deep Reinforcement Learning |
Siyi Lu et.al. |
2409.17659 |
null |
2024-09-26 |
Global-Local Medical SAM Adaptor Based on Full Adaption |
Meng Wang et.al. |
2409.17486 |
null |
2024-09-25 |
VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection |
Liangyu Zhong et.al. |
2409.17330 |
null |
2024-09-25 |
WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks |
Alberto Bacchin et.al. |
2409.16999 |
link |
2024-09-24 |
A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation |
Avisha Kumar et.al. |
2409.16441 |
link |
2024-09-24 |
Instance Segmentation of Reinforced Concrete Bridges with Synthetic Point Clouds |
Asad Ur Rahman et.al. |
2409.16381 |
null |
2024-09-24 |
Fields of The World: A Machine Learning Benchmark Dataset For Global Agricultural Field Boundary Segmentation |
Hannah Kerner et.al. |
2409.16252 |
link |
2024-09-24 |
Deep Learning for Precision Agriculture: Post-Spraying Evaluation and Deposition Estimation |
Harry Rogers et.al. |
2409.16213 |
link |
2024-09-24 |
Potential Field as Scene Affordance for Behavior Change-Based Visual Risk Object Identification |
Pang-Yuan Pao et.al. |
2409.15846 |
null |
2024-09-24 |
DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation |
Soojin Jang et.al. |
2409.15801 |
null |
2024-09-23 |
ZeroSCD: Zero-Shot Street Scene Change Detection |
Shyam Sundar Kannan et.al. |
2409.15255 |
null |
2024-09-27 |
Diffusion-based RGB-D Semantic Segmentation with Deformable Attention Transformer |
Minh Bui et.al. |
2409.15117 |
null |
2024-09-23 |
The BRAVO Semantic Segmentation Challenge Results in UNCV2024 |
Tuan-Hung Vu et.al. |
2409.15107 |
link |
2024-09-21 |
MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors |
Zhenhua Du et.al. |
2409.14019 |
null |
2024-09-21 |
Enhanced Semantic Segmentation for Large-Scale and Imbalanced Point Clouds |
Haoran Gong et.al. |
2409.13983 |
null |
2024-09-21 |
CUS3D :CLIP-based Unsupervised 3D Segmentation via Object-level Denoise |
Fuyang Yu et.al. |
2409.13982 |
null |
2024-09-20 |
Efficient Domain Augmentation for Autonomous Driving Testing Using Diffusion Models |
Luciano Baresi et.al. |
2409.13661 |
null |
2024-09-20 |
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning |
Daniele Rege Cambrin et.al. |
2409.13641 |
link |
2024-09-20 |
Towards Semi-supervised Dual-modal Semantic Segmentation |
Qiulei Dong et.al. |
2409.13325 |
null |
2024-09-19 |
Automated Linear Disturbance Mapping via Semantic Segmentation of Sentinel-2 Imagery |
Andrew M. Nagel et.al. |
2409.12817 |
null |
2024-09-20 |
Autonomous Visual Fish Pen Inspections for Estimating the State of Biofouling Buildup Using ROV -- Extended Abstract |
Matej Fabijanić et.al. |
2409.12813 |
null |
2024-09-17 |
Uncertainty and Prediction Quality Estimation for Semantic Segmentation via Graph Neural Networks |
Edgar Heinert et.al. |
2409.11373 |
link |
2024-09-17 |
MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping |
Amirreza Fateh et.al. |
2409.11316 |
link |
2024-09-17 |
Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark |
Clifford Broni-Bediako et.al. |
2409.11227 |
link |
2024-09-17 |
HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios |
Nick Theisen et.al. |
2409.11205 |
link |
2024-09-16 |
Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning |
Amin Karimi Monsefi et.al. |
2409.10362 |
null |
2024-09-16 |
BAFNet: Bilateral Attention Fusion Network for Lightweight Semantic Segmentation of Urban Remote Sensing Images |
Wentao Wang et.al. |
2409.10269 |
null |
2024-09-15 |
Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation |
Zhanteng Xie et.al. |
2409.09899 |
null |
2024-09-15 |
Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation |
Qilong Zhangli et.al. |
2409.09893 |
null |
2024-09-15 |
High Definition Map Mapping and Update: A General Overview and Future Directions |
Benny Wijaya et.al. |
2409.09726 |
null |
2024-09-14 |
Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation |
Hugo Porta et.al. |
2409.09497 |
null |
2024-09-13 |
AWF: Adaptive Weight Fusion for Enhanced Class Incremental Semantic Segmentation |
Zechao Sun et.al. |
2409.08516 |
null |
2024-09-13 |
VistaFormer: Scalable Vision Transformers for Satellite Image Time Series Segmentation |
Ezra MacDonald et.al. |
2409.08461 |
link |
2024-09-12 |
Bayesian Self-Training for Semi-Supervised 3D Segmentation |
Ozan Unal et.al. |
2409.08102 |
null |
2024-09-12 |
Depth Matters: Exploring Deep Interactions of RGB-D for Semantic Segmentation in Traffic Scenes |
Siyu Chen et.al. |
2409.07995 |
null |
2024-09-12 |
SURGIVID: Annotation-Efficient Surgical Video Object Discovery |
Çağhan Köksal et.al. |
2409.07801 |
null |
2024-09-12 |
Lagrange Duality and Compound Multi-Attention Transformer for Semi-Supervised Medical Image Segmentation |
Fuchen Zheng et.al. |
2409.07793 |
link |
2024-09-12 |
ASSNet: Adaptive Semantic Segmentation Network for Microtumors and Multi-Organ Segmentation |
Fuchen Zheng et.al. |
2409.07779 |
link |
2024-09-12 |
Open-Vocabulary Remote Sensing Image Semantic Segmentation |
Qinglong Cao et.al. |
2409.07683 |
link |
2024-09-11 |
Token Turing Machines are Efficient Vision Models |
Purvish Jajal et.al. |
2409.07613 |
null |
2024-09-11 |
AC-IND: Sparse CT reconstruction based on attenuation coefficient estimation and implicit neural distribution |
Wangduo Xie et.al. |
2409.07171 |
null |
2024-09-11 |
Brain-Inspired Stepwise Patch Merging for Vision Transformers |
Yonghao Yu et.al. |
2409.06963 |
null |
2024-09-10 |
Cross-Modal Self-Supervised Learning with Effective Contrastive Units for LiDAR Point Clouds |
Mu Cai et.al. |
2409.06827 |
link |
2024-09-10 |
PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation |
Yin Hu et.al. |
2409.06309 |
null |
2024-09-10 |
EDADepth: Enhanced Data Augmentation for Monocular Depth Estimation |
Nischal Khanal et.al. |
2409.06183 |
link |
2024-09-09 |
SVS-GAN: Leveraging GANs for Semantic Video Synthesis |
Khaled M. Seyam et.al. |
2409.06074 |
null |
2024-09-12 |
Enhanced Generative Data Augmentation for Semantic Segmentation via Stronger Guidance |
Quang-Huy Che et.al. |
2409.06002 |
null |
2024-09-09 |
Segmentation by Factorization: Unsupervised Semantic Segmentation for Pathology by Factorizing Foundation Model Features |
Jacob Gildenblat et.al. |
2409.05697 |
null |
2024-09-09 |
ICPR 2024 Competition on Safe Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather Conditions |
Furqan Ahmed Shaik et.al. |
2409.05327 |
null |
2024-09-08 |
RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network |
Zhiwei Lin et.al. |
2409.04979 |
null |
2024-09-06 |
Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation |
Björn Michele et.al. |
2409.04409 |
link |
2024-09-05 |
Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution |
Marga Don et.al. |
2409.03754 |
link |
2024-09-05 |
LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones |
Moritz Nottebaum et.al. |
2409.03460 |
link |
2024-09-05 |
Training-free Conversion of Pretrained ANNs to SNNs for Low-Power and High-Performance Applications |
Tong Bu et.al. |
2409.03368 |
null |
2024-09-05 |
UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking |
Md. Mahfuzur Rahman et.al. |
2409.03245 |
null |
2024-09-05 |
Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation |
Xixi Jiang et.al. |
2409.03228 |
link |
2024-09-06 |
iSeg: An Iterative Refinement-based Framework for Training-free Segmentation |
Lin Sun et.al. |
2409.03209 |
link |
2024-09-04 |
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation |
Hayeon Jo et.al. |
2409.02838 |
null |
2024-09-04 |
CLDA: Collaborative Learning for Enhanced Unsupervised Domain Adaptation |
Minhee Cho et.al. |
2409.02699 |
null |
2024-09-04 |
SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction |
Sumin Son et.al. |
2409.02513 |
null |
2024-09-03 |
K-Origins: Better Colour Quantification for Neural Networks |
Lewis Mason et.al. |
2409.02281 |
link |
2024-09-03 |
AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions |
Chenghao Qian et.al. |
2409.02045 |
link |
2024-09-03 |
Segmenting Object Affordances: Reproducibility and Sensitivity to Scale |
Tommaso Apicella et.al. |
2409.01814 |
link |
2024-09-03 |
Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation |
Haodong Wang et.al. |
2409.01662 |
null |
2024-09-02 |
SOOD-ImageNet: a Large-Scale Dataset for Semantic Out-Of-Distribution Image Classification and Semantic Segmentation |
Alberto Bacchin et.al. |
2409.01109 |
link |
2024-09-02 |
From Bird's-Eye to Street View: Crafting Diverse and Condition-Aligned Images with Latent Diffusion Model |
Xiaojie Xu et.al. |
2409.01014 |
null |
2024-09-02 |
SeCo-INR: Semantically Conditioned Implicit Neural Representations for Improved Medical Image Super-Resolution |
Mevan Ekanayake et.al. |
2409.01013 |
null |
2024-09-02 |
IVGF: The Fusion-Guided Infrared and Visible General Framework |
Fangcen Liu et.al. |
2409.00973 |
null |
2024-09-01 |
Image-to-Lidar Relational Distillation for Autonomous Driving Data |
Anas Mahmoud et.al. |
2409.00845 |
null |
2024-09-01 |
Change-Aware Siamese Network for Surface Defects Segmentation under Complex Background |
Biyuan Liu et.al. |
2409.00589 |
link |
2024-08-31 |
Plant detection from ultra high resolution remote sensing images: A Semantic Segmentation approach based on fuzzy loss |
Shivam Pande et.al. |
2409.00513 |
null |
2024-08-30 |
Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes |
Li Zhang et.al. |
2408.17421 |
link |
2024-08-30 |
Structuring a Training Strategy to Robustify Perception Models with Realistic Image Augmentations |
Ahmed Hammam et.al. |
2408.17311 |
null |
2024-08-30 |
Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training |
Zizheng Huang et.al. |
2408.17081 |
link |
2024-08-30 |
Transient Fault Tolerant Semantic Segmentation for Autonomous Driving |
Leonardo Iurada et.al. |
2408.16952 |
link |
2024-08-29 |
SODAWideNet++: Combining Attention and Convolutions for Salient Object Detection |
Rohit Venkata Sai Dulam et.al. |
2408.16645 |
link |
2024-08-29 |
Multi-source Domain Adaptation for Panoramic Semantic Segmentation |
Jing Jiang et.al. |
2408.16469 |
link |
2024-08-29 |
EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More |
Kanghao Chen et.al. |
2408.16254 |
null |
2024-08-28 |
SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors |
Zhiqing Zhang et.al. |
2408.15887 |
null |
2024-08-28 |
DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries |
Yu Yang et.al. |
2408.15813 |
null |
2024-08-28 |
TeFF: Tracking-enhanced Forgetting-free Few-shot 3D LiDAR Semantic Segmentation |
Junbao Zhou et.al. |
2408.15657 |
link |
2024-08-27 |
Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images |
Silvia Seidlitz et.al. |
2408.15373 |
link |
2024-08-27 |
An Investigation on The Position Encoding in Vision-Based Dynamics Prediction |
Jiageng Zhu et.al. |
2408.15201 |
null |
2024-08-27 |
Applying ViT in Generalized Few-shot Semantic Segmentation |
Liyuan Geng et.al. |
2408.14957 |
link |
2024-08-27 |
Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch Attack |
Naufal Suryanto et.al. |
2408.14879 |
link |
2024-08-27 |
MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation |
Yuanbing Zhu et.al. |
2408.14776 |
null |
2024-08-26 |
Physically Feasible Semantic Segmentation |
Shamik Basu et.al. |
2408.14672 |
link |
2024-08-25 |
OpenNav: Efficient Open Vocabulary 3D Object Detection for Smart Wheelchair Navigation |
Muhammad Rameez ur Rahman et.al. |
2408.13936 |
link |
2024-08-25 |
Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation |
Yuwen Pan et.al. |
2408.13838 |
null |
2024-08-25 |
TripleMixer: A 3D Point Cloud Denoising Model for Adverse Weather |
Xiongwei Zhao et.al. |
2408.13802 |
link |
2024-08-25 |
ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation |
Xin Zhang et.al. |
2408.13771 |
null |
2024-08-25 |
Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation |
Zhaoyang Li et.al. |
2408.13752 |
null |
2024-08-24 |
ESA: Annotation-Efficient Active Learning for Semantic Segmentation |
Jinchao Ge et.al. |
2408.13491 |
link |
2024-08-23 |
Accuracy Improvement of Cell Image Segmentation Using Feedback Former |
Hinako Mitsuoka et.al. |
2408.12974 |
null |
2024-08-23 |
Image Segmentation in Foundation Model Era: A Survey |
Tianfei Zhou et.al. |
2408.12957 |
link |
2024-08-23 |
Symmetric masking strategy enhances the performance of Masked Image Modeling |
Khanh-Binh Nguyen et.al. |
2408.12772 |
null |
2024-08-22 |
Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets |
Wolfgang Boettcher et.al. |
2408.12489 |
link |
2024-08-22 |
The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation |
Tuyen Tran et.al. |
2408.12447 |
null |
2024-08-26 |
UNetMamba: An Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images |
Enze Zhu et.al. |
2408.11545 |
link |
2024-08-21 |
Exploring Scene Coherence for Semi-Supervised 3D Semantic Segmentation |
Chuandong Liu et.al. |
2408.11280 |
null |
2024-08-20 |
NeCo: Improving DINOv2's spatial representations in 19 GPU hours with Patch Neighbor Consistency |
Valentinos Pariza et.al. |
2408.11054 |
null |
2024-08-20 |
CO2Wounds-V2: Extended Chronic Wounds Dataset From Leprosy Patients |
Karen Sanchez et.al. |
2408.10827 |
link |
2024-08-20 |
Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended? |
Chen Liang et.al. |
2408.10627 |
null |
2024-08-20 |
Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation |
Jiawei Han et.al. |
2408.10537 |
link |
2024-08-19 |
Imbalance-Aware Culvert-Sewer Defect Segmentation Using an Enhanced Feature Pyramid Network |
Rasha Alshawi et.al. |
2408.10181 |
null |
2024-08-19 |
Dynamic Label Injection for Imbalanced Industrial Defect Segmentation |
Emanuele Caruso et.al. |
2408.10031 |
link |
2024-08-19 |
Detecting Adversarial Attacks in Semantic Segmentation via Uncertainty Estimation: A Deep Analysis |
Kira Maag et.al. |
2408.10021 |
null |
2024-08-19 |
Segment-Anything Models Achieve Zero-shot Robustness in Autonomous Driving |
Jun Yan et.al. |
2408.09839 |
link |
2024-08-18 |
OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras |
Muhammad Rameez Ur Rahman et.al. |
2408.09424 |
link |
2024-08-18 |
Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration |
Hao Ai et.al. |
2408.09336 |
null |
2024-08-17 |
Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology |
Junchao Zhu et.al. |
2408.09278 |
link |
2024-08-17 |
GoodSAM++: Bridging Domain and Capacity Gaps via Segment Anything Model for Panoramic Semantic Segmentation |
Weiming Zhang et.al. |
2408.09115 |
null |
2024-08-17 |
Depth-guided Texture Diffusion for Image Semantic Segmentation |
Wei Sun et.al. |
2408.09097 |
null |
2024-08-15 |
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks |
Dongshuo Yin et.al. |
2408.08345 |
link |
2024-08-14 |
MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis |
Nimeesha Chan et.al. |
2408.07773 |
link |
2024-08-15 |
MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation |
Beoungwoo Kang et.al. |
2408.07576 |
link |
2024-08-19 |
MagicFace: Training-free Universal-Style Human Image Customized Synthesis |
Yibin Wang et.al. |
2408.07433 |
null |
2024-08-14 |
Segment Using Just One Example |
Pratik Vora et.al. |
2408.07393 |
null |
2024-08-14 |
Ensemble architecture in polyp segmentation |
Hao-Yun Hsu et.al. |
2408.07262 |
link |
2024-08-14 |
Leveraging Perceptual Scores for Dataset Pruning in Computer Vision Tasks |
Raghavendra Singh et.al. |
2408.07243 |
null |
2024-08-14 |
Enhancing Autonomous Vehicle Perception in Adverse Weather through Image Augmentation during Semantic Segmentation Training |
Ethan Kou et.al. |
2408.07239 |
link |
2024-08-13 |
ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation |
Jingyun Wang et.al. |
2408.06747 |
link |
2024-08-10 |
Dilated Convolution with Learnable Spacings |
Ismail Khalfaoui-Hassani et.al. |
2408.06383 |
null |
2024-08-12 |
Correlation Weighted Prototype-based Self-Supervised One-Shot Segmentation of Medical Images |
Siladittya Manna et.al. |
2408.06235 |
null |
2024-08-12 |
A-BDD: Leveraging Data Augmentations for Safe Autonomous Driving in Adverse Weather and Lighting |
Felix Assion et.al. |
2408.06071 |
null |
2024-08-12 |
Enhancing 3D Transformer Segmentation Model for Medical Image with Token-level Representation Learning |
Xinrong Hu et.al. |
2408.05889 |
link |
2024-08-11 |
Seg-CycleGAN : SAR-to-optical image translation guided by a downstream task |
Hannuo Zhang et.al. |
2408.05777 |
null |
2024-08-11 |
MacFormer: Semantic Segmentation with Fine Object Boundaries |
Guoan Xu et.al. |
2408.05699 |
null |
2024-08-10 |
Multimodal generative semantic communication based on latent diffusion model |
Weiqi Fu et.al. |
2408.05455 |
null |
2024-08-09 |
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation |
Dahyun Kang et.al. |
2408.04961 |
link |
2024-08-09 |
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation |
Mengcheng Lan et.al. |
2408.04883 |
link |
2024-08-09 |
Extracting Signal Electron Trajectories in the COMET Phase-I Cylindrical Drift Chamber Using Deep Learning |
Fumihiro Kaneko et.al. |
2408.04795 |
null |
2024-08-08 |
SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation |
Jieming Yu et.al. |
2408.04593 |
null |
2024-08-08 |
SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios |
Sriram Mandalika et.al. |
2408.04482 |
null |
2024-08-08 |
What could go wrong? Discovering and describing failure modes in computer vision |
Gabriela Csurka et.al. |
2408.04471 |
null |
2024-08-07 |
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications |
Tianfang Zhang et.al. |
2408.03703 |
link |
2024-08-07 |
SAM2-PATH: A better segment anything model for semantic segmentation in digital pathology |
Mingya Zhang et.al. |
2408.03651 |
link |
2024-08-06 |
Post-Mortem Human Iris Segmentation Analysis with Deep Learning |
Afzal Hossain et.al. |
2408.03448 |
null |
2024-08-06 |
Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression |
Jonas Schmitt et.al. |
2408.03046 |
link |
2024-08-05 |
Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs |
Jeongkee Lim et.al. |
2408.02261 |
link |
2024-08-05 |
Curriculum learning based pre-training using Multi-Modal Contrastive Masked Autoencoders |
Muhammad Abdullah Jamal et.al. |
2408.02245 |
null |
2024-08-04 |
Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation |
Ye Du et.al. |
2408.02039 |
null |
2024-08-03 |
Bayesian Active Learning for Semantic Segmentation |
Sima Didari et.al. |
2408.01694 |
null |
2024-08-03 |
A Comparative Analysis of CNN-based Deep Learning Models for Landslide Detection |
Omkar Oak et.al. |
2408.01692 |
null |
2024-08-03 |
Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation |
Balázs Opra et.al. |
2408.01640 |
null |
2024-08-02 |
Balanced Residual Distillation Learning for 3D Point Cloud Class-Incremental Semantic Segmentation |
Yuanzhi Su et.al. |
2408.01356 |
null |
2024-08-02 |
StitchFusion: Weaving Any Visual Modalities to Enhance Multimodal Semantic Segmentation |
Bingyu Li et.al. |
2408.01343 |
null |
2024-08-02 |
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach |
Yabin Zhu et.al. |
2408.00969 |
link |
2024-08-01 |
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation |
Siyu Jiao et.al. |
2408.00744 |
link |
2024-08-01 |
Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function |
Matias Oscar Volman Stern et.al. |
2408.00707 |
null |
2024-08-01 |
AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation |
Asbjørn Munk et.al. |
2408.00640 |
link |
2024-08-01 |
SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation |
Shengbo Tan et.al. |
2408.00496 |
link |
2024-07-31 |
Open-Vocabulary Audio-Visual Semantic Segmentation |
Ruohao Guo et.al. |
2407.21721 |
null |
2024-07-31 |
MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment |
Anurag Das et.al. |
2407.21654 |
null |
2024-07-31 |
Small Object Few-shot Segmentation for Vision-based Industrial Inspection |
Zilong Zhang et.al. |
2407.21351 |
link |
2024-07-31 |
On-the-fly Point Feature Representation for Point Clouds Analysis |
Jiangyi Wang et.al. |
2407.21335 |
null |
2024-07-31 |
Fine-grained Metrics for Point Cloud Semantic Segmentation |
Zhuheng Lu et.al. |
2407.21289 |
null |
2024-07-30 |
PLANesT-3D: A new annotated dataset for segmentation of 3D plant point clouds |
Kerem Mertoğlu et.al. |
2407.21150 |
null |
2024-07-30 |
Learning Ordinality in Semantic Segmentation |
Rafael Cristino et.al. |
2407.20959 |
null |
2024-07-29 |
Improving 2D Feature Representations by 3D-Aware Fine-Tuning |
Yuanwen Yue et.al. |
2407.20229 |
null |
2024-07-29 |
Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset |
Yimian Dai et.al. |
2407.20078 |
link |
2024-07-29 |
Language-driven Grasp Detection with Mask-guided Attention |
Tuan Van Vo et.al. |
2407.19877 |
null |
2024-07-29 |
Rethinking RGB-D Fusion for Semantic Segmentation in Surgical Datasets |
Muhammad Abdullah Jamal et.al. |
2407.19714 |
null |
2024-07-29 |
ALEN: A Dual-Approach for Uniform and Non-Uniform Low-Light Image Enhancement |
Ezequiel Perez-Zarate et.al. |
2407.19708 |
link |
2024-07-28 |
ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding |
Zhen Chen et.al. |
2407.19435 |
link |
2024-07-27 |
Ensembling convolutional neural networks for human skin segmentation |
Patryk Kuban et.al. |
2407.19310 |
null |
2024-07-27 |
Sewer Image Super-Resolution with Depth Priors and Its Lightweight Network |
Gang Pan et.al. |
2407.19271 |
null |
2024-07-26 |
Sparse Refinement for Efficient High-Resolution Semantic Segmentation |
Zhijian Liu et.al. |
2407.19014 |
null |
2024-07-29 |
Learning Spectral-Decomposed Tokens for Domain Generalized Semantic Segmentation |
Jingjun Yi et.al. |
2407.18568 |
null |
2024-07-25 |
Taxonomy-Aware Continual Semantic Segmentation in Hyperbolic Spaces for Open-World Perception |
Julia Hindel et.al. |
2407.18145 |
null |
2024-07-25 |
TiCoSS: Tightening the Coupling between Semantic Segmentation and Stereo Matching within A Joint Learning Framework |
Guanfeng Tang et.al. |
2407.18038 |
null |
2024-07-25 |
Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions |
Jan Nikolas Morshuis et.al. |
2407.18026 |
link |
2024-07-24 |
Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation |
Hyunwoo Yu et.al. |
2407.17261 |
link |
2024-07-25 |
Enhancing Environmental Monitoring through Multispectral Imaging: The WasteMS Dataset for Semantic Segmentation of Lakeside Waste |
Qinfeng Zhu et.al. |
2407.17028 |
link |
2024-07-24 |
Progressive Query Refinement Framework for Bird's-Eye-View Semantic Segmentation from Surrounding Images |
Dooseop Choi et.al. |
2407.17003 |
link |
2024-07-23 |
Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving |
Anam Manzoor et.al. |
2407.16647 |
null |
2024-07-23 |
Deep Bayesian segmentation for colon polyps: Well-calibrated predictions in medical imaging |
Daniela L. Ramos et.al. |
2407.16608 |
link |
2024-07-23 |
Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision |
Aditya Krishnan et.al. |
2407.16102 |
null |
2024-07-22 |
Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond |
Silvio Galesso et.al. |
2407.15739 |
link |
2024-07-22 |
MSSPlace: Multi-Sensor Place Recognition with Visual and Text Semantics |
Alexander Melekhin et.al. |
2407.15663 |
link |
2024-07-22 |
Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling |
Bo Yuan et.al. |
2407.15429 |
link |
2024-07-22 |
Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data |
Junha Song et.al. |
2407.15383 |
link |
2024-07-21 |
Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation |
Xiaoyang Wu et.al. |
2407.15282 |
null |
2024-07-20 |
Downstream-Pretext Domain Knowledge Traceback for Active Learning |
Beichen Zhang et.al. |
2407.14720 |
null |
2024-07-19 |
Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model |
Kun Zhao et.al. |
2407.14326 |
null |
2024-07-19 |
Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation |
Zhengyuan Xie et.al. |
2407.14142 |
link |
2024-07-19 |
GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation |
Florian Chabot et.al. |
2407.14108 |
null |
2024-07-18 |
Many Perception Tasks are Highly Redundant Functions of their Input Data |
Rahul Ramesh et.al. |
2407.13841 |
null |
2024-07-18 |
GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model |
Abdelrahman Shaker et.al. |
2407.13772 |
link |
2024-07-18 |
SegPoint: Segment Any Point Cloud via Large Language Model |
Shuting He et.al. |
2407.13761 |
null |
2024-07-23 |
MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis |
Ziming Zhong et.al. |
2407.13675 |
link |
2024-07-18 |
Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models |
Xiaoyu Zhu et.al. |
2407.13642 |
null |
2024-07-18 |
FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures |
Hao Lu et.al. |
2407.13500 |
link |
2024-07-18 |
FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions |
Sohyun Lee et.al. |
2407.13437 |
null |
2024-07-18 |
Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation |
Chang Liu et.al. |
2407.13363 |
link |
2024-07-18 |
Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation |
Shoumeng Qiu et.al. |
2407.13254 |
link |
2024-07-18 |
OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation |
Jian Sun et.al. |
2407.13137 |
null |
2024-07-18 |
Tree semantic segmentation from aerial image time series |
Venkatesh Ramesh et.al. |
2407.13102 |
null |
2024-07-17 |
ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders |
Carlos Hinojosa et.al. |
2407.13036 |
null |
2024-07-17 |
Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation |
Prantik Howlader et.al. |
2407.12630 |
link |
2024-07-17 |
Instance-wise Uncertainty for Class Imbalance in Semantic Segmentation |
Luís Almeida et.al. |
2407.12609 |
null |
2024-07-18 |
Benchmarking Robust Self-Supervised Learning Across Diverse Downstream Tasks |
Antoni Kowalczuk et.al. |
2407.12588 |
link |
2024-07-17 |
Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation |
Ruijie Xu et.al. |
2407.12489 |
link |
2024-07-17 |
Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation |
Hyun Seok Seong et.al. |
2407.12463 |
link |
2024-07-17 |
ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference |
Mengcheng Lan et.al. |
2407.12442 |
null |
2024-07-17 |
Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model |
Tao Wang et.al. |
2407.12319 |
null |
2024-07-16 |
FoodMem: Near Real-time and Precise Food Video Segmentation |
Ahmad AlMughrabi et.al. |
2407.12121 |
null |
2024-07-16 |
Mitigating Background Shift in Class-Incremental Semantic Segmentation |
Gilhan Park et.al. |
2407.11859 |
link |
2024-07-16 |
Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation |
Juncheng Ma et.al. |
2407.11820 |
link |
2024-07-16 |
XEdgeAI: A Human-centered Industrial Inspection Framework with Data-centric Explainable Edge AI Approach |
Truong Thanh Hung Nguyen et.al. |
2407.11771 |
link |
2024-07-16 |
OAM-TCD: A globally diverse dataset of high-resolution tree cover maps |
Josh Veitch-Michaelis et.al. |
2407.11743 |
link |
2024-07-16 |
SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds |
Yanbo Wang et.al. |
2407.11569 |
link |
2024-07-16 |
Leveraging Segment Anything Model in Identifying Buildings within Refugee Camps (SAM4Refugee) from Satellite Imagery for Humanitarian Operations |
Yunya Gao et.al. |
2407.11381 |
link |
2024-07-16 |
Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities |
Xu Zheng et.al. |
2407.11351 |
null |
2024-07-16 |
Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation |
Xu Zheng et.al. |
2407.11344 |
null |
2024-07-16 |
TCFormer: Visual Recognition via Token Clustering Transformer |
Wang Zeng et.al. |
2407.11321 |
link |
2024-07-15 |
Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding |
Danish Nazir et.al. |
2407.11224 |
null |
2024-07-15 |
Finding Meaning in Points: Weakly Supervised Semantic Segmentation for Event Cameras |
Hoonhee Cho et.al. |
2407.11216 |
link |
2024-07-15 |
No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations |
Walter Simoncini et.al. |
2407.10964 |
link |
2024-07-15 |
APC: Adaptive Patch Contrast for Weakly Supervised Semantic Segmentation |
Wangyu Wu et.al. |
2407.10649 |
null |
2024-07-15 |
Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs |
Rong Ma et.al. |
2407.10534 |
null |
2024-07-14 |
Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data |
Tuo Feng et.al. |
2407.10200 |
link |
2024-07-14 |
RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation |
Li Li et.al. |
2407.10159 |
link |
2024-07-14 |
HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation |
Chengjie Jiang et.al. |
2407.10047 |
null |
2024-07-13 |
Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation |
Anqi Zhang et.al. |
2407.09838 |
null |
2024-07-13 |
3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance |
Xiaoxu Xu et.al. |
2407.09826 |
link |
2024-07-13 |
TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation |
Xiaopei Wu et.al. |
2407.09751 |
link |
2024-07-12 |
Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion |
Shiqi Tan et.al. |
2407.09697 |
null |
2024-07-12 |
SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images |
Josh Myers-Dean et.al. |
2407.09686 |
null |
2024-07-12 |
FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background |
Muhammad Ali et.al. |
2407.09379 |
link |
2024-07-12 |
Salt & Pepper Heatmaps: Diffusion-informed Landmark Detection Strategy |
Julian Wyatt et.al. |
2407.09192 |
null |
2024-07-12 |
Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off |
Levente Halmosi et.al. |
2407.09150 |
link |
2024-07-12 |
Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation |
Wei Cong et.al. |
2407.09047 |
null |
2024-07-12 |
Textual Query-Driven Mask Transformer for Domain Generalized Segmentation |
Byeonghyun Pak et.al. |
2407.09033 |
link |
2024-07-12 |
Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation |
Zihao Li et.al. |
2407.08994 |
null |
2024-07-11 |
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation |
Tong Shao et.al. |
2407.08268 |
link |
2024-07-11 |
Enrich the content of the image Using Context-Aware Copy Paste |
Qiushi Guo et.al. |
2407.08151 |
null |
2024-07-10 |
MambaVision: A Hybrid Mamba-Transformer Vision Backbone |
Ali Hatamizadeh et.al. |
2407.08083 |
link |
2024-07-10 |
Satellite Image Time Series Semantic Change Detection: Novel Architecture and Analysis of Domain Shift |
Elliot Vincent et.al. |
2407.07616 |
link |
2024-07-10 |
H-FCBFormer Hierarchical Fully Convolutional Branch Transformer for Occlusal Contact Segmentation with Articulating Paper |
Ryan Banks et.al. |
2407.07604 |
link |
2024-07-11 |
Trainable Highly-expressive Activation Functions |
Irit Chelly et.al. |
2407.07564 |
link |
2024-07-10 |
Deformable-Heatmap-Segmentation for Automobile Visual Perception |
Hongyu Jin et.al. |
2407.07493 |
null |
2024-07-10 |
Exploring the Untouched Sweeps for Conflict-Aware 3D Segmentation Pretraining |
Tianfang Sun et.al. |
2407.07465 |
null |
2024-07-11 |
HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation |
Guoan Xu et.al. |
2407.07441 |
null |
2024-07-09 |
ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation |
Yuyuan Liu et.al. |
2407.07171 |
link |
2024-07-08 |
Training-free CryoET Tomogram Segmentation |
Yizhou Zhao et.al. |
2407.06833 |
link |
2024-07-09 |
CycleSAM: One-Shot Surgical Scene Segmentation using Cycle-Consistent Feature Matching to Prompt SAM |
Aditya Murali et.al. |
2407.06795 |
null |
2024-07-09 |
LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset based on Muti-sensor for Autonomous Exploration |
Jiayi Liu et.al. |
2407.06512 |
link |
2024-07-08 |
Leveraging image captions for selective whole slide image annotation |
Jingna Qiu et.al. |
2407.06363 |
link |
2024-07-08 |
Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots |
Siva Krishna Ravipati et.al. |
2407.06077 |
link |
2024-07-08 |
Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts |
Puzuo Wang et.al. |
2407.06043 |
null |
2024-07-08 |
RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation |
Sarah Elmahdy et.al. |
2407.06016 |
link |
2024-07-07 |
Semantic Segmentation for Real-World and Synthetic Vehicle's Forward-Facing Camera Images |
Tuan T. Nguyen et.al. |
2407.05452 |
null |
2024-07-07 |
Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness |
Idris Hamoud et.al. |
2407.05448 |
null |
2024-07-06 |
A Study of Test-time Contrastive Concepts for Open-world, Open-vocabulary Semantic Segmentation |
Monika Wysoczańska et.al. |
2407.05061 |
null |
2024-07-06 |
BlessemFlood21: Advancing Flood Analysis with a High-Resolution Georeferenced Dataset for Humanitarian Aid Support |
Vladyslav Polushko et.al. |
2407.05007 |
null |
2024-07-05 |
Explainable Metric Learning for Deflating Data Bias |
Emma Andrews et.al. |
2407.04866 |
null |
2024-07-10 |
LMSeg: A deep graph message-passing network for efficient and accurate semantic segmentation of large-scale 3D landscape meshes |
Zexian Huang et.al. |
2407.04326 |
null |
2024-07-04 |
Relative Difficulty Distillation for Semantic Segmentation |
Dong Liang et.al. |
2407.03719 |
link |
2024-07-04 |
POSTURE: Pose Guided Unsupervised Domain Adaptation for Human Body Part Segmentation |
Arindam Dutta et.al. |
2407.03549 |
null |
2024-07-03 |
A Unified Framework for 3D Scene Understanding |
Wei Xu et.al. |
2407.03263 |
link |
2024-07-03 |
ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation |
Chang Li et.al. |
2407.03033 |
null |
2024-07-03 |
ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation |
Yipin Guo et.al. |
2407.02881 |
null |
2024-07-03 |
Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation |
Tao Chen et.al. |
2407.02768 |
link |
2024-07-02 |
Open Panoramic Segmentation |
Junwei Zheng et.al. |
2407.02685 |
link |
2024-07-02 |
Holistically-Nested Structure-Aware Graph Neural Network for Road Extraction |
Tinghuai Wang et.al. |
2407.02639 |
null |
2024-07-02 |
Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather |
Junsung Park et.al. |
2407.02286 |
link |
2024-07-02 |
MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders |
Baijiong Lin et.al. |
2407.02228 |
link |
2024-07-02 |
Occlusion-Aware Seamless Segmentation |
Yihong Cao et.al. |
2407.02182 |
link |
2024-07-02 |
VRBiom: A New Periocular Dataset for Biometric Applications of HMD |
Ketan Kotwal et.al. |
2407.02150 |
null |
2024-07-02 |
Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts |
Pasquale De Marinis et.al. |
2407.02075 |
link |
2024-07-02 |
Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning |
Chengchao Shen et.al. |
2407.02014 |
link |
2024-07-01 |
Label-free Neural Semantic Image Synthesis |
Jiayi Wang et.al. |
2407.01790 |
null |
2024-07-01 |
PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction |
Xuan Yu et.al. |
2407.01349 |
null |
2024-07-01 |
CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes |
Danial Qashqai et.al. |
2407.01328 |
link |
2024-06-29 |
SolarSAM: Building-scale Photovoltaic Potential Assessment Based on Segment Anything Model (SAM) and Remote Sensing for Emerging City |
Guohao Wang et.al. |
2407.00296 |
link |
2024-06-28 |
Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review |
Moseli Mots'oehli et.al. |
2407.00252 |
null |
2024-07-01 |
Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding |
Yifan Tang et.al. |
2406.19791 |
null |
2024-06-28 |
Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation |
Junsung Park et.al. |
2406.19638 |
link |
2024-06-28 |
PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation |
Deyi Ji et.al. |
2406.19632 |
null |
2024-06-27 |
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model |
Haobo Yuan et.al. |
2406.19369 |
link |
2024-06-27 |
ProtoGMM: Multi-prototype Gaussian-Mixture-based Domain Adaptation Model for Semantic Segmentation |
Nazanin Moradinasab et.al. |
2406.19225 |
null |
2024-06-30 |
Segment Anything Model for automated image data annotation: empirical studies using text prompts from Grounding DINO |
Fuseini Mumuni et.al. |
2406.19057 |
null |
2024-06-27 |
Divide, Ensemble and Conquer: The Last Mile on Unsupervised Domain Adaptation for On-Board Semantic Segmentation |
Tao Lian et.al. |
2406.18809 |
null |
2024-06-26 |
CAS: Confidence Assessments of classification algorithms for Semantic segmentation of EO data |
Nikolaos Dionelis et.al. |
2406.18279 |
link |
2024-06-26 |
The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval |
Meinardus Boris et.al. |
2406.18113 |
link |
2024-06-26 |
Few-Shot Medical Image Segmentation with High-Fidelity Prototypes |
Song Tang et.al. |
2406.18074 |
link |
2024-06-25 |
Local-to-Global Cross-Modal Attention-Aware Fusion for HSI-X Semantic Segmentation |
Xuming Zhang et.al. |
2406.17679 |
null |
2024-06-25 |
DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation |
Ahmad Mohammadshirazi et.al. |
2406.17591 |
link |
2024-06-25 |
Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation |
Felix Stillger et.al. |
2406.17541 |
null |
2024-06-25 |
Investigating Self-Supervised Methods for Label-Efficient Learning |
Srinivasa Rao Nandam et.al. |
2406.17460 |
null |
2024-06-25 |
Pseudo Labelling for Enhanced Masked Autoencoders |
Srinivasa Rao Nandam et.al. |
2406.17450 |
null |
2024-06-25 |
Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model |
Zhuoyuan Li et.al. |
2406.17442 |
null |
2024-06-25 |
Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes |
Qi Ma et.al. |
2406.17438 |
link |
2024-06-24 |
Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation |
Yizheng Wu et.al. |
2406.16776 |
link |
2024-06-24 |
μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation |
Pierangela Bruno et.al. |
2406.16724 |
null |
2024-06-24 |
GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection |
Harnaik Dhami et.al. |
2406.16625 |
link |
2024-06-24 |
LOGCAN++: Local-global class-aware network for semantic segmentation of remote sensing images |
Xiaowen Ma et.al. |
2406.16502 |
link |
2024-06-24 |
Cascade Reward Sampling for Efficient Decoding-Time Alignment |
Bolian Li et.al. |
2406.16306 |
link |
2024-06-24 |
SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments |
Neng Wang et.al. |
2406.16279 |
link |
2024-06-23 |
UDHF2-Net: An Uncertainty-diffusion-model-based High-Frequency TransFormer Network for High-accuracy Interpretation of Remotely Sensed Imagery |
Pengfei Zhang et.al. |
2406.16129 |
null |
2024-06-22 |
Fine-grained Background Representation for Weakly Supervised Semantic Segmentation |
Xu Yin et.al. |
2406.15755 |
link |
2024-06-20 |
Evaluation of Deep Learning Semantic Segmentation for Land Cover Mapping on Multispectral, Hyperspectral and High Spatial Aerial Imagery |
Ilham Adi Panuntun et.al. |
2406.14220 |
null |
2024-06-20 |
Trusting Semantic Segmentation Networks |
Samik Some et.al. |
2406.14201 |
null |
2024-06-20 |
EvSegSNN: Neuromorphic Semantic Segmentation for Event Data |
Dalia Hareb et.al. |
2406.14178 |
null |
2024-06-20 |
Seg-LSTM: Performance of xLSTM for Semantic Segmentation of Remotely Sensed Images |
Qinfeng Zhu et.al. |
2406.14086 |
link |
2024-06-19 |
Search-based DNN Testing and Retraining with GAN-enhanced Simulations |
Mohammed Oualid Attaoui et.al. |
2406.13359 |
null |
2024-06-19 |
Deep Learning-Based 3D Instance and Semantic Segmentation: A Review |
Siddiqui Muhammad Yasir et.al. |
2406.13308 |
null |
2024-06-18 |
Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation |
Guoyu Yang et.al. |
2406.12496 |
link |
2024-06-18 |
Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble |
Wang Liu et.al. |
2406.12271 |
null |
2024-06-17 |
OoDIS: Anomaly Instance Segmentation Benchmark |
Alexey Nekrasov et.al. |
2406.11835 |
link |
2024-06-17 |
Multimodal Learning To Improve Segmentation With Intraoperative CBCT & Preoperative CT |
Maximilian E. Tschuchnig et.al. |
2406.11650 |
null |
2024-06-17 |
Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding |
Yunsong Wang et.al. |
2406.11283 |
null |
2024-06-17 |
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation |
Bingfeng Zhang et.al. |
2406.11189 |
link |
2024-06-21 |
$α$ -SSC: Uncertainty-Aware Camera-based 3D Semantic Scene Completion |
Sanbao Su et.al. |
2406.11021 |
null |
2024-06-16 |
PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery |
Libo Wang et.al. |
2406.10828 |
link |
2024-06-15 |
GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR |
Bharat Singh et.al. |
2406.10722 |
null |
2024-06-15 |
A Late-Stage Bitemporal Feature Fusion Network for Semantic Change Detection |
Chenyao Zhou et.al. |
2406.10678 |
link |
2024-06-14 |
ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers |
Narges Norouzi et.al. |
2406.09936 |
link |
2024-06-14 |
Label-Efficient Semantic Segmentation of LiDAR Point Clouds in Adverse Weather Conditions |
Aldi Piroli et.al. |
2406.09906 |
null |
2024-06-17 |
Exploring the Benefits of Vision Foundation Models for Unsupervised Domain Adaptation |
Brunó B. Englert et.al. |
2406.09896 |
link |
2024-06-14 |
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing |
Xiangheng Shan et.al. |
2406.09829 |
link |
2024-06-13 |
Instance-level quantitative saliency in multiple sclerosis lesion segmentation |
Federico Spagnolo et.al. |
2406.09335 |
link |
2024-06-13 |
APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation |
Weizhao He et.al. |
2406.08372 |
null |
2024-06-12 |
Dataset Enhancement with Instance-Level Augmentations |
Orest Kupyn et.al. |
2406.08249 |
link |
2024-06-13 |
A $^{2}$ -MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder |
Lixian Zhang et.al. |
2406.08079 |
null |
2024-06-12 |
OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding |
Yinan Deng et.al. |
2406.08009 |
link |
2024-06-12 |
SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation |
Chanda Grover Kamra et.al. |
2406.07986 |
link |
2024-06-12 |
Small Scale Data-Free Knowledge Distillation |
He Liu et.al. |
2406.07876 |
link |
2024-06-11 |
Beyond Bare Queries: Open-Vocabulary Object Retrieval with 3D Scene Graph |
Sergey Linok et.al. |
2406.07113 |
null |
2024-06-11 |
PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving |
Yining Shi et.al. |
2406.07037 |
null |
2024-06-12 |
LiSD: An Efficient Multi-Task Learning Framework for LiDAR Segmentation and Detection |
Jiahua Xu et.al. |
2406.07023 |
null |
2024-06-10 |
Stable Neighbor Denoising for Source-free Domain Adaptive Segmentation |
Dong Zhao et.al. |
2406.06813 |
link |
2024-06-09 |
Transforming Heart Chamber Imaging: Self-Supervised Learning for Whole Heart Reconstruction and Segmentation |
Abdul Qayyum et.al. |
2406.06643 |
null |
2024-06-10 |
Merlin: A Vision Language Foundation Model for 3D Computed Tomography |
Louis Blankemeier et.al. |
2406.06512 |
null |
2024-06-10 |
UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving |
Daniel Bogdoll et.al. |
2406.06370 |
null |
2024-06-09 |
Scaling Graph Convolutions for Mobile Vision |
William Avery et.al. |
2406.05850 |
link |
2024-06-09 |
Solution for CVPR 2024 UG2+ Challenge Track on All Weather Semantic Segmentation |
Jun Yu et.al. |
2406.05837 |
null |
2024-06-09 |
Convolution and Attention-Free Mamba-based Cardiac Image Segmentation |
Abbas Khan et.al. |
2406.05786 |
link |
2024-06-09 |
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language |
Mark Hamilton et.al. |
2406.05629 |
link |
2024-06-08 |
A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+ |
Jianzhao Wang et.al. |
2406.05513 |
null |
2024-06-08 |
Layered Image Vectorization via Semantic Simplification |
Zhenyu Wang et.al. |
2406.05404 |
null |
2024-06-08 |
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation |
Qingfeng Liu et.al. |
2406.05352 |
null |
2024-06-07 |
USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation |
Xiaoqi Wang et.al. |
2406.05271 |
null |
2024-06-07 |
Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment |
Venkanna Babu Guthula et.al. |
2406.04949 |
null |
2024-06-06 |
Characterizing segregation in blast rock piles a deep-learning approach leveraging aerial image analysis |
Chengeng Liu et.al. |
2406.04149 |
null |
2024-06-06 |
Frequency-based Matcher for Long-tailed Semantic Segmentation |
Shan Li et.al. |
2406.03917 |
link |
2024-06-07 |
Enhanced Semantic Segmentation Pipeline for WeatherProof Dataset Challenge |
Nan Zhang et.al. |
2406.03799 |
link |
2024-06-06 |
DSNet: A Novel Way to Use Atrous Convolutions in Semantic Segmentation |
Zilu Guo et.al. |
2406.03702 |
link |
2024-06-05 |
Comparative Benchmarking of Failure Detection Methods in Medical Image Segmentation: Unveiling the Role of Confidence Aggregation |
Maximilian Zenk et.al. |
2406.03323 |
null |
2024-06-05 |
Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy |
Yunho Kim et.al. |
2406.02989 |
null |
2024-06-04 |
W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics |
Andre Schreiber et.al. |
2406.02822 |
link |
2024-06-04 |
Window to Wall Ratio Detection using SegFormer |
Zoe De Simone et.al. |
2406.02706 |
link |
2024-06-04 |
Detecting Endangered Marine Species in Autonomous Underwater Vehicle Imagery Using Point Annotations and Few-Shot Learning |
Heather Doig et.al. |
2406.01932 |
null |
2024-06-03 |
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding |
Thanh-Dat Truong et.al. |
2406.01429 |
null |
2024-06-03 |
TE-NeXt: A LiDAR-Based 3D Sparse Convolutional Network for Traversability Estimation |
Antonio Santo et.al. |
2406.01395 |
link |
2024-06-03 |
ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds |
Ka Lung Cheung et.al. |
2406.01337 |
link |
2024-06-03 |
LSKSANet: A Novel Architecture for Remote Sensing Image Semantic Segmentation Leveraging Large Selective Kernel and Sparse Attention Mechanism |
Miao Fu et.al. |
2406.01228 |
null |
2024-06-04 |
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer |
Ding Jia et.al. |
2406.01210 |
link |
2024-06-03 |
S-CycleGAN: Semantic Segmentation Enhanced CT-Ultrasound Image-to-Image Translation for Robotic Ultrasonography |
Yuhan Song et.al. |
2406.01191 |
link |
2024-06-02 |
Diffusion Features to Bridge Domain Gap for Semantic Segmentation |
Yuxiang Ji et.al. |
2406.00777 |
link |
2024-06-02 |
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation |
Yunheng Li et.al. |
2406.00670 |
link |
2024-06-02 |
Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024 |
Biao Wu et.al. |
2406.00587 |
null |
2024-06-01 |
Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation |
Xinyue Chen et.al. |
2406.00545 |
null |
2024-06-01 |
2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation |
Biao Wu et.al. |
2406.00500 |
null |
2024-06-01 |
DSCA: A Digital Subtraction Angiography Sequence Dataset and Spatio-Temporal Model for Cerebral Artery Segmentation |
Qihang Xie et.al. |
2406.00341 |
null |
2024-06-01 |
Complex Style Image Transformations for Domain Generalization in Medical Images |
Nikolaos Spanos et.al. |
2406.00298 |
null |
2024-05-31 |
TotalVibeSegmentator: Full Torso Segmentation for the NAKO and UK Biobank in Volumetric Interpolated Breath-hold Examination Body Images |
Robert Graf et.al. |
2406.00125 |
link |
2024-05-31 |
Uncertainty Quantification for Bird's Eye View Semantic Segmentation: Methods and Benchmarks |
Linlin Yu et.al. |
2405.20986 |
null |
2024-05-31 |
Revisiting and Maximizing Temporal Knowledge in Semi-supervised Semantic Segmentation |
Wooseok Shin et.al. |
2405.20610 |
link |
2024-05-30 |
P-MSDiff: Parallel Multi-Scale Diffusion for Remote Sensing Image Segmentation |
Qi Zhang et.al. |
2405.20443 |
link |
2024-05-30 |
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow |
Chaoyang Wang et.al. |
2405.20282 |
link |
2024-05-30 |
MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion |
Angel Villar-Corrales et.al. |
2405.19921 |
link |
2024-05-30 |
Open-Set Domain Adaptation for Semantic Segmentation |
Seun-An Choe et.al. |
2405.19899 |
link |
2024-05-30 |
DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation |
Ron Keuth et.al. |
2405.19746 |
link |
2024-05-30 |
CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation |
Ankush Gajanan Arudkar et.al. |
2405.19672 |
null |
2024-05-29 |
Organizing Background to Explore Latent Classes for Incremental Few-shot Semantic Segmentation |
Lianlei Shan et.al. |
2405.19568 |
null |
2024-05-29 |
Enabling Visual Recognition at Radio Frequency |
Haowen Lai et.al. |
2405.19516 |
null |
2024-05-29 |
Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models |
Tianrun Chen et.al. |
2405.19326 |
null |
2024-05-29 |
A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation |
Niclas Vödisch et.al. |
2405.19035 |
link |
2024-05-29 |
Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation |
Zelin Peng et.al. |
2405.18840 |
null |
2024-05-28 |
Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation |
JuneHyoung Kwon et.al. |
2405.18148 |
null |
2024-05-28 |
Edge-guided and Class-balanced Active Learning for Semantic Segmentation of Aerial Images |
Lianlei Shan et.al. |
2405.18078 |
null |
2024-05-28 |
RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields |
Mihnea-Bogdan Jurca et.al. |
2405.18033 |
link |
2024-05-28 |
DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture |
Shentong Mo et.al. |
2405.17995 |
link |
2024-05-28 |
The Binary Quantized Neural Network for Dense Prediction via Specially Designed Upsampling and Attention |
Xingyu Ding et.al. |
2405.17776 |
null |
2024-05-27 |
Evaluation of Multi-task Uncertainties in Joint Semantic Segmentation and Monocular Depth Estimation |
Steven Landgraf et.al. |
2405.17097 |
null |
2024-05-27 |
DSU-Net: Dynamic Snake U-Net for 2-D Seismic First Break Picking |
Hongtao Wang et.al. |
2405.16980 |
null |
2024-05-27 |
Collective Perception Datasets for Autonomous Driving: A Comprehensive Review |
Sven Teufel et.al. |
2405.16973 |
null |
2024-05-27 |
Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models |
Qian Wang et.al. |
2405.16947 |
link |
2024-05-27 |
A re-calibration method for object detection with multi-modal alignment bias in autonomous driving |
Zhihang Song et.al. |
2405.16848 |
null |
2024-05-25 |
BOLD: Boolean Logic Deep Learning |
Van Minh Nguyen et.al. |
2405.16339 |
null |
2024-05-25 |
Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation |
Huizhou Chen et.al. |
2405.16099 |
null |
2024-05-25 |
Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality |
Hakim Ikebayashi et.al. |
2405.16008 |
null |
2024-05-24 |
Visualize and Paint GAN Activations |
Rudolf Herdt et.al. |
2405.15636 |
null |
2024-05-24 |
Leveraging knowledge distillation for partial multi-task learning from multiple remote sensing datasets |
Hoàng-Ân Lê et.al. |
2405.15394 |
link |
2024-05-24 |
U3M: Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation |
Bingyu Li et.al. |
2405.15365 |
link |
2024-05-24 |
Cross-Domain Few-Shot Semantic Segmentation via Doubly Matching Transformation |
Jiayi Chen et.al. |
2405.15265 |
link |
2024-05-23 |
Mamba-R: Vision Mamba ALSO Needs Registers |
Feng Wang et.al. |
2405.14858 |
null |
2024-05-23 |
Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation |
Daniel Kienzle et.al. |
2405.14467 |
link |
2024-05-23 |
MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models |
Jiuming Liu et.al. |
2405.14338 |
null |
2024-05-23 |
Tuning-free Universally-Supervised Semantic Segmentation |
Xiaobo Yang et.al. |
2405.14294 |
null |
2024-05-23 |
SCMix: Stochastic Compound Mixing for Open Compound Domain Adaptation in Semantic Segmentation |
Kai Yao et.al. |
2405.14278 |
null |
2024-05-23 |
Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations |
Mohammed Baharoon et.al. |
2405.14239 |
link |
2024-05-24 |
Leveraging Semantic Segmentation Masks with Embeddings for Fine-Grained Form Classification |
Taylor Archibald et.al. |
2405.14162 |
null |
2024-05-23 |
Skip-SCAR: A Modular Approach to ObjectGoal Navigation with Sparsity and Adaptive Skips |
Yaotian Liu et.al. |
2405.14154 |
null |
2024-05-22 |
TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System |
Diogo Lavado et.al. |
2405.13989 |
null |
2024-05-22 |
Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer |
Qihang Fan et.al. |
2405.13337 |
link |
2024-05-22 |
Vision Transformer with Sparse Scan Prior |
Qihang Fan et.al. |
2405.13335 |
link |
2024-05-22 |
Deep Learning-Driven State Correction: A Hybrid Architecture for Radar-Based Dynamic Occupancy Grid Mapping |
Max Peter Ronecker et.al. |
2405.13307 |
null |
2024-05-21 |
Transparency Distortion Robustness for SOTA Image Segmentation Tasks |
Volker Knauthe et.al. |
2405.12864 |
null |
2024-05-20 |
A comprehensive overview of deep learning techniques for 3D point cloud classification and semantic segmentation |
Sushmita Sarker et.al. |
2405.11903 |
null |
2024-05-20 |
Salience-guided Ground Factor for Robust Localization of Delivery Robots in Complex Urban Environments |
Jooyong Park et.al. |
2405.11855 |
null |
2024-05-20 |
Universal Organizer of SAM for Unsupervised Semantic Segmentation |
Tingting Li et.al. |
2405.11742 |
link |
2024-05-19 |
Interpreting a Semantic Segmentation Model for Coastline Detection |
Conor O'Sullivan et.al. |
2405.11500 |
link |
2024-05-17 |
CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation |
Mushui Liu et.al. |
2405.10530 |
link |
2024-05-16 |
Towards Task-Compatible Compressible Representations |
Anderson de Andrade et.al. |
2405.10244 |
link |
2024-05-16 |
A Preprocessing and Postprocessing Voxel-based Method for LiDAR Semantic Segmentation Improvement in Long Distance |
Andrea Matteazzi et.al. |
2405.10046 |
null |
2024-05-16 |
Towards Realistic Incremental Scenario in Class Incremental Semantic Segmentation |
Jihwan Kwak et.al. |
2405.09858 |
link |
2024-05-15 |
Synth-to-Real Unsupervised Domain Adaptation for Instance Segmentation |
Guo Yachan et.al. |
2405.09682 |
null |
2024-05-14 |
CLIP with Quality Captions: A Strong Pretraining for Vision Tasks |
Pavan Kumar Anasosalu Vasu et.al. |
2405.08911 |
null |
2024-05-14 |
Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study |
Qinfeng Zhu et.al. |
2405.08493 |
null |
2024-05-14 |
TEDNet: Twin Encoder Decoder Neural Network for 2D Camera and LiDAR Road Detection |
Martín Bayón-Gutiérrez et.al. |
2405.08429 |
link |
2024-05-13 |
IMAFD: An Interpretable Multi-stage Approach to Flood Detection from time series Multispectral Data |
Ziyang Zhang et.al. |
2405.07916 |
null |
2024-05-12 |
Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception |
Haoming Chen et.al. |
2405.07201 |
link |
2024-05-10 |
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs |
Mustafa Munir et.al. |
2405.06849 |
link |
2024-05-10 |
Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach |
Elham Ravanbakhsh et.al. |
2405.06586 |
null |
2024-05-10 |
Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation |
Xiaowen Ma et.al. |
2405.06525 |
link |
2024-05-10 |
Multi-Target Unsupervised Domain Adaptation for Semantic Segmentation without External Data |
Yonghao Xu et.al. |
2405.06502 |
link |
2024-05-10 |
Multi-level Personalized Federated Learning on Heterogeneous and Long-Tailed Data |
Rongyu Zhang et.al. |
2405.06413 |
null |
2024-05-10 |
Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation |
Zhenliang Ni et.al. |
2405.06228 |
link |
2024-05-10 |
Zero-shot Degree of Ill-posedness Estimation for Active Small Object Change Detection |
Koji Takeda et.al. |
2405.06185 |
null |
2024-05-10 |
Prior-guided Diffusion Model for Cell Segmentation in Quantitative Phase Imaging |
Zhuchen Shao et.al. |
2405.06175 |
null |
2024-05-09 |
Mask-TS Net: Mask Temperature Scaling Uncertainty Calibration for Polyp Segmentation |
Yudian Zhang et.al. |
2405.05830 |
null |
2024-05-08 |
OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies |
Lingdong Kong et.al. |
2405.05259 |
link |
2024-05-08 |
Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving |
Lingdong Kong et.al. |
2405.05258 |
link |
2024-05-08 |
Weakly-supervised Semantic Segmentation via Dual-stream Contrastive Learning of Cross-image Contextual Information |
Qi Lai et.al. |
2405.04913 |
null |
2024-05-08 |
DeepDamageNet: A two-step deep-learning model for multi-disaster building damage segmentation and classification using satellite imagery |
Irene Alisjahbana et.al. |
2405.04800 |
null |
2024-05-07 |
A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields |
Raiyan Rahman et.al. |
2405.04305 |
null |
2024-05-07 |
ELiTe: Efficient Image-to-LiDAR Knowledge Transfer for Semantic Segmentation |
Zhibo Zhang et.al. |
2405.04121 |
null |
2024-05-06 |
PTQ4SAM: Post-Training Quantization for Segment Anything |
Chengtao Lv et.al. |
2405.03144 |
link |
2024-05-04 |
MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning |
Vishal Nedungadi et.al. |
2405.02771 |
link |
2024-05-04 |
Few-Shot Fruit Segmentation via Transfer Learning |
Jordan A. James et.al. |
2405.02556 |
link |
2024-05-03 |
DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model |
Peijin Jia et.al. |
2405.02008 |
null |
2024-05-02 |
Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey |
Guoping Xu et.al. |
2405.01725 |
link |
2024-05-02 |
Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey |
Rokas Gipiškis et.al. |
2405.01636 |
null |
2024-05-02 |
CromSS: Cross-modal pre-training with noisy labels for remote sensing image segmentation |
Chenying Liu et.al. |
2405.01217 |
null |
2024-05-02 |
Uncertainty-aware self-training with expectation maximization basis transformation |
Zijia Wang et.al. |
2405.01175 |
null |
2024-05-01 |
Exploring Self-Supervised Vision Transformers for Deepfake Detection: A Comparative Analysis |
Huy H. Nguyen et.al. |
2405.00355 |
link |
2024-04-30 |
Masked Multi-Query Slot Attention for Unsupervised Object Discovery |
Rishav Pramanik et.al. |
2404.19654 |
link |
2024-04-30 |
DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents |
Taylor Archibald et.al. |
2404.19259 |
null |
2024-04-29 |
Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing |
Leonardo Rossi et.al. |
2404.18924 |
link |
2024-04-29 |
IPixMatch: Boost Semi-supervised Semantic Segmentation with Inter-Pixel Relation |
Kebin Wu et.al. |
2404.18891 |
null |
2024-04-29 |
Towards Long-term Robotics in the Wild |
Stephen Hausler et.al. |
2404.18477 |
null |
2024-04-27 |
Multi-Stream Cellular Test-Time Adaptation of Real-Time Models Evolving in Dynamic Environments |
Benoît Gérin et.al. |
2404.17930 |
link |
2024-04-27 |
CLFT: Camera-LiDAR Fusion Transformer for Semantic Segmentation in Autonomous Driving |
Junyi Gu et.al. |
2404.17793 |
link |
2024-04-26 |
Optimizing Universal Lesion Segmentation: State Space Model-Guided Hierarchical Networks with Feature Importance Adjustment |
Kazi Shahriar Sanjid et.al. |
2404.17235 |
null |
2024-04-25 |
Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals |
Oliver Hahn et.al. |
2404.16818 |
link |
2024-04-26 |
Multi-Scale Representations by Varying Window Attention for Semantic Segmentation |
Haotian Yan et.al. |
2404.16573 |
link |
2024-04-25 |
360SFUDA++: Towards Source-free UDA for Panoramic Segmentation by Learning Reliable Category Prototypes |
Xu Zheng et.al. |
2404.16501 |
null |
2024-04-25 |
Semantic Segmentation Refiner for Ultrasound Applications with Zero-Shot Foundation Models |
Hedda Cohen Indelman et.al. |
2404.16325 |
null |
2024-04-29 |
A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation |
Yifan Zhao et.al. |
2404.16266 |
link |
2024-04-24 |
3D Freehand Ultrasound using Visual Inertial and Deep Inertial Odometry for Measuring Patellar Tracking |
Russell Buchanan et.al. |
2404.15847 |
null |
2024-04-24 |
Vision Transformer-based Adversarial Domain Adaptation |
Yahan Li et.al. |
2404.15817 |
link |
2024-04-22 |
OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks |
Sophia Sirko-Galouchenko et.al. |
2404.14027 |
link |
2024-04-21 |
Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation |
Guanlong Jiao et.al. |
2404.13701 |
null |
2024-04-21 |
PV-S3: Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images |
Abhishek Jha et.al. |
2404.13693 |
null |
2024-04-21 |
A Complete System for Automated 3D Semantic-Geometric Mapping of Corrosion in Industrial Environments |
Rui Pimentel de Figueiredo et.al. |
2404.13691 |
null |
2024-04-21 |
LMFNet: An Efficient Multimodal Fusion Approach for Semantic Segmentation in High-Resolution Remote Sensing |
Tong Wang et.al. |
2404.13659 |
null |
2024-04-21 |
Towards Unified Representation of Multi-Modal Pre-training for 3D Understanding via Differentiable Rendering |
Ben Fei et.al. |
2404.13619 |
null |
2024-04-20 |
AMMUNet: Multi-Scale Attention Map Merging for Remote Sensing Image Segmentation |
Yang Yang et.al. |
2404.13408 |
link |
2024-04-19 |
BACS: Background Aware Continual Semantic Segmentation |
Mostafa ElAraby et.al. |
2404.13148 |
link |
2024-04-19 |
ToNNO: Tomographic Reconstruction of a Neural Network's Output for Weakly Supervised Segmentation of 3D Medical Images |
Marius Schmidt-Mengin et.al. |
2404.13103 |
null |
2024-04-19 |
Foundation Model assisted Weakly Supervised LiDAR Semantic Segmentation |
Yilong Chen et.al. |
2404.12861 |
null |
2024-04-19 |
COIN: Counterfactual inpainting for weakly supervised semantic segmentation for medical images |
Dmytro Shvetsov et.al. |
2404.12832 |
link |
2024-04-19 |
A Point-Based Approach to Efficient LiDAR Multi-Task Perception |
Christopher Lang et.al. |
2404.12798 |
null |
2024-04-19 |
Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework |
Zhuohong Li et.al. |
2404.12721 |
link |
2024-04-19 |
Improving Prediction Accuracy of Semantic Segmentation Methods Using Convolutional Autoencoder Based Pre-processing Layers |
Hisashi Shimodaira et.al. |
2404.12718 |
null |
2024-04-19 |
Show and Grasp: Few-shot Semantic Segmentation for Robot Grasping through Zero-shot Foundation Models |
Leonardo Barcellona et.al. |
2404.12717 |
null |
2024-04-18 |
A Perspective on Deep Vision Performance with Standard Image and Video Codecs |
Christoph Reich et.al. |
2404.12330 |
null |
2024-04-18 |
Deep Gaussian mixture model for unsupervised image segmentation |
Matthias Schwab et.al. |
2404.12252 |
link |
2024-04-18 |
Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training |
Jin Gao et.al. |
2404.12210 |
link |
2024-04-18 |
How to Benchmark Vision Foundation Models for Semantic Segmentation? |
Tommie Kerssies et.al. |
2404.12172 |
link |
2024-04-19 |
Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation |
Chongjie Si et.al. |
2404.11981 |
null |
2024-04-18 |
Group-On: Boosting One-Shot Segmentation with Supportive Query |
Hanjing Zhou et.al. |
2404.11871 |
null |
2024-04-17 |
Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach |
Mir Rayat Imtiaz Hossain et.al. |
2404.11732 |
null |
2024-04-17 |
A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching |
Francesco Pro et.al. |
2404.11302 |
link |
2024-04-17 |
Learning from Unlabelled Data with Transformers: Domain Adaptation for Semantic Segmentation of High Resolution Aerial Images |
Nikolaos Dionelis et.al. |
2404.11299 |
link |
2024-04-16 |
A Concise Tiling Strategy for Preserving Spatial Context in Earth Observation Imagery |
Ellianna Abrahams et.al. |
2404.10927 |
link |
2024-04-16 |
Vocabulary-free Image Classification and Semantic Segmentation |
Alessandro Conti et.al. |
2404.10864 |
link |
2024-04-16 |
Gasformer: A Transformer-based Architecture for Segmenting Methane Emissions from Livestock in Optical Gas Imaging |
Toqi Tahamid Sarker et.al. |
2404.10841 |
link |
2024-04-16 |
Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark |
Jiangning Zhang et.al. |
2404.10760 |
link |
2024-04-16 |
ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation |
Iaroslav Melekhov et.al. |
2404.10699 |
link |
2024-04-16 |
Contextrast: Contextual Contrastive Learning for Semantic Segmentation |
Changki Sung et.al. |
2404.10633 |
null |
2024-04-16 |
Label merge-and-split: A graph-colouring approach for memory-efficient brain parcellation |
Aaron Kujawa et.al. |
2404.10572 |
null |
2024-04-16 |
LAECIPS: Large Vision Model Assisted Adaptive Edge-Cloud Collaboration for IoT-based Perception System |
Shijing Hu et.al. |
2404.10498 |
null |
2024-04-16 |
Adversarial Identity Injection for Semantic Face Image Synthesis |
Giuseppe Tarollo et.al. |
2404.10408 |
null |
2024-04-16 |
Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation |
Jiapeng Su et.al. |
2404.10322 |
link |
2024-04-15 |
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL |
Fangwei Zhong et.al. |
2404.09857 |
null |
2024-04-15 |
In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation |
Han Xue et.al. |
2404.09633 |
null |
2024-04-15 |
The revenge of BiSeNet: Efficient Multi-Task Image Segmentation |
Gabriele Rosi et.al. |
2404.09570 |
null |
2024-04-16 |
Human-in-the-Loop Segmentation of Multi-species Coral Imagery |
Scarlett Raine et.al. |
2404.09406 |
link |
2024-04-14 |
Bridging Data Islands: Geographic Heterogeneity-Aware Federated Learning for Collaborative Remote Sensing Semantic Segmentation |
Jieyi Tan et.al. |
2404.09292 |
null |
2024-04-12 |
Analyzing Decades-Long Environmental Changes in Namibia Using Archival Aerial Photography and Deep Learning |
Girmaw Abebe Tadesse et.al. |
2404.08544 |
null |
2024-04-12 |
LaSagnA: Language-based Segmentation Assistant for Complex Queries |
Cong Wei et.al. |
2404.08506 |
link |
2024-04-12 |
Tackling Ambiguity from Perspective of Uncertainty Inference and Affinity Diversification for Weakly Supervised Semantic Segmentation |
Zhiwei Yang et.al. |
2404.08195 |
link |
2024-04-12 |
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation |
Sina Hajimiri et.al. |
2404.08181 |
link |
2024-04-10 |
AI-Guided Feature Segmentation Techniques to Model Features from Single Crystal Diamond Growth |
Rohan Reddy Mekala et.al. |
2404.08017 |
null |
2024-04-11 |
Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification |
Ricardo Pereira et.al. |
2404.07739 |
null |
2024-04-11 |
OpenTrench3D: A Photogrammetric 3D Point Cloud Dataset for Semantic Segmentation of Underground Utilities |
Lasse H. Hansen et.al. |
2404.07711 |
link |
2024-04-11 |
Implicit and Explicit Language Guidance for Diffusion-based Visual Perception |
Hefeng Wang et.al. |
2404.07600 |
null |
2024-04-11 |
Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling |
Sourajit Saha et.al. |
2404.07410 |
link |
2024-04-10 |
AI-Guided Defect Detection Techniques to Model Single Crystal Diamond Growth |
Rohan Reddy Mekala et.al. |
2404.07306 |
null |
2024-04-10 |
RESSCAL3D: Resolution Scalable 3D Semantic Segmentation of Point Clouds |
Remco Royen et.al. |
2404.06863 |
null |
2024-04-10 |
O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation |
Muer Tie et.al. |
2404.06836 |
null |
2024-04-10 |
Convolution-based Probability Gradient Loss for Semantic Segmentation |
Guohang Shan et.al. |
2404.06704 |
link |
2024-04-09 |
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation |
Luca Barsellotti et.al. |
2404.06542 |
null |
2024-04-09 |
QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding |
Yash Mehan et.al. |
2404.06442 |
null |
2024-04-09 |
DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird's Eye View Segmentation with Occlusion Reasoning |
Senthil Yogamani et.al. |
2404.06352 |
null |
2024-04-09 |
Hierarchical Insights: Exploiting Structural Similarities for Reliable 3D Semantic Segmentation |
Mariella Dreissig et.al. |
2404.06124 |
null |
2024-04-09 |
Improving Facial Landmark Detection Accuracy and Efficiency with Knowledge Distillation |
Zong-Wei Hong et.al. |
2404.06029 |
null |
2024-04-08 |
Evaluating the Efficacy of Cut-and-Paste Data Augmentation in Semantic Segmentation for Satellite Imagery |
Ionut M. Motoi et.al. |
2404.05693 |
link |
2024-04-08 |
AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation |
Jiannan Ge et.al. |
2404.05667 |
null |
2024-04-08 |
Impact of LiDAR visualisations on semantic segmentation of archaeological objects |
Raveerat Jaturapitpornchai et.al. |
2404.05512 |
null |
2024-04-08 |
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance |
Dazhong Shen et.al. |
2404.05384 |
link |
2024-04-08 |
GPS-free Autonomous Navigation in Cluttered Tree Rows with Deep Semantic Segmentation |
Alessandro Navone et.al. |
2404.05338 |
null |
2024-04-08 |
Human Detection from 4D Radar Data in Low-Visibility Field Conditions |
Mikael Skog et.al. |
2404.05307 |
null |
2024-04-08 |
iVPT: Improving Task-relevant Information Sharing in Visual Prompt Tuning by Cross-layer Dynamic Connection |
Nan Zhou et.al. |
2404.05207 |
null |
2024-04-08 |
UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather |
Haimei Zhao et.al. |
2404.05145 |
null |
2024-04-07 |
D2SL: Decouple Defogging and Semantic Learning for Foggy Domain-Adaptive Segmentation |
Xuan Sun et.al. |
2404.04807 |
null |
2024-04-06 |
HawkDrive: A Transformer-driven Visual Perception System for Autonomous Driving in Night Scene |
Ziang Guo et.al. |
2404.04653 |
link |
2024-04-06 |
Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation |
Danpei Zhao et.al. |
2404.04608 |
null |
2024-04-06 |
PIE: Physics-inspired Low-light Enhancement |
Dong Liang et.al. |
2404.04586 |
null |
2024-04-06 |
Frequency Decomposition-Driven Unsupervised Domain Adaptation for Remote Sensing Image Semantic Segmentation |
Xianping Ma et.al. |
2404.04531 |
link |
2024-04-05 |
Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation |
Zifu Wan et.al. |
2404.04256 |
link |
2024-04-05 |
Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation |
Ji-Jia Wu et.al. |
2404.04231 |
link |
2024-04-05 |
MarsSeg: Mars Surface Semantic Segmentation with Multi-level Extractor and Connector |
Junbo Li et.al. |
2404.04155 |
null |
2024-04-04 |
Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation |
Elham Amin Mansour et.al. |
2404.03799 |
null |
2024-04-04 |
Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincaré Ball |
Simon Weber et.al. |
2404.03778 |
link |
2024-04-04 |
Background Noise Reduction of Attention Map for Weakly Supervised Semantic Segmentation |
Izumi Fujimori et.al. |
2404.03394 |
null |
2024-04-03 |
GPU-Accelerated RSF Level Set Evolution for Large-Scale Microvascular Segmentation |
Meher Niger et.al. |
2404.02813 |
null |
2024-04-03 |
RS-Mamba for Large Remote Sensing Image Dense Prediction |
Sijie Zhao et.al. |
2404.02668 |
link |
2024-04-03 |
A Satellite Band Selection Framework for Amazon Forest Deforestation Detection Task |
Eduardo Neto et.al. |
2404.02659 |
null |
2024-04-03 |
SG-BEV: Satellite-Guided BEV Fusion for Cross-View Semantic Segmentation |
Junyan Ye et.al. |
2404.02638 |
link |
2024-04-03 |
Active learning for efficient annotation in precision agriculture: a use-case on crop-weed semantic segmentation |
Bart M. van Marrewijk et.al. |
2404.02580 |
null |
2024-04-03 |
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras |
Zhongyu Xia et.al. |
2404.02517 |
link |
2024-04-03 |
Optimizing traffic signs and lights visibility for the teleoperation of autonomous vehicles through ROI compression |
I. Dror et.al. |
2404.02481 |
null |
2024-04-03 |
RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation |
Xianping Ma et.al. |
2404.02457 |
link |
2024-04-02 |
Constrained Robotic Navigation on Preferred Terrains Using LLMs and Speech Instruction: Exploiting the Power of Adverbs |
Faraz Lotfi et.al. |
2404.02294 |
null |
2024-04-01 |
Versatile Navigation under Partial Observability via Value-guided Diffusion Policy |
Gengyu Zhang et.al. |
2404.02176 |
null |
2024-04-02 |
Multi-Level Label Correction by Distilling Proximate Patterns for Semi-supervised Semantic Segmentation |
Hui Xiao et.al. |
2404.02065 |
null |
2024-04-02 |
Synthetic Data for Robust Stroke Segmentation |
Liam Chalcroft et.al. |
2404.01946 |
link |
2024-04-02 |
Improving Bird's Eye View Semantic Segmentation by Task Decomposition |
Tianhao Zhao et.al. |
2404.01925 |
null |
2024-04-02 |
Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model |
Qinfeng Zhu et.al. |
2404.01705 |
link |
2024-04-04 |
Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss |
Jaeha Kim et.al. |
2404.01692 |
link |
2024-04-01 |
PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation |
Jinfeng Xu et.al. |
2404.00979 |
link |
2024-04-01 |
GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields |
Yunsong Wang et.al. |
2404.00931 |
link |
2024-04-02 |
Rethinking Saliency-Guided Weakly-Supervised Semantic Segmentation |
Beomyoung Kim et.al. |
2404.00918 |
link |
2024-03-31 |
Training-Free Semantic Segmentation via LLM-Supervision |
Wenfang Sun et.al. |
2404.00701 |
null |
2024-03-31 |
LAESI: Leaf Area Estimation with Synthetic Imagery |
Jacek Kałużny et.al. |
2404.00593 |
null |
2024-03-30 |
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation |
Sanghyun Jo et.al. |
2404.00380 |
link |
2024-03-30 |
Efficient Multi-branch Segmentation Network for Situation Awareness in Autonomous Navigation |
Guan-Cheng Zhou et.al. |
2404.00366 |
null |
2024-03-30 |
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation |
Yuan Wang et.al. |
2404.00262 |
null |
2024-03-29 |
Modeling Weather Uncertainty for Multi-weather Co-Presence Estimation |
Qi Bi et.al. |
2403.20092 |
null |
2024-03-29 |
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection |
Ali Behrouz et.al. |
2403.19888 |
null |
2024-03-28 |
Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation |
Qitian Ma et.al. |
2403.19826 |
null |
2024-03-28 |
ENet-21: An Optimized light CNN Structure for Lane Detection |
Seyed Rasoul Hosseini et.al. |
2403.19782 |
null |
2024-03-29 |
Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers |
Pingcheng Dong et.al. |
2403.19591 |
link |
2024-03-28 |
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs |
Donghyun Kim et.al. |
2403.19588 |
link |
2024-03-28 |
Learning Multiple Representations with Inconsistency-Guided Detail Regularization for Mask-Guided Matting |
Weihao Jiang et.al. |
2403.19213 |
null |
2024-03-27 |
Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D |
Mukund Varma T et.al. |
2403.18922 |
null |
2024-03-27 |
I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic Segmentation |
Ayoub Karine et.al. |
2403.18490 |
null |
2024-03-28 |
ViTAR: Vision Transformer with Any Resolution |
Qihang Fan et.al. |
2403.18361 |
null |
2024-03-27 |
Generating Diverse Agricultural Data for Vision-Based Farming Applications |
Mikolaj Cieslak et.al. |
2403.18351 |
null |
2024-03-27 |
Road Obstacle Detection based on Unknown Objectness Scores |
Chihiro Noguchi et.al. |
2403.18207 |
null |
2024-03-26 |
The Need for Speed: Pruning Transformers with One Recipe |
Samir Khaki et.al. |
2403.17921 |
link |
2024-03-26 |
Compressed Multi-task embeddings for Data-Efficient Downstream training and inference in Earth Observation |
Carlos Gomes et.al. |
2403.17886 |
link |
2024-03-26 |
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition |
Chenhongyi Yang et.al. |
2403.17695 |
link |
2024-03-25 |
Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions |
Ye Li et.al. |
2403.17009 |
link |
2024-03-25 |
DreamLIP: Language-Image Pre-training with Long Captions |
Kecheng Zheng et.al. |
2403.17007 |
link |
2024-03-25 |
TwinLiteNetPlus: A Stronger Model for Real-time Drivable Area and Lane Segmentation |
Quang-Huy Che et.al. |
2403.16958 |
null |
2024-03-25 |
HPL-ESS: Hybrid Pseudo-Labeling for Unsupervised Event-based Semantic Segmentation |
Linglin Jing et.al. |
2403.16788 |
null |
2024-03-25 |
SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation |
Aysim Toker et.al. |
2403.16605 |
null |
2024-03-25 |
Self-Supervised Learning for Medical Image Data with Anatomy-Oriented Imaging Planes |
Tianwei Zhang et.al. |
2403.16499 |
null |
2024-03-25 |
GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation |
Weiming Zhang et.al. |
2403.16370 |
null |
2024-03-24 |
Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System |
Jing Li et.al. |
2403.16227 |
null |
2024-03-24 |
Segment Anything Model for Road Network Graph Extraction |
Congrui Hetang et.al. |
2403.16051 |
link |
2024-03-24 |
SM2C: Boost the Semi-supervised Segmentation for Medical Image by using Meta Pseudo Labels and Mixed Images |
Yifei Wang et.al. |
2403.16009 |
null |
2024-03-22 |
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting |
Jun Guo et.al. |
2403.15624 |
null |
2024-03-22 |
A2DMN: Anatomy-Aware Dilated Multiscale Network for Breast Ultrasound Semantic Segmentation |
Kyle Lucke et.al. |
2403.15560 |
null |
2024-03-22 |
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding |
Yi Wang et.al. |
2403.15377 |
link |
2024-03-22 |
Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations |
Pranav Kulkarni et.al. |
2403.15218 |
link |
2024-03-22 |
Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion |
Sofia Casarin et.al. |
2403.15194 |
null |
2024-03-22 |
Improve Cross-domain Mixed Sampling with Guidance Training for Adaptive Segmentation |
Wenlve Zhou et.al. |
2403.14995 |
link |
2024-03-21 |
WeatherProof: Leveraging Language Guidance for Semantic Segmentation in Adverse Weather |
Blake Gella et.al. |
2403.14874 |
null |
2024-03-21 |
Learning to Project for Cross-Task Knowledge Distillation |
Dylan Auty et.al. |
2403.14494 |
null |
2024-03-21 |
OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation |
Bohao Peng et.al. |
2403.14418 |
link |
2024-03-21 |
Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models |
Pablo Marcos-Manchón et.al. |
2403.14291 |
link |
2024-03-21 |
OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation |
Kwanyoung Kim et.al. |
2403.14183 |
link |
2024-03-21 |
Evidential Semantic Mapping in Off-road Environments with Uncertainty-aware Bayesian Kernel Inference |
Junyoung Kim et.al. |
2403.14138 |
null |
2024-03-21 |
Soft Masked Transformer for Point Cloud Processing with Skip Attention-Based Upsampling |
Yong He et.al. |
2403.14124 |
null |
2024-03-21 |
Semantics from Space: Satellite-Guided Thermal Semantic Segmentation Annotation for Aerial Field Robots |
Connor Lee et.al. |
2403.14056 |
null |
2024-03-20 |
When Cars meet Drones: Hyperbolic Federated Learning for Source-Free Domain Adaptation in Adverse Weather |
Giulia Rizzoli et.al. |
2403.13762 |
link |
2024-03-20 |
Next day fire prediction via semantic segmentation |
Konstantinos Alexis et.al. |
2403.13545 |
null |
2024-03-20 |
MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining |
Di Wang et.al. |
2403.13430 |
link |
2024-03-20 |
AMCO: Adaptive Multimodal Coupling of Vision and Proprioception for Quadruped Robot Navigation in Outdoor Environments |
Mohamed Elnoor et.al. |
2403.13235 |
null |
2024-03-20 |
Modeling the Label Distributions for Weakly-Supervised Semantic Segmentation |
Linshan Wu et.al. |
2403.13225 |
link |
2024-03-19 |
Reflectivity Is All You Need!: Advancing LiDAR Semantic Segmentation |
Kasi Viswanath et.al. |
2403.13188 |
link |
2024-03-19 |
As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks? |
Anjun Hu et.al. |
2403.12693 |
null |
2024-03-19 |
PCT: Perspective Cue Training Framework for Multi-Camera BEV Segmentation |
Haruya Ishikawa et.al. |
2403.12530 |
null |
2024-03-19 |
Semantics, Distortion, and Style Matter: Towards Source-free UDA for Panoramic Segmentation |
Xu Zheng et.al. |
2403.12505 |
null |
2024-03-18 |
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation |
Wangbo Zhao et.al. |
2403.11808 |
link |
2024-03-18 |
LSKNet: A Foundation Lightweight Backbone for Remote Sensing |
Yuxuan Li et.al. |
2403.11735 |
link |
2024-03-18 |
TTT-KD: Test-Time Training for 3D Semantic Segmentation through Knowledge Distillation from Foundation Models |
Lisa Weijler et.al. |
2403.11691 |
null |
2024-03-18 |
OurDB: Ouroboric Domain Bridging for Multi-Target Domain Adaptive Semantic Segmentation |
Seungbeom Woo et.al. |
2403.11582 |
null |
2024-03-18 |
MCD: Diverse Large-Scale Multi-Campus Dataset for Robot Perception |
Thien-Minh Nguyen et.al. |
2403.11496 |
null |
2024-03-18 |
Uncertainty-Calibrated Test-Time Model Adaptation without Forgetting |
Mingkui Tan et.al. |
2403.11491 |
null |
2024-03-17 |
TAG: Guidance-free Open-Vocabulary Semantic Segmentation |
Yasufumi Kawano et.al. |
2403.11197 |
link |
2024-03-17 |
MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation |
Yasufumi Kawano et.al. |
2403.11194 |
link |
2024-03-17 |
DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation |
Yuanchen Wu et.al. |
2403.11184 |
link |
2024-03-17 |
Adaptive Semantic-Enhanced Denoising Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution |
Jialu Sui et.al. |
2403.11078 |
link |
2024-03-16 |
Fuzzy Rank-based Late Fusion Technique for Cytology image Segmentation |
Soumyajyoti Dey et.al. |
2403.10884 |
null |
2024-03-16 |
Active Label Correction for Semantic Segmentation with Foundation Models |
Hoyoung Kim et.al. |
2403.10820 |
link |
2024-03-15 |
SwinMTL: A Shared Architecture for Simultaneous Depth Estimation and Semantic Segmentation from Monocular Camera Images |
Pardis Taghavi et.al. |
2403.10662 |
link |
2024-03-15 |
FeatUp: A Model-Agnostic Framework for Features at Any Resolution |
Stephanie Fu et.al. |
2403.10516 |
link |
2024-03-15 |
Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search |
Hongyuan Yu et.al. |
2403.10413 |
link |
2024-03-15 |
Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning |
Meixuan Li et.al. |
2403.10252 |
null |
2024-03-15 |
Exploring Optical Flow Inclusion into nnU-Net Framework for Surgical Instrument Segmentation |
Marcos Fernández-Rodríguez et.al. |
2403.10216 |
null |
2024-03-14 |
WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity |
Qiyuan Wang et.al. |
2403.09551 |
null |
2024-03-14 |
Annotation Free Semantic Segmentation with Vision Foundation Models |
Soroush Seifi et.al. |
2403.09307 |
null |
2024-03-14 |
When Semantic Segmentation Meets Frequency Aliasing |
Linwei Chen et.al. |
2403.09065 |
link |
2024-03-13 |
CART: Caltech Aerial RGB-Thermal Dataset in the Wild |
Connor Lee et.al. |
2403.08997 |
link |
2024-03-13 |
SLCF-Net: Sequential LiDAR-Camera Fusion for Semantic Scene Completion using a 3D Recurrent U-Net |
Helin Cao et.al. |
2403.08885 |
link |
2024-03-13 |
Segmentation of Knee Bones for Osteoarthritis Assessment: A Comparative Analysis of Supervised, Few-Shot, and Zero-Shot Learning Approaches |
Yun Xin Teoh et.al. |
2403.08761 |
null |
2024-03-13 |
Real-time 3D semantic occupancy prediction for autonomous vehicles using memory-efficient sparse convolution |
Samuel Sze et.al. |
2403.08748 |
null |
2024-03-13 |
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation |
Zicheng Zhang et.al. |
2403.08426 |
null |
2024-03-13 |
LIX: Implicitly Infusing Spatial Geometric Prior Knowledge into Visual Semantic Segmentation for Autonomous Driving |
Sicen Guo et.al. |
2403.08215 |
null |
2024-03-13 |
Multiscale Low-Frequency Memory Network for Improved Feature Extraction in Convolutional Neural Networks |
Fuzhi Wu et.al. |
2403.08157 |
link |
2024-03-12 |
Mitigating the Impact of Attribute Editing on Face Recognition |
Sudipta Banerjee et.al. |
2403.08092 |
null |
2024-03-12 |
Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation |
Feilong Tang et.al. |
2403.07630 |
link |
2024-03-12 |
PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution |
Honghao Chen et.al. |
2403.07589 |
null |
2024-03-12 |
Open-World Semantic Segmentation Including Class Similarity |
Matteo Sodano et.al. |
2403.07532 |
link |
2024-03-11 |
Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation |
Theodore Barfoot et.al. |
2403.06759 |
link |
2024-03-11 |
Forest Inspection Dataset for Aerial Semantic Segmentation and Depth Estimation |
Bianca-Cerasela-Zelia Blaga et.al. |
2403.06621 |
link |
2024-03-11 |
OMH: Structured Sparsity via Optimally Matched Hierarchy for Unsupervised Semantic Segmentation |
Baran Ozaydin et.al. |
2403.06546 |
null |
2024-03-11 |
Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy |
Jiuming Liu et.al. |
2403.06467 |
link |
2024-03-14 |
Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation |
Xiaoyang Wang et.al. |
2403.06462 |
link |
2024-03-11 |
Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation |
Peng Zhang et.al. |
2403.06401 |
null |
2024-03-10 |
Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning |
Woo-Jin Ahn et.al. |
2403.06122 |
link |
2024-03-08 |
Attention-guided Feature Distillation for Semantic Segmentation |
Amir M. Mansourian et.al. |
2403.05451 |
link |
2024-03-08 |
Generalized Correspondence Matching via Flexible Hierarchical Refinement and Patch Descriptor Distillation |
Yu Han et.al. |
2403.05388 |
null |
2024-03-08 |
Embedded Deployment of Semantic Segmentation in Medicine through Low-Resolution Inputs |
Erik Ostrowski et.al. |
2403.05340 |
null |
2024-03-08 |
LVIC: Multi-modality segmentation by Lifting Visual Info as Cue |
Zichao Dong et.al. |
2403.05159 |
null |
2024-03-06 |
ECAP: Extensive Cut-and-Paste Augmentation for Unsupervised Domain Adaptive Semantic Segmentation |
Erik Brorsson et.al. |
2403.03854 |
link |
2024-03-06 |
Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision |
Yajie Liu et.al. |
2403.03707 |
null |
2024-03-06 |
Causal Prototype-inspired Contrast Adaptation for Unsupervised Domain Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery |
Jingru Zhu et.al. |
2403.03704 |
null |
2024-03-06 |
GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding |
Zi-Ting Chou et.al. |
2403.03608 |
null |
2024-03-06 |
Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator |
Wonhyeok Choi et.al. |
2403.03468 |
null |
2024-03-05 |
ActiveAD: Planning-Oriented Active Learning for End-to-End Autonomous Driving |
Han Lu et.al. |
2403.02877 |
null |
2024-03-05 |
DDF: A Novel Dual-Domain Image Fusion Strategy for Remote Sensing Image Semantic Segmentation with Unsupervised Domain Adaptation |
Lingyan Ran et.al. |
2403.02784 |
null |
2024-03-08 |
Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels |
Zhuohong Li et.al. |
2403.02746 |
link |
2024-03-05 |
FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View |
Jiawei Hou et.al. |
2403.02710 |
null |
2024-03-05 |
Deep Common Feature Mining for Efficient Video Semantic Segmentation |
Yaoyan Zheng et.al. |
2403.02689 |
link |
2024-03-04 |
Self-Supervised Facial Representation Learning with Facial Region Awareness |
Zheng Gao et.al. |
2403.02138 |
null |
2024-03-04 |
Semi-Supervised Semantic Segmentation Based on Pseudo-Labels: A Survey |
Lingyan Ran et.al. |
2403.01909 |
null |
2024-03-04 |
Map-aided annotation for pole base detection |
Benjamin Missaoui et.al. |
2403.01868 |
null |
2024-03-06 |
AllSpark: Reborn Labeled Features from Unlabeled in Transformer for Semi-Supervised Semantic Segmentation |
Haonan Wang et.al. |
2403.01818 |
link |
2024-03-03 |
EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation |
Chanyoung Kim et.al. |
2403.01482 |
link |
2024-03-02 |
Benchmarking Segmentation Models with Mask-Preserved Attribute Editing |
Zijin Yin et.al. |
2403.01231 |
link |
2024-03-02 |
Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation |
Lian Xu et.al. |
2403.01156 |
null |
2024-03-01 |
Rethinking Few-shot 3D Point Cloud Semantic Segmentation |
Zhaochong An et.al. |
2403.00592 |
link |
2024-03-01 |
Small, Versatile and Mighty: A Range-View Perception Framework |
Qiang Meng et.al. |
2403.00325 |
null |
2024-03-01 |
YOLO-MED : Multi-Task Interaction Network for Biomedical Images |
Suizhi Huang et.al. |
2403.00245 |
null |
2024-02-29 |
FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything |
Safouane El Ghazouali et.al. |
2403.00175 |
link |
2024-02-29 |
RSAM-Seg: A SAM-based Approach with Prior Knowledge Integration for Remote Sensing Image Semantic Segmentation |
Jie Zhang et.al. |
2402.19004 |
null |
2024-02-28 |
Spatial Coherence Loss for Salient and Camouflaged Object Detection and Beyond |
Ziyun Yang et.al. |
2402.18698 |
null |
2024-02-29 |
Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation |
Zhiwei Yang et.al. |
2402.18467 |
link |
2024-02-29 |
A Modular System for Enhanced Robustness of Multimedia Understanding Networks via Deep Parametric Estimation |
Francesco Barbato et.al. |
2402.18402 |
link |
2024-02-28 |
Enhancing Roadway Safety: LiDAR-based Tree Clearance Analysis |
Miriam Louise Carnot et.al. |
2402.18309 |
null |
2024-02-28 |
Self-Supervised Learning in Electron Microscopy: Towards a Foundation Model for Advanced Image Analysis |
Bashir Kazimi et.al. |
2402.18286 |
null |
2024-02-28 |
PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation |
Haoyu Xie et.al. |
2402.18117 |
null |
2024-02-28 |
Spannotation: Enhancing Semantic Segmentation for Autonomous Navigation with Efficient Image Annotation |
Samuel O. Folorunsho et.al. |
2402.18084 |
link |
2024-02-27 |
Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation |
Xinyu Yang et.al. |
2402.17891 |
link |
2024-02-27 |
Masked Gamma-SSL: Learning Uncertainty Estimation via Masked Image Modeling |
David S. W. Williams et.al. |
2402.17622 |
null |
2024-02-27 |
A Large-scale Evaluation of Pretraining Paradigms for the Detection of Defects in Electroluminescence Solar Cell Images |
David Torpey et.al. |
2402.17611 |
null |
2024-02-27 |
Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised Semantic Segmentation with Its Class Label |
Xinliang Zhang et.al. |
2402.17555 |
link |
2024-02-26 |
ConSept: Continual Semantic Segmentation via Adapter-based Vision Transformer |
Bowen Dong et.al. |
2402.16674 |
null |
2024-02-26 |
UN-SAM: Universal Prompt-Free Segmentation for Generalized Nuclei Images |
Zhen Chen et.al. |
2402.16663 |
link |
2024-02-26 |
Placing Objects in Context via Inpainting for Out-of-distribution Segmentation |
Pau de Jorge et.al. |
2402.16392 |
link |
2024-02-26 |
BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM |
Li Zhang et.al. |
2402.16338 |
link |
2024-02-23 |
Modified CycleGAN for the synthesization of samples for wheat head segmentation |
Jaden Myers et.al. |
2402.15135 |
null |
2024-02-22 |
Semantic Image Synthesis with Unconditional Generator |
Jungwoo Chae et.al. |
2402.14395 |
null |
2024-02-22 |
Think before You Leap: Content-Aware Low-Cost Edge-Assisted Video Semantic Segmentation |
Mingxuan Yan et.al. |
2402.14326 |
null |
2024-02-21 |
Tumor segmentation on whole slide images: training or prompting? |
Huaqian Wu et.al. |
2402.13932 |
null |
2024-02-26 |
BenchCloudVision: A Benchmark Analysis of Deep Learning Approaches for Cloud Detection and Segmentation in Remote Sensing Imagery |
Loddo Fabio et.al. |
2402.13918 |
link |
2024-02-21 |
Zero-BEV: Zero-shot Projection of Any First-Person Modality to BEV Maps |
Gianluca Monaci et.al. |
2402.13848 |
null |
2024-02-21 |
Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation |
Jialei Chen et.al. |
2402.13697 |
null |
2024-02-20 |
Cross-Domain Transfer Learning with CoRTe: Consistent and Reliable Transfer from Black-Box to Lightweight Segmentation Model |
Claudia Cuttano et.al. |
2402.13122 |
null |
2024-02-19 |
LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks |
Truong Thanh Hung Nguyen et.al. |
2402.12525 |
link |
2024-02-19 |
Towards Explainable LiDAR Point Cloud Semantic Segmentation via Gradient Based Target Localization |
Abhishek Kuriyal et.al. |
2402.12098 |
link |
2024-02-19 |
ISCUTE: Instance Segmentation of Cables Using Text Embedding |
Shir Kozlovsky et.al. |
2402.11996 |
null |
2024-02-18 |
Key Patch Proposer: Key Patches Contain Rich Information |
Jing Xu et.al. |
2402.11458 |
link |
2024-02-17 |
ChatEarthNet: A Global-Scale, High-Quality Image-Text Dataset for Remote Sensing |
Zhenghang Yuan et.al. |
2402.11325 |
link |
2024-02-17 |
A Decoding Scheme with Successive Aggregation of Multi-Level Features for Light-Weight Semantic Segmentation |
Jiwon Yoo et.al. |
2402.11201 |
null |
2024-02-16 |
HistoSegCap: Capsules for Weakly-Supervised Semantic Segmentation of Histological Tissue Type in Whole Slide Images |
Mobina Mansoori et.al. |
2402.10851 |
null |
2024-02-16 |
Selective Prediction for Semantic Segmentation using Post-Hoc Confidence Estimation and Its Performance under Distribution Shift |
Bruno Laboissiere Camargos Borges et.al. |
2402.10665 |
null |
2024-02-16 |
Efficient Multi-task Uncertainties for Joint Semantic Segmentation and Monocular Depth Estimation |
Steven Landgraf et.al. |
2402.10580 |
null |
2024-02-15 |
Is Continual Learning Ready for Real-world Challenges? |
Theodora Kontogianni et.al. |
2402.10130 |
null |
2024-02-15 |
Robust semi-automatic vessel tracing in the human retinal image by an instance segmentation neural network |
Siyi Chen et.al. |
2402.10055 |
null |
2024-02-15 |
MM-Point: Multi-View Information-Enhanced Multi-Modal Self-Supervised 3D Point Cloud Understanding |
Hai-Tao Yu et.al. |
2402.10002 |
link |
2024-02-14 |
Automated Plaque Detection and Agatston Score Estimation on Non-Contrast CT Scans: A Multicenter Study |
Andrew M. Nguyen et.al. |
2402.09569 |
null |
2024-02-14 |
Reducing Texture Bias of Deep Neural Networks via Edge Enhancing Diffusion |
Edgar Heinert et.al. |
2402.09530 |
link |
2024-02-13 |
Adaptive Hierarchical Certification for Segmentation using Randomized Smoothing |
Alaa Anani et.al. |
2402.08400 |
link |
2024-02-13 |
Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss |
Kei Iino et.al. |
2402.08267 |
null |
2024-02-12 |
Semantic segmentation for recognition of epileptiform patterns recorded via Microelectrode Arrays in vitro |
Gabriel Galeote-Checa et.al. |
2402.08099 |
null |
2024-02-11 |
Data Quality Aware Approaches for Addressing Model Drift of Semantic Segmentation Models |
Samiha Mirza et.al. |
2402.07258 |
null |
2024-02-09 |
More than the Sum of Its Parts: Ensembling Backbone Networks for Few-Shot Segmentation |
Nico Catalano et.al. |
2402.06581 |
null |
2024-02-09 |
Hybridnet for depth estimation and semantic segmentation |
Dalila Sánchez-Escobedo et.al. |
2402.06539 |
null |
2024-02-09 |
Classifying point clouds at the facade-level using geometric features and deep learning networks |
Yue Tan et.al. |
2402.06506 |
link |
2024-02-09 |
ControlUDA: Controllable Diffusion-assisted Unsupervised Domain Adaptation for Cross-Weather Semantic Segmentation |
Fengyi Shen et.al. |
2402.06446 |
null |
2024-02-08 |
Privacy-Preserving Synthetic Continual Semantic Segmentation for Robotic Surgery |
Mengya Xu et.al. |
2402.05860 |
link |
2024-02-08 |
On the Effect of Image Resolution on Semantic Segmentation |
Ritambhara Singh et.al. |
2402.05398 |
null |
2024-02-07 |
Multi-Scale Semantic Segmentation with Modified MBConv Blocks |
Xi Chen et.al. |
2402.04618 |
null |
2024-02-06 |
Energy-based Domain-Adaptive Segmentation with Depth Guidance |
Jinjing Zhu et.al. |
2402.03795 |
null |
2024-02-05 |
SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM |
Mingrui Li et.al. |
2402.03246 |
link |
2024-02-05 |
RRWNet: Recursive Refinement Network for Effective Retinal Artery/Vein Segmentation and Classification |
José Morano et.al. |
2402.03166 |
link |
2024-02-05 |
Unsupervised semantic segmentation of high-resolution UAV imagery for road scene parsing |
Zihan Ma et.al. |
2402.02985 |
link |
2024-02-04 |
M $^3$ Face: A Unified Multi-Modal Multilingual Framework for Human Face Generation and Editing |
Mohammadreza Mofayezi et.al. |
2402.02369 |
null |
2024-02-04 |
Exploring Intrinsic Properties of Medical Images for Self-Supervised Binary Semantic Segmentation |
Pranav Singh et.al. |
2402.02367 |
null |
2024-02-04 |
Region-Based Representations Revisited |
Michal Shlapentokh-Rothman et.al. |
2402.02352 |
link |
2024-02-03 |
Multi-Level Feature Aggregation and Recursive Alignment Network for Real-Time Semantic Segmentation |
Yanhua Zhang et.al. |
2402.02286 |
link |
2024-02-03 |
Evaluating the Robustness of Off-Road Autonomous Driving Segmentation against Adversarial Attacks: A Dataset-Centric analysis |
Pankaj Deoli et.al. |
2402.02154 |
link |
2024-02-03 |
Decomposition-based and Interference Perception for Infrared and Visible Image Fusion in Complex Scenes |
Xilai Li et.al. |
2402.02096 |
null |
2024-02-03 |
MLIP: Enhancing Medical Visual Representation with Divergence Encoder and Knowledge-guided Contrastive Learning |
Zhe Li et.al. |
2402.02045 |
null |
2024-02-02 |
Convolution kernel adaptation to calibrated fisheye |
Bruno Berenguel-Baeta et.al. |
2402.01456 |
link |
2024-02-02 |
Delving into Decision-based Black-box Attacks on Semantic Segmentation |
Zhaoyu Chen et.al. |
2402.01220 |
null |
2024-02-02 |
Scale Equalization for Multi-Level Feature Fusion |
Bum Jun Kim et.al. |
2402.01149 |
link |
2024-02-06 |
We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline |
Simar Kareer et.al. |
2402.00868 |
link |
2024-02-01 |
Automatic Segmentation of the Spinal Cord Nerve Rootlets |
Jan Valosek et.al. |
2402.00724 |
link |
2024-02-01 |
A Framework for Building Point Cloud Cleaning, Plane Detection and Semantic Segmentation |
Ilyass Abouelaziz et.al. |
2402.00692 |
null |
2024-01-31 |
Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model |
Zihan Zhong et.al. |
2401.17868 |
link |
2024-01-31 |
Leveraging Swin Transformer for Local-to-Global Weakly Supervised Semantic Segmentation |
Rozhan Ahmadi et.al. |
2401.17828 |
link |
2024-02-01 |
Tiered approach for rapid damage characterisation of infrastructure enabled by remote sensing and deep learning technologies |
Nadiia Kopiika et.al. |
2401.17759 |
null |
2024-01-31 |
Towards Image Semantics and Syntax Sequence Learning |
Chun Tao et.al. |
2401.17515 |
null |
2024-01-30 |
Evaluation of Out-of-Distribution Detection Performance on Autonomous Driving Datasets |
Jens Henriksson et.al. |
2401.17013 |
null |
2024-01-30 |
CAFCT: Contextual and Attentional Feature Fusions of Convolutional Neural Networks and Transformer for Liver Tumor Segmentation |
Ming Kang et.al. |
2401.16886 |
null |
2024-01-29 |
Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors |
Shiyin Dong et.al. |
2401.16459 |
null |
2024-01-28 |
SERNet-Former: Semantic Segmentation by Efficient Residual Network with Attention-Boosting Gates and Attention-Fusion Networks |
Serdar Erisen et.al. |
2401.15741 |
link |
2024-01-28 |
UP-CrackNet: Unsupervised Pixel-Wise Road Crack Detection via Adversarial Image Restoration |
Nachuan Ma et.al. |
2401.15647 |
null |
2024-01-27 |
Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes |
Diandian Guo et.al. |
2401.15261 |
link |
2024-01-26 |
Biological Valuation Map of Flanders: A Sentinel-2 Imagery Analysis |
Mingshi Li et.al. |
2401.15223 |
null |
2024-01-26 |
Kitchen Food Waste Image Segmentation and Classification for Compost Nutrients Estimation |
Raiyan Rahman et.al. |
2401.15175 |
null |
2024-01-26 |
SSR: SAM is a Strong Regularizer for domain adaptive semantic segmentation |
Yanqi Ge et.al. |
2401.14686 |
null |
2024-01-25 |
CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds |
Muhammad Ahmed Chaudhry et.al. |
2401.14486 |
null |
2024-01-25 |
Unlocking Past Information: Temporal Embeddings in Cooperative Bird's Eye View Prediction |
Dominik Rößle et.al. |
2401.14325 |
null |
2024-01-24 |
Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for Nuclei Segmentation |
Saiyang Na et.al. |
2401.13220 |
null |
2024-01-24 |
Boundary and Relation Distillation for Semantic Segmentation |
Dong Zhang et.al. |
2401.13174 |
null |
2024-01-23 |
DatUS^2: Data-driven Unsupervised Semantic Segmentation with Pre-trained Self-supervised Vision Transformer |
Sonal Kumar et.al. |
2401.12820 |
link |
2024-01-23 |
Self-Supervised Vision Transformers Are Efficient Segmentation Learners for Imperfect Labels |
Seungho Lee et.al. |
2401.12535 |
null |
2024-01-23 |
Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration |
Yifan Zhang et.al. |
2401.12452 |
link |
2024-01-22 |
Scaling Up Quantization-Aware Neural Architecture Search for Efficient Deep Learning on the Edge |
Yao Lu et.al. |
2401.12350 |
null |
2024-01-22 |
Exploring Simple Open-Vocabulary Semantic Segmentation |
Zihang Lai et.al. |
2401.12217 |
link |
2024-01-22 |
Out-of-Distribution Detection & Applications With Ablated Learned Temperature Energy |
Will LeVine et.al. |
2401.12129 |
link |
2024-01-22 |
HomeRobot Open Vocabulary Mobile Manipulation Challenge 2023 Participant Report (Team KuzHum) |
Volodymyr Kuzma et.al. |
2401.12048 |
null |
2024-01-22 |
SemPLeS: Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation |
Ci-Siang Lin et.al. |
2401.11791 |
null |
2024-01-22 |
EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models |
Koichi Namekata et.al. |
2401.11739 |
null |
2024-01-22 |
MetaSeg: Content-Aware Meta-Net for Omni-Supervised Semantic Segmentation |
Shenwang Jiang et.al. |
2401.11738 |
null |
2024-01-22 |
SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation |
Xinqiao Zhao et.al. |
2401.11719 |
link |
2024-01-21 |
A Survey on African Computer Vision Datasets, Topics and Researchers |
Abdul-Hakeem Omotayo et.al. |
2401.11617 |
link |
2024-01-21 |
Embedded Hyperspectral Band Selection with Adaptive Optimization for Image Semantic Segmentation |
Yaniv Zimmer et.al. |
2401.11420 |
null |
2024-01-21 |
S $^3$ M-Net: Joint Learning of Semantic Segmentation and Stereo Matching for Autonomous Driving |
Zhiyuan Wu et.al. |
2401.11414 |
null |
2024-01-21 |
ANNA: A Deep Learning Based Dataset in Heterogeneous Traffic for Autonomous Vehicles |
Mahedi Kamal et.al. |
2401.11358 |
link |
2024-01-20 |
Weakly-Supervised Semantic Segmentation of Circular-Scan, Synthetic-Aperture-Sonar Imagery |
Isaac J. Sledge et.al. |
2401.11313 |
null |
2024-01-20 |
A Novel Benchmark for Few-Shot Semantic Segmentation in the Era of Foundation Models |
Reda Bensaid et.al. |
2401.11311 |
link |
2024-01-20 |
Spatial Structure Constraints for Weakly Supervised Semantic Segmentation |
Tao Chen et.al. |
2401.11122 |
link |
2024-01-19 |
RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision |
Fernando Pérez-García et.al. |
2401.10815 |
null |
2024-01-19 |
Exploring Color Invariance through Image-Level Ensemble Learning |
Yunpeng Gong et.al. |
2401.10512 |
link |
2024-01-18 |
RAP-SAM: Towards Real-Time All-Purpose Segment Anything |
Shilin Xu et.al. |
2401.10228 |
link |
2024-01-18 |
Ventricular Segmentation: A Brief Comparison of U-Net Derivatives |
Ketan Suhaas Saichandran et.al. |
2401.09980 |
null |
2024-01-18 |
XAI-Enhanced Semantic Segmentation Models for Visual Quality Inspection |
Tobias Clement et.al. |
2401.09900 |
null |
2024-01-18 |
Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation |
Songhe Deng et.al. |
2401.09883 |
link |
2024-01-18 |
Boosting Few-Shot Semantic Segmentation Via Segment Anything Model |
Chen-Bin Feng et.al. |
2401.09826 |
null |
2024-01-18 |
P2Seg: Pointly-supervised Segmentation via Mutual Distillation |
Zipeng Wang et.al. |
2401.09709 |
null |
2024-01-17 |
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model |
Lianghui Zhu et.al. |
2401.09417 |
link |
2024-01-17 |
POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images |
Antonin Vobecky et.al. |
2401.09413 |
null |
2024-01-17 |
PixelDINO: Semi-Supervised Semantic Segmentation for Detecting Permafrost Disturbances |
Konrad Heidler et.al. |
2401.09271 |
link |
2024-01-17 |
Uncertainty estimates for semantic segmentation: providing enhanced reliability for automated motor claims handling |
Jan Küchler et.al. |
2401.09245 |
null |
2024-01-17 |
Learning to detect cloud and snow in remote sensing images from noisy labels |
Zili Liu et.al. |
2401.08932 |
null |
2024-01-16 |
Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive |
Yumeng Li et.al. |
2401.08815 |
link |
2024-01-16 |
ValUES: A Framework for Systematic Validation of Uncertainty Estimation in Semantic Segmentation |
Kim-Celine Kahl et.al. |
2401.08501 |
link |
2024-01-16 |
Faster ISNet for Background Bias Mitigation on Deep Neural Networks |
Pedro R. A. S. Bassi et.al. |
2401.08409 |
link |
2024-01-17 |
Generative Denoise Distillation: Simple Stochastic Noises Induce Efficient Knowledge Transfer for Dense Prediction |
Zhaoge Liu et.al. |
2401.08332 |
link |
2024-01-16 |
End-to-End Optimized Image Compression with the Frequency-Oriented Transform |
Yuefeng Zhang et.al. |
2401.08194 |
null |
2024-01-16 |
S3M: Semantic Segmentation Sparse Mapping for UAVs with RGB-D Camera |
Thanh Nguyen Canh et.al. |
2401.08134 |
null |
2024-01-16 |
UV-SAM: Adapting Segment Anything Model for Urban Village Identification |
Xin Zhang et.al. |
2401.08083 |
link |
2024-01-15 |
Semantic Scene Segmentation for Robotics |
Juana Valeria Hurtado et.al. |
2401.07589 |
null |
2024-01-15 |
Compositional Oil Spill Detection Based on Object Detector and Adapted Segment Anything Model from SAR Images |
Wenhui Wu et.al. |
2401.07502 |
null |
2024-01-15 |
Semantic Segmentation in Multiple Adverse Weather Conditions with Domain Knowledge Retention |
Xin Yang et.al. |
2401.07459 |
null |
2024-01-13 |
Weak Labeling for Cropland Mapping in Africa |
Gilles Quentin Hacheme et.al. |
2401.07014 |
null |
2024-01-12 |
Seeing the roads through the trees: A benchmark for modeling spatial dependencies with aerial imagery |
Caleb Robinson et.al. |
2401.06762 |
link |
2024-01-12 |
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding |
Bowen Shi et.al. |
2401.06397 |
link |
2024-01-11 |
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications |
Yuwen Xiong et.al. |
2401.06197 |
link |
2024-01-09 |
Generic Knowledge Boosted Pre-training For Remote Sensing Images |
Ziyue Huang et.al. |
2401.04614 |
link |
2024-01-08 |
Fully Attentional Networks with Self-emerging Token Labeling |
Bingyin Zhao et.al. |
2401.03844 |
link |
2024-01-07 |
SeTformer is What You Need for Vision and Language |
Pourya Shamsolmoali et.al. |
2401.03540 |
null |
2024-01-06 |
Multi-View 3D Instance Segmentation of Structural Anomalies for Enhanced Structural Inspection of Concrete Bridges |
Christian Benz et.al. |
2401.03298 |
link |
2024-01-02 |
Unsupervised Federated Domain Adaptation for Segmentation of MRI Images |
Navapat Nananukul et.al. |
2401.02941 |
null |
2024-01-04 |
ClassWise-SAM-Adapter: Parameter Efficient Fine-tuning Adapts Segment Anything to SAR Domain for Semantic Segmentation |
Xinyang Pu et.al. |
2401.02326 |
link |
2024-01-03 |
Towards Robust Semantic Segmentation against Patch-based Attack via Attention Refinement |
Zheng Yuan et.al. |
2401.01750 |
null |
2024-01-03 |
S3Net: Innovating Stereo Matching and Semantic Segmentation with a Single-Branch Semantic Stereo Network in Satellite Epipolar Imagery |
Qingyuan Yang et.al. |
2401.01643 |
link |
2024-01-03 |
Context-Aware Interaction Network for RGB-T Semantic Segmentation |
Ying Lv et.al. |
2401.01624 |
link |
2024-01-02 |
Off-Road LiDAR Intensity Based Semantic Segmentation |
Kasi Viswanath et.al. |
2401.01439 |
link |
2024-01-02 |
Integrating Edges into U-Net Models with Explainable Activation Maps for Brain Tumor Segmentation using MR Images |
Subin Sahayam et.al. |
2401.01303 |
null |
2024-01-02 |
Physics-informed Generalizable Wireless Channel Modeling with Segmentation and Deep Learning: Fundamentals, Methodologies, and Challenges |
Ethan Zhu et.al. |
2401.01288 |
null |
2024-01-02 |
GBSS:a global building semantic segmentation dataset for large-scale remote sensing building extraction |
Yuping Hu et.al. |
2401.01178 |
null |
2024-01-02 |
DTBS: Dual-Teacher Bi-directional Self-training for Domain Adaptation in Nighttime Semantic Segmentation |
Fanding Huang et.al. |
2401.01066 |
link |
2024-01-02 |
Online Continual Domain Adaptation for Semantic Image Segmentation Using Internal Representations |
Serban Stan et.al. |
2401.01035 |
link |
2023-12-31 |
Analyzing Local Representations of Self-supervised Vision Transformers |
Ani Vanyan et.al. |
2401.00463 |
null |
2023-12-28 |
Learning Vision from Models Rivals Learning Vision from Data |
Yonglong Tian et.al. |
2312.17742 |
link |
2024-01-04 |
HEAP: Unsupervised Object Discovery and Localization with Contrastive Grouping |
Xin Zhang et.al. |
2312.17492 |
null |
2023-12-28 |
Unsupervised Universal Image Segmentation |
Dantong Niu et.al. |
2312.17243 |
link |
2024-01-03 |
An Improved Baseline for Reasoning Segmentation with Large Language Model |
Senqiao Yang et.al. |
2312.17240 |
null |
2023-12-28 |
SCTNet: Single-Branch CNN with Transformer Semantic Information for Real-Time Segmentation |
Zhengze Xu et.al. |
2312.17071 |
link |
2023-12-28 |
EvPlug: Learn a Plug-and-Play Module for Event and Image Fusion |
Jianping Jiang et.al. |
2312.16933 |
null |
2023-12-29 |
Multi-modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation |
Xiawei Li et.al. |
2312.16578 |
link |
2023-12-27 |
ConstScene: Dataset and Model for Advancing Robust Semantic Segmentation in Construction Environments |
Maghsood Salimi et.al. |
2312.16516 |
link |
2023-12-26 |
VirtualPainting: Addressing Sparsity with Virtual Points and Distance-Aware Data Augmentation for 3D Object Detection |
Sudip Dhakal et.al. |
2312.16141 |
null |
2023-12-26 |
LangSplat: 3D Language Gaussian Splatting |
Minghan Qin et.al. |
2312.16084 |
link |
2023-12-23 |
WildScenes: A Benchmark for 2D and 3D Semantic Segmentation in Large-scale Natural Environments |
Kavisha Vidanapathirana et.al. |
2312.15364 |
link |
2023-12-23 |
Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models |
Gianni Franchi et.al. |
2312.15297 |
null |
2023-12-22 |
Harnessing Diffusion Models for Visual Perception with Meta Prompts |
Qiang Wan et.al. |
2312.14733 |
link |
2023-12-22 |
Variance-insensitive and Target-preserving Mask Refinement for Interactive Image Segmentation |
Chaowei Fang et.al. |
2312.14387 |
null |
2023-12-26 |
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification |
Qinying Liu et.al. |
2312.14149 |
link |
2023-12-21 |
Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation |
Rasha Alshawi et.al. |
2312.14053 |
link |
2023-12-21 |
Few Shot Part Segmentation Reveals Compositional Logic for Industrial Anomaly Detection |
Soopil Kim et.al. |
2312.13783 |
link |
2023-12-22 |
Weakly Supervised Semantic Segmentation for Driving Scenes |
Dongseob Kim et.al. |
2312.13646 |
link |
2023-12-20 |
DVIS++: Improved Decoupled Framework for Universal Video Segmentation |
Tao Zhang et.al. |
2312.13305 |
link |
2023-12-20 |
BEVSeg2TP: Surround View Camera Bird's-Eye-View Based Joint Vehicle Segmentation and Ego Vehicle Trajectory Prediction |
Sushil Sharma et.al. |
2312.13081 |
link |
2023-12-20 |
Multi-task Learning To Improve Semantic Segmentation Of CBCT Scans Using Image Reconstruction |
Maximilian Ernst Tschuchnig et.al. |
2312.12990 |
null |
2023-12-20 |
TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training |
Yuqi Lin et.al. |
2312.12828 |
link |
2023-12-20 |
MetaSegNet: Metadata-collaborative Vision-Language Representation Learning for Semantic Segmentation of Remote Sensing Images |
Libo Wang et.al. |
2312.12735 |
null |
2023-12-20 |
Segment Anything Model Meets Image Harmonization |
Haoxing Chen et.al. |
2312.12729 |
null |
2023-12-19 |
DDOS: The Drone Depth and Obstacle Segmentation Dataset |
Benedikt Kolbeinsson et.al. |
2312.12494 |
null |
2023-12-19 |
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process |
Mengyu Wang et.al. |
2312.12425 |
link |
2023-12-19 |
CLIP-DINOiser: Teaching CLIP a few DINO tricks |
Monika Wysoczańska et.al. |
2312.12359 |
link |
2023-12-19 |
All for One, and One for All: UrbanSyn Dataset, the third Musketeer of Synthetic Driving Scenes |
Jose L. Gómez et.al. |
2312.12176 |
null |
2023-12-18 |
Detecting the edges of galaxies with deep learning |
Jesús Fernández et.al. |
2312.11654 |
null |
2023-12-18 |
Language-Assisted 3D Scene Understanding |
Yanmin Wu et.al. |
2312.11451 |
link |
2023-12-18 |
Research on Multilingual Natural Scene Text Detection Algorithm |
Tao Wang et.al. |
2312.11153 |
null |
2023-12-18 |
SeeBel: Seeing is Believing |
Sourajit Saha et.al. |
2312.10933 |
link |
2023-12-17 |
Artificial intelligence optical hardware empowers high-resolution hyperspectral video understanding at 1.2 Tb/s |
Maksim Makarenko et.al. |
2312.10639 |
null |
2023-12-16 |
Transformers in Unsupervised Structure-from-Motion |
Hemang Chawla et.al. |
2312.10529 |
link |
2023-12-16 |
Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning |
Kaiyou Song et.al. |
2312.10457 |
link |
2023-12-15 |
Forging Tokens for Improved Storage-efficient Training |
Minhyun Lee et.al. |
2312.10105 |
link |
2023-12-15 |
Collaborating Foundation models for Domain Generalized Semantic Segmentation |
Yasser Benigmim et.al. |
2312.09788 |
link |
2023-12-15 |
Density Matters: Improved Core-set for Active Domain Adaptive Segmentation |
Shizhan Liu et.al. |
2312.09595 |
null |
2023-12-15 |
AEGIS-Net: Attention-guided Multi-Level Feature Aggregation for Indoor Place Recognition |
Yuhang Ming et.al. |
2312.09538 |
link |
2023-12-15 |
WeatherProof: A Paired-Dataset Approach to Semantic Segmentation in Adverse Weather |
Blake Gella et.al. |
2312.09534 |
null |
2023-12-14 |
LIME: Localized Image Editing via Attention Regularization in Diffusion Models |
Enis Simsar et.al. |
2312.09256 |
null |
2023-12-14 |
Reliability in Semantic Segmentation: Can We Use Synthetic Data? |
Thibaut Loiseau et.al. |
2312.09231 |
link |
2023-12-18 |
Progressive Feature Self-reinforcement for Weakly Supervised Semantic Segmentation |
Jingxuan He et.al. |
2312.08916 |
link |
2023-12-14 |
Agent Attention: On the Integration of Softmax and Linear Attention |
Dongchen Han et.al. |
2312.08874 |
link |
2023-12-14 |
Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities |
Runwei Guan et.al. |
2312.08851 |
link |
2023-12-14 |
Offshore Wind Plant Instance Segmentation Using Sentinel-1 Time Series, GIS, and Semantic Segmentation Models |
Osmar Luiz Ferreira de Carvalho et.al. |
2312.08773 |
null |
2023-12-14 |
Segment Beyond View: Handling Partially Missing Modality for Audio-Visual Semantic Segmentation |
Renjie Wu et.al. |
2312.08673 |
null |
2023-12-14 |
Semi-supervised Semantic Segmentation Meets Masked Modeling:Fine-grained Locality Learning Matters in Consistency Regularization |
Wentao Pan et.al. |
2312.08631 |
null |
2023-12-11 |
DFGET: Displacement-Field Assisted Graph Energy Transmitter for Gland Instance Segmentation |
Caiqing Jian et.al. |
2312.07584 |
null |
2023-12-12 |
X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Transfer |
Linglin Jing et.al. |
2312.07378 |
link |
2023-12-12 |
Adversarial Semi-Supervised Domain Adaptation for Semantic Segmentation: A New Role for Labeled Target Samples |
Marwa Kechaou et.al. |
2312.07370 |
null |
2023-12-12 |
Expand-and-Quantize: Unsupervised Semantic Segmentation Using High-Dimensional Space and Product Quantization |
Jiyoung Kim et.al. |
2312.07342 |
null |
2023-12-12 |
Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation |
Yuanbin Wang et.al. |
2312.07221 |
null |
2023-12-12 |
MCFNet: Multi-scale Covariance Feature Fusion Network for Real-time Semantic Segmentation |
Xiaojie Fang et.al. |
2312.07207 |
null |
2023-12-11 |
Densify Your Labels: Unsupervised Clustering with Bipartite Matching for Weakly Supervised Point Cloud Segmentation |
Shaobo Xia et.al. |
2312.06799 |
null |
2023-12-11 |
Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations |
Xiao Zhang et.al. |
2312.06716 |
link |
2023-12-10 |
AM-RADIO: Agglomerative Model -- Reduce All Domains Into One |
Mike Ranzinger et.al. |
2312.06709 |
link |
2023-12-11 |
Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation |
Xiaoyi Bao et.al. |
2312.06474 |
null |
2023-12-11 |
Semantic Connectivity-Driven Pseudo-labeling for Cross-domain Segmentation |
Dong Zhao et.al. |
2312.06331 |
link |
2023-12-11 |
U-MixFormer: UNet-like Transformer with Mix-Attention for Efficient Semantic Segmentation |
Seul-Ki Yeom et.al. |
2312.06272 |
link |
2023-12-11 |
Adaptive Annotation Distribution for Weakly Supervised Point Cloud Semantic Segmentation |
Zhiyi Pan et.al. |
2312.06259 |
link |
2023-12-10 |
Deep-Learning-Assisted Analysis of Cataract Surgery Videos |
Negin Ghamsarian et.al. |
2312.05900 |
null |
2023-12-09 |
CSL: Class-Agnostic Structure-Constrained Learning for Segmentation Including the Unseen |
Hao Zhang et.al. |
2312.05538 |
null |
2023-12-08 |
Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects |
Junyu Lu et.al. |
2312.05278 |
null |
2023-12-08 |
Datasets, Models, and Algorithms for Multi-Sensor, Multi-agent Autonomy Using AVstack |
R. Spencer Hallyburton et.al. |
2312.04970 |
null |
2023-12-07 |
Point2CAD: Reverse Engineering CAD Models from 3D Point Clouds |
Yujia Liu et.al. |
2312.04962 |
null |
2023-12-08 |
Segmentation of Kidney Tumors on Non-Contrast CT Images using Protuberance Detection Network |
Taro Hatsutani et.al. |
2312.04796 |
null |
2023-12-07 |
gcDLSeg: Integrating Graph-cut into Deep Learning for Binary Semantic Segmentation |
Hui Xie et.al. |
2312.04713 |
null |
2023-12-07 |
HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image |
Tong Wu et.al. |
2312.04543 |
null |
2023-12-07 |
Self-Guided Open-Vocabulary Semantic Segmentation |
Osman Ülger et.al. |
2312.04539 |
link |
2023-12-07 |
Semi-Supervised Active Learning for Semantic Segmentation in Unknown Environments Using Informative Path Planning |
Julius Rückin et.al. |
2312.04402 |
link |
2023-12-07 |
Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation |
Zhixiang Wei et.al. |
2312.04265 |
link |
2023-12-07 |
Fine-tune vision foundation model for crack segmentation in civil infrastructures |
Kang Ge et.al. |
2312.04233 |
null |
2023-12-07 |
Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic Segmentation |
Jiawei Fan et.al. |
2312.04168 |
link |
2023-12-07 |
Residual Graph Convolutional Network for Bird's-Eye-View Semantic Segmentation |
Qiuxiao Chen et.al. |
2312.04044 |
null |
2023-12-06 |
Novel class discovery meets foundation models for 3D semantic segmentation |
Luigi Riz et.al. |
2312.03782 |
null |
2023-12-06 |
Foundation Model Assisted Weakly Supervised Semantic Segmentation |
Xiaobo Yang et.al. |
2312.03585 |
link |
2023-12-06 |
ShareCMP: Polarization-Aware RGB-P Semantic Segmentation |
Zhuoyan Liu et.al. |
2312.03430 |
link |
2023-12-06 |
DeepPyramid+: Medical Image Segmentation using Pyramid View Fusion and Deformable Pyramid Reception |
Negin Ghamsarian et.al. |
2312.03409 |
null |
2023-12-06 |
Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields |
Shijie Zhou et.al. |
2312.03203 |
link |
2023-12-05 |
AI-SAM: Automatic and Interactive Segment Anything Model |
Yimu Pan et.al. |
2312.03119 |
link |
2023-12-05 |
DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control |
Yuru Jia et.al. |
2312.03048 |
null |
2023-12-05 |
6D Assembly Pose Estimation by Point Cloud Registration for Robot Manipulation |
K. Samarawickrama et.al. |
2312.02593 |
link |