Skip to content

WuxinrongY/cv-arxiv-daily

 
 

Repository files navigation

Updated on 2025.08.17

Table of Contents
  1. Object Detection
  2. Small Object Detection
  3. Image Matching
  4. Visual Localization
  5. Homogeous Image Transformation
  6. Homogeous Image

Object Detection

Publish Date Title Authors PDF Code
2025-07-23 Perspective-Invariant 3D Object Detection Ao Liang et.al. 2507.17665 null
2025-07-23 Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning Xinyao Liu et.al. 2507.17539 null
2025-07-23 Illicit object detection in X-ray imaging using deep learning techniques: A comparative evaluation Jorgen Cani et.al. 2507.17508 null
2025-07-23 Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection Yehao Lu et.al. 2507.17436 null
2025-07-23 SFUOD: Source-Free Unknown Object Detection Keon-Hee Park et.al. 2507.17373 null
2025-07-23 Optimizing Delivery Logistics: Enhancing Speed and Safety with Drone Technology Maharshi Shastri et.al. 2507.17253 null
2025-07-23 A Low-Cost Machine Learning Approach for Timber Diameter Estimation Fatemeh Hasanzadeh Fard et.al. 2507.17219 null
2025-07-22 Few-Shot Learning in Video and 3D Object Detection: A Survey Md Meftahul Ferdaus et.al. 2507.17079 null
2025-07-22 Transformer Based Building Boundary Reconstruction using Attraction Field Maps Muhammad Kamran et.al. 2507.17038 null
2025-07-22 Task-Specific Zero-shot Quantization-Aware Training for Object Detection Changhao Li et.al. 2507.16782 null
2025-07-22 Screen2AX: Vision-Based Approach for Automatic macOS Accessibility Generation Viktor Muryn et.al. 2507.16704 null
2025-07-22 Benchmarking pig detection and tracking under diverse and challenging conditions Jonathan Henrich et.al. 2507.16639 null
2025-07-22 A2Mamba: Attention-augmented State Space Models for Visual Recognition Meng Lou et.al. 2507.16624 null
2025-07-22 PlantSAM: An Object Detection-Driven Segmentation Pipeline for Herbarium Specimens Youcef Sklab et.al. 2507.16506 null
2025-07-22 Towards Railway Domain Adaptation for LiDAR-based 3D Detection: Road-to-Rail and Sim-to-Real via SynDRA-BBox Xavier Diaz et.al. 2507.16413 null
2025-07-22 MAN++: Scaling Momentum Auxiliary Network for Supervised Local Learning in Vision Tasks Junhao Su et.al. 2507.16279 null
2025-07-22 Edge-case Synthesis for Fisheye Object Detection: A Data-centric Perspective Seunghyeon Kim et.al. 2507.16254 null
2025-07-22 LDRFusion: A LiDAR-Dominant multimodal refinement framework for 3D object detection Jijun Wang et.al. 2507.16224 null
2025-07-22 Design and Implementation of a Lightweight Object Detection System for Resource-Constrained Edge Environments Jiyue Jiang et.al. 2507.16155 null
2025-07-21 Experimenting active and sequential learning in a medieval music manuscript Sachin Sharma et.al. 2507.15633 null
2025-07-21 Few-Shot Object Detection via Spatial-Channel State Space Model Zhimeng Xin et.al. 2507.15308 null
2025-07-20 Event-based Graph Representation with Spatial and Motion Vectors for Asynchronous Object Detection Aayush Atul Verma et.al. 2507.15150 null
2025-07-20 BleedOrigin: Dynamic Bleeding Source Localization in Endoscopic Submucosal Dissection via Dual-Stage Detection and Tracking Mengya Xu et.al. 2507.15094 null
2025-07-20 InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis Jiale Liu et.al. 2507.14899 null
2025-07-20 An Uncertainty-aware DETR Enhancement Framework for Object Detection Xingshu Chen et.al. 2507.14855 null
2025-07-19 Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection Jifeng Shen et.al. 2507.14643 null
2025-07-18 C-DOG: Training-Free Multi-View Multi-Object Association in Dense Scenes Without Visual Feature via Connected δ-Overlap Graphs Yung-Hong Sun et.al. 2507.14095 null
2025-07-18 Enhancing LiDAR Point Features with Foundation Model Priors for 3D Object Detection Yujian Mo et.al. 2507.13899 null
2025-07-18 Moving Object Detection from Moving Camera Using Focus of Expansion Likelihood and Segmentation Masahiro Ogawa et.al. 2507.13628 null
2025-07-17 NSF-DOE Vera C. Rubin Observatory Observations of Interstellar Comet 3I/ATLAS (C/2025 N1) Colin Orion Chandler et.al. 2507.13409 null
2025-07-17 A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains Antonio Finocchiaro et.al. 2507.13326 null
2025-07-17 RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images Xiaozheng Jiang et.al. 2507.13120 null
2025-07-17 Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection Riku Inoue et.al. 2507.13085 null
2025-07-17 Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis Saswat Priyadarshi Nayak et.al. 2507.13073 null
2025-07-17 SOD-YOLO: Enhancing YOLO-Based Detection of Small Objects in UAV Imagery Peijun Wang et.al. 2507.12727 null
2025-07-16 Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios Van-Hoang-Anh Phan et.al. 2507.12449 null
2025-07-16 InterpIoU: Rethinking Bounding Box Regression with Interpolation-Based IoU Optimization Haoyuan Liu et.al. 2507.12420 null
2025-07-16 AutoVDC: Automated Vision Data Cleaning Using Vision-Language Models Santosh Vasa et.al. 2507.12414 null
2025-07-18 OD-VIRAT: A Large-Scale Benchmark for Object Detection in Realistic Surveillance Environments Hayat Ullah et.al. 2507.12396 null
2025-07-16 Improving Lightweight Weed Detection via Knowledge Distillation Ahmet Oğuz Saltık et.al. 2507.12344 null
2025-07-16 SS-DC: Spatial-Spectral Decoupling and Coupling Across Visible-Infrared Gap for Domain Adaptive Object Detection Xiwei Zhang et.al. 2507.12017 null
2025-07-16 Frequency-Dynamic Attention Modulation for Dense Prediction Linwei Chen et.al. 2507.12006 null
2025-07-15 Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping Yujie Zhang et.al. 2507.11279 null
2025-07-15 Using Continual Learning for Real-Time Detection of Vulnerable Road Users in Complex Traffic Scenarios Faryal Aurooj Nasir et.al. 2507.11046 null
2025-07-15 Combining Transformers and CNNs for Efficient Object Detection in High-Resolution Satellite Imagery Nicolas Drapier et.al. 2507.11040 null
2025-07-14 A Lightweight and Robust Framework for Real-Time Colorectal Polyp Detection Using LOF-Based Preprocessing and YOLO-v11n Saadat Behzadi et.al. 2507.10864 null
2025-07-14 LLM-Guided Agentic Object Detection for Open-World Understanding Furkan Mumcu et.al. 2507.10844 null
2025-07-14 Versatile and Generalizable Manipulation via Goal-Conditioned Reinforcement Learning with Grounded Object Detection Huiyi Wang et.al. 2507.10814 null
2025-07-14 Fine-Grained Zero-Shot Object Detection Hongxu Ma et.al. 2507.10358 null
2025-07-14 BlueGlass: A Framework for Composite AI Safety Harshal Nandigramwar et.al. 2507.10106 null
2025-07-14 SRG/ART-XC All-Sky X-ray Survey: Sensitivity Assessment Based on Aperture Photometry N. Y. Tyrin et.al. 2507.10060 null
2025-07-14 3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving Yixun Zhang et.al. 2507.09993 null
2025-07-14 Measuring the Impact of Rotation Equivariance on Aerial Object Detection Xiuyu Wu et.al. 2507.09896 null
2025-07-13 MLoRQ: Bridging Low-Rank and Quantization for Transformer Compression Ofir Gordon et.al. 2507.09616 null
2025-07-12 Stereo-based 3D Anomaly Object Detection for Autonomous Driving: A New Dataset and Baseline Shiyi Mu et.al. 2507.09214 null
2025-07-12 On the Fragility of Multimodal Perception to Temporal Misalignment in Autonomous Driving Md Hasan Shahriar et.al. 2507.09095 null
2025-07-11 VISTA: A Visual Analytics Framework to Enhance Foundation Model-Generated Data Labels Xiwei Xuan et.al. 2507.09008 null
2025-07-11 RoundaboutHD: High-Resolution Real-World Urban Environment Benchmark for Multi-Camera Vehicle Tracking Yuqiang Lin et.al. 2507.08729 null
2025-07-11 DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images Haoran Sun et.al. 2507.08648 null
2025-07-11 OnlineBEV: Recurrent Temporal Fusion in Bird's Eye View Representations for Multi-Camera 3D Perception Junho Koh et.al. 2507.08644 null
2025-07-11 Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset Mathias Zinnen et.al. 2507.08384 null
2025-07-11 Spectroscopic Observations of Four Candidates for Blue Large-Amplitude Pulsators. No BLAPs at High Galactic Latitudes P. Pietrukowicz et.al. 2507.08372 null
2025-07-11 Understanding Driving Risks using Large Language Models: Toward Elderly Driver Assessment Yuki Yoshihara et.al. 2507.08367 null
2025-07-10 An Embedded Real-time Object Alert System for Visually Impaired: A Monocular Depth Estimation based Approach through Computer Vision Jareen Anjom et.al. 2507.08165 null
2025-07-10 Rainbow Artifacts from Electromagnetic Signal Injection Attacks on Image Sensors Youqian Zhang et.al. 2507.07773 null
2025-07-09 Automated Video Segmentation Machine Learning Pipeline Johannes Merz et.al. 2507.07242 null
2025-07-09 Aerial Maritime Vessel Detection and Identification Antonella Barisic Kulas et.al. 2507.07153 null
2025-07-09 DenoiseCP-Net: Efficient Collective Perception in Adverse Weather via Joint LiDAR-Based 3D Object Detection and Denoising Sven Teufel et.al. 2507.06976 null
2025-07-09 A multi-modal dataset for insect biodiversity with imagery and DNA at the trap and individual level Johanna Orsholm et.al. 2507.06972 null
2025-07-09 Dataset and Benchmark for Enhancing Critical Retained Foreign Object Detection Yuli Wang et.al. 2507.06937 null
2025-07-09 Unlocking Thermal Aerial Imaging: Synthetic Enhancement of UAV Datasets Antonella Barisic Kulas et.al. 2507.06797 null
2025-07-09 LOVON: Legged Open-Vocabulary Object Navigator Daojie Peng et.al. 2507.06747 null
2025-07-09 EA: An Event Autoencoder for High-Speed Vision Sensing Riadul Islam et.al. 2507.06459 null
2025-07-08 Hierarchical Multi-Stage Transformer Architecture for Context-Aware Temporal Action Localization Hayat Ullah et.al. 2507.06411 null
2025-07-08 ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge Daghash K. Alqahtani et.al. 2507.06011 null
2025-07-08 R-VLM: Region-Aware Vision Language Model for Precise GUI Grounding Joonhyung Park et.al. 2507.05673 null
2025-07-07 From a Different Star: 3I/ATLAS in the context of the Ōtautahi-Oxford interstellar object population model Matthew J. Hopkins et.al. 2507.05318 null
2025-07-07 Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations Xiang Xu et.al. 2507.05260 null
2025-07-07 LERa: Replanning with Visual Feedback in Instruction Following Svyatoslav Pchelintsev et.al. 2507.05135 null
2025-07-07 CVFusion: Cross-View Fusion of 4D Radar and Camera for 3D Object Detection Hanzhi Zhong et.al. 2507.04587 null
2025-07-06 MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection Hanshi Wang et.al. 2507.04369 null
2025-07-06 DMAT: An End-to-End Framework for Joint Atmospheric Turbulence Mitigation and Object Detection Paul Hill et.al. 2507.04323 null
2025-07-06 ZERO: Multi-modal Prompt-based Visual Grounding Sangbum Choi et.al. 2507.04270 null
2025-07-05 Towards Accurate and Efficient 3D Object Detection for Autonomous Driving: A Mixture of Experts Computing System on Edge Linshen Liu et.al. 2507.04123 null
2025-07-04 Zero Memory Overhead Approach for Protecting Vision Transformer Parameters Fereshteh Baradaran et.al. 2507.03816 null
2025-07-04 2.5D Object Detection for Intelligent Roadside Infrastructure Nikolai Polley et.al. 2507.03564 null
2025-07-04 Enhancing Uncertainty Quantification for Runtime Safety Assurance Using Causal Risk Analysis and Operational Design Domain Radouane Bouchekir et.al. 2507.03515 null
2025-07-03 Partial Weakly-Supervised Oriented Object Detection Mingxin Liu et.al. 2507.02751 null
2025-07-03 Automatic Labelling for Low-Light Pedestrian Detection Dimitrios Bouzoulas et.al. 2507.02513 null
2025-07-03 Weakly-supervised Contrastive Learning with Quantity Prompts for Moving Infrared Small Target Detection Weiwei Duan et.al. 2507.02454 null
2025-07-03 A Late Collaborative Perception Framework for 3D Multi-Object and Multi-Source Association and Fusion Maryem Fadili et.al. 2507.02430 null
2025-07-03 PLOT: Pseudo-Labeling via Video Object Tracking for Scalable Monocular 3D Object Detection Seokyeong Lee et.al. 2507.02393 null
2025-07-03 Two-Steps Neural Networks for an Automated Cerebrovascular Landmark Detection Rafic Nader et.al. 2507.02349 null
2025-07-03 Perception Activator: An intuitive and portable framework for brain cognitive exploration Le Xu et.al. 2507.02311 null
2025-07-03 Understanding Trade offs When Conditioning Synthetic Data Brandon Trabucco et.al. 2507.02217 null
2025-07-02 How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks Rahul Ramachandran et.al. 2507.01955 null
2025-07-02 Learning from Random Subspace Exploration: Generalized Test-Time Augmentation with Self-supervised Distillation Andrei Jelea et.al. 2507.01347 null
2025-07-01 Rapid Salient Object Detection with Difference Convolutional Neural Networks Zhuo Su et.al. 2507.01182 null
2025-07-01 Robust Component Detection for Flexible Manufacturing: A Deep Learning Approach to Tray-Free Object Recognition under Variable Lighting Fatemeh Sadat Daneshmand et.al. 2507.00852 null
2025-07-01 UAVD-Mamba: Deformable Token Fusion Vision Mamba for Multimodal UAV Detection Wei Li et.al. 2507.00849 null
2025-07-01 High-Frequency Semantics and Geometric Priors for End-to-End Detection Transformers in Challenging UAV Imagery Hongxing Peng et.al. 2507.00825 null
2025-07-01 Multi-Modal Graph Convolutional Network with Sinusoidal Encoding for Robust Human Action Segmentation Hao Xing et.al. 2507.00752 null
2025-07-01 UPRE: Zero-Shot Domain Adaptation for Object Detection via Unified Prompt and Representation Enhancement Xiao Zhang et.al. 2507.00721 null
2025-07-01 Rectifying Magnitude Neglect in Linear Attention Qihang Fan et.al. 2507.00698 null
2025-07-01 De-Simplifying Pseudo Labels to Enhancing Domain Adaptive Object Detection Zehua Fu et.al. 2507.00608 null
2025-06-30 Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios Deng Li et.al. 2506.24063 null
2025-06-30 Visual Textualization for Image Prompted Object Detection Yongjian Wu et.al. 2506.23785 null
2025-06-30 PBCAT: Patch-based composite adversarial training against physically realizable attacks on object detection Xiao Li et.al. 2506.23581 null
2025-06-30 Event-based Tiny Object Detection: A Benchmark Dataset and Baseline Nuo Chen et.al. 2506.23575 null
2025-06-30 OcRFDet: Object-Centric Radiance Fields for Multi-View 3D Object Detection in Autonomous Driving Mingqian Ji et.al. 2506.23565 null
2025-06-30 From Sight to Insight: Unleashing Eye-Tracking in Weakly Supervised Video Salient Object Detection Qi Qin et.al. 2506.23519 null
2025-06-30 Improve Underwater Object Detection through YOLOv12 Architecture and Physics-informed Augmentation Tinh Nguyen et.al. 2506.23505 null
2025-06-29 Detecting What Matters: A Novel Approach for Out-of-Distribution 3D Object Detection in Autonomous Vehicles Menna Taha et.al. 2506.23426 null
2025-06-29 Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement Siyuan Chai et.al. 2506.23353 null
2025-06-29 GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields Shunsuke Yasuki et.al. 2506.23352 null
2025-06-27 Attention-disentangled Uniform Orthogonal Feature Space Optimization for Few-shot Object Detection Taijin Zhao et.al. 2506.22161 null
2025-06-27 Evaluating Pointing Gestures for Target Selection in Human-Robot Collaboration Noora Sassali et.al. 2506.22116 null
2025-06-27 CERBERUS: Crack Evaluation & Recognition Benchmark for Engineering Reliability & Urban Stability Justin Reinman et.al. 2506.21909 null
2025-06-27 Visual Content Detection in Educational Videos with Transfer Learning and Dataset Enrichment Dipayan Biswas et.al. 2506.21903 null
2025-06-27 Embodied Domain Adaptation for Object Detection Xiangyu Shi et.al. 2506.21860 null
2025-06-26 PhotonSplat: 3D Scene Reconstruction and Colorization from SPAD Sensors Sai Sri Teja et.al. 2506.21680 null
2025-06-26 Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection Tobias J. Riedlinger et.al. 2506.21486 null
2025-06-26 TITAN: Query-Token based Domain Adaptive Adversarial Learning Tajamul Ashraf et.al. 2506.21484 null
2025-06-26 A Comprehensive Dataset for Underground Miner Detection in Diverse Scenario Cyrus Addy et.al. 2506.21451 null
2025-06-26 DuET: Dual Incremental Object Detection via Exemplar-Free Task Arithmetic Munish Monga et.al. 2506.21260 null
2025-06-26 LASFNet: A Lightweight Attention-Guided Self-Modulation Feature Fusion Network for Multimodal Object Detection Lei Hao et.al. 2506.21018 null
2025-06-26 ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation Shruti Bansal et.al. 2506.20969 null
2025-06-25 Lightweight Multi-Frame Integration for Robust YOLO Object Detection in Videos Yitong Quan et.al. 2506.20550 null
2025-06-25 Learning-based safety lifting monitoring system for cranes on construction sites Hao Chen et.al. 2506.20475 null
2025-06-25 Feature Hallucination for Self-supervised Action Recognition Lei Wang et.al. 2506.20342 null
2025-06-25 From Codicology to Code: A Comparative Study of Transformer and YOLO-based Detectors for Layout Analysis in Historical Documents Sergio Torres Aguilar et.al. 2506.20326 null
2025-06-25 TDiR: Transformer based Diffusion for Image Restoration Tasks Abbas Anwar et.al. 2506.20302 null
2025-06-25 Integrated optomechanical ultrasonic sensors with nano-Pascal-level sensitivity Xuening Cao et.al. 2506.20219 null
2025-06-24 A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects Shulan Ruan et.al. 2506.19769 null
2025-06-26 Semantic Scene Graph for Ultrasound Image Explanation and Scanning Guidance Xuesong Li et.al. 2506.19683 null
2025-06-24 Probabilistic modelling and safety assurance of an agriculture robot providing light-treatment Mustafa Adam et.al. 2506.19620 null
2025-06-24 USIS16K: High-Quality Dataset for Underwater Salient Instance Segmentation Lin Hong et.al. 2506.19472 null
2025-06-23 SpaNN: Detecting Multiple Adversarial Patches on CNNs by Spanning Saliency Thresholds Mauricio Byrd Victorica et.al. 2506.18591 null
2025-06-23 Improvement on LiDAR-Camera Calibration Using Square Targets Zhongyuan Li et.al. 2506.18294 null
2025-06-23 Learning Approach to Efficient Vision-based Active Tracking of a Flying Target by an Unmanned Aerial Vehicle Jagadeswara PKV Pothuri et.al. 2506.18264 null
2025-06-23 Ground tracking for improved landmine detection in a GPR system Li Tang et.al. 2506.18258 null
2025-06-24 Referring Expression Instance Retrieval and A Strong End-to-End Baseline Xiangzhao Hao et.al. 2506.18246 null
2025-06-24 Unfolding the Past: A Comprehensive Deep Learning Approach to Analyzing Incunabula Pages Klaudia Ropel et.al. 2506.18069 null
2025-06-21 YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception Mengqi Lei et.al. 2506.17733 null
2025-06-21 CSDN: A Context-Gated Self-Adaptive Detection Network for Real-Time Object Detection Wei Haolin et.al. 2506.17679 null
2025-06-21 DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For Driving Mihir Godbole et.al. 2506.17590 null
2025-06-20 YASMOT: Yet another stereo image multi-object tracker Ketil Malde et.al. 2506.17186 link
2025-06-20 Class Agnostic Instance-level Descriptor for Visual Instance Search Qi-Ying Sun et.al. 2506.16745 null
2025-06-20 Cross-modal Offset-guided Dynamic Alignment and Fusion for Weakly Aligned UAV Object Detection Liu Zongzhen et.al. 2506.16737 null
2025-06-19 How Hard Is Snow? A Paired Domain Adaptation Dataset for Clear and Snowy Weather: CADC+ Mei Qi Tang et.al. 2506.16531 null
2025-06-19 Can AI Dream of Unseen Galaxies? Conditional Diffusion Model for Galaxy Morphology Augmentation Chenrui Ma et.al. 2506.16233 null
2025-06-19 VideoGAN-based Trajectory Proposal for Automated Vehicles Annajoyce Mariani et.al. 2506.16209 null
2025-06-19 BLADE: An Automated Framework for Classifying Light Curves from the Center for Near-Earth Object Studies (CNEOS) Fireball Database Elizabeth A. Silber et.al. 2506.16099 null
2025-06-19 Polyline Path Masked Attention for Vision Transformer Zhongchen Zhao et.al. 2506.15940 null
2025-06-18 BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion Yuqing Lan et.al. 2506.15610 null
2025-06-18 Retrospective Memory for Camouflaged Object Detection Chenxi Zhang et.al. 2506.15244 null
2025-06-19 Efficient Retail Video Annotation: A Robust Key Frame Generation Approach for Product and Customer Interaction Analysis Varun Mannam et.al. 2506.14854 null
2025-06-18 YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework Dahang Wan et.al. 2506.14696 null
2025-06-17 VisText-Mosquito: A Multimodal Dataset and Benchmark for AI-Based Mosquito Breeding Site Detection and Reasoning Md. Adnanul Islam et.al. 2506.14629 null
2025-06-17 GAMORA: A Gesture Articulated Meta Operative Robotic Arm for Hazardous Material Handling in Containment-Level Environments Farha Abdul Wasay et.al. 2506.14513 null
2025-06-17 Comparison of Two Methods for Stationary Incident Detection Based on Background Image Deepak Ghimire et.al. 2506.14256 null
2025-06-16 A Point Cloud Completion Approach for the Grasping of Partially Occluded Objects and Its Applications in Robotic Strawberry Harvesting Ali Abouzeid et.al. 2506.14066 link
2025-06-16 FindMeIfYouCan: Bringing Open Set metrics to $\textit{near} $, $ \textit{far} $ and $\textit{farther}$ Out-of-Distribution Object Detection Daniel Montoya et.al. 2506.14008 null
2025-06-16 How Real is CARLAs Dynamic Vision Sensor? A Study on the Sim-to-Real Gap in Traffic Object Detection Kaiyuan Tan et.al. 2506.13722 null
2025-06-17 Lecture Video Visual Objects (LVVO) Dataset: A Benchmark for Visual Object Detection in Educational Videos Dipayan Biswas et.al. 2506.13657 link
2025-06-16 UAV Object Detection and Positioning in a Mining Industrial Metaverse with Custom Geo-Referenced Data Vasiliki Balaska et.al. 2506.13505 null
2025-06-16 Sparse Convolutional Recurrent Learning for Efficient Event-based Neuromorphic Object Detection Shenqi Wang et.al. 2506.13440 null
2025-06-16 Cognitive Synergy Architecture: SEGO for Human-Centric Collaborative Robots Jaehong Oh et.al. 2506.13149 null
2025-06-15 MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection Yuxiang Wang et.al. 2506.12697 null
2025-06-14 UniDet-D: A Unified Dynamic Spectral Attention Model for Object Detection under Adverse Weathers Yuantao Wang et.al. 2506.12324 null
2025-06-14 MatchPlant: An Open-Source Pipeline for UAV-Based Single-Plant Detection and Data Extraction Worasit Sangjan et.al. 2506.12295 link
2025-06-13 Vision-based Lifting of 2D Object Detections for Automated Driving Hendrik Königshof et.al. 2506.11839 null
2025-06-13 Teleoperated Driving: a New Challenge for 3D Object Detection in Compressed Point Clouds Filippo Bragato et.al. 2506.11804 null
2025-06-13 GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers Guang Liang et.al. 2506.11784 null
2025-06-12 Teaching in adverse scenes: a statistically feedback-driven threshold and mask adjustment teacher-student framework for object detection in UAV images under adverse scenes Hongyu Chen et.al. 2506.11175 null
2025-06-12 Discrete Lorenz Attractors in 3D Sinusoidal Maps Sishu Shankar Muni et.al. 2506.10788 null
2025-06-12 Uncertainty-Masked Bernoulli Diffusion for Camouflaged Object Detection Refinement Yuqi Shen et.al. 2506.10712 null
2025-06-12 Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection Xinyuan Liu et.al. 2506.10601 link
2025-06-12 Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration Jun Wang et.al. 2506.10573 null
2025-06-12 FSATFusion: Frequency-Spatial Attention Transformer for Infrared and Visible Image Fusion Tianpei Zhang et.al. 2506.10366 link
2025-06-11 DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos Rajeev Yasarla et.al. 2506.10242 null
2025-06-11 CEM-FBGTinyDet: Context-Enhanced Foreground Balance with Gradient Tuning for tiny Objects Tao Liu et.al. 2506.09897 null
2025-06-11 3DGeoDet: General-purpose Geometry-aware Image-based 3D Object Detection Yi Zhang et.al. 2506.09541 null
2025-06-11 MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning Tong Wang et.al. 2506.09327 null
2025-06-10 Efficient Edge Deployment of Quantized YOLOv4-Tiny for Aerial Emergency Object Detection on Raspberry Pi 5 Sindhu Boddu et.al. 2506.09300 null
2025-06-10 Lightweight Object Detection Using Quantized YOLOv4-Tiny for Emergency Response in Aerial Imagery Sindhu Boddu et.al. 2506.09299 null
2025-06-10 WD-DETR: Wavelet Denoising-Enhanced Real-Time Object Detection Transformer for Robot Perception with Event Cameras Yangjie Cui et.al. 2506.09098 null
2025-06-11 Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models Xuanchi Ren et.al. 2506.09042 null
2025-06-10 ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations Amirreza Rouhi et.al. 2506.08968 null
2025-06-10 Data Augmentation For Small Object using Fast AutoAugment DaeEun Yoon et.al. 2506.08956 null
2025-06-11 Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting Keyi Liu et.al. 2506.08777 null
2025-06-10 ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction Juan Yeo et.al. 2506.08678 null
2025-06-10 Hierarchical Neural Collapse Detection Transformer for Class Incremental Object Detection Duc Thanh Pham et.al. 2506.08562 null
2025-06-10 Discovery of Odd Radio Circles and Other Peculiars in the First Year of the EMU Survey using Object Detection Nikhel Gupta et.al. 2506.08439 null
2025-06-09 CrosswalkNet: An Optimized Deep Learning Framework for Pedestrian Crosswalk Detection in Aerial Images with High-Performance Computing Zubin Bhuyan et.al. 2506.07885 null
2025-06-09 SAM2Auto: Auto Annotation Using FLASH Arash Rocky et.al. 2506.07850 null
2025-06-09 Design and Evaluation of Deep Learning-Based Dual-Spectrum Image Fusion Methods Beining Xu et.al. 2506.07779 null
2025-06-09 SpikeSMOKE: Spiking Neural Networks for Monocular 3D Object Detection with Cross-Scale Gated Coding Xuemei Chen et.al. 2506.07737 null
2025-06-09 Domain Randomization for Object Detection in Manufacturing Applications using Synthetic Data: A Comprehensive Study Xiaomeng Zhu et.al. 2506.07539 null
2025-06-09 SpatialLM: Training Large Language Models for Structured Indoor Modeling Yongsen Mao et.al. 2506.07491 null
2025-06-09 Happiness Finder: Exploring the Role of AI in Enhancing Well-Being During Four-Leaf Clover Searches Anna Yokokubo et.al. 2506.07393 null
2025-06-09 Multiple Object Stitching for Unsupervised Representation Learning Chengchao Shen et.al. 2506.07364 link
2025-06-09 CBAM-STN-TPS-YOLO: Enhancing Agricultural Object Detection through Spatially Adaptive Attention Mechanisms Satvik Praveen et.al. 2506.07357 null
2025-06-08 UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning Weiqi Yan et.al. 2506.07087 null
2025-06-06 Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection Yu Li et.al. 2506.05872 null
2025-06-06 Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration Fanhu Zeng et.al. 2506.05709 null
2025-06-06 Integer Binary-Range Alignment Neuron for Spiking Neural Networks Binghao Ye et.al. 2506.05679 null
2025-06-05 Synthetic Dataset Generation for Autonomous Mobile Robots Using 3D Gaussian Splatting for Vision Training Aneesh Deogan et.al. 2506.05092 null
2025-06-06 Bridging Annotation Gaps: Transferring Labels to Align Object Detection Datasets Mikhail Kennerley et.al. 2506.04737 null
2025-06-05 Gen-n-Val: Agentic Image Data Generation and Validation Jing-En Huang et.al. 2506.04676 null
2025-06-05 VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection Wuyang Li et.al. 2506.04623 null
2025-06-04 FALO: Fast and Accurate LiDAR 3D Object Detection on Resource-Constrained Devices Shizhong Han et.al. 2506.04499 null
2025-06-04 Neural Object Detection for 4D STEM: High-Throughput Sub-Pixel Electron Diffraction Pattern Recognition Arda Genc et.al. 2506.04477 null
2025-06-04 Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector Boyong He et.al. 2506.04211 link
2025-06-04 FSHNet: Fully Sparse Hybrid Network for 3D Object Detection Shuai Liu et.al. 2506.03714 null
2025-06-04 How PARTs assemble into wholes: Learning the relative composition of images Melika Ayoughi et.al. 2506.03682 null
2025-06-05 MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection Xiaochun Lei et.al. 2506.03654 null
2025-06-04 DiagNet: Detecting Objects using Diagonal Constraints on Adjacency Matrix of Graph Neural Network Chong Hyun Lee et.al. 2506.03571 null
2025-06-03 SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports Dheeraj Khanna et.al. 2506.03335 null
2025-06-03 Simulate Any Radar: Attribute-Controllable Radar Simulation via Waveform Parameter Embedding Weiqing Xiao et.al. 2506.03134 null
2025-06-03 Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection Yechi Ma et.al. 2506.02914 null
2025-06-04 Open-PMC-18M: A High-Fidelity Large Scale Medical Dataset for Multimodal Representation Learning Negin Baghbanzadeh et.al. 2506.02738 null
2025-06-03 GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal Shufan Qing et.al. 2506.02736 link
2025-06-03 Sight Guide: A Wearable Assistive Perception and Navigation System for the Vision Assistance Race in the Cybathlon 2024 Patrick Pfreundschuh et.al. 2506.02676 null
2025-06-03 Probabilistic Online Event Downsampling Andreu Girbau-Xalabarder et.al. 2506.02547 null
2025-06-03 Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning Kunyu Wang et.al. 2506.02462 null
2025-06-03 Auto-Labeling Data for Object Detection Brent A. Griffin et.al. 2506.02359 null
2025-06-02 OD3: Optimization-free Dataset Distillation for Object Detection Salwa K. Al Khatib et.al. 2506.01942 null
2025-06-02 A Novel Context-Adaptive Fusion of Shadow and Highlight Regions for Efficient Sonar Image Classification Kamal Basha S et.al. 2506.01445 null
2025-05-30 Deformable Attention Mechanisms Applied to Object Detection, case of Remote Sensing Anasse Boutayeb et.al. 2505.24489 null
2025-05-30 Leadership Assessment in Pediatric Intensive Care Unit Team Training Liangyang Ouyang et.al. 2505.24389 null
2025-05-30 D2AF: A Dual-Driven Annotation and Filtering Framework for Visual Grounding Yichi Zhang et.al. 2505.24372 null
2025-05-29 Conformal Object Detection by Sequential Risk Control Léo Andéol et.al. 2505.24038 null
2025-05-29 Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping Justin Lazarow et.al. 2505.23756 null
2025-05-29 Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need Qiang Wang et.al. 2505.23744 null
2025-05-29 FMG-Det: Foundation Model Guided Robust Object Detection Darryl Hannan et.al. 2505.23726 null
2025-05-29 CF-DETR: Coarse-to-Fine Transformer for Real-Time Object Detection Woojin Shin et.al. 2505.23317 null
2025-05-30 WTEFNet: Real-Time Low-Light Object Detection for Advanced Driver Assistance Systems Hao Wu et.al. 2505.23201 null
2025-05-29 Language-guided Learning for Object Detection Tackling Multiple Variations in Aerial Images Sungjune Park et.al. 2505.23193 null
2025-05-29 DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes Sungjune Park et.al. 2505.23179 null
2025-05-29 The Meeseeks Mesh: Spatially Consistent 3D Adversarial Objects for BEV Detector Aixuan Li et.al. 2505.22499 null
2025-05-28 Task-Driven Implicit Representations for Automated Design of LiDAR Systems Nikhil Behari et.al. 2505.22344 null
2025-05-29 YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction Mingzhuang Wang et.al. 2505.22250 null
2025-05-28 S2AFormer: Strip Self-Attention for Efficient Vision Transformer Guoan Xu et.al. 2505.22195 null
2025-05-28 Learning A Robust RGB-Thermal Detector for Extreme Modality Imbalance Chao Tian et.al. 2505.22154 null
2025-05-28 Prototype Embedding Optimization for Human-Object Interaction Detection in Livestreaming Menghui Zhang et.al. 2505.22011 null
2025-05-28 Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection Guiping Cao et.al. 2505.21868 null
2025-05-27 Object Concepts Emerge from Motion Haoqian Liang et.al. 2505.21635 null
2025-05-27 Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO Muzhi Zhu et.al. 2505.21457 null
2025-05-27 Visual Product Graph: Bridging Visual Products And Composite Images For End-to-End Style Recommendations Yue Li Du et.al. 2505.21454 null
2025-05-27 YOLO-SPCI: Enhancing Remote Sensing Object Detection via Selective-Perspective-Class Integration Xinyuan Wang et.al. 2505.21370 null
2025-05-27 Assured Autonomy with Neuro-Symbolic Perception R. Spencer Hallyburton et.al. 2505.21322 null
2025-05-27 Robust Video-Based Pothole Detection and Area Estimation for Intelligent Vehicles with Depth Map and Kalman Smoothing Dehao Wang et.al. 2505.21049 null
2025-05-27 YOLO-FireAD: Efficient Fire Detection via Attention-Guided Inverted Residual Learning and Dual-Pooling Feature Preservation Weichao Pan et.al. 2505.20884 null
2025-05-27 Open-Det: An Efficient Learning Framework for Open-Ended Detection Guiping Cao et.al. 2505.20639 null
2025-05-27 Roboflow100-VL: A Multi-Domain Object Detection Benchmark for Vision-Language Models Peter Robicheaux et.al. 2505.20612 null
2025-05-26 From Data to Modeling: Fully Open-vocabulary Scene Graph Generation Zuyao Chen et.al. 2505.20106 null
2025-05-26 Underwater Diffusion Attention Network with Contrastive Language-Image Joint Learning for Underwater Image Enhancement Afrah Shaahid et.al. 2505.19895 null
2025-05-26 ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting Wenhua Wu et.al. 2505.19420 null
2025-05-26 Neural nanophotonic object detector with ultra-wide field-of-view Ji Chen et.al. 2505.19379 null
2025-05-25 What do Blind and Low-Vision People Really Want from Assistive Smart Devices? Comparison of the Literature with a Focus Study Bhanuka Gamage et.al. 2505.19325 null
2025-05-25 VL-SAM-V2: Open-World Object Detection with General and Specific Query Fusion Zhiwei Lin et.al. 2505.18986 null
2025-05-24 Mitigating Context Bias in Domain Adaptation for Object Detection using Mask Pooling Hojun Son et.al. 2505.18446 null
2025-05-23 Sampling Strategies for Efficient Training of Deep Learning Object Detection Algorithms Gefei Shen et.al. 2505.18302 null
2025-05-23 One RL to See Them All: Visual Triple Unified Reinforcement Learning Yan Ma et.al. 2505.18129 null
2025-05-23 SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification Shashank Agnihotri et.al. 2505.18015 null
2025-05-23 RQR3D: Reparametrizing the regression targets for BEV-based 3D object detection Ozsel Kilinc et.al. 2505.17732 null
2025-05-23 Adaptive Semantic Token Communication for Transformer-based Edge Inference Alessio Devoto et.al. 2505.17604 null
2025-05-23 OrionBench: A Benchmark for Chart and Human-Recognizable Object Detection in Infographics Jiangning Zhu et.al. 2505.17473 null
2025-05-23 Reflectance Prediction-based Knowledge Distillation for Robust 3D Object Detection in Compressed Point Clouds Hao Jing et.al. 2505.17442 null
2025-05-23 Optimizing YOLOv8 for Parking Space Detection: Comparative Analysis of Custom YOLOv8 Architecture Apar Pokhrel et.al. 2505.17364 null
2025-05-22 Extending Dataset Pruning to Object Detection: A Variance-based Approach Ryota Yagi et.al. 2505.17245 null
2025-05-22 Semi-Supervised State-Space Model with Dynamic Stacking Filter for Real-World Video Deraining Shangquan Sun et.al. 2505.16811 null
2025-05-22 Robust Vision-Based Runway Detection through Conformal Prediction and Conformal mAP Alya Zouzou et.al. 2505.16740 link
2025-05-22 CodeMerge: Codebook-Guided Model Merging for Robust Test-Time Adaptation in Autonomous Driving Huitong Yang et.al. 2505.16524 null
2025-05-22 MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection Yichen Li et.al. 2505.16442 null
2025-05-22 AdvReal: Adversarial Patch Generation Framework with Application to Adversarial Safety Evaluation of Object Detection Systems Yuanhao Huang et.al. 2505.16402 link
2025-05-22 Self-Classification Enhancement and Correction for Weakly Supervised Object Detection Yufei Yin et.al. 2505.16294 null
2025-05-21 SNAP: A Benchmark for Testing the Effects of Capture Conditions on Fundamental Vision Tasks Iuliia Kotseruba et.al. 2505.15628 link
2025-05-21 Detection of Underwater Multi-Targets Based on Self-Supervised Learning and Deformable Path Aggregation Feature Pyramid Network Chang Liu et.al. 2505.15518 null
2025-05-21 RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation Naman Patel et.al. 2505.15373 null
2025-05-21 Multispectral Detection Transformer with Infrared-Centric Sensor Fusion Seongmin Hwang et.al. 2505.15137 null
2025-05-20 SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation Yuyang Dong et.al. 2505.14381 null
2025-05-20 Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation Bin-Bin Gao et.al. 2505.14239 null
2025-05-20 Intra-class Patch Swap for Self-Distillation Hongjun Choi et.al. 2505.14124 link
2025-05-20 Scaling Vision Mamba Across Resolutions via Fractal Traversal Bo Li et.al. 2505.14062 null
2025-05-20 Automated Quality Evaluation of Cervical Cytopathology Whole Slide Images Based on Content Analysis Lanlan Kang et.al. 2505.13875 null
2025-05-20 Safety2Drive: Safety-Critical Scenario Benchmark for the Evaluation of Autonomous Driving Jingzheng Li et.al. 2505.13872 null
2025-05-20 A Challenge to Build Neuro-Symbolic Video Agents Sahil Shah et.al. 2505.13851 link
2025-05-20 InstanceBEV: Unifying Instance and BEV Representation for Global Modeling Feng Li et.al. 2505.13817 null
2025-05-19 Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object Detection Xiao Wang et.al. 2505.12908 link
2025-05-19 Rethinking Features-Fused-Pyramid-Neck for Object Detection Hulin Li et.al. 2505.12820 link
2025-05-19 Enhancing Transformers Through Conditioned Embedded Tokens Hemanth Saratchandran et.al. 2505.12789 null
2025-05-19 LiDAR MOT-DETR: A LiDAR-based Two-Stage Transformer for 3D Multiple Object Tracking Martha Teiko Teye et.al. 2505.12753 null
2025-05-19 VLC Fusion: Vision-Language Conditioned Sensor Fusion for Robust Object Detection Aditya Taparia et.al. 2505.12715 null
2025-05-17 EarthSynth: Generating Informative Earth Observation with Diffusion Models Jiancheng Pan et.al. 2505.12108 null
2025-05-17 Experimental Study on Automatically Assembling Custom Catering Packages With a 3-DOF Delta Robot Using Deep Learning Methods Reihaneh Yourdkhani et.al. 2505.11879 null
2025-05-16 Improving Object Detection Performance through YOLOv8: A Comprehensive Training and Evaluation Study Rana Poureskandar et.al. 2505.11424 null
2025-05-16 MTevent: A Multi-Task Event Camera Dataset for 6D Pose Estimation and Moving Object Detection Shrutarv Awasthi et.al. 2505.11282 null
2025-05-16 M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for Optical-SAR Fusion Object Detection Chao Wang et.al. 2505.10931 null
2025-05-16 A High-Performance Thermal Infrared Object Detection Framework with Centralized Regulation Jinke Li et.al. 2505.10825 null
2025-05-15 StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation Daniel A. P. Oliveira et.al. 2505.10292 link
2025-05-15 Defect Detection in Photolithographic Patterns Using Deep Learning Models Trained on Synthetic Data Prashant P. Shinde et.al. 2505.10192 null
2025-05-15 Application of YOLOv8 in monocular downward multiple Car Target detection Shijie Lyu et.al. 2505.10016 null
2025-05-14 EdgeAI Drone for Autonomous Construction Site Demonstrator Emre Girgin et.al. 2505.09837 link
2025-05-14 WhatsAI: Transforming Meta Ray-Bans into an Extensible Generative AI Platform for Accessibility Nasif Zaman et.al. 2505.09823 null
2025-05-14 MoRAL: Motion-aware Multi-Frame 4D Radar and LiDAR Fusion for Robust 3D Object Detection Xiangyuan Peng et.al. 2505.09422 null
2025-05-14 A drone that learns to efficiently find objects in agricultural fields: from simulation to the real world Rick van Essen et.al. 2505.09278 null
2025-05-14 DRRNet: Macro-Micro Feature Fusion and Dual Reverse Refinement for Camouflaged Object Detection Jianlin Sun et.al. 2505.09168 link
2025-05-14 Beyond General Prompts: Automated Prompt Refinement using Contrastive Class Alignment Scores for Disambiguating Objects in Vision-Language Models Lucas Choi et.al. 2505.09139 null
2025-05-14 Promoting SAM for Camouflaged Object Detection via Selective Key Point-based Guidance Guoying Liang et.al. 2505.09123 null
2025-05-13 Robustness Analysis against Adversarial Patch Attacks in Fully Unmanned Stores Hyunsik Na et.al. 2505.08835 null
2025-05-13 Augmented Reality for RObots (ARRO): Pointing Visuomotor Policies Towards Visual Robustness Reihaneh Mirjalili et.al. 2505.08627 null
2025-05-14 Thermal Detection of People with Mobility Restrictions for Barrier Reduction at Traffic Lights Controlled Intersections Xiao Ni et.al. 2505.08568 link
2025-05-13 MDF: Multi-Modal Data Fusion with CNN-Based Object Detection for Enhanced Indoor Localization Using LiDAR-SLAM Saqi Hussain Kalan et.al. 2505.08388 null
2025-05-13 HMPNet: A Feature Aggregation Architecture for Maritime Object Detection from a Shipborne Perspective Yu Zhang et.al. 2505.08231 link
2025-05-13 Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix Unai Gurbindo et.al. 2505.08228 null
2025-05-13 MoKD: Multi-Task Optimization for Knowledge Distillation Zeeshan Hayder et.al. 2505.08170 null
2025-05-12 Hybrid Spiking Vision Transformer for Object Detection with Event Cameras Qi Xu et.al. 2505.07715 null
2025-05-12 Self-Supervised Event Representations: Towards Accurate, Real-Time Perception on SoC FPGAs Kamil Jeziorek et.al. 2505.07556 null
2025-05-12 DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection Mingqian Ji et.al. 2505.07398 null
2025-05-12 Language-Driven Dual Style Mixing for Single-Domain Generalized Object Detection Hongda Qin et.al. 2505.07219 link
2025-05-11 Differentiable NMS via Sinkhorn Matching for End-to-End Fabric Defect Detection Zhengyang Lu et.al. 2505.07040 null
2025-05-11 VALISENS: A Validated Innovative Multi-Sensor System for Cooperative Automated Driving Lei Wan et.al. 2505.06980 null
2025-05-10 M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark Morui Zhu et.al. 2505.06746 null
2025-05-10 Underwater object detection in sonar imagery with detection transformer and Zero-shot neural architecture search XiaoTong Gu et.al. 2505.06694 null
2025-05-10 METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship Detection Yongqi Wang et.al. 2505.06663 link
2025-05-09 Camera-Only Bird's Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles Anupkumar Bochare et.al. 2505.06113 null
2025-05-09 Artificial intelligence pioneers the double-strangeness factory Yan He et.al. 2505.05802 null
2025-05-09 Dome-DETR: DETR with Density-Oriented Feature-Query Manipulation for Efficient Tiny Object Detection Zhangchi Hu et.al. 2505.05741 null
2025-05-09 DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer Ho-Joong Kim et.al. 2505.05711 link
2025-05-08 PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model Zhang Zhang et.al. 2505.05397 null
2025-05-08 PaniCar: Securing the Perception of Advanced Driving Assistance Systems Against Emergency Vehicle Lighting Elad Feldman et.al. 2505.05183 null
2025-05-08 FG-CLIP: Fine-Grained Visual and Textual Alignment Chunyu Xie et.al. 2505.05071 null
2025-05-08 A Simple Detector with Frame Dynamics is a Strong Tracker Chenxu Peng et.al. 2505.04917 null
2025-05-08 Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model Navin Ranjan et.al. 2505.04861 null
2025-05-07 Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff Perspective Songsong Duan et.al. 2505.04758 null
2025-05-07 Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer Sainath Dey et.al. 2505.04740 null
2025-05-08 MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection Zhihao Zhang et.al. 2505.04594 null
2025-05-07 DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception Junjie Wang et.al. 2505.04410 link
2025-05-06 LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs Xinyuan Zhang et.al. 2505.03460 null
2025-05-06 From Word to Sentence: A Large-Scale Multi-Instance Dataset for Open-Set Aerial Detection Guoting Wei et.al. 2505.03334 null
2025-05-06 VISLIX: An XAI Framework for Validating Vision Models with Slice Discovery and Analysis Xinyuan Yan et.al. 2505.03132 null
2025-05-05 Sim2Real Transfer for Vision-Based Grasp Verification Pau Amargant et.al. 2505.03046 link
2025-05-05 DPNet: Dynamic Pooling Network for Tiny Object Detection Luqi Gong et.al. 2505.02797 null
2025-05-05 RGBX-DiffusionDet: A Framework for Multi-Modal RGB-X Object Detection Using DiffusionDet Eliraz Orfaig et.al. 2505.02586 null
2025-05-05 Point Cloud Recombination: Systematic Real Data Augmentation Using Robotic Targets for LiDAR Perception Validation Hubert Padusinski et.al. 2505.02476 null
2025-05-03 DriveNetBench: An Affordable and Configurable Single-Camera Benchmarking System for Autonomous Driving Networks Ali Al-Bustami et.al. 2505.01893 link
2025-05-03 OODTE: A Differential Testing Engine for the ONNX Optimizer Nikolaos Louloudakis et.al. 2505.01892 null
2025-05-03 CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture Vladimir Frants et.al. 2505.01882 null
2025-05-03 DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion Haoteng Li et.al. 2505.01857 null
2025-05-03 Toward Onboard AI-Enabled Solutions to Space Object Detection for Space Sustainability Wenxuan Zhang et.al. 2505.01650 null
2025-05-02 CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion Boyuan Meng et.al. 2505.00938 link
2025-05-01 Efficient On-Chip Implementation of 4D Radar-Based 3D Object Detection on Hailo-8L Woong-Chan Byun et.al. 2505.00757 null
2025-05-03 Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook Muyi Bao et.al. 2505.00630 null
2025-05-01 Visual Trajectory Prediction of Vessels for Inland Navigation Alexander Puzicha et.al. 2505.00599 null
2025-05-01 Synthesizing and Identifying Noise Levels in Autonomous Vehicle Camera Radar Datasets Mathis Morales et.al. 2505.00584 null
2025-05-01 X-ray illicit object detection using hybrid CNN-transformer neural network architectures Jorgen Cani et.al. 2505.00564 null
2025-05-01 A Robust Deep Networks based Multi-Object MultiCamera Tracking System for City Scale Traffic Muhammad Imran Zaman et.al. 2505.00534 null
2025-05-01 Inconsistency-based Active Learning for LiDAR Object Detection Esteban Rivera et.al. 2505.00511 null
2025-05-01 HeAL3D: Heuristical-enhanced Active Learning for 3D Object Detection Esteban Rivera et.al. 2505.00507 null
2025-05-05 Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution Luigi Sigillo et.al. 2505.00334 null
2025-04-30 V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving Jannik Lübberstedt et.al. 2505.00156 null
2025-04-30 LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics Marc Glocker et.al. 2504.21716 null
2025-04-29 T2ID-CAS: Diffusion Model and Class Aware Sampling to Mitigate Class Imbalance in Neck Ultrasound Anatomical Landmark Detection Manikanta Varaganti et.al. 2504.21231 null
2025-04-29 FLIM-based Salient Object Detection Networks with Adaptive Decoders Gilson Junior Soares et.al. 2504.20872 null
2025-04-29 A Survey on Event-based Optical Marker Systems Nafiseh Jabbari Tofighi et.al. 2504.20736 null
2025-04-29 Purifying, Labeling, and Utilizing: A High-Quality Pipeline for Small Object Detection Siwei Wang et.al. 2504.20602 null
2025-04-29 Style-Adaptive Detection Transformer for Single-Source Domain Generalized Object Detection Jianhong Han et.al. 2504.20498 null
2025-04-28 More Clear, More Flexible, More Precise: A Comprehensive Oriented Object Detection benchmark for UAV Kai Ye et.al. 2504.20032 null
2025-04-28 Lossy Source Coding with Focal Loss Alex Dytso et.al. 2504.19913 null
2025-04-28 Neural network task specialization via domain constraining Roman Malashin et.al. 2504.19592 null
2025-04-28 GMAR: Gradient-Driven Multi-Head Attention Rollout for Vision Transformer Interpretability Sehyeong Jo et.al. 2504.19414 null
2025-04-27 Improving Small Drone Detection Through Multi-Scale Processing and Data Augmentation Rayson Laroca et.al. 2504.19347 null
2025-04-27 ODExAI: A Comprehensive Object Detection Explainable AI Evaluation Loc Phuc Truong Nguyen et.al. 2504.19249 null
2025-04-27 Boosting Single-domain Generalized Object Detection via Vision-Language Knowledge Interaction Xiaoran Xu et.al. 2504.19086 null
2025-04-26 Federated Learning-based Semantic Segmentation for Lane and Object Detection in Autonomous Driving Gharbi Khamis Alshammari et.al. 2504.18939 null
2025-04-25 Dream-Box: Object-wise Outlier Generation for Out-of-Distribution Detection Brian K. S. Isaac-Medina et.al. 2504.18746 null
2025-04-25 A Review of 3D Object Detection with Vision-Language Models Ranjan Sapkota et.al. 2504.18738 null
2025-04-25 Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models Patrick Müller et.al. 2504.18510 null
2025-04-25 Iterative Event-based Motion Segmentation by Variational Contrast Maximization Ryo Yamaki et.al. 2504.18447 null
2025-04-25 A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection Carlo Sgaravatti et.al. 2504.18419 null
2025-04-25 A comprehensive review of classifier probability calibration metrics Richard Oliver Lane et.al. 2504.18278 null
2025-04-25 LiDAR-Guided Monocular 3D Object Detection for Long-Range Railway Monitoring Raul David Dominguez Sanchez et.al. 2504.18203 null
2025-04-25 Multi-Grained Compositional Visual Clue Learning for Image Intent Recognition Yin Tang et.al. 2504.18201 null
2025-04-25 E-InMeMo: Enhanced Prompting for Visual In-Context Learning Jiahao Zhang et.al. 2504.18158 null
2025-04-25 MASF-YOLO: An Improved YOLOv11 Network for Small Object Detection on Drone View Liugang Lu et.al. 2504.18136 null
2025-04-25 Opportunistic Collaborative Planning with Large Vision Model Guided Control and Joint Query-Service Optimization Jiayi Chen et.al. 2504.18057 null
2025-04-25 Direct sampling method to retrieve small objects from two-dimensional limited-aperture scattered field data Won-Kwang Park et.al. 2504.18036 null
2025-04-24 DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks Yinqi Li et.al. 2504.17253 link
2025-04-24 Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Phillip Y. Lee et.al. 2504.17207 null
2025-04-24 AUTHENTICATION: Identifying Rare Failure Modes in Autonomous Vehicle Perception Systems using Adversarially Guided Diffusion Models Mohammad Zarei et.al. 2504.17179 null
2025-04-23 Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection Jens Petersen et.al. 2504.17076 null
2025-04-23 Gaussian Splatting is an Effective Data Generator for 3D Object Detection Farhad G. Zanjani et.al. 2504.16740 null
2025-04-23 EHGCN: Hierarchical Euclidean-Hyperbolic Fusion via Motion-Aware GCN for Hybrid Event Stream Perception Haosheng Chen et.al. 2504.16616 null
2025-04-23 Beyond Anonymization: Object Scrubbing for Privacy-Preserving 2D and 3D Vision Tasks Murat Bilgehan Ertan et.al. 2504.16557 null
2025-04-23 Assessing the Feasibility of Internet-Sourced Video for Automatic Cattle Lameness Detection Md Fahimuzzman Sohan et.al. 2504.16404 null
2025-04-23 Revisiting Radar Camera Alignment by Contrastive Learning for 3D Object Detection Linhua Kong et.al. 2504.16368 null
2025-04-22 Vision Controlled Orthotic Hand Exoskeleton Connor Blais et.al. 2504.16319 null
2025-04-22 $π_{0.5}$ : a Vision-Language-Action Model with Open-World Generalization Physical Intelligence et.al. 2504.16054 null
2025-04-22 SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision Systems Manjunath D et.al. 2504.15728 null
2025-04-22 You Sense Only Once Beneath: Ultra-Light Real-Time Underwater Object Detection Jun Dong et.al. 2504.15694 null
2025-04-22 A Vision-Enabled Prosthetic Hand for Children with Upper Limb Disabilities Md Abdul Baset Sarker et.al. 2504.15654 null
2025-04-21 Context Aware Grounded Teacher for Source Free Object Detection Tajamul Ashraf et.al. 2504.15404 null
2025-04-21 SuoiAI: Building a Dataset for Aquatic Invertebrates in Vietnam Tue Vo et.al. 2504.15252 null
2025-04-21 An Efficient Aerial Image Detection with Variable Receptive Fields Liu Wenbin et.al. 2504.15165 null
2025-04-19 Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization Nazia Aslam et.al. 2504.14301 null
2025-04-19 Visual Consensus Prompting for Co-Salient Object Detection Jie Wang et.al. 2504.14254 link
2025-04-18 Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models Junjie Yang et.al. 2504.13825 null
2025-04-18 Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction Yushen He et.al. 2504.13647 link
2025-04-18 DenSe-AdViT: A novel Vision Transformer for Dense SAR Object Detection Yang Zhang et.al. 2504.13638 null
2025-04-18 HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection YangChen Zeng et.al. 2504.13469 null
2025-04-18 Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety Shashank Shriram et.al. 2504.13399 link
2025-04-17 VLLFL: A Vision-Language Model Based Lightweight Federated Learning Framework for Smart Agriculture Long Li et.al. 2504.13365 null
2025-04-17 SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling Yasin Almalioglu et.al. 2504.13310 null
2025-04-17 Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes Andreas Lau Hansen et.al. 2504.13297 null
2025-04-17 RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity Ranjan Sapkota et.al. 2504.13099 null
2025-04-17 Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving Shumin Wang et.al. 2504.12709 null
2025-04-18 RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding Hang Ji et.al. 2504.12643 null
2025-04-16 Towards a General-Purpose Zero-Shot Synthetic Low-Light Image and Video Pipeline Joanne Lin et.al. 2504.12169 null
2025-04-16 RADLER: Radar Object Detection Leveraging Semantic 3D City Models and Self-Supervised Radar-Image Learning Yuan Luo et.al. 2504.12167 null
2025-04-16 pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild Jonas Myhre Schiøtt et.al. 2504.12045 null
2025-04-16 A Review of YOLOv12: Attention-Based Enhancements vs. Previous Versions Rahima Khanam et.al. 2504.11995 null
2025-04-16 Multimodal Spatio-temporal Graph Learning for Alignment-free RGBT Video Object Detection Qishun Wang et.al. 2504.11779 null
2025-04-15 Multi-level Cellular Automata for FLIM networks Felipe Crispim Salvagnini et.al. 2504.11406 null
2025-04-15 CFIS-YOLO: A Lightweight Multi-Scale Fusion Network for Edge-Deployable Wood Defect Detection Jincheng Kang et.al. 2504.11305 null
2025-04-15 Flyweight FLIM Networks for Salient Object Detection in Biomedical Images Leonardo M. Joao et.al. 2504.11112 null
2025-04-15 S $^2$ Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object Detection Yu Lin et.al. 2504.11111 null
2025-04-15 DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmen Hyejin Lee et.al. 2504.11019 null
2025-04-16 GATE3D: Generalized Attention-based Task-synergized Estimation in 3D* Eunsoo Im et.al. 2504.11014 null
2025-04-15 CDUPatch: Color-Driven Universal Adversarial Patch Attack for Dual-Modal Visible-Infrared Detectors Jiahuan Long et.al. 2504.10888 null
2025-04-15 Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task Aviral Chharia et.al. 2504.10880 null
2025-04-15 Weather-Aware Object Detection Transformer for Domain Adaptation Soheil Gharatappeh et.al. 2504.10877 null
2025-04-15 ATLASv2: LLM-Guided Adaptive Landmark Acquisition and Navigation on the Edge Mikolaj Walczak et.al. 2504.10784 null
2025-04-14 DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing Jinyue Zhang et.al. 2504.10278 null
2025-04-14 Balancing Stability and Plasticity in Pretrained Detector: A Dual-Path Framework for Incremental Object Detection Songze Li et.al. 2504.10214 null
2025-04-15 WildLive: Near Real-time Visual Wildlife Tracking onboard UAVs Nguyen Ngoc Dat et.al. 2504.10165 null
2025-04-14 COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts Jiansheng Li et.al. 2504.10158 null
2025-04-14 Small Object Detection with YOLO: A Performance Analysis Across Model Versions and Hardware Muhammad Fasih Tariq et.al. 2504.09900 null
2025-04-14 Density-based Object Detection in Crowded Scenes Chenyang Zhao et.al. 2504.09819 null
2025-04-13 Uncertainty Guided Refinement for Fine-Grained Salient Object Detection Yao Yuan et.al. 2504.09666 link
2025-04-13 Pillar-Voxel Fusion Network for 3D Object Detection in Airborne Hyperspectral Point Clouds Yanze Jiang et.al. 2504.09506 null
2025-04-13 Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation Yongchao Feng et.al. 2504.09480 null
2025-04-13 InfoBound: A Provable Information-Bounds Inspired Framework for Both OoD Generalization and OoD Detection Lin Zhu et.al. 2504.09448 null
2025-04-11 TinyCenterSpeed: Efficient Center-Based Object Detection for Autonomous Racing Neil Reichlin et.al. 2504.08655 null
2025-04-11 Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization Jialu Li et.al. 2504.08641 null
2025-04-10 Enhanced Cooperative Perception Through Asynchronous Vehicle to Infrastructure Framework with Delay Mitigation for Connected and Automated Vehicles Nithish Kumar Saravanan et.al. 2504.08172 null
2025-04-10 Multi-Task Learning with Multi-Annotation Triplet Loss for Improved Object Detection Meilun Zhou et.al. 2504.08054 null
2025-04-10 Detect Anything 3D in the Wild Hanxue Zhang et.al. 2504.07958 null
2025-04-11 Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks Erin Carson et.al. 2504.07835 null
2025-04-10 P2Object: Single Point Supervised Object Detection and Instance Segmentation Pengfei Chen et.al. 2504.07813 null
2025-04-10 Nonlocal Retinex-Based Variational Model and its Deep Unfolding Twin for Low-Light Image Enhancement Daniel Torres et.al. 2504.07810 null
2025-04-10 Adaptive Detection of Fast Moving Celestial Objects Using a Mixture of Experts and Physical-Inspired Neural Network Peng Jia et.al. 2504.07777 null
2025-04-10 VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model Haozhan Shen et.al. 2504.07615 link
2025-04-10 RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions Youngwan Jin et.al. 2504.07603 null
2025-04-10 WS-DETR: Robust Water Surface Object Detection through Vision-Radar Fusion with Detection Transformer Huilin Yin et.al. 2504.07441 null
2025-04-09 Few-Shot Adaptation of Grounding DINO for Agricultural Domain Rajhans Singh et.al. 2504.07252 null
2025-04-09 Multi-Object Tracking for Collision Avoidance Using Multiple Cameras in Open RAN Networks Jordi Serra et.al. 2504.07163 null
2025-04-09 Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection Ruoyu Chen et.al. 2504.07060 null
2025-04-09 UAV Position Estimation using a LiDAR-based 3D Object Detection Method Uthman Olawoye et.al. 2504.07028 null
2025-04-09 Towards Efficient Roadside LiDAR Deployment: A Fast Surrogate Metric Based on Entropy-Guided Visibility Yuze Jiang et.al. 2504.06772 null
2025-04-09 Domain-Conditioned Scene Graphs for State-Grounded Task Planning Jonas Herzog et.al. 2504.06661 null
2025-04-09 Visually Similar Pair Alignment for Robust Cross-Domain Object Detection Onkar Krishna et.al. 2504.06607 null
2025-04-08 From Broadcast to Minimap: Achieving State-of-the-Art SoccerNet Game State Reconstruction Vladimir Golovkin et.al. 2504.06357 null
2025-04-08 Analyzing the Impact of Low-Rank Adaptation for Cross-Domain Few-Shot Object Detection in Aerial Images Hicham Talaoubrid et.al. 2504.06330 null
2025-04-08 Balancing long- and short-term dynamics for the modeling of saliency in videos Theodor Wulff et.al. 2504.05913 null
2025-04-08 PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario Sriram Mandalika et.al. 2504.05908 null
2025-04-08 Intrinsic Saliency Guided Trunk-Collateral Network for Unsupervised Video Object Segmentation Xiangyu Zheng et.al. 2504.05904 null
2025-04-08 KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection Xingyuan Li et.al. 2504.05878 null
2025-04-08 DefMamba: Deformable Visual State Space Model Leiye Liu et.al. 2504.05794 null
2025-04-08 Event-based Civil Infrastructure Visual Defect Detection: ev-CIVIL Dataset and Benchmark Udayanga G. W. K. N. Gamage et.al. 2504.05679 null
2025-04-08 POD: Predictive Object Detection with Single-Frame FMCW LiDAR Point Cloud Yining Shi et.al. 2504.05649 null
2025-04-08 AD-Det: Boosting Object Detection in UAV Images with Focused Small Objects and Balanced Tail Classes Zhenteng Li et.al. 2504.05601 null
2025-04-07 SSLFusion: Scale & Space Aligned Latent Fusion Model for Multimodal 3D Object Detection Bonan Ding et.al. 2504.05170 null
2025-04-07 Inland Waterway Object Detection in Multi-environment: Dataset and Approach Shanshan Wang et.al. 2504.04835 null
2025-04-07 Playing Non-Embedded Card-Based Games with Reinforcement Learning Tianyang Wu et.al. 2504.04783 null
2025-04-07 Feedback-Enhanced Hallucination-Resistant Vision-Language Model for Real-Time Scene Understanding Zahir Alsulaimawi et.al. 2504.04772 null
2025-04-07 Inverse++: Vision-Centric 3D Semantic Occupancy Prediction Assisted with 3D Object Detection Zhenxing Ming et.al. 2504.04732 null
2025-04-06 Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection Jiancheng Pan et.al. 2504.04517 link
2025-04-06 eKalibr-Stereo: Continuous-Time Spatiotemporal Calibration for Event-Based Stereo Visual Systems Shuolong Chen et.al. 2504.04451 link
2025-04-05 Autoregressive High-Order Finite Difference Modulo Imaging: High-Dynamic Range for Computer Vision Applications Brayan Monroy et.al. 2504.04228 null
2025-04-05 An Optimized Density-Based Lane Keeping System for A Cost-Efficient Autonomous Vehicle Platform: AurigaBot V1 Farbod Younesi et.al. 2504.04217 null
2025-04-05 Learning about the Physical World through Analytic Concepts Jianhua Sun et.al. 2504.04170 null
2025-04-04 PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector Kaidong Li et.al. 2504.03563 null
2025-04-04 ZFusion: An Effective Fuser of Camera and 4D Radar for 3D Object Perception in Autonomous Driving Sheng Yang et.al. 2504.03438 null
2025-04-04 Infrared bubble recognition in the Milky Way and beyond using deep learning Shimpei Nishimoto et.al. 2504.03367 null
2025-04-04 Real-Time Roadway Obstacle Detection for Electric Scooters Using Deep Learning and Multi-Sensor Fusion Zeyang Zheng et.al. 2504.03171 null
2025-04-04 Finding the Reflection Point: Unpadding Images to Remove Data Augmentation Artifacts in Large Open Source Image Datasets for Machine Learning Lucas Choi et.al. 2504.03168 null
2025-04-03 LiDAR-based Object Detection with Real-time Voice Specifications Anurag Kulkarni et.al. 2504.02920 null
2025-04-03 BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation Van Nguyen Nguyen et.al. 2504.02812 null
2025-04-03 Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline Results Andrei Dumitriu et.al. 2504.02558 null
2025-04-03 Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision Xiaofeng Han et.al. 2504.02477 null
2025-04-03 CornerPoint3D: Look at the Nearest Corner Instead of the Center Ruixiao Zhang et.al. 2504.02464 null
2025-04-03 Hyperspectral Remote Sensing Images Salient Object Detection: The First Benchmark Dataset and Baseline Peifu Liu et.al. 2504.02416 null
2025-04-03 SemiISP/SemiIE: Semi-Supervised Image Signal Processor and Image Enhancement Leveraging One-to-Many Mapping sRGB-to-RAW Masakazu Yoshimura et.al. 2504.02345 null
2025-04-03 LLM-Guided Evolution: An Autonomous Model Optimization for Object Detection YiMing Yu et.al. 2504.02280 null
2025-04-02 Cat-Eye Inspired Active-Passive-Composite Aperture-Shared Sub-Terahertz Meta-Imager for Non-Interactive Concealed Object Detection Mingshuang Hu et.al. 2504.01473 null
2025-04-02 CFMD: Dynamic Cross-layer Feature Fusion for Salient Object Detection Jin Lian et.al. 2504.01326 null
2025-04-01 Enabling Efficient Processing of Spiking Neural Networks with On-Chip Learning on Commodity Neuromorphic Processors for Edge AI Systems Rachmad Vidya Wicaksana Putra et.al. 2504.00957 null
2025-04-01 NeuRadar: Neural Radiance Fields for Automotive Radar Point Clouds Mahan Rafidashti et.al. 2504.00859 null
2025-04-01 AttentiveGRU: Recurrent Spatio-Temporal Modeling for Advanced Radar-Based BEV Object Detection Loveneet Saini et.al. 2504.00559 null
2025-04-01 High-Quality Pseudo-Label Generation Based on Visual Prompt Assisted Cloud Model Update Xinrun Xu et.al. 2504.00526 null
2025-04-01 Intrinsic-feature-guided 3D Object Detection Wanjing Zhang et.al. 2504.00382 null
2025-04-01 CamoSAM2: Motion-Appearance Induced Auto-Refining Prompts for Video Camouflaged Object Detection Xin Zhang et.al. 2504.00375 null
2025-03-31 Towards Precise Action Spotting: Addressing Temporal Misalignment in Labels with Dynamic Label Assignment Masato Tamura et.al. 2504.00149 null
2025-03-31 SU-YOLO: Spiking Neural Network for Efficient Underwater Object Detection Chenyang Li et.al. 2503.24389 link
2025-03-31 MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote Sensing Karim Radouane et.al. 2503.24219 link
2025-03-31 Spectral-Adaptive Modulation Networks for Visual Perception Guhnoo Yun et.al. 2503.23947 null
2025-03-31 Expanding-and-Shrinking Binary Neural Networks Xulong Shi et.al. 2503.23709 link
2025-03-30 Re-Aligning Language to Visual Objects with an Agentic Workflow Yuming Chen et.al. 2503.23508 null
2025-03-30 EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing Hongxiang Jiang et.al. 2503.23330 null
2025-03-29 Context in object detection: a systematic literature review Mahtab Jamali et.al. 2503.23249 null
2025-03-29 Large Self-Supervised Models Bridge the Gap in Domain Adaptive Object Detection Marc-Antoine Lavoie et.al. 2503.23220 null
2025-03-29 A GAN-Enhanced Deep Learning Framework for Rooftop Detection from Historical Aerial Imagery Pengyu Chen et.al. 2503.23200 null
2025-03-29 Intelligent Bear Prevention System Based on Computer Vision: An Approach to Reduce Human-Bear Conflicts in the Tibetan Plateau Area, China Pengyu Chen et.al. 2503.23178 null
2025-03-28 AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization Martin Kišš et.al. 2503.22526 null
2025-03-28 Data Quality Matters: Quantifying Image Quality Impact on Machine Learning Performance Christian Steinhauser et.al. 2503.22375 null
2025-03-28 ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection Nandakishor M et.al. 2503.22363 null
2025-03-28 Knowledge Rectification for Camouflaged Object Detection: Unlocking Insights from Low-Quality Data Juwei Guan et.al. 2503.22180 null
2025-03-28 A Survey on Remote Sensing Foundation Models: From Vision to Multimodality Ziyue Huang et.al. 2503.22081 null
2025-03-27 AGILE: A Diffusion-Based Attention-Guided Image and Label Translation for Efficient Cross-Domain Plant Trait Identification Earl Ranario et.al. 2503.22019 null
2025-03-27 FACETS: Efficient Once-for-all Object Detection via Constrained Iterative Search Tony Tran et.al. 2503.21999 null
2025-03-27 Exponentially Weighted Instance-Aware Repeat Factor Sampling for Long-Tailed Object Detection Model Training in Unmanned Aerial Vehicles Surveillance Scenarios Taufiq Ahmed et.al. 2503.21893 null
2025-03-27 Learning Class Prototypes for Unified Sparse Supervised 3D Object Detection Yun Zhu et.al. 2503.21099 link
2025-03-26 SaViD: Spectravista Aesthetic Vision Integration for Robust and Discerning 3D Object Detection in Challenging Environments Tanmoy Dam et.al. 2503.20614 link
2025-03-26 Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications Mahya Nikouei et.al. 2503.20516 null
2025-03-25 Gemini Robotics: Bringing AI into the Physical World Gemini Robotics Team et.al. 2503.20020 null
2025-03-25 Hyperdimensional Uncertainty Quantification for Multimodal Uncertainty Fusion in Autonomous Vehicles Perception Luke Chen et.al. 2503.20011 null
2025-03-25 Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models Ilias Stogiannidis et.al. 2503.19707 null
2025-03-25 BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction Jan Kohút et.al. 2503.19658 null
2025-03-25 Single Shot AI-assisted quantification of KI-67 proliferation index in breast cancer Deepti Madurai Muthu et.al. 2503.19606 null
2025-03-25 MATT-GS: Masked Attention-based 3DGS for Robot Perception and Object Detection Jee Won Lee et.al. 2503.19330 null
2025-03-25 Multiscale Feature Importance-based Bit Allocation for End-to-End Feature Coding for Machines Junle Liu et.al. 2503.19278 null
2025-03-24 Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite Imagery Sara Al-Emadi et.al. 2503.19202 null
2025-03-24 Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach Jakob Abeßer et.al. 2503.19161 null
2025-03-24 Cooperative Control of Multi-Quadrotors for Transporting Cable-Suspended Payloads: Obstacle-Aware Planning and Event-Based Nonlinear Model Predictive Control Tohid Kargar Tasooji et.al. 2503.19135 null
2025-03-24 Building Blocks for Robust and Effective Semi-Supervised Real-World Object Detection Moussa Kassem Sbeyti et.al. 2503.18903 null
2025-03-24 LGI-DETR: Local-Global Interaction for UAV Object Detection Zifa Chen et.al. 2503.18785 null
2025-03-25 Frequency Dynamic Convolution for Dense Image Prediction Linwei Chen et.al. 2503.18783 null
2025-03-25 CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection Zhichao Sun et.al. 2503.18430 null
2025-03-24 Vision-Guided Loco-Manipulation with a Snake Robot Adarsh Salagame et.al. 2503.18308 null
2025-03-22 MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability Paul Hill et.al. 2503.17700 null
2025-03-22 Sense4FL: Vehicular Crowdsensing Enhanced Federated Learning for Autonomous Driving Yanan Ma et.al. 2503.17697 null
2025-03-21 Should we pre-train a decoder in contrastive learning for dense prediction tasks? Sébastien Quetin et.al. 2503.17526 null
2025-03-21 Event-Based Crossing Dataset (EBCD) Joey Mulé et.al. 2503.17499 null
2025-03-21 You Only Look Once at Anytime (AnytimeYOLO): Analysis and Optimization of Early-Exits for Object-Detection Daniel Kuhse et.al. 2503.17497 null
2025-03-21 An Iterative Feedback Mechanism for Improving Natural Language Class Descriptions in Open-Vocabulary Object Detection Louis Y. Kim et.al. 2503.17285 null
2025-03-21 Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection Duanrui Yu et.al. 2503.17175 null
2025-03-21 Hi-ALPS -- An Experimental Robustness Quantification of Six LiDAR-based Object Detection Systems for Autonomous Driving Alexandra Arzberger et.al. 2503.17168 null
2025-03-21 R-LiViT: A LiDAR-Visual-Thermal Dataset Enabling Vulnerable Road User Focused Roadside Perception Jonas Mirlach et.al. 2503.17122 null
2025-03-21 Exploring Few-Shot Object Detection on Blood Smear Images: A Case Study of Leukocytes and Schistocytes Davide Antonio Mura et.al. 2503.17107 null
2025-03-21 R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model Boyuan Zheng et.al. 2503.17097 null
2025-03-21 Superpowering Open-Vocabulary Object Detectors for X-ray Vision Pablo Garcia-Fernandez et.al. 2503.17071 null
2025-03-21 Scoring, Remember, and Reference: Catching Camouflaged Objects in Videos Yuang Feng et.al. 2503.17050 null
2025-03-21 Salient Object Detection in Traffic Scene through the TSOD10K Dataset Yu Qiu et.al. 2503.16910 null
2025-03-21 Seg2Box: 3D Object Detection by Point-Wise Semantics Supervision Maoji Zheng et.al. 2503.16811 null
2025-03-20 RESFL: An Uncertainty-Aware Framework for Responsible Federated Learning by Balancing Privacy, Fairness and Utility in Autonomous Vehicles Dawood Wasif et.al. 2503.16251 null
2025-03-20 MapGlue: Multimodal Remote Sensing Image Matching Peihao Wu et.al. 2503.16185 null
2025-03-20 Uncertainty Meets Diversity: A Comprehensive Active Learning Framework for Indoor 3D Object Detection Jiangyi Wang et.al. 2503.16125 null
2025-03-20 Semantic-Guided Global-Local Collaborative Networks for Lightweight Image Super-Resolution Wanshu Fan et.al. 2503.16056 null
2025-03-19 DCA: Dividing and Conquering Amnesia in Incremental Object Detection Aoting Zhang et.al. 2503.15295 null
2025-03-19 Test-Time Backdoor Detection for Object Detection Models Hangtao Zhang et.al. 2503.15293 null
2025-03-19 GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector Zechuan Li et.al. 2503.15211 null
2025-03-19 UltraFlwr -- An Efficient Federated Medical and Surgical Object Detection Framework Yang Li et.al. 2503.15161 null
2025-03-19 An Investigation of Beam Density on LiDAR Object Detection Performance Christoph Griesbacher et.al. 2503.15087 null
2025-03-20 Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark Ying Liu et.al. 2503.14862 null
2025-03-19 State Space Model Meets Transformer: A New Paradigm for 3D Object Detection Chuxin Wang et.al. 2503.14493 null
2025-03-18 A Revisit to the Decoder for Camouflaged Object Detection Seung Woo Ko et.al. 2503.14035 null
2025-03-18 Shift, Scale and Rotation Invariant Multiple Object Detection using Balanced Joint Transform Correlator Xi Shen et.al. 2503.14034 null
2025-03-18 LEGNet: Lightweight Edge-Gaussian Driven Network for Low-Quality Remote Sensing Image Object Detection Wei Lu et.al. 2503.14012 link
2025-03-18 FrustumFusionNets: A Three-Dimensional Object Detection Network Based on Tractor Road Scene Lili Yang et.al. 2503.13951 null
2025-03-18 Is Discretization Fusion All You Need for Collaborative Perception? Kang Yang et.al. 2503.13946 null
2025-03-18 PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds Barza Nisar et.al. 2503.13914 null
2025-03-18 HSOD-BIT-V2: A New Challenging Benchmarkfor Hyperspectral Salient Object Detection Yuhao Qiu et.al. 2503.13906 null
2025-03-18 TGBFormer: Transformer-GraphFormer Blender Network for Video Object Detection Qiang Qi et.al. 2503.13903 null
2025-03-18 YOLO-LLTS: Real-Time Low-Light Traffic Sign Detection via Prior-Guided Enhancement and Multi-Branch Feature Interaction Ziyu Lin et.al. 2503.13883 null
2025-03-17 Beyond RGB: Adaptive Parallel Processing for RAW Object Detection Shani Gamrian et.al. 2503.13163 null
2025-03-17 SparseAlign: A Fully Sparse Framework for Cooperative Object Detection Yunshuang Yuan et.al. 2503.12982 null
2025-03-17 Efficient Multimodal 3D Object Detector via Instance-Level Contrastive Distillation Zhuoqun Su et.al. 2503.12914 null
2025-03-16 Point Cloud Based Scene Segmentation: A Survey Dan Halperin et.al. 2503.12595 null
2025-03-16 GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing Zilun Zhang et.al. 2503.12490 null
2025-03-15 An Efficient Deep Learning-Based Approach to Automating Invoice Document Validation Aziz Amari et.al. 2503.12267 null
2025-03-15 Minuscule Cell Detection in AS-OCT Images with Progressive Field-of-View Focusing Boyu Chen et.al. 2503.12249 null
2025-03-15 SFMNet: Sparse Focal Modulation for 3D Object Detection Oren Shrout et.al. 2503.12093 null
2025-03-18 UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection Xin Jin et.al. 2503.12009 null
2025-03-14 Rethinking Multi-modal Object Detection from the Perspective of Mono-Modality Feature Learning Tianyi Zhao et.al. 2503.11780 null
2025-03-14 FLASHμ: Fast Localizing And Sizing of Holographic Microparticles Ayush Paliwal et.al. 2503.11538 null
2025-03-14 Falcon: A Remote Sensing Vision-Language Foundation Model Kelu Yao et.al. 2503.11070 null
2025-03-14 FMNet: Frequency-Assisted Mamba-Like Linear Attention Network for Camouflaged Object Detection Ming Deng et.al. 2503.11030 null
2025-03-17 Comparative Analysis of Advanced AI-based Object Detection Models for Pavement Marking Quality Assessment during Daytime Gian Antariksa et.al. 2503.11008 null
2025-03-14 Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection Chuhan Zhang et.al. 2503.11005 null
2025-03-13 The Power of One: A Single Example is All it Takes for Segmentation in VLMs Mir Rayat Imtiaz Hossain et.al. 2503.10779 null
2025-03-13 HeightFormer: Learning Height Prediction in Voxel Features for Roadside Vision Centric 3D Object Detection via Transformer Zhang Zhang et.al. 2503.10777 null
2025-03-15 Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection Chaoqun Wang et.al. 2503.10579 null
2025-03-13 RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground Simulation Yuwen Du et.al. 2503.10410 link
2025-03-13 RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing Fengxiang Wang et.al. 2503.10392 link
2025-03-13 Object detection characteristics in a learning factory environment using YOLOv8 Toni Schneidereit et.al. 2503.10356 null
2025-03-13 TARS: Traffic-Aware Radar Scene Flow Estimation Jialong Wu et.al. 2503.10210 null
2025-03-13 A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection Shenghao Fu et.al. 2503.10152 link
2025-03-13 Deep Learning-Based Direct Leaf Area Estimation using Two RGBD Datasets for Model Development Namal Jayasuriya et.al. 2503.10129 null
2025-03-13 Style Evolving along Chain-of-Thought for Unknown-Domain Object Detection Zihao Zhang et.al. 2503.09968 null
2025-03-12 CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation Hariprasath Govindarajan et.al. 2503.09878 null
2025-03-12 How good are deep learning methods for automated road safety analysis using video data? An experimental study Qingwu Liu et.al. 2503.09807 null
2025-03-12 Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X Katharina Prasse et.al. 2503.09361 null
2025-03-12 Fully-Synthetic Training for Visual Quality Inspection in Automotive Production Christoph Huber et.al. 2503.09354 null
2025-03-12 DitHub: A Modular Framework for Incremental Open-Vocabulary Object Detection Chiara Cappellino et.al. 2503.09271 null
2025-03-12 Polygonizing Roof Segments from High-Resolution Aerial Images Using Yolov8-Based Edge Detection Qipeng Mei et.al. 2503.09187 null
2025-03-12 Dual-Domain Homogeneous Fusion with Cross-Modal Mamba and Progressive Decoder for 3D Object Detection Xuzhong Hu et.al. 2503.08992 null
2025-03-11 GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection Dušan Malić et.al. 2503.08639 null
2025-03-11 Referring to Any Person Qing Jiang et.al. 2503.08507 null
2025-03-11 SuperCap: Multi-resolution Superpixel-based Image Captioning Henry Senior et.al. 2503.08496 null
2025-03-13 Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels Qiming Xia et.al. 2503.08421 null
2025-03-11 Embodied Crowd Counting Runling Long et.al. 2503.08367 null
2025-03-11 Physics-based AI methodology for Material Parameter Extraction from Optical Data M. Koumans et.al. 2503.08183 null
2025-03-11 Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method Fei Wang et.al. 2503.08144 null
2025-03-12 Accelerate 3D Object Detection Models via Zero-Shot Attention Key Pruning Lizhen Xu et.al. 2503.08101 link
2025-03-11 SparseVoxFormer: Sparse Voxel-based Transformer for Multi-modal 3D Object Detection Hyeongseok Son et.al. 2503.08092 null
2025-03-11 Simulating Automotive Radar with Lidar and Camera Inputs Peili Song et.al. 2503.08068 null
2025-03-10 YOLOE: Real-Time Seeing Anything Ao Wang et.al. 2503.07465 link
2025-03-10 HGO-YOLO: Advancing Anomaly Behavior Detection with Hierarchical Features and Lightweight Optimized Detection Qizhi Zheng et.al. 2503.07371 null
2025-03-10 Mitigating Hallucinations in YOLO-based Object Detection Models: A Revisit to Out-of-Distribution Detection Weicheng He et.al. 2503.07330 null
2025-03-10 Semantic Communications with Computer Vision Sensing for Edge Video Transmission Yubo Peng et.al. 2503.07252 null
2025-03-10 MIRAM: Masked Image Reconstruction Across Multiple Scales for Breast Lesion Risk Prediction Hung Q. Vo et.al. 2503.07157 null
2025-03-10 A Light Perspective for 3D Object Detection Marcelo Eduardo Pederiva et.al. 2503.07133 null
2025-03-10 SimROD: A Simple Baseline for Raw Object Detection with Global and Local Enhancements Haiyang Xie et.al. 2503.07101 null
2025-03-10 RS2V-L: Vehicle-Mounted LiDAR Data Generation from Roadside Sensor Observations Ruidan Xing et.al. 2503.07085 null
2025-03-10 Availability-aware Sensor Fusion via Unified Canonical Space for 4D Radar, LiDAR, and Camera Dong-Hee Paek et.al. 2503.07029 null
2025-03-10 Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection Wentao Wu et.al. 2503.06948 null
2025-03-06 Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach Soumyadeep Ro et.al. 2503.04918 null
2025-03-06 Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation David T. Hoffmann et.al. 2503.04718 null
2025-03-06 DEAL-YOLO: Drone-based Efficient Animal Localization using YOLO Aditya Prashant Naidu et.al. 2503.04698 null
2025-03-06 Teach YOLO to Remember: A Self-Distillation Approach for Continual Object Detection Riccardo De Monte et.al. 2503.04688 null
2025-03-09 ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem Yu-Hsi Chen et.al. 2503.04500 null
2025-03-06 A lightweight model FDM-YOLO for small target improvement based on YOLOv8 Xuerui Zhang et.al. 2503.04452 null
2025-03-06 Shaken, Not Stirred: A Novel Dataset for Visual Understanding of Glasses in Human-Robot Bartending Tasks Lukáš Gajdošech et.al. 2503.04308 null
2025-03-06 CA-W3D: Leveraging Context-Aware Knowledge for Weakly Supervised Monocular 3D Detection Chupeng Liu et.al. 2503.04154 null
2025-03-06 Robust Computer-Vision based Construction Site Detection for Assistive-Technology Applications Junchi Feng et.al. 2503.04139 null
2025-03-06 Fractional Correspondence Framework in Detection Transformer Masoumeh Zareapoor et.al. 2503.04107 null
2025-03-05 DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance Zhao Yang et.al. 2503.03689 null
2025-03-05 4D Radar Ground Truth Augmentation with LiDAR-to-4D Radar Data Synthesis Woo-Jin Jung et.al. 2503.03637 null
2025-03-05 Simulation-Based Performance Evaluation of 3D Object Detection Methods with Deep Learning for a LiDAR Point Cloud Dataset in a SOTIF-related Use Case Milin Patel et.al. 2503.03548 link
2025-03-05 AI-Driven Multi-Stage Computer Vision System for Defect Detection in Laser-Engraved Industrial Nameplates Adhish Anitha Vilasan et.al. 2503.03395 null
2025-03-05 MIAdapt: Source-free Few-shot Domain Adaptive Object Detection for Microscopic Images Nimra Dilawar et.al. 2503.03370 null
2025-03-05 BEVMOSNet: Multimodal Fusion for BEV Moving Object Segmentation Hiep Truong Cong et.al. 2503.03280 null
2025-03-04 Class-Aware PillarMix: Can Mixed Sample Data Augmentation Enhance 3D Object Detection with Radar Point Clouds? Miao Zhang et.al. 2503.02687 null
2025-03-04 Exploring Model Quantization in GenAI-based Image Inpainting and Detection of Arable Plants Sourav Modak et.al. 2503.02420 null
2025-03-04 Robust detection of overlapping bioacoustic sound events Louis Mahon et.al. 2503.02389 null
2025-03-04 YOLO-PRO: Enhancing Instance-Specific Object Detection with Full-Channel Global Self-Attention Lin Huang et.al. 2503.02348 null
2025-03-04 SSNet: Saliency Prior and State Space Model-based Network for Salient Object Detection in RGB-D Images Gargi Panda et.al. 2503.02270 null
2025-03-03 Generalized Diffusion Detector: Mining Robust Features from Diffusion Models for Domain-Generalized Detection Boyong He et.al. 2503.02101 null
2025-03-03 Uncertainty Representation in a SOTIF-Related Use Case with Dempster-Shafer Theory for LiDAR Sensor-Based Object Detection Milin Patel et.al. 2503.02087 link
2025-03-03 Visual-RFT: Visual Reinforcement Fine-Tuning Ziyu Liu et.al. 2503.01785 link
2025-03-03 Enhancing Object Detection Accuracy in Underwater Sonar Images through Deep Learning-based Denoising Ziyu Wang et.al. 2503.01655 null
2025-03-03 Evaluating Stenosis Detection with Grounding DINO, YOLO, and DINO-DETR Muhammad Musab Ansari et.al. 2503.01601 null
2025-02-28 The Common Objects Underwater (COU) Dataset for Robust Underwater Object Detection Rishi Mukherjee et.al. 2502.20651 null
2025-02-28 RTGen: Real-Time Generative Detection Transformer Chi Ruan et.al. 2502.20622 null
2025-02-28 LV-DOT: LiDAR-visual dynamic obstacle detection and tracking for autonomous robot navigation Zhefan Xu et.al. 2502.20607 null
2025-02-27 Multi-Scale Neighborhood Occupancy Masked Autoencoder for Self-Supervised Learning in LiDAR Point Clouds Mohamed Abdelsamad et.al. 2502.20316 null
2025-02-27 OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels Meng Lou et.al. 2502.20087 link
2025-02-27 Night-Voyager: Consistent and Efficient Nocturnal Vision-Aided State Estimation in Object Maps Tianxiao Gao et.al. 2502.20054 null
2025-02-27 Learning Mask Invariant Mutual Information for Masked Image Modeling Tao Huang et.al. 2502.19718 null
2025-02-27 BEVDiffuser: Plug-and-Play Diffusion Model for BEV Denoising with Ground-Truth Guidance Xin Ye et.al. 2502.19694 null
2025-02-26 Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras Hoonhee Cho et.al. 2502.19630 null
2025-02-23 Rewards-based image analysis in microscopy Kamyar Barakati et.al. 2502.18522 null
2025-02-25 Multi-Perspective Data Augmentation for Few-shot Object Detection Anh-Khoa Nguyen Vu et.al. 2502.18195 null
2025-02-25 Progressive Local Alignment for Medical Multimodal Pre-training Huimin Yan et.al. 2502.18047 null
2025-02-25 Automatic Vehicle Detection using DETR: A Transformer-Based Approach for Navigating Treacherous Roads Istiaq Ahmed Fahad et.al. 2502.17843 null
2025-02-24 Semi-Supervised Weed Detection in Vegetable Fields: In-domain and Cross-domain Experiments Boyang Deng et.al. 2502.17673 null
2025-02-24 Experimental validation of UAV search and detection system in real wilderness environment Stella Dumenčić et.al. 2502.17372 null
2025-02-24 LCV2I: Communication-Efficient and High-Performance Collaborative Perception Framework with Low-Resolution LiDAR Xinxin Feng et.al. 2502.17039 null
2025-02-23 Geometry-Aware 3D Salient Object Detection Network Chen Wang et.al. 2502.16488 null
2025-02-26 MQADet: A Plug-and-Play Paradigm for Enhancing Open-Vocabulary Object Detection via Multimodal Question Answering Caixiong Li et.al. 2502.16486 null
2025-02-23 Cross-domain Few-shot Object Detection with Multi-modal Textual Enrichment Zeyu Shangguan et.al. 2502.16469 null
2025-02-23 Deep learning approaches to surgical video segmentation and object detection: A Scoping Review Devanish N. Kamtam et.al. 2502.16459 null
2025-02-22 FeatSharp: Your Vision Model Features, Sharper Mike Ranzinger et.al. 2502.16025 null
2025-02-21 Generative AI Framework for 3D Object Generation in Augmented Reality Majid Behravan et.al. 2502.15869 null
2025-02-21 Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection Yue Sun et.al. 2502.15516 null
2025-02-21 Q-PETR: Quant-aware Position Embedding Transformation for Multi-View 3D Object Detection Jiangyong Yu et.al. 2502.15488 null
2025-02-20 Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios Richard Marcus et.al. 2502.15076 null
2025-02-20 YOLOv12: A Breakdown of the Key Architectural Features Mujadded Al Rabbani Alif et.al. 2502.14740 null
2025-02-20 LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera Weiyi Xiong et.al. 2502.14503 null
2025-02-20 ODVerse33: Is the New YOLO Version Always Better? A Multi Domain benchmark from YOLO v5 to v11 Tianyou Jiang et.al. 2502.14314 null
2025-02-19 Image compositing is all you need for data augmentation Ang Jia Ning Shermaine et.al. 2502.13936 null
2025-02-19 MSVCOD:A Large-Scale Multi-Scene Dataset for Video Camouflage Object Detection Shuyong Gao et.al. 2502.13859 null
2025-02-19 An Overall Real-Time Mechanism for Classification and Quality Evaluation of Rice Wanke Xia et.al. 2502.13764 null
2025-02-18 Multiple Distribution Shift -- Aerial (MDS-A): A Dataset for Test-Time Error Detection and Model Adaptation Noel Ngu et.al. 2502.13289 null
2025-02-18 RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection Jingtong Yue et.al. 2502.13071 null
2025-02-18 Task-Oriented Semantic Communication for Stereo-Vision 3D Object Detection Zijian Cao et.al. 2502.12735 null
2025-02-18 DAMamba: Vision State Space Model with Dynamic Adaptive Scan Tanzhe Li et.al. 2502.12627 null
2025-02-18 Gaseous Object Detection Kailai Zhou et.al. 2502.12415 null
2025-02-17 Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection Tessa Pulli et.al. 2502.12027 null
2025-02-16 DAViMNet: SSMs-Based Domain Adaptive Object Detection A. Enes Doruk et.al. 2502.11178 null
2025-02-15 CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs Qizhen Lan et.al. 2502.10683 null
2025-02-14 Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding Wenxuan Guo et.al. 2502.10392 null
2025-02-14 Object Detection and Tracking Md Pranto et.al. 2502.10310 null
2025-02-14 Artificial Intelligence to Assess Dental Findings from Panoramic Radiographs -- A Multinational Study Yin-Chih Chelsea Wang et.al. 2502.10277 null
2025-02-13 Instance Segmentation of Scene Sketches Using Natural Image Priors Mia Tang et.al. 2502.09608 null
2025-02-13 Wholly-WOOD: Wholly Leveraging Diversified-quality Labels for Weakly-supervised Oriented Object Detection Yi Yu et.al. 2502.09471 link
2025-02-13 Mitigating the Impact of Prominent Position Shift in Drone-based RGBT Object Detection Yan Zhang et.al. 2502.09311 null
2025-02-12 Uncertainty Aware Human-machine Collaboration in Camouflaged Object Detection Ziyue Yang et.al. 2502.08373 link
2025-02-12 Plantation Monitoring Using Drone Images: A Dataset and Performance Review Yashwanth Karumanchi et.al. 2502.08233 null
2025-02-12 Take What You Need: Flexible Multi-Task Semantic Communications with Channel Adaptation Xiang Chen et.al. 2502.08221 null
2025-02-13 SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation Zhiming Ma et.al. 2502.08168 link
2025-02-12 Knowledge Swapping via Learning and Unlearning Mingyu Xing et.al. 2502.08075 null
2025-02-13 Visual-based spatial audio generation system for multi-speaker environments Xiaojing Liu et.al. 2502.07538 null
2025-02-11 Quantitative Analysis of Objects in Prisoner Artworks Thea Christoffersen et.al. 2502.07440 null
2025-02-11 Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving Novendra Setyawan et.al. 2502.07417 null
2025-02-11 Multi-Task-oriented Nighttime Haze Imaging Enhancer for Vision-driven Measurement Systems Ai Chen et.al. 2502.07351 link
2025-02-11 SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer Wenxi Li et.al. 2502.07216 null
2025-02-11 Dense Object Detection Based on De-homogenized Queries Yueming Huang et.al. 2502.07194 null
2025-02-11 Foreign-Object Detection in High-Voltage Transmission Line Based on Improved YOLOv8m Zhenyue Wang et.al. 2502.07175 null
2025-02-11 A Survey on Mamba Architecture for Vision Applications Fady Ibrahim et.al. 2502.07161 null
2025-02-10 Multimodal Search on a Line Jared Coleman et.al. 2502.07000 null
2025-02-10 AgilePilot: DRL-Based Drone Agent for Real-Time Motion Planning in Dynamic Environments by Leveraging Object Detection Roohan Ahmed Khan et.al. 2502.06725 null
2025-02-10 EdgeMLBalancer: A Self-Adaptive Approach for Dynamic Model Switching on Resource-Constrained Edge Devices Akhila Matathammal et.al. 2502.06493 null
2025-02-10 Enhancing Document Key Information Localization Through Data Augmentation Yue Dai et.al. 2502.06132 null
2025-02-10 Improved YOLOv5s model for key components detection of power transmission lines Chen Chen et.al. 2502.06127 null
2025-02-10 A Novel Multi-Teacher Knowledge Distillation for Real-Time Object Detection using 4D Radar Seung-Hyun Song et.al. 2502.06114 null
2025-02-09 Training-free Anomaly Event Detection via LLM-guided Symbolic Pattern Discovery Yuhui Zeng et.al. 2502.05843 null
2025-02-08 Demystifying Catastrophic Forgetting in Two-Stage Incremental Object Detector Qirui Wu et.al. 2502.05540 null
2025-02-07 LP-DETR: Layer-wise Progressive Relations for Object Detection Zhengjian Kang et.al. 2502.05147 null
2025-02-07 Counting Fish with Temporal Representations of Sonar Video Kai Van Brunt et.al. 2502.05129 null
2025-02-07 DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection Mingxuan Yan et.al. 2502.04804 null
2025-02-07 MHAF-YOLO: Multi-Branch Heterogeneous Auxiliary Fusion YOLO for accurate object detection Zhiqiang Yang et.al. 2502.04656 null
2025-02-07 AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers Runqing Jiang et.al. 2502.04628 null
2025-02-06 An Optimized YOLOv5 Based Approach For Real-time Vehicle Detection At Road Intersections Using Fisheye Cameras Md. Jahin Alam et.al. 2502.04566 null
2025-02-06 OneTrack-M: A multitask approach to transformer-based MOT models Luiz C. S. de Araujo et.al. 2502.04478 null
2025-02-07 Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances Yi Yu et.al. 2502.04268 null
2025-02-06 An object detection approach for lane change and overtake detection from motion profiles Andrea Benericetti et.al. 2502.04244 null
2025-02-06 YOLOv4: A Breakthrough in Real-Time Object Detection Athulya Sundaresan Geetha et.al. 2502.04161 null
2025-02-06 Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks Yuhui Jin et.al. 2502.03877 null
2025-02-06 Pursuing Better Decision Boundaries for Long-Tailed Object Detection via Category Information Amount Yanbiao Ma et.al. 2502.03852 null
2025-02-06 Single-Domain Generalized Object Detection by Balancing Domain Diversity and Invariance Zhenwei He et.al. 2502.03835 null
2025-02-06 UAV Cognitive Semantic Communications Enabled by Knowledge Graph for Robust Object Detection Xi Song et.al. 2502.03761 null
2025-02-06 RAMOTS: A Real-Time System for Aerial Multi-Object Tracking based on Deep Learning and Big Data Technology Nhat-Tan Do et.al. 2502.03760 null
2025-02-05 An Empirical Study of Methods for Small Object Detection from Satellite Imagery Xiaohui Yuan et.al. 2502.03674 null
2025-02-05 Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics Indrashis Das et.al. 2502.03654 null
2025-02-05 RoboGrasp: A Universal Grasping Policy for Robust Robotic Control Yiqi Huang et.al. 2502.03072 null
2025-02-05 Enhancing Quantum-ready QUBO-based Suppression for Object Detection with Appearance and Confidence Features Keiichiro Yamamura et.al. 2502.02895 null
2025-02-05 RS-YOLOX: A High Precision Detector for Object Detection in Satellite Remote Sensing Images Lei Yang et.al. 2502.02850 null
2025-02-04 Learning the RoPEs: Better 2D and 3D Position Encodings with STRING Connor Schenck et.al. 2502.02562 null
2025-02-04 Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks Huiqun Huang et.al. 2502.02537 null
2025-02-04 Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features Hsin-Cheng Lu et.al. 2502.02322 null
2025-02-05 From Fog to Failure: How Dehazing Can Harm Clear Image Object Detection Ashutosh Kumar et.al. 2502.02027 null
2025-02-04 Memory Efficient Transformer Adapter for Dense Predictions Dong Zhang et.al. 2502.01962 null
2025-02-04 INTACT: Inducing Noise Tolerance through Adversarial Curriculum Training for LiDAR-based Safety-Critical Perception and Autonomy Nastaran Darabi et.al. 2502.01896 null
2025-02-04 SimBEV: A Synthetic Multi-Task Multi-Sensor Driving Data Generation Tool and Dataset Goodarz Mehr et.al. 2502.01894 null
2025-02-03 Reliability-Driven LiDAR-Camera Fusion for Robust 3D Object Detection Reza Sadeghian et.al. 2502.01856 null
2025-02-03 GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection Jeffri Murrugarra-LLerena et.al. 2502.01565 null
2025-02-03 Human Body Restoration with One-Step Diffusion Model and A New Benchmark Jue Gong et.al. 2502.01411 null
2025-01-31 Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches Ying Zang et.al. 2501.19329 null
2025-01-31 GO: The Great Outdoors Multimodal Dataset Peng Jiang et.al. 2501.19274 null
2025-01-31 Early Diagnosis and Severity Assessment of Weligama Coconut Leaf Wilt Disease and Coconut Caterpillar Infestation using Deep Learning-based Image Processing Techniques Samitha Vidhanaarachchi et.al. 2501.18835 null
2025-01-30 Tuning Event Camera Biases Heuristic for Object Detection Applications in Staring Scenarios David El-Chai Ben-Ezra et.al. 2501.18788 null
2025-01-30 Adaptive Object Detection for Indoor Navigation Assistance: A Performance Evaluation of Real-Time Algorithms Abhinav Pratap et.al. 2501.18444 null
2025-01-29 Real Time Scheduling Framework for Multi Object Detection via Spiking Neural Networks Donghwa Kang et.al. 2501.18412 null
2025-01-30 IROAM: Improving Roadside Monocular 3D Object Detection Learning from Autonomous Vehicle Data Domain Zhe Wang et.al. 2501.18162 null
2025-02-03 Efficient Feature Fusion for UAV Object Detection Xudong Wang et.al. 2501.17983 null
2025-01-29 TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection Lei Cheng et.al. 2501.17977 link
2025-01-28 Object Detection with Deep Learning for Rare Event Search in the GADGET II TPC Tyler Wheeler et.al. 2501.17892 null
2025-01-29 Detection of Oscillation-like Patterns in Eclipsing Binary Light Curves using Neural Network-based Object Detection Algorithms Burak Ulaş et.al. 2501.17538 link
2025-01-30 Assessing the Capability of YOLO- and Transformer-based Object Detectors for Real-time Weed Detection Alicia Allmendinger et.al. 2501.17387 null
2025-01-28 DINOSTAR: Deep Iterative Neural Object Detector Self-Supervised Training for Roadside LiDAR Applications Muhammad Shahbaz et.al. 2501.17076 null
2025-01-28 Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding Akash Kumar et.al. 2501.17053 null
2025-01-28 Approach Towards Semi-Automated Certification for Low Criticality ML-Enabled Airborne Applications Chandrasekar Sridhar et.al. 2501.17028 null
2025-01-28 Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection Xiangyu Gao et.al. 2501.16981 null
2025-01-28 SSF-PAN: Semantic Scene Flow-Based Perception for Autonomous Navigation in Traffic Scenarios Yinqi Chen et.al. 2501.16754 null
2025-01-28 DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging Muxi Chen et.al. 2501.16751 null
2025-01-27 Efficient Object Detection of Marine Debris using Pruned YOLO Model Abi Aryaza et.al. 2501.16571 null
2025-01-27 Object Detection for Medical Image Analysis: Insights from the RT-DETR Model Weijie He et.al. 2501.16469 null
2025-01-27 The Linear Attention Resurrection in Vision Transformer Chuanyang Zheng et.al. 2501.16182 null
2025-01-27 Real-Time Brain Tumor Detection in Intraoperative Ultrasound Using YOLO11: From Model Training to Deployment in the Operating Room Santiago Cepeda et.al. 2501.15994 null
2025-01-26 Breaking the SSL-AL Barrier: A Synergistic Semi-Supervised Active Learning Framework for 3D Object Detection Zengran Wang et.al. 2501.15449 null
2025-01-26 FAVbot: An Autonomous Target Tracking Micro-Robot with Frequency Actuation Control Zhijian Hao et.al. 2501.15426 null
2025-01-26 Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception Lianqing Zheng et.al. 2501.15394 null
2025-01-26 iFormer: Integrating ConvNet and Transformer for Mobile Application Chuanyang Zheng et.al. 2501.15369 link
2025-01-25 Explainable YOLO-Based Dyslexia Detection in Synthetic Handwriting Data Nora Fink et.al. 2501.15263 null
2025-01-28 SpikSSD: Better Extraction and Fusion for Object Detection with Spiking Neuron Networks Yimeng Fan et.al. 2501.15151 link
2025-01-25 Comprehensive Evaluation of Cloaking Backdoor Attacks on Object Detector in Real-World Hua Ma et.al. 2501.15101 null
2025-01-24 TD-RD: A Top-Down Benchmark with Real-Time Framework for Road Damage Detection Xi Xiao et.al. 2501.14302 null
2025-01-23 Efficient Precision Control in Object Detection Models for Enhanced and Reliable Ovarian Follicle Counting Vincent Blot et.al. 2501.14036 null
2025-01-23 Enhanced PEC-YOLO for Detecting Improper Safety Gear Wearing Among Power Line Workers Chen Zuguo et.al. 2501.13981 null
2025-01-23 PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection Peiyuan Zhang et.al. 2501.13898 link
2025-01-23 First Lessons Learned of an Artificial Intelligence Robotic System for Autonomous Coarse Waste Recycling Using Multispectral Imaging-Based Methods Timo Lange et.al. 2501.13855 null
2025-01-23 Integrating Causality with Neurochaos Learning: Proposed Approach and Research Agenda Nanjangud C. Narendra et.al. 2501.13763 null
2025-01-23 You Only Crash Once v2: Perceptually Consistent Strong Features for One-Stage Domain Adaptive Detection of Space Terrain Timothy Chase Jr et.al. 2501.13725 null
2025-01-23 YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID Iñaki Erregue et.al. 2501.13710 link
2025-01-24 Multi-aspect Knowledge Distillation with Large Language Model Taegyeong Lee et.al. 2501.13341 null
2025-01-22 MONA: Moving Object Detection from Videos Shot by Dynamic Camera Boxun Hu et.al. 2501.13183 null
2025-01-21 Large-image Object Detection for Fine-grained Recognition of Punches Patterns in Medieval Panel Painting Josh Bruegger et.al. 2501.12489 link
2025-01-21 TOFFE -- Temporally-binned Object Flow from Events for High-speed and Energy-Efficient Object Detection and Tracking Adarsh Kumar Kosta et.al. 2501.12482 null
2025-01-21 Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems Stefano Carlo Lambertenghi et.al. 2501.12269 null
2025-01-21 DLEN: Dual Branch of Transformer for Low-Light Image Enhancement in Dual Domains Junyu Xia et.al. 2501.12235 null
2025-01-21 SVGS-DSGAT: An IoT-Enabled Innovation in Underwater Robotic Object Detection Technology Dongli Wu et.al. 2501.12169 null
2025-01-21 Co-Paced Learning Strategy Based on Confidence for Flying Bird Object Detection Model Training Zi-Wei Sun et.al. 2501.12071 null
2025-01-21 SMamba: Sparse Mamba for Event-based Object Detection Nan Yang et.al. 2501.11971 null
2025-01-20 Enhancing SAR Object Detection with Self-Supervised Pre-training on Masked Auto-Encoders Xinyang Pu et.al. 2501.11249 null
2025-01-19 LiFT: Lightweight, FPGA-tailored 3D object detection based on LiDAR data Konrad Lis et.al. 2501.11159 link
2025-01-19 Advanced technology in railway track monitoring using the GPR Technique: A Review Farhad Kooban et.al. 2501.11132 null
2025-01-19 Green Video Camouflaged Object Detection Xinyu Wang et.al. 2501.10914 null
2025-01-18 ClusterViG: Efficient Globally Aware Vision GNNs via Image Partitioning Dhruv Parikh et.al. 2501.10640 null
2025-01-17 MutualForce: Mutual-Aware Enhancement for 4D Radar-LiDAR 3D Object Detection Xiangyuan Peng et.al. 2501.10266 null
2025-01-17 Leveraging Confident Image Regions for Source-Free Domain-Adaptive Object Detection Mohamed Lamine Mekhalfi et.al. 2501.10081 null
2025-01-17 One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression Keita Miwa et.al. 2501.10064 null
2025-01-17 LWGANet: A Lightweight Group Attention Backbone for Remote Sensing Visual Tasks Wei Lu et.al. 2501.10040 link
2025-01-17 FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis Zhe Chen et.al. 2501.09887 null
2025-01-16 A Simple Aerial Detection Baseline of Multimodal Language Models Qingyun Li et.al. 2501.09720 link
2025-01-16 Practical Continual Forgetting for Pre-trained Vision Models Hongbo Zhao et.al. 2501.09705 link
2025-01-16 Multi-task deep-learning for sleep event detection and stage classification Adriana Anido-Alonso et.al. 2501.09519 link
2025-01-16 The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning Wonjun Jo et.al. 2501.09485 null
2025-01-16 MonoSOWA: Scalable monocular 3D Object detector Without human Annotations Jan Skvrna et.al. 2501.09481 null
2025-01-16 RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection Jianrui Shi et.al. 2501.09465 null
2025-01-16 On the Relation between Optical Aperture and Automotive Object Detection Ofer Bar-Shalom et.al. 2501.09456 null
2025-01-16 SoccerSynth-Detection: A Synthetic Dataset for Soccer Player Detection Haobin Qin et.al. 2501.09281 null
2025-01-15 Polyp detection in colonoscopy images using YOLOv11 Alok Ranjan Sahoo et.al. 2501.09051 null
2025-01-15 PACF: Prototype Augmented Compact Features for Improving Domain Adaptive Object Detection Chenguang Liu et.al. 2501.08605 null
2025-01-14 Predicting Performance of Object Detection Models in Electron Microscopy Using Random Forests Ni Li et.al. 2501.08465 link
2025-01-14 Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying Jonathan Lyhs et.al. 2501.08142 null
2025-01-14 Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation Yunzhi Zhuge et.al. 2501.07806 link
2025-01-14 Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding Zhaokai Wang et.al. 2501.07783 link
2025-01-13 SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing Varun Biyyala et.al. 2501.07554 link
2025-01-13 ML Mule: Mobile-Driven Context-Aware Collaborative Learning Haoxiang Yu et.al. 2501.07536 null
2025-01-13 TimberVision: A Multi-Task Dataset and Framework for Log-Component Segmentation and Tracking in Autonomous Forestry Operations Daniel Steininger et.al. 2501.07360 null
2025-01-13 Toward Realistic Camouflaged Object Detection: Benchmarks and Method Zhimeng Xin et.al. 2501.07297 link
2025-01-13 Dual Scale-aware Adaptive Masked Knowledge Distillation for Object Detection ZhouRui Zhang et.al. 2501.07101 null
2025-01-11 CoreNet: Conflict Resolution Network for Point-Pixel Misalignment and Sub-Task Suppression of 3D LiDAR-Camera Object Detection Yiheng Li et.al. 2501.06550 link
2025-01-11 CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement Yijie Li et.al. 2501.06441 null
2025-01-11 FocusDD: Real-World Scene Infusion for Robust Dataset Distillation Youbing Hu et.al. 2501.06405 null
2025-01-10 A Holistically Point-guided Text Framework for Weakly-Supervised Camouflaged Object Detection Tsui Qin Mok et.al. 2501.06038 null
2025-01-10 Minimizing Occlusion Effect on Multi-View Camera Perception in BEV with Multi-Sensor Fusion Sanjay Kumar et.al. 2501.05997 null
2025-01-10 EDNet: Edge-Optimized Small Target Detection in UAV Imagery -- Faster Context Attention, Better Feature Fusion, and Hardware Acceleration Zhifan Song et.al. 2501.05885 null
2025-01-10 Automatic detection of single-electron regime of quantum dots and definition of virtual gates using U-Net and clustering Yui Muto et.al. 2501.05878 null
2025-01-10 Zero-shot Shark Tracking and Biometrics from Aerial Imagery Chinmay K Lalgudi et.al. 2501.05717 null
2025-01-10 Dark Energy Survey Year 6 Results: Synthetic-source Injection Across the Full Survey Using Balrog D. Anbajagane et.al. 2501.05683 null
2025-01-09 Approximate Supervised Object Distance Estimation on Unmanned Surface Vehicles Benjamin Kiefer et.al. 2501.05567 null
2025-01-09 Performance of YOLOv7 in Kitchen Safety While Handling Knife Athulya Sundaresan Geetha et.al. 2501.05399 null
2025-01-09 A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision Ali Rohan et.al. 2501.05147 null
2025-01-09 CorrDiff: Adaptive Delay-aware Detector with Temporal Cue Inputs for Real-time Object Detection Xiang Zhang et.al. 2501.05132 null
2025-01-09 AD-L-JEPA: Self-Supervised Spatial World Models with Joint Embedding Predictive Architecture for Autonomous Driving with LiDAR Data Haoran Zhu et.al. 2501.04969 link
2025-01-09 Online Continual Learning: A Systematic Literature Review of Approaches, Challenges, and Benchmarks Seyed Amir Bidaki et.al. 2501.04897 link
2025-01-08 Video Summarisation with Incident and Context Information using Generative AI Ulindu De Silva et.al. 2501.04764 null
2025-01-08 Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models Miaoyang He et.al. 2501.04582 null
2025-01-08 RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark Xin Zhang et.al. 2501.04440 link
2025-01-08 FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection Guoxin Zhang et.al. 2501.04373 null
2025-01-08 H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving Siran Chen et.al. 2501.04302 null
2025-01-08 UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles Abhishek Balasubramaniam et.al. 2501.04213 null
2025-01-07 LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving Lingdong Kong et.al. 2501.04005 null
2025-01-07 Visual question answering: from early developments to recent advances -- a survey Ngoc Dung Huynh et.al. 2501.03939 null
2025-01-07 SCC-YOLO: An Improved Object Detector for Assisting in Brain Tumor Diagnosis Runci Bai et.al. 2501.03836 null
2025-01-08 Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection Xinbin Yuan et.al. 2501.03775 link
2025-01-07 AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features Ruochen Zhang et.al. 2501.03700 null
2025-01-07 Anomaly Triplet-Net: Progress Recognition Model Using Deep Metric Learning Considering Occlusion for Manual Assembly Work Takumi Kitsukawa et.al. 2501.03533 null
2025-01-05 Multispectral Pedestrian Detection with Sparsely Annotated Label Chan Lee et.al. 2501.02640 null
2025-01-05 Generalization-Enhanced Few-Shot Object Detection in Remote Sensing Hui Lin et.al. 2501.02474 link
2025-01-04 V2X-DGPE: Addressing Domain Gaps and Pose Errors for Robust Collaborative 3D Object Detection Sichao Wang et.al. 2501.02363 null
2025-01-04 Accurate Crop Yield Estimation of Blueberries using Deep Learning and Smart Drones Hieu D. Nguyen et.al. 2501.02344 null
2025-01-04 RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar Liye Jia et.al. 2501.02314 null
2025-01-03 A Separable Self-attention Inspired by the State Space Model for Computer Vision Juntao Zhang et.al. 2501.02040 link
2025-01-03 UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery Huaxiang Zhang et.al. 2501.01855 null
2025-01-03 Dual Mutual Learning Network with Global-local Awareness for RGB-D Salient Object Detection Kang Yi et.al. 2501.01648 null
2025-01-02 A Multi-task Supervised Compression Model for Split Computing Yoshitomo Matsubara et.al. 2501.01420 link
2025-01-02 MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception Xiaoshuai Hao et.al. 2501.01037 null
2025-01-01 A Novel Approach using CapsNet and Deep Belief Network for Detection and Identification of Oral Leukopenia Hirthik Mathesh GV et.al. 2501.00876 null
2025-01-01 NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model Yuzhi Lai et.al. 2501.00785 null
2024-12-31 Gaussian Building Mesh (GBM): Extract a Building's 3D Mesh with Google Earth and Gaussian Splatting Kyle Gao et.al. 2501.00625 null
2024-12-31 B2Net: Camouflaged Object Detection via Boundary Aware and Boundary Fusion Junmin Cai et.al. 2501.00426 null
2024-12-30 TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning Distillation Shaoqing Xu et.al. 2412.20911 link
2024-12-30 Humanoid Robot RHP Friends: Seamless Combination of Autonomous and Teleoperated Tasks in a Nursing Context Mehdi Benallegue et.al. 2412.20770 null
2024-12-30 Solar Filaments Detection using Active Contours Without Edges Sanmoy Bandyopadhyay et.al. 2412.20749 null
2024-12-30 Open-Set Object Detection By Aligning Known Class Representations Hiran Sarkar et.al. 2412.20701 null
2024-12-30 SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection Yuxuan Li et.al. 2412.20665 link
2024-12-30 YOLO-UniOW: Efficient Universal Open-World Object Detection Lihao Liu et.al. 2412.20645 link
2024-12-29 A Novel FPGA-based CNN Hardware Accelerator: Optimization for Convolutional Layers using Karatsuba Ofman Multiplier Amit Sarkar et.al. 2412.20393 null
2024-12-29 Differential Evolution Integrated Hybrid Deep Learning Model for Object Detection in Pre-made Dishes Lujia Lv et.al. 2412.20370 null
2024-12-28 Plastic Waste Classification Using Deep Learning: Insights from the WaDaBa Dataset Suman Kunwar et.al. 2412.20232 null
2024-12-28 SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection Phi Vu Tran et.al. 2412.20047 null
2024-12-27 Chimera: A Block-Based Neural Architecture Search Framework for Event-Based Object Detection Diego A. Silva et.al. 2412.19646 null
2024-12-27 Optimizing Helmet Detection with Hybrid YOLO Pipelines: A Detailed Analysis Vaikunth M et.al. 2412.19467 null
2024-12-26 Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement Qiude Zhang et.al. 2412.19165 null
2024-12-26 From Coin to Data: The Impact of Object Detection on Digital Numismatics Rafael Cabral et.al. 2412.19091 null
2024-12-26 Assessing Pre-trained Models for Transfer Learning through Distribution of Spectral Components Tengxue Zhang et.al. 2412.19085 null
2024-12-25 CGCOD: Class-Guided Camouflaged Object Detection Chenxi Zhang et.al. 2412.18977 null
2024-12-25 HV-BEV: Decoupling Horizontal and Vertical Feature Sampling for Multi-View 3D Object Detection Di Wu et.al. 2412.18884 null
2024-12-25 TSceneJAL: Joint Active Learning of Traffic Scenes for 3D Object Detection Chenyang Lei et.al. 2412.18870 null
2024-12-25 Distortion-Aware Adversarial Attacks on Bounding Boxes of Object Detectors Pham Phuc et.al. 2412.18815 link
2024-12-25 Unified Local and Global Attention Interaction Modeling for Vision Transformers Tan Nguyen et.al. 2412.18778 null
2024-12-24 Sampling Bag of Views for Open-Vocabulary Object Detection Hojun Choi et.al. 2412.18273 null
2024-12-24 Efficient Detection Framework Adaptation for Edge Computing: A Plug-and-play Neural Network Toolbox Enabling Edge Deployment Jiaqi Wu et.al. 2412.18230 null
2024-12-24 Spectrum-oriented Point-supervised Saliency Detector for Hyperspectral Images Peifu Liu et.al. 2412.18112 link
2024-12-24 Multi-Point Positional Insertion Tuning for Small Object Detection Kanoko Goto et.al. 2412.18090 null
2024-12-24 COMO: Cross-Mamba Interaction and Offset-Guided Fusion for Multimodal Object Detection Chang Liu et.al. 2412.18076 null
2024-12-23 Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection Yitong Chen et.al. 2412.17800 link
2024-12-23 Enhanced Temporal Processing in Spiking Neural Networks for Static Object Detection Using 3D Convolutions Huaxu He et.al. 2412.17654 null
2024-12-23 Impact of Evidence Theory Uncertainty on Training Object Detection Models M. Tahasanul Ibrahim et.al. 2412.17405 null
2024-12-23 Feature Based Methods Domain Adaptation for Object Detection: A Review Paper Helia Mohamadi et.al. 2412.17325 null
2024-12-23 Towards Unsupervised Model Selection for Domain Adaptive Object Detection Hengfu Yu et.al. 2412.17284 link
2024-12-22 NumbOD: A Spatial-Frequency Fusion Attack Against Object Detectors Ziqi Zhou et.al. 2412.16955 link
2024-12-22 Separating Drone Point Clouds From Complex Backgrounds by Cluster Filter -- Technical Report for CVPR 2024 UG2 Challenge Hanfang Liang et.al. 2412.16947 null
2024-12-22 Seamless Detection: Unifying Salient Object Detection and Camouflaged Object Detection Yi Liu et.al. 2412.16840 link
2024-12-24 Human-Guided Image Generation for Expanding Small-Scale Training Image Datasets Changjian Chen et.al. 2412.16839 null
2024-12-21 IV-tuning: Parameter-Efficient Transfer Learning for Infrared-Visible Tasks Yaming Zhang et.al. 2412.16654 link
2024-12-20 NeRF-To-Real Tester: Neural Radiance Fields as Test Image Generators for Vision of Autonomous Systems Laura Weihl et.al. 2412.16141 null
2024-12-20 MR-GDINO: Efficient Open-World Continual Object Detection Bowen Dong et.al. 2412.15979 link
2024-12-20 Mask-RadarNet: Enhancing Transformer With Spatial-Temporal Semantic Context for Radar Object Detection in Autonomous Driving Yuzhi Wu et.al. 2412.15595 null
2024-12-19 Exploring Machine Learning Engineering for Object Detection and Tracking by Unmanned Aerial Vehicle (UAV) Aneesha Guna et.al. 2412.15347 null
2024-12-19 Leveraging Color Channel Independence for Improved Unsupervised Object Detection Bastian Jäckl et.al. 2412.15150 null
2024-12-19 A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space Yonghao He et.al. 2412.14680 link
2024-12-19 Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers Rui Ding et.al. 2412.14633 null
2024-12-19 Alignment-Free RGB-T Salient Object Detection: A Large-scale Dataset and Progressive Correlation Network Kunpeng Wang et.al. 2412.14576 link
2024-12-19 SCKD: Semi-Supervised Cross-Modality Knowledge Distillation for 4D Radar Object Detection Ruoyu Xu et.al. 2412.14571 null
2024-12-18 HA-RDet: Hybrid Anchor Rotation Detector for Oriented Object Detection Phuc D. A. Nguyen et.al. 2412.14379 link
2024-12-18 Joint Perception and Prediction for Autonomous Driving: A Survey Lucas Dal'Col et.al. 2412.14088 link
2024-12-18 Object Style Diffusion for Generalized Object Detection in Urban Scene Hao Li et.al. 2412.13815 null
2024-12-18 MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing Chuang Yang et.al. 2412.13684 null
2024-12-18 Comparative Analysis of YOLOv9, YOLOv10 and RT-DETR for Real-Time Weed Detection Ahmet Oğuz Saltık et.al. 2412.13490 null
2024-12-17 Continuous Patient Monitoring with AI: Real-Time Analysis of Video in Hospital Care Settings Paolo Gabriel et.al. 2412.13152 null
2024-12-17 A New Adversarial Perspective for LiDAR-based 3D Object Detection Shijun Zheng et.al. 2412.13017 null
2024-12-17 What is YOLOv6? A Deep Insight into the Object Detection Model Athulya Sundaresan Geetha et.al. 2412.13006 null
2024-12-17 Differential Alignment for Domain Adaptive Object Detection Xinyu He et.al. 2412.12830 null
2024-12-17 RCTrans: Radar-Camera Transformer via Radar Densifier and Sequential Decoder for 3D Object Detection Yiheng Li et.al. 2412.12799 link
2024-12-17 RaCFormer: Towards High-Quality 3D Object Detection via Query-based Radar-Camera Fusion Xiaomeng Chu et.al. 2412.12725 null
2024-12-17 Efficient Oriented Object Detection with Enhanced Small Object Recognition in Aerial Images Zhifei Shi et.al. 2412.12562 null
2024-12-17 CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal Dynamics Ruixin Mao et.al. 2412.12525 link
2024-12-17 PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts Kun Guo et.al. 2412.12460 link
2024-12-16 Domain Generalization in Autonomous Driving: Evaluating YOLOv8s, RT-DETR, and YOLO-NAS with the ROAD-Almaty Dataset Madiyar Alimov et.al. 2412.12349 null
2024-12-16 Coconut Palm Tree Counting on Drone Images with Deep Object Detection and Synthetic Training Data Tobias Rohe et.al. 2412.11949 null
2024-12-16 Sonar-based Deep Learning in Underwater Robotics: Overview, Robustness and Challenges Martin Aubard et.al. 2412.11840 null
2024-12-16 CLDA-YOLO: Visual Contrastive Learning Based Domain Adaptive YOLO Detector Tianheng Qiu et.al. 2412.11812 null
2024-12-16 PhysAug: A Physical-guided and Frequency-based Data Augmentation for Single-Domain Generalized Object Detection Xiaoran Xu et.al. 2412.11807 link
2024-12-16 Learning UAV-based path planning for efficient localization of objects using prior knowledge Rick van Essen et.al. 2412.11717 null
2024-12-16 Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning Chang Xu et.al. 2412.11582 null
2024-12-16 HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection Zijian Gu et.al. 2412.11489 link
2024-12-16 Universal Domain Adaptive Object Detection via Dual Probabilistic Alignment Yuanfan Zheng et.al. 2412.11443 link
2024-12-16 V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations Jin-Cheng Jhang et.al. 2412.11412 null
2024-12-15 From Simple to Professional: A Combinatorial Controllable Image Captioning Agent Xinran Wang et.al. 2412.11025 link
2024-12-13 A dual contrastive framework Yuan Sun et.al. 2412.10348 null
2024-12-13 MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization Shuaiting Li et.al. 2412.10261 null
2024-12-13 Copy-Move Detection in Optical Microscopy: A Segmentation Network and A Dataset Hao-Chiang Shao et.al. 2412.10258 null
2024-12-13 UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection Haomiao Liu et.al. 2412.10176 link
2024-12-13 HS-FPN: High Frequency and Spatial Perception FPN for Tiny Object Detection Zican Shi et.al. 2412.10116 null
2024-12-13 RemDet: Rethinking Efficient Model Design for UAV Object Detection Chen Li et.al. 2412.10040 link
2024-12-13 Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving Zhihang Song et.al. 2412.10033 null
2024-12-13 Object-Focused Data Selection for Dense Prediction Tasks Niclas Popp et.al. 2412.10032 null
2024-12-13 CP-DETR: Concept Prompt Guide DETR Toward Stronger Universal Object Detection Qibo Chen et.al. 2412.09799 null
2024-12-12 FD2-Net: Frequency-Driven Feature Decomposition Network for Infrared-Visible Object Detection Ke Li et.al. 2412.09258 null
2024-12-12 UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set Object Detection Framework Silin Cheng et.al. 2412.09229 null
2024-12-12 ContextHOI: Spatial Context Learning for Human-Object Interaction Detection Mingda Jia et.al. 2412.09050 null
2024-12-12 STEAM: Squeeze and Transform Enhanced Attention Module Rishabh Sabharwal et.al. 2412.09023 null
2024-12-12 Sensing for Space Safety and Sustainability: A Deep Learning Approach with Vision Transformers Wenxuan Zhang et.al. 2412.08913 null
2024-12-11 DALI: Domain Adaptive LiDAR Object Detection via Distribution-level and Instance-level Pseudo Label Denoising Xiaohu Lu et.al. 2412.08806 link
2024-12-11 Utilizing Multi-step Loss for Single Image Reflection Removal Abdelrahman Elnenaey et.al. 2412.08582 link
2024-12-11 PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud Completion Yi Zhong et.al. 2412.08421 null
2024-12-13 Physical Informed Driving World Model Zhuoran Yang et.al. 2412.08410 null
2024-12-11 Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation Jiaming Lv et.al. 2412.08139 null
2024-12-11 DTAA: A Detect, Track and Avoid Architecture for navigation in spaces with Multiple Velocity Objects Samuel Nordström et.al. 2412.08121 null
2024-12-11 THUD++: Large-Scale Dynamic Indoor Scene Dataset and Benchmark for Mobile Robots Zeshun Li et.al. 2412.08096 null
2024-12-11 MAGIC: Mastering Physical Adversarial Generation in Context through Collaborative LLM Agents Yun Xing et.al. 2412.08014 null
2024-12-10 Low-Latency Scalable Streaming for Event-Based Vision Andrew Hamara et.al. 2412.07889 null
2024-12-10 Multimodal Contextualized Support for Enhancing Video Retrieval System Quoc-Bao Nguyen-Le et.al. 2412.07584 null
2024-12-10 Making the Flow Glow -- Robot Perception under Severe Lighting Conditions using Normalizing Flow Gradients Simon Kristoffersson Lind et.al. 2412.07565 link
2024-12-10 Enhancing 3D Object Detection in Autonomous Vehicles Based on Synthetic Virtual Environment Analysis Vladislav Li et.al. 2412.07509 null
2024-12-10 DSFEC: Efficient and Deployable Deep Radar Object Detection Gayathri Dandugula et.al. 2412.07411 null
2024-12-10 Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments Muhayy Ud Din et.al. 2412.07392 null
2024-12-09 FlexEvent: Event Camera Object Detection at Arbitrary Frequencies Dongyue Lu et.al. 2412.06708 null
2024-12-09 EMOv2: Pushing 5M Vision Model Frontier Jiangning Zhang et.al. 2412.06674 link
2024-12-09 Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset Xiao Wang et.al. 2412.06647 null
2024-12-09 Self-Paced Learning Strategy with Easy Sample Prior Based on Confidence for the Flying Bird Object Detection Model Training Zi-Wei Sun et.al. 2412.06306 null
2024-12-09 No Annotations for Object Detection in Art through Stable Diffusion Patrick Ramos et.al. 2412.06286 link
2024-12-09 DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction Yunheng Li et.al. 2412.06244 null
2024-12-09 A Real-Time Defense Against Object Vanishing Adversarial Patch Attacks for Object Detection in Autonomous Vehicles Jaden Mu et.al. 2412.06215 null
2024-12-09 PoLaRIS Dataset: A Maritime Object Detection and Tracking Dataset in Pohang Canal Jiwon Choi et.al. 2412.06192 null
2024-12-08 Tiny Object Detection with Single Point Supervision Haoran Zhu et.al. 2412.05837 null
2024-12-07 Rethinking Annotation for Object Detection: Is Annotating Small-size Instances Worth Its Cost? Yusuke Hosoya et.al. 2412.05611 null
2024-12-06 From classical techniques to convolution-based models: A review of object detection algorithms Fnu Neha et.al. 2412.05252 null
2024-12-06 Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection Chaoda Zheng et.al. 2412.05154 link
2024-12-06 DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object Detection Yishuo Chen et.al. 2412.04931 link
2024-12-06 Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection Khurram Azeem Hashmi et.al. 2412.04915 null
2024-12-05 Cubify Anything: Scaling Indoor 3D Object Detection Justin Lazarow et.al. 2412.04458 null
2024-12-05 Reflective Teacher: Semi-Supervised Multimodal 3D Object Detection in Bird's-Eye-View via Uncertainty Measure Saheli Hazra et.al. 2412.04337 null
2024-12-05 YOLO-CCA: A Context-Based Approach for Traffic Sign Detection Linfeng Jiang et.al. 2412.04289 link
2024-12-05 DEIM: DETR with Improved Matching for Fast Convergence Shihua Huang et.al. 2412.04234 link
2024-12-05 Frequency-Adaptive Low-Latency Object Detection Using Events and Frames Haitian Zhang et.al. 2412.04149 null
2024-12-05 Thermal and RGB Images Work Better Together in Wind Turbine Damage Detection Serhii Svystun et.al. 2412.04114 null
2024-12-05 SoRA: Singular Value Decomposed Low-Rank Adaptation for Domain Generalizable Representation Learning Seokju Yun et.al. 2412.04077 null
2024-12-05 Space to Policy: Scalable Brick Kiln Detection and Automatic Compliance Monitoring with Geospatial Data Zeel B Patel et.al. 2412.04065 null
2024-12-05 UNCOVER: Unknown Class Object Detection for Autonomous Vehicles in Real-time Lars Schmarje et.al. 2412.03986 null
2024-12-05 MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction Mithun Parab et.al. 2412.03928 null
2024-12-04 Perception Tokens Enhance Visual Reasoning in Multimodal Language Models Mahtab Bigverdi et.al. 2412.03548 null
2024-12-04 Data Fusion of Semantic and Depth Information in the Context of Object Detection Md Abu Yusuf et.al. 2412.03490 null
2024-12-04 Task-driven Image Fusion with Learnable Fusion Loss Haowen Bai et.al. 2412.03240 null
2024-12-04 ObjectFinder: Open-Vocabulary Assistive System for Interactive Object Search by Blind People Ruiping Liu et.al. 2412.03118 null
2024-12-04 TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception Runjian Chen et.al. 2412.03054 null
2024-12-04 Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection Prabhat Kc et.al. 2412.02920 null
2024-12-03 EvRT-DETR: The Surprising Effectiveness of DETR-based Detection for Event Cameras Dmitrii Torbunov et.al. 2412.02890 null
2024-12-03 Optimized CNNs for Rapid 3D Point Cloud Object Recognition Tianyi Lyu et.al. 2412.02855 null
2024-12-03 Gaussian Splatting Under Attack: Investigating Adversarial Noise in 3D Objects Abdurrahman Zeybey et.al. 2412.02803 null
2024-12-03 SJTU:Spatial judgments in multimodal models towards unified segmentation through coordinate detection Joongwon Chae et.al. 2412.02565 null
2024-12-03 Underload: Defending against Latency Attacks for Object Detectors on Edge Devices Tianyi Wang et.al. 2412.02171 null
2024-12-03 Redundant Queries in DETR-Based 3D Detection Methods: Unnecessary and Prunable Lizhen Xu et.al. 2412.02054 null
2024-12-02 Smart Parking with Pixel-Wise ROI Selection for Vehicle Detection Using YOLOv8, YOLOv9, YOLOv10, and YOLOv11 Gustavo P. C. P. da Luz et.al. 2412.01983 null
2024-12-02 HPRM: High-Performance Robotic Middleware for Intelligent Autonomous Systems Jacky Kwok et.al. 2412.01799 null
2024-12-02 Identifying Reliable Predictions in Detection Transformers Young-Jin Park et.al. 2412.01782 null
2024-12-02 FEVER-OOD: Free Energy Vulnerability Elimination for Robust Out-of-Distribution Detection Brian K. S. Isaac-Medina et.al. 2412.01596 null
2024-12-02 Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection Hao Tang et.al. 2412.01556 null
2024-12-03 GFreeDet: Exploiting Gaussian Splatting and Foundation Models for Model-free Unseen Object Detection in the BOP Challenge 2024 Xingyu Liu et.al. 2412.01552 null
2024-12-02 Improving Object Detection by Modifying Synthetic Data with Explainable AI Nitish Mital et.al. 2412.01477 null
2024-11-29 SpaRC: Sparse Radar-Camera Fusion for 3D Object Detection Philipp Wolters et.al. 2411.19860 null
2024-11-29 Feedback-driven object detection and iterative model improvement Sönke Tenckhoff et.al. 2411.19835 link
2024-11-29 Real-Time Anomaly Detection in Video Streams Fabien Poirier et.al. 2411.19731 null
2024-11-29 LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention Zewen Du et.al. 2411.19585 link
2024-11-29 Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding Wenbo Zhang et.al. 2411.19551 null
2024-11-28 Automatic Prompt Generation and Grounding Object Detection for Zero-Shot Image Anomaly Detection Tsun-Hin Cheung et.al. 2411.19220 null
2024-11-28 Co-Learning: Towards Semi-Supervised Object Detection with Road-side Cameras Jicheng Yuan et.al. 2411.19143 null
2024-11-28 On Moving Object Segmentation from Monocular Video with Transformers Christian Homeyer et.al. 2411.19141 null
2024-11-28 Dynamic Attention and Bi-directional Fusion for Safety Helmet Wearing Detection Junwei Feng et.al. 2411.19071 null
2024-11-28 MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers Jongseong Bae et.al. 2411.18995 null
2024-11-27 Efficient Dynamic LiDAR Odometry for Mobile Robots with Structured Point Clouds Jonathan Lichtenfeld et.al. 2411.18443 link
2024-11-27 Deep Fourier-embedded Network for Bi-modal Salient Object Detection Pengfei Lyu et.al. 2411.18409 link
2024-11-27 Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks Chen Zhou et.al. 2411.18288 link
2024-11-27 From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects Zizhao Li et.al. 2411.18207 link
2024-11-27 RPEE-HEADS: A Novel Benchmark for Pedestrian Head Detection in Crowd Videos Mohamad Abubaker et.al. 2411.18164 null
2024-11-27 ROICtrl: Boosting Instance Control for Visual Generation Yuchao Gu et.al. 2411.17949 null
2024-11-26 Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning Hoàng-Ân Lê et.al. 2411.17536 link
2024-11-26 TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba Xiaowen Ma et.al. 2411.17473 link
2024-11-26 Communication-Efficient Cooperative SLAMMOT via Determining the Number of Collaboration Vehicles Susu Fang et.al. 2411.17432 null
2024-11-26 DGNN-YOLO: Dynamic Graph Neural Networks with YOLO11 for Small Object Detection and Tracking in Traffic Surveillance Shahriar Soudeep et.al. 2411.17251 null
2024-11-26 Event-based Spiking Neural Networks for Object Detection: A Review of Datasets, Architectures, Learning Rules, and Implementation Craig Iaboni et.al. 2411.17006 link
2024-11-25 Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory Zaira Manigrasso et.al. 2411.16934 null
2024-11-25 Open Vocabulary Monocular 3D Object Detection Jin Yao et.al. 2411.16833 link
2024-11-25 Imperceptible Adversarial Examples in the Physical World Weilin Xu et.al. 2411.16622 null
2024-11-25 STDWeb: Simple Transient Detection pipeline for the Web Sergey Karpov et.al. 2411.16470 null
2024-11-25 Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and Tasks Asanobu Kitamoto et.al. 2411.16421 link
2024-11-26 CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation Leon Sick et.al. 2411.16319 null
2024-11-25 Diagnosis of diabetic retinopathy using machine learning & deep learning technique Eric Shah et.al. 2411.16250 null
2024-11-25 Interpreting Object-level Foundation Models via Visual Precision Search Ruoyu Chen et.al. 2411.16198 null
2024-11-25 Learn from Foundation Model: Fruit Detection Model without Manual Annotation Yanan Wang et.al. 2411.16196 null
2024-11-25 CIA: Controllable Image Augmentation Framework Based on Stable Diffusion Mohamed Benkedadra et.al. 2411.16128 null
2024-11-25 You only thermoelastically deform once: Point Absorber Detection in LIGO Test Masses with YOLO Simon R. Goode et.al. 2411.16104 null
2024-11-25 Leverage Task Context for Object Affordance Ranking Haojie Huang et.al. 2411.16082 null
2024-11-22 A Real-Time DETR Approach to Bangladesh Road Object Detection for Autonomous Vehicles Irfan Nafiz Shahan et.al. 2411.15110 null
2024-11-22 MSSF: A 4D Radar and Camera Fusion Framework With Multi-Stage Sampling for 3D Object Detection in Autonomous Driving Hongsi Liu et.al. 2411.15016 null
2024-11-22 VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving Haiming Zhang et.al. 2411.14716 null
2024-11-21 Unveiling the Hidden: A Comprehensive Evaluation of Underwater Image Enhancement and Its Impact on Object Detection Ali Awad et.al. 2411.14626 null
2024-11-21 DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding Tianhe Ren et.al. 2411.14347 link
2024-11-21 AnywhereDoor: Multi-Target Backdoor Attacks on Object Detection Jialin Lu et.al. 2411.14243 null
2024-11-21 Transforming Static Images Using Generative Models for Video Salient Object Detection Suhwan Cho et.al. 2411.13975 link
2024-11-21 Multitask Learning for SAR Ship Detection with Gaussian-Mask Joint Segmentation Ming Zhao et.al. 2411.13847 null
2024-11-20 MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection Tong Ning et.al. 2411.13628 null
2024-11-20 DIS-Mine: Instance Segmentation for Disaster-Awareness in Poor-Light Condition in Underground Mines Mizanur Rahman Jewel et.al. 2411.13544 null
2024-11-20 A Resource Efficient Fusion Network for Object Detection in Bird's-Eye View using Camera and Raw Radar Data Kavin Chandrasekaran et.al. 2411.13311 link
2024-11-20 VADet: Multi-frame LiDAR 3D Object Detection using Variable Aggregation Chengjie Huang et.al. 2411.13186 null
2024-11-20 RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation Christoph Reinders et.al. 2411.13150 link
2024-11-20 YCB-LUMA: YCB Object Dataset with Luminance Keying for Object Localization Thomas Pöllabauer et.al. 2411.13149 link
2024-11-20 Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension Yongdong Luo et.al. 2411.13093 link
2024-11-20 Bounding-box Watermarking: Defense against Model Extraction Attacks on Object Detectors Satoru Koda et.al. 2411.13047 null
2024-11-20 Collaborative Feature-Logits Contrastive Learning for Open-Set Semi-Supervised Object Detection Xinhao Zhong et.al. 2411.13001 null
2024-11-19 Maps from Motion (MfM): Generating 2D Semantic Maps from Sparse Multi-view Images Matteo Toso et.al. 2411.12620 null
2024-11-19 GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving Shaoqing Xu et.al. 2411.12452 null
2024-11-19 Physics-Guided Detector for SAR Airplanes Zhongling Huang et.al. 2411.12301 link
2024-11-18 Scaling Deep Learning Research with Kubernetes on the NRP Nautilus HyperCluster J. Alex Hurt et.al. 2411.12038 null
2024-11-18 LightFFDNets: Lightweight Convolutional Neural Networks for Rapid Facial Forgery Detection Günel Jabbarlı et.al. 2411.11826 null
2024-11-18 WoodYOLO: A Novel Object Detector for Wood Species Detection in Microscopic Images Lars Nieradzik et.al. 2411.11738 null
2024-11-18 Exploring Emerging Trends and Research Opportunities in Visual Place Recognition Antonios Gasteratos et.al. 2411.11481 null
2024-11-18 SL-YOLO: A Stronger and Lighter Drone Target Detection Model Defan Chen et.al. 2411.11477 null
2024-11-19 EVT: Efficient View Transformation for Multi-Modal 3D Object Detection Yongjin Lee et.al. 2411.10715 null
2024-11-15 Vision Eagle Attention: A New Lens for Advancing Image Classification Mahmudul Hasan et.al. 2411.10564 link
2024-11-15 Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions Xumin Gao et.al. 2411.10357 null
2024-11-15 RETR: Multi-View Radar Detection Transformer for Indoor Perception Ryoma Yataka et.al. 2411.10293 null
2024-11-15 Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning Jingru Yang et.al. 2411.10252 null
2024-11-15 Real-Time AI-Driven People Tracking and Counting Using Overhead Cameras Ishrath Ahamed et.al. 2411.10072 null
2024-11-15 Diachronic Document Dataset for Semantic Layout Analysis Thibault Clérice et.al. 2411.10068 null
2024-11-14 Adversarial Attacks Using Differentiable Rendering: A Survey Matthew Hull et.al. 2411.09749 null
2024-11-14 Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration Yifan Shao et.al. 2411.09604 link
2024-11-14 Long-Tailed Object Detection Pre-training: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction Chen-Long Duan et.al. 2411.09453 null
2024-11-14 Instruction-Driven Fusion of Infrared-Visible Images: Tailoring for Diverse Downstream Tasks Zengyi Yang et.al. 2411.09387 null
2024-11-14 DT-JRD: Deep Transformer based Just Recognizable Difference Prediction Model for Video Coding for Machines Junqi Liu et.al. 2411.09308 null
2024-11-14 Cross-Modal Consistency in Multimodal Large Language Models Xiang Zhang et.al. 2411.09273 null
2024-11-14 LEAP:D -- A Novel Prompt-based Approach for Domain-Generalized Aerial Object Detection Chanyeong Park et.al. 2411.09180 null
2024-11-13 Multimodal Object Detection using Depth and Image Data for Manufacturing Parts Nazanin Mahjourian et.al. 2411.09062 null
2024-11-13 DART-LLM: Dependency-Aware Multi-Robot Task Decomposition and Execution using Large Language Models Yongdong Wang et.al. 2411.09022 null
2024-11-13 UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation Chengyuan Zhang et.al. 2411.08569 null
2024-11-13 Methodology for a Statistical Analysis of Influencing Factors on 3D Object Detection Performance Anton Kuznietsov et.al. 2411.08482 null
2024-11-13 V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion Xun Huang et.al. 2411.08402 link
2024-11-12 Large-scale Remote Sensing Image Target Recognition and Automatic Annotation Wuzheng Dong et.al. 2411.07802 link
2024-11-12 Efficient 3D Perception on Multi-Sweep Point Cloud with Gumbel Spatial Pruning Jianhao Li et.al. 2411.07742 null
2024-11-12 Depthwise Separable Convolutions with Deep Residual Convolutions Md Arid Hasan et.al. 2411.07544 null
2024-11-11 Transformers for Charged Particle Track Reconstruction in High Energy Physics Samuel Van Stroud et.al. 2411.07149 null
2024-11-11 Multi-scale Frequency Enhancement Network for Blind Image Deblurring Yawen Xiang et.al. 2411.06893 null
2024-11-11 Fast and Efficient Transformer-based Method for Bird's Eye View Instance Prediction Miguel Antunes-García et.al. 2411.06851 link
2024-11-11 United Domain Cognition Network for Salient Object Detection in Optical Remote Sensing Images Yanguang Sun et.al. 2411.06703 link
2024-11-11 Track Any Peppers: Weakly Supervised Sweet Pepper Tracking Using VLMs Jia Syuen Lim et.al. 2411.06702 null
2024-11-11 LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection Zhengyi Liu et.al. 2411.06652 null
2024-11-09 LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation Weijie Ma et.al. 2411.06173 link
2024-11-09 AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems Zhiyu Zhu et.al. 2411.06146 null
2024-11-09 Pattern Integration and Enhancement Vision Transformer for Self-Supervised Learning in Remote Sensing Kaixuan Lu et.al. 2411.06091 null
2024-11-09 An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models Fatemeh Shiri et.al. 2411.06048 link
2024-11-08 Open-set object detection: towards unified problem formulation and benchmarking Hejer Ammar et.al. 2411.05564 null
2024-11-08 ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving Tao Ma et.al. 2411.05311 null
2024-11-08 SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection Yun Zhao et.al. 2411.05292 null
2024-11-07 On the Inherent Robustness of One-Stage Object Detection against Out-of-Distribution Data Aitor Martinez-Seras et.al. 2411.04586 null
2024-11-07 l0-Regularized Sparse Coding-based Interpretable Network for Multi-Modal Image Fusion Gargi Panda et.al. 2411.04519 null
2024-11-07 Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player's Trajectory Ali K. AlShami et.al. 2411.04501 null
2024-11-08 SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation Xun Tu et.al. 2411.04386 null
2024-11-07 UEVAVD: A Dataset for Developing UAV's Eye View Active Object Detection Xinhua Jiang et.al. 2411.04348 null
2024-11-07 GazeGen: Gaze-Driven User Interaction for Visual Content Generation He-Yen Hsieh et.al. 2411.04335 null
2024-11-06 Efficient Fourier Filtering Network with Contrastive Learning for UAV-based Unaligned Bi-modal Salient Object Detection Pengfei Lyu et.al. 2411.03728 link
2024-11-06 Estimation of Psychosocial Work Environment Exposures Through Video Object Detection. Proof of Concept Using CCTV Footage Claus D. Hansen et.al. 2411.03724 null
2024-11-05 An Application-Agnostic Automatic Target Recognition System Using Vision Language Models Anthony Palladino et.al. 2411.03491 null
2024-11-05 Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data Irum Mehboob et.al. 2411.03082 null
2024-11-05 CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection Jisong Kim et.al. 2411.03013 null
2024-11-05 Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery Bowei Du et.al. 2411.02861 null
2024-11-05 Correlation of Object Detection Performance with Visual Saliency and Depth Estimation Matthias Bartolo et.al. 2411.02844 link
2024-11-05 ERUP-YOLO: Enhancing Object Detection Robustness for Adverse Weather Condition by Unified Image-Adaptive Processing Yuka Ogino et.al. 2411.02799 null
2024-11-05 Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection Yifan Wang et.al. 2411.02747 null
2024-11-05 Analysis of Multi-epoch JWST Images of $\sim 300$ Little Red Dots: Tentative Detection of Variability in a Minority of Sources Zijian Zhang et.al. 2411.02729 null
2024-11-04 Intelligent Video Recording Optimization using Activity Detection for Surveillance Systems Youssef Elmir et.al. 2411.02632 null
2024-11-04 SIRA: Scalable Inter-frame Relation and Association for Radar Perception Ryoma Yataka et.al. 2411.02220 null
2024-11-04 Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation Yan Li et.al. 2411.02057 link
2024-11-04 V-CAS: A Realtime Vehicle Anti Collision System Using Vision Transformer on Multi-Camera Streams Muhammad Waqas Ashraf et.al. 2411.01963 null
2024-11-04 Exploiting Contextual Uncertainty of Visual Data for Efficient Training of Deep Models Sharat Agarwal et.al. 2411.01925 null
2024-11-04 LiDAttack: Robust Black-box Attack on LiDAR-based Object Detection Jinyin Chen et.al. 2411.01889 link
2024-11-03 ROAD-Waymo: Action Awareness at Scale for Autonomous Driving Salman Khan et.al. 2411.01683 null
2024-11-03 OSAD: Open-Set Aircraft Detection in SAR Images Xiayang Xiao et.al. 2411.01597 null
2024-11-03 One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection Zhenyu Wang et.al. 2411.01584 null
2024-11-03 A Visual Question Answering Method for SAR Ship: Breaking the Requirement for Multimodal Dataset Construction and Model Fine-Tuning Fei Wang et.al. 2411.01445 null
2024-11-03 Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision Xiangzhong Luo et.al. 2411.01431 null
2024-10-31 ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D Images Timing Yang et.al. 2410.24001 link
2024-10-31 Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images Yakun Xie et.al. 2410.23991 null
2024-10-31 Uncertainty Estimation for 3D Object Detection via Evidential Learning Nikita Durasov et.al. 2410.23910 null
2024-10-31 From Web Data to Real Fields: Low-Cost Unsupervised Domain Adaptation for Agricultural Robots Vasileios Tzouras et.al. 2410.23906 null
2024-10-31 Open-Set 3D object detection in LiDAR data as an Out-of-Distribution problem Louis Soum-Fontez et.al. 2410.23767 null
2024-10-31 Context-Aware Token Selection and Packing for Enhanced Vision Transformer Tianyi Zhang et.al. 2410.23608 null
2024-10-30 EMMA: End-to-End Multimodal Model for Autonomous Driving Jyh-Jing Hwang et.al. 2410.23262 null
2024-10-30 S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving Maciej K. Wozniak et.al. 2410.23085 null
2024-10-30 First Place Solution to the ECCV 2024 ROAD++ Challenge @ ROAD++ Spatiotemporal Agent Detection 2024 Tengfei Zhang et.al. 2410.23077 null
2024-10-30 AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection Yujin Wang et.al. 2410.22939 null
2024-10-29 Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection Gyusam Chang et.al. 2410.22461 null
2024-10-29 Lighten CARAFE: Dynamic Lightweight Upsampling with Guided Reassemble Kernels Ruigang Fu et.al. 2410.22139 link
2024-10-29 Data Generation for Hardware-Friendly Post-Training Quantization Lior Dikstein et.al. 2410.22110 null
2024-10-29 Cognitive Semantic Augmentation LEO Satellite Networks for Earth Observation Hong-fu Chou et.al. 2410.21916 null
2024-10-29 PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices Ming Kang et.al. 2410.21822 link
2024-10-28 MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps Yating Xu et.al. 2410.21566 link
2024-10-28 TACO: Adversarial Camouflage Optimization on Trucks to Fool Object Detectors Adonisz Dimitriu et.al. 2410.21443 null
2024-10-28 Synthetica: Large Scale Synthetic Data for Robot Perception Ritvik Singh et.al. 2410.21153 null
2024-10-28 IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream Tasks Manjunath D et.al. 2410.20953 null
2024-10-28 SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity Kunyun Wang et.al. 2410.20790 null
2024-10-27 Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution Network Chongxiao Liu et.al. 2410.20546 link
2024-10-27 Guidance Disentanglement Network for Optics-Guided Thermal UAV Image Super-Resolution Zhicheng Zhao et.al. 2410.20466 link
2024-10-27 Open-Vocabulary Object Detection via Language Hierarchy Jiaxing Huang et.al. 2410.20371 null
2024-10-27 Historical Test-time Prompt Tuning for Vision Foundation Models Jingyi Zhang et.al. 2410.20346 null
2024-10-25 OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery Philipe Dias et.al. 2410.19965 null
2024-10-25 MetaTrading: An Immersion-Aware Model Trading Framework for Vehicular Metaverse Services Hongjia Wu et.al. 2410.19665 null
2024-10-25 Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models Shenghao Fu et.al. 2410.19635 null
2024-10-25 MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors Fanqi Pu et.al. 2410.19590 null
2024-10-25 DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems Muhammad Zaeem Shahzad et.al. 2410.19336 null
2024-10-25 In-Simulation Testing of Deep Learning Vision Models in Autonomous Robotic Manipulators Dmytro Humeniuk et.al. 2410.19277 null
2024-10-24 HUE Dataset: High-Resolution Event and Frame Sequences for Low-Light Vision Burak Ercan et.al. 2410.19164 null
2024-10-24 Optimizing Edge Offloading Decisions for Object Detection Jiaming Qiu et.al. 2410.18919 link
2024-10-24 You Only Look Around: Learning Illumination Invariant Feature for Low-light Object Detection Mingbo Hong et.al. 2410.18398 null
2024-10-24 Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared images Dong-Guw Lee et.al. 2410.18340 link
2024-10-23 Automated Defect Detection and Grading of Piarom Dates Using Deep Learning Nasrin Azimi et.al. 2410.18208 null
2024-10-23 DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection Qingpeng Li et.al. 2410.17822 link
2024-10-23 YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions Xiguang Li et.al. 2410.17734 null
2024-10-23 YOLOv11: An Overview of the Key Architectural Enhancements Rahima Khanam et.al. 2410.17725 null
2024-10-23 PlantCamo: Plant Camouflage Detection Jinyu Yang et.al. 2410.17598 link
2024-10-23 OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking Haiji Liang et.al. 2410.17534 link
2024-10-22 EPContrast: Effective Point-level Contrastive Learning for Large-scale Point Cloud Understanding Zhiyi Pan et.al. 2410.17207 null
2024-10-22 YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using Optimized Receptive Fields and Anchor-Free Fusion Junzhou Chen et.al. 2410.17144 null
2024-10-22 FlightAR: AR Flight Assistance Interface with Multiple Video Streams and Object Detection Aimed at Immersive Drone Control Oleg Sautenkov et.al. 2410.16943 null
2024-10-22 AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models Yongjian Wu et.al. 2410.16820 link
2024-10-22 DSORT-MCU: Detecting Small Objects in Real-Time on Microcontroller Units Liam Boyle et.al. 2410.16769 null
2024-10-22 DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model Zhixiong Nan et.al. 2410.16707 null
2024-10-22 Fire and Smoke Detection with Burning Intensity Representation Xiaoyi Han et.al. 2410.16642 link
2024-10-21 Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models Yufei Zhan et.al. 2410.16163 link
2024-10-21 Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data Nikos Sakellariou et.al. 2410.16089 null
2024-10-21 Few-shot target-driven instance detection based on open-vocabulary object detection models Ben Crulis et.al. 2410.16028 null
2024-10-21 How Important are Data Augmentations to Close the Domain Gap for Object Detection in Orbit? Maximilian Ulmer et.al. 2410.15766 null
2024-10-21 P-YOLOv8: Efficient and Accurate Real-Time Detection of Distracted Driving Mohamed R. Elshamy et.al. 2410.15602 null
2024-10-21 Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications Jintao Ren et.al. 2410.15584 null
2024-10-21 Online Pseudo-Label Unified Object Detection for Multiple Datasets Training XiaoJun Tang et.al. 2410.15569 null
2024-10-20 TrackMe:A Simple and Effective Multiple Object Tracking Annotation Tool Thinh Phan et.al. 2410.15518 null
2024-10-20 YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-Dictionary Hao-Tang Tsui et.al. 2410.15346 null
2024-10-20 Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability Yusuke Hosoya et.al. 2410.15315 null
2024-10-18 MultiOrg: A Multi-rater Organoid-detection Dataset Christina Bukas et.al. 2410.14612 null
2024-10-18 Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-Speech Shuwei He et.al. 2410.14101 link
2024-10-18 Enhancing In-vehicle Multiple Object Tracking Systems with Embeddable Ising Machines Kosuke Tatsumura et.al. 2410.14093 null
2024-10-17 Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring Kristina Telegraph et.al. 2410.13616 null
2024-10-17 RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images Kejun Ren et.al. 2410.13532 null
2024-10-16 Syn2Real Domain Generalization for Underwater Mine-like Object Detection Using Side-Scan Sonar Aayush Agrawal et.al. 2410.12953 null
2024-10-16 MambaBEV: An efficient 3D detection model with Mamba2 Zihan You et.al. 2410.12673 null
2024-10-16 Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion Minkyoung Cho et.al. 2410.12592 null
2024-10-16 Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look Yong Zhang et.al. 2410.12396 null
2024-10-16 Real-time Stereo-based 3D Object Detection for Streaming Perception Changcai Li et.al. 2410.12394 link
2024-10-16 Context-Infused Visual Grounding for Art Selina Khan et.al. 2410.12369 link
2024-10-16 Fusion from Decomposition: A Self-Supervised Approach for Image Fusion and Beyond Pengwei Liang et.al. 2410.12274 null
2024-10-16 Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm Guanming Huang et.al. 2410.12259 null
2024-10-17 SAM-Guided Masked Token Prediction for 3D Scene Understanding Zhimin Chen et.al. 2410.12158 null
2024-10-16 Unveiling the Limits of Alignment: Multi-modal Dynamic Local Fusion Network and A Benchmark for Unaligned RGBT Video Object Detection Qishun Wang et.al. 2410.12143 null
2024-10-17 Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation Zhijie Yan et.al. 2410.11989 null
2024-10-15 Fractal Calibration for long-tailed object detection Konstantinos Panagiotis Alexandridis et.al. 2410.11774 null
2024-10-15 POLO -- Point-based, multi-class animal detection Giacomo May et.al. 2410.11741 null
2024-10-15 YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection Olalekan Akindele et.al. 2410.11727 null
2024-10-15 SeaDATE: Remedy Dual-Attention Transformer with Semantic Alignment via Contrast Learning for Multimodal Object Detection Shuhan Dong et.al. 2410.11358 null
2024-10-15 Open World Object Detection: A Survey Yiming Li et.al. 2410.11301 null
2024-10-15 Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without Training Bryan Bo Cao et.al. 2410.11233 null
2024-10-15 TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal Enhancement Zhiwei Lin et.al. 2410.11228 null
2024-10-16 CVCP-Fusion: On Implicit Depth Estimation for 3D Bounding Box Prediction Pranav Gupta et.al. 2410.11211 link
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 null
2024-10-14 UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles Hui Ye et.al. 2410.11125 null
2024-10-14 ROSAR: An Adversarial Re-Training Framework for Robust Side-Scan Sonar Object Detection Martin Aubard et.al. 2410.10554 link
2024-10-14 Learning to Ground VLMs without Forgetting Aritra Bhowmik et.al. 2410.10491 null
2024-10-14 SMART-TRACK: A Novel Kalman Filter-Guided Sensor Fusion For Robust UAV Object Tracking in Dynamic Environments Khaled Gabr et.al. 2410.10409 null
2024-10-14 V2M: Visual 2-Dimensional Mamba for Image Representation Learning Chengkun Wang et.al. 2410.10382 link
2024-10-14 GlobalMamba: Global Image Serialization for Vision Mamba Chengkun Wang et.al. 2410.10316 link
2024-10-14 ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object Jiwei Chen et.al. 2410.10298 null
2024-10-14 Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors Tao Lin et.al. 2410.10091 link
2024-10-15 Optimizing Waste Management with Advanced Object Detection for Garbage Classification Everest Z. Kuang et.al. 2410.09975 null
2024-10-13 EITNet: An IoT-Enhanced Framework for Real-Time Basketball Action Recognition Jingyu Liu et.al. 2410.09954 null
2024-10-13 LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond Md Tanvir Islam et.al. 2410.09831 link
2024-10-11 DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection Haochen Li et.al. 2410.09004 null
2024-10-11 LIME-Eval: Rethinking Low-light Image Enhancement Evaluation via Object Detection Mingjia Li et.al. 2410.08810 null
2024-10-11 Hespi: A pipeline for automatically detecting information from hebarium specimen sheets Robert Turnbull et.al. 2410.08740 null
2024-10-11 MMLF: Multi-modal Multi-class Late Fusion for Object Detection with Uncertainty Estimation Qihang Yang et.al. 2410.08739 null
2024-10-11 Boosting Open-Vocabulary Object Detection by Handling Background Samples Ruizhe Zeng et.al. 2410.08645 null
2024-10-11 DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention Nguyen Huu Bao Long et.al. 2410.08582 link
2024-10-11 VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking Zekun Qian et.al. 2410.08529 null
2024-10-10 Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? Samir Abou Haidar et.al. 2410.08365 null
2024-10-10 PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection Botao Ren et.al. 2410.08210 null
2024-10-10 Dynamic Object Catching with Quadruped Robot Front Legs André Schakkal et.al. 2410.08065 null
2024-10-10 HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective Pei Liu et.al. 2410.07758 null
2024-10-10 O1O: Grouping of Known Classes to Identify Unknown Objects as Odd-One-Out Mısra Yavuz et.al. 2410.07514 null
2024-10-09 Progressive Multi-Modal Fusion for Robust 3D Object Detection Rohit Mohan et.al. 2410.07475 null
2024-10-11 Self-Supervised Learning for Real-World Object Detection: a Survey Alina Ciocarlan et.al. 2410.07442 null
2024-10-09 Robust infrared small target detection using self-supervised and a contrario paradigms Alina Ciocarlan et.al. 2410.07437 null
2024-10-09 SurANet: Surrounding-Aware Network for Concealed Object Detection via Highly-Efficient Interactive Contrastive Learning Strategy Yuhan Kang et.al. 2410.06842 link
2024-10-09 Rethinking the Evaluation of Visible and Infrared Image Fusion Dayan Guan et.al. 2410.06811 link
2024-10-10 QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model Fei Xie et.al. 2410.06806 link
2024-10-09 QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation Yuxin Li et.al. 2410.06516 null
2024-10-08 Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions Mateus Karvat et.al. 2410.06380 null
2024-10-08 Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach Sha Guo et.al. 2410.06149 null
2024-10-08 Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts Zhiwei Lin et.al. 2410.05963 null
2024-10-08 Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga Takara Taniguchi et.al. 2410.05935 null
2024-10-08 Unobserved Object Detection using Generative Models Subhransu S. Bhattacharjee et.al. 2410.05869 null
2024-10-08 CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection Mingyi Guo et.al. 2410.05804 null
2024-10-07 Real-Time Truly-Coupled Lidar-Inertial Motion Correction and Spatiotemporal Dynamic Object Detection Cedric Le Gentil et.al. 2410.05152 null
2024-10-07 Human-in-the-loop Reasoning For Traffic Sign Detection: Collaborative Approach Yolo With Video-llava Mehdi Azarafza et.al. 2410.05096 null
2024-10-07 Improving Object Detection via Local-global Contrastive Learning Danai Triantafyllidou et.al. 2410.05058 null
2024-10-07 Improved detection of discarded fish species through BoxAL active learning Maria Sokolova et.al. 2410.04880 link
2024-10-06 Learning De-Biased Representations for Remote-Sensing Imagery Zichen Tian et.al. 2410.04546 link
2024-10-05 ETHcavation: A Dataset and Pipeline for Panoptic Scene Understanding and Object Tracking in Dynamic Construction Environments Lorenzo Terenzi et.al. 2410.04250 null
2024-10-05 Fast Object Detection with a Machine Learning Edge Device Richard C. Rodriguez et.al. 2410.04173 null
2024-10-05 Robust Task-Oriented Communication Framework for Real-Time Collaborative Vision Perception Zhengru Fang et.al. 2410.04168 null
2024-10-05 Cross Resolution Encoding-Decoding For Detection Transformers Ashish Kumar et.al. 2410.04088 link
2024-10-05 Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection Dingwen Zhang et.al. 2410.03987 null
2024-10-04 DRAFTS: A Deep Learning-Based Radio Fast Transient Search Pipeline Yong-Kun Zhang et.al. 2410.03200 null
2024-10-04 Learning 3D Perception from Others' Predictions Jinsu Yoo et.al. 2410.02646 null
2024-10-02 Enhancing Screen Time Identification in Children with a Multi-View Vision Language Model and Screen Time Tracker Xinlong Hou et.al. 2410.01966 null
2024-10-02 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection Yang Cao et.al. 2410.01647 link
2024-10-02 Gaussian-Det: Learning Closed-Surface Gaussians for 3D Object Detection Hongru Yan et.al. 2410.01404 null
2024-10-02 Finetuning Pre-trained Model with Limited Data for LiDAR-based 3D Object Detection by Bridging Domain Gaps Jiyun Jang et.al. 2410.01319 null
2024-10-02 Panopticus: Omnidirectional 3D Object Detection on Resource-constrained Edge Devices Jeho Lee et.al. 2410.01270 null
2024-10-02 High and Low Resolution Tradeoffs in Roadside Multimodal Sensing Shaozu Ding et.al. 2410.01250 null
2024-10-07 Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility Conditions Ashutosh Kumar et.al. 2410.01225 link
2024-10-02 A versatile machine learning workflow for high-throughput analysis of supported metal catalyst particles Arda Genc et.al. 2410.01213 link

(back to top)

Small Object Detection

Publish Date Title Authors PDF Code
2025-07-17 SOD-YOLO: Enhancing YOLO-Based Detection of Small Objects in UAV Imagery Peijun Wang et.al. 2507.12727 null
2025-07-16 InterpIoU: Rethinking Bounding Box Regression with Interpolation-Based IoU Optimization Haoyuan Liu et.al. 2507.12420 null
2025-07-08 High-Frequency Semantics and Geometric Priors for End-to-End Detection Transformers in Challenging UAV Imagery Hongxing Peng et.al. 2507.00825 null
2025-06-30 Event-based Tiny Object Detection: A Benchmark Dataset and Baseline Nuo Chen et.al. 2506.23575 null
2025-06-15 MGDFIS: Multi-scale Global-detail Feature Integration Strategy for Small Object Detection Yuxiang Wang et.al. 2506.12697 null
2025-05-28 Cross-DINO: Cross the Deep MLP and Transformer for Small Object Detection Guiping Cao et.al. 2505.21868 null
2025-05-27 Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO Muzhi Zhu et.al. 2505.21457 null
2025-05-27 Robust Video-Based Pothole Detection and Area Estimation for Intelligent Vehicles with Depth Map and Kalman Smoothing Dehao Wang et.al. 2505.21049 null
2025-05-22 MAFE R-CNN: Selecting More Samples to Learn Category-aware Features for Small Object Detection Yichen Li et.al. 2505.16442 null
2025-07-23 Application of YOLOv8 in monocular downward multiple Car Target detection Shijie Lyu et.al. 2505.10016 null
2025-04-30 Learning to Borrow Features for Improved Detection of Small Objects in Single-Shot Detectors Richard Schmit et.al. 2505.00044 null
2025-04-29 Purifying, Labeling, and Utilizing: A High-Quality Pipeline for Small Object Detection Siwei Wang et.al. 2504.20602 null
2025-04-25 MASF-YOLO: An Improved YOLOv11 Network for Small Object Detection on Drone View Liugang Lu et.al. 2504.18136 null
2025-04-18 HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection YangChen Zeng et.al. 2504.13469 null
2025-04-11 SO-DETR: Leveraging Dual-Domain Features and Knowledge Distillation for Small Object Detection Huaxiang Zhang et.al. 2504.11470 null
2025-04-14 Small Object Detection with YOLO: A Performance Analysis Across Model Versions and Hardware Muhammad Fasih Tariq et.al. 2504.09900 null
2025-03-29 Context in object detection: a systematic literature review Mahtab Jamali et.al. 2503.23249 null
2025-03-26 Small Object Detection: A Comprehensive Survey on Challenges, Techniques and Real-World Applications Mahya Nikouei et.al. 2503.20516 null
2025-03-24 LGI-DETR: Local-Global Interaction for UAV Object Detection Zifa Chen et.al. 2503.18785 null
2025-03-30 YOLO-LLTS: Real-Time Low-Light Traffic Sign Detection via Prior-Guided Enhancement and Multi-Branch Feature Interaction Ziyu Lin et.al. 2503.13883 null
2025-03-06 DEAL-YOLO: Drone-based Efficient Animal Localization using YOLO Aditya Prashant Naidu et.al. 2503.04698 null
2025-02-05 An Empirical Study of Methods for Small Object Detection from Satellite Imagery Xiaohui Yuan et.al. 2502.03674 null
2025-01-30 Tuning Event Camera Biases Heuristic for Object Detection Applications in Staring Scenarios David El-Chai Ben-Ezra et.al. 2501.18788 null
2024-12-24 Multi-Point Positional Insertion Tuning for Small Object Detection Kanoko Goto et.al. 2412.18090 null
2024-12-13 PanSR: An Object-Centric Mask Transformer for Panoptic Segmentation Lojze Žust et.al. 2412.10589 link
2024-12-12 Analysis of Object Detection Models for Tiny Object in Satellite Imagery: A Dataset-Centric Approach Kailas PS et.al. 2412.10453 null
2024-12-16 RemDet: Rethinking Efficient Model Design for UAV Object Detection Chen Li et.al. 2412.10040 link
2025-01-08 YOLOv5-Based Object Detection for Emergency Response in Aerial Imagery Sindhu Boddu et.al. 2412.05394 null
2024-11-28 Dynamic Attention and Bi-directional Fusion for Safety Helmet Wearing Detection Junwei Feng et.al. 2411.19071 null
2024-12-27 DGNN-YOLO: Interpretable Dynamic Graph Neural Networks with YOLO11 for Small Object Detection and Tracking in Traffic Surveillance Shahriar Soudeep et.al. 2411.17251 null
2025-01-13 SL-YOLO: A Stronger and Lighter Drone Target Detection Model Defan Chen et.al. 2411.11477 null
2024-11-15 Interactive Image-Based Aphid Counting in Yellow Water Traps under Stirring Actions Xumin Gao et.al. 2411.10357 null
2024-11-14 Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration Yifan Shao et.al. 2411.09604 link
2024-11-01 LAM-YOLO: Drones-based Small Object Detection on Lighting-Occlusion Attention Mechanism YOLO Yuchen Zheng et.al. 2411.00485 null
2024-10-29 PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices Ming Kang et.al. 2410.21822 link
2024-10-11 Self-Supervised Learning for Real-World Object Detection: a Survey Alina Ciocarlan et.al. 2410.07442 null
2024-10-09 Robust infrared small target detection using self-supervised and a contrario paradigms Alina Ciocarlan et.al. 2410.07437 null
2024-08-28 Small Object Detection for Indoor Assistance to the Blind using YOLO NAS Small and Super Gradients Rashmi BN et.al. 2409.07469 null
2024-09-07 Unleashing the Power of Generic Segmentation Models: A Simple Baseline for Infrared Small Target Detection Mingjin Zhang et.al. 2409.04714 null
2024-09-06 BFA-YOLO: Balanced multiscale object detection network for multi-view building facade attachments detection Yangguang Chen et.al. 2409.04025 null
2024-08-16 Enhancing Object Detection with Hybrid dataset in Manufacturing Environments: Comparing Federated Learning to Conventional Techniques Vinit Hegiste et.al. 2408.08974 null
2024-08-14 Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection Zhonglin Chen et.al. 2408.07455 null
2024-08-08 SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes Boshra Khalili et.al. 2408.04786 null
2024-07-29 Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images Zewen Du et.al. 2407.19696 link
2024-07-25 XS-VID: An Extremely Small Video Object Detection Dataset Jiahao Guo et.al. 2407.18137 null
2024-07-23 ESOD: Efficient Small Object Detection on High-Resolution Images Kai Liu et.al. 2407.16424 null
2024-06-20 Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines Xinyi Ying et.al. 2406.14482 link

(back to top)

Image Matching

Publish Date Title Authors PDF Code
2025-07-22 A Single-step Accurate Fingerprint Registration Method Based on Local Feature Matching Yuwei Jia et.al. 2507.16201 null
2025-07-09 Dual-Granularity Cross-Modal Identity Association for Weakly-Supervised Text-to-Person Image Matching Yafei Zhang et.al. 2507.06744 null
2025-07-05 From Query to Explanation: Uni-RAG for Multi-Modal Retrieval-Augmented Learning in STEM Xinyi Wu et.al. 2507.03868 null
2025-07-02 What does really matter in image goal navigation? Gianluca Monaci et.al. 2507.01667 null
2025-06-30 Efficient and Accurate Image Provenance Analysis: A Scalable Pipeline for Large-scale Images Jiewei Lai et.al. 2506.23707 null
2025-06-29 Dynamic Contrastive Learning for Hierarchical Retrieval: A Case Study of Distance-Aware Cross-View Geo-Localization Suofei Zhang et.al. 2506.23077 null
2025-06-27 MatChA: Cross-Algorithm Matching with Feature Augmentation Paula Carbó Cubero et.al. 2506.22336 null
2025-07-22 Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs Shaojie Zhang et.al. 2506.22139 null
2025-06-27 ZeroReg3D: A Zero-shot Registration Pipeline for 3D Consecutive Histopathology Image Reconstruction Juming Xiong et.al. 2506.21923 null
2025-06-25 Fast entropy-regularized SDP relaxations for permutation synchronization Michael Lindsey et.al. 2506.20191 null
2025-06-18 ReSeDis: A Dataset for Referring-based Object Search across Large-Scale Image Collections Ziling Huang et.al. 2506.15180 null
2025-06-16 EmbodiedPlace: Learning Mixture-of-Features with Embodied Constraints for Visual Place Recognition Bingxi Liu et.al. 2506.13133 null
2025-06-12 RealKeyMorph: Keypoints in Real-world Coordinates for Resolution-agnostic Image Registration Mina C. Moghadam et.al. 2506.10344 null
2025-06-11 Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints Xiangkai Zhang et.al. 2506.09748 null
2025-06-11 ScaleLSD: Scalable Deep Line Segment Detection Streamlined Zeran Ke et.al. 2506.09369 link
2025-05-21 Anti-interrupted sampling repeater jamming via linear canonical Wigner distribution lightweight LFM detection Jia-Mian Li et.al. 2506.06302 null
2025-06-05 Vanishing arcs for isolated plane curve singularities Hanwool Bae et.al. 2506.04917 null
2025-06-05 Deep Learning Reforms Image Matching: A Survey and Outlook Shihua Zhang et.al. 2506.04619 null
2025-06-20 SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping Mingxu Zhang et.al. 2505.24305 null
2025-06-05 Universal Domain Adaptation for Semantic Segmentation Seun-An Choe et.al. 2505.22458 null
2025-05-23 To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models Simone Gaisbauer et.al. 2505.17973 link
2025-05-16 Multi-view dense image matching with similarity learning and geometry priors Mohamed Ali Chebbi et.al. 2505.11264 null
2025-05-12 Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection Yuqi Cheng et.al. 2505.07375 link
2025-05-04 OBD-Finder: Explainable Coarse-to-Fine Text-Centric Oracle Bone Duplicates Discovery Chongsheng Zhang et.al. 2505.03836 link
2025-05-06 LiftFeat: 3D Geometry-Aware Local Feature Matching Yepeng Liu et.al. 2505.03422 link
2025-05-04 Focus What Matters: Matchability-Based Reweighting for Local Feature Matching Dongyue Li et.al. 2505.02161 null
2025-05-15 Mitigating Modality Bias in Multi-modal Entity Alignment from a Causal Perspective Taoyu Su et.al. 2504.19458 link
2025-04-28 Dynamic Arthroscopic Navigation System for Anterior Cruciate Ligament Reconstruction Based on Multi-level Memory Architecture Shuo Wang et.al. 2504.19398 null
2025-04-23 Road Similarity-Based BEV-Satellite Image Matching for UGV Localization Zhenping Sun et.al. 2504.16346 null
2025-04-18 Outlier-Robust Multi-Model Fitting on Quantum Annealers Saurabh Pandey et.al. 2504.13836 null
2025-04-11 Geometric Consistency Refinement for Single Image Novel View Synthesis via Test-Time Adaptation of Diffusion Models Josef Bengtson et.al. 2504.08348 null
2025-04-10 Image registration of 2D optical thin sections in a 3D porous medium: Application to a Berea sandstone digital rock image Jaehong Chung et.al. 2504.06604 link
2025-04-22 To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition Davide Sferrazza et.al. 2504.06116 link
2025-04-10 Learning Affine Correspondences by Integrating Geometric Constraints Pengju Sun et.al. 2504.04834 link
2025-04-01 Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data Yiqun Duan et.al. 2504.00812 null
2025-03-31 CoMatch: Dynamic Covisibility-Aware Transformer for Bilateral Subpixel-Level Semi-Dense Image Matching Zizhuo Li et.al. 2503.23925 null
2025-03-28 Pairwise Matching of Intermediate Representations for Fine-grained Explainability Lauren Shrack et.al. 2503.22881 link
2025-03-26 Multimodal Image Matching based on Frequency-domain Information of Local Energy Response Meng Yang et.al. 2503.20827 null
2025-03-22 Normalized Matching Transformer Abtin Pourhadi et.al. 2503.17715 link
2025-03-20 Loop Closure from Two Views: Revisiting PGO for Scalable Trajectory Estimation through Monocular Priors Tian Yi Lim et.al. 2503.16275 null
2025-03-20 MapGlue: Multimodal Remote Sensing Image Matching Peihao Wu et.al. 2503.16185 link
2025-03-19 PAPI-Reg: Patch-to-Pixel Solution for Efficient Cross-Modal Registration between LiDAR Point Cloud and Camera Image Yuanchao Yue et.al. 2503.15285 null
2025-04-07 Less Biased Noise Scale Estimation for Threshold-Robust RANSAC Johan Edstedt et.al. 2503.13433 null
2025-03-17 SatDepth: A Novel Dataset for Satellite Image Matching Rahul Deshmukh et.al. 2503.12706 link
2025-03-14 Refining Image Edge Detection via Linear Canonical Riesz Transforms Shuhui Yang et.al. 2503.11148 null
2025-03-13 Speedy MASt3R Jingxing Li et.al. 2503.10017 null
2025-03-11 Keypoint Detection and Description for Raw Bayer Images Jiakai Lin et.al. 2503.08673 null
2025-03-06 Learning 3D Medical Image Models From Brain Functional Connectivity Network Supervision For Mental Disorder Diagnosis Xingcan Hu et.al. 2503.04205 null
2025-03-07 Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration Qianliang Wu et.al. 2503.04127 null
2025-03-05 JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba Xiaoyong Lu et.al. 2503.03437 null
2025-02-28 CNSv2: Probabilistic Correspondence Encoded Neural Image Servo Anzhe Chen et.al. 2503.00132 null
2025-02-27 A2-GNN: Angle-Annular GNN for Visual Descriptor-free Camera Relocalization Yejun Zhang et.al. 2502.20036 link
2025-02-27 RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges Thibaut Loiseau et.al. 2502.19955 null
2025-02-26 BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure Haoxin Cai et.al. 2502.19242 link
2025-02-25 PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching Han Nie et.al. 2502.18104 link
2025-02-25 Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking Xin Tong et.al. 2502.17766 null
2025-03-04 Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model Yaxuan Huang et.al. 2502.16779 null
2025-02-16 FeaKM: Robust Collaborative Perception under Noisy Pose Conditions Jiuwu Hao et.al. 2502.11003 link
2025-02-24 Enhancing Ground-to-Aerial Image Matching for Visual Misinformation Detection Using Semantic Segmentation Emanuele Mule et.al. 2502.06288 link
2025-02-04 Muographic Image Upsampling with Machine Learning for Built Infrastructure Applications William O'Donnell et.al. 2502.02624 null
2025-02-01 MambaGlue: Fast and Robust Local Feature Matching With Mamba Kihwan Ryoo et.al. 2502.00462 link
2025-01-24 Dense-SfM: Structure from Motion with Dense Consistent Matching JongMin Lee et.al. 2501.14277 null
2025-01-20 MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching Yepeng Liu et.al. 2501.11299 null
2025-01-13 MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training Xingyi He et.al. 2501.07556 null
2025-01-13 Matching Free Depth Recovery from Structured Light Zhuohang Yu et.al. 2501.07113 null
2025-01-02 Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views Yulun Wu et.al. 2501.01196 null
2024-12-31 Towards Real-Time 2D Mapping: Harnessing Drones, AI, and Computer Vision for Advanced Insights Bharath Kumar Agnur et.al. 2412.20210 null
2024-12-27 MINIMA: Modality Invariant Image Matching Xingyu Jiang et.al. 2412.19412 link
2024-12-24 GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network Xianfeng Song et.al. 2412.18221 link
2024-12-17 Bringing Multimodality to Amazon Visual Search System Xinliang Zhu et.al. 2412.13364 null
2024-12-04 Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis Siyoon Jin et.al. 2412.03150 null
2024-11-20 DT-LSD: Deformable Transformer-based Line Segment Detection Sebastian Janampa et.al. 2411.13005 link
2024-11-15 Image Matching Filtering and Refinement by Planes and Beyond Fabio Bellavia et.al. 2411.09484 link
2024-11-11 XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration Ismail Can Yagmur et.al. 2411.07430 link
2024-11-07 The Impact of Semi-Supervised Learning on Line Segment Detection Johanna Engman et.al. 2411.04596 link
2024-11-04 Silver medal Solution for Image Matching Challenge 2024 Yian Wang et.al. 2411.01851 null
2024-10-30 Variable Resolution Sampling and Deep Learning Image Recovery for Accelerated Multi-Spectral MRI Near Metal Implants Azadeh Sharafi et.al. 2410.23329 null
2024-11-05 RelationBooth: Towards Relation-Aware Customized Object Generation Qingyu Shi et.al. 2410.23280 null
2024-10-31 ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses Junjie Ni et.al. 2410.22733 null
2024-10-30 LoFLAT: Local Feature Matching using Focused Linear Attention Transformer Naijian Cao et.al. 2410.22710 null
2024-10-26 Generative Adversarial Patches for Physical Attacks on Cross-Modal Pedestrian Re-Identification Yue Su et.al. 2410.20097 null
2024-10-01 A Robust Multisource Remote Sensing Image Matching Method Utilizing Attention and Feature Enhancement Against Noise Interference Yuan Li et.al. 2410.11848 null
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505 null
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-09-27 Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras Yipeng Lu et.al. 2409.18673 null
2024-09-25 Game4Loc: A UAV Geo-Localization Benchmark from Game Data Yuxiang Ji et.al. 2409.16925 link
2024-09-24 Automatic Registration of SHG and H&E Images with Feature-based Initial Alignment and Intensity-based Instance Optimization: Contribution to the COMULIS Challenge Marek Wodzinski et.al. 2409.15931 null
2024-09-10 Weakly-supervised Camera Localization by Ground-to-satellite Image Registration Yujiao Shi et.al. 2409.06471 link
2024-09-05 Enabling Practical and Privacy-Preserving Image Processing Chao Wang et.al. 2409.03568 null
2024-09-20 A General Albedo Recovery Approach for Aerial Photogrammetric Images through Inverse Rendering Shuang Song et.al. 2409.03032 link
2024-08-29 Super-Resolution works for coastal simulations Zhi-Song Liu et.al. 2408.16553 null
2024-09-15 Mismatched: Evaluating the Limits of Image Matching Approaches and Benchmarks Sierra Bonilla et.al. 2408.16445 link
2024-08-26 Affine steerers for structured keypoint description Georg Bökman et.al. 2408.14186 link
2024-08-25 TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers Chuanrui Zhang et.al. 2408.13770 null

(back to top)

Visual Localization

Publish Date Title Authors PDF Code
2024-10-16 Development of Image Collection Method Using YOLO and Siamese Network Chan Young Shin et.al. 2410.12561 null
2024-10-16 LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment Juelin Zhu et.al. 2410.12269 null
2024-10-16 Leveraging Spatial Attention and Edge Context for Optimized Feature Selection in Visual Localization Nanda Febri Istighfarin et.al. 2410.12240 null
2024-10-15 LoGS: Visual Localization via Gaussian Splatting with Fewer Training Images Yuzhou Cheng et.al. 2410.11505 null
2024-10-15 Multiview Scene Graph Juexiao Zhang et.al. 2410.11187 null
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-10-11 Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System Zheng Liu et.al. 2410.08935 link
2024-10-16 Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP Eunji Kim et.al. 2410.08469 null
2024-10-11 A Unified Deep Semantic Expansion Framework for Domain-Generalized Person Re-identification Eugene P. W. Ang et.al. 2410.08456 null
2024-10-10 A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks Hoin Jung et.al. 2410.07593 null
2024-10-09 Exploiting Distribution Constraints for Scalable and Efficient Image Retrieval Mohammad Omama et.al. 2410.07022 null
2024-10-09 Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers Stephen Hausler et.al. 2410.06614 null
2024-10-09 MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging Noel C. F. Codella et.al. 2410.06542 null
2024-10-08 Temporal Image Caption Retrieval Competition -- Description and Results Jakub Pokrywka et.al. 2410.06314 null
2024-10-08 Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching Gongxin Yao et.al. 2410.06285 null
2024-10-08 GSLoc: Visual Localization with 3D Gaussian Splatting Kazii Botashev et.al. 2410.06165 null
2024-10-08 Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning Ayush Singh et.al. 2410.05928 null
2024-10-08 RNR-Nav: A Real-World Visual Navigation System Using Renderable Neural Radiance Maps Minsoo Kim et.al. 2410.05621 null
2024-10-11 LoTLIP: Improving Language-Image Pre-training for Long Text Understanding Wei Wu et.al. 2410.05249 null
2024-10-06 LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation Jianhao Jiao et.al. 2410.04419 null
2024-10-02 Boosting Weakly-Supervised Referring Image Segmentation via Progressive Comprehension Zaiquan Yang et.al. 2410.01544 null
2024-10-03 EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections Francesc Net et.al. 2410.01536 link
2024-10-04 CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment Safouane El Ghazouali et.al. 2410.01411 link
2024-09-30 Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation Aleyna Kütük et.al. 2410.00266 null
2024-09-29 CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation Yifan Duan et.al. 2409.19597 null
2024-09-28 VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition Ahmad Khaliq et.al. 2409.19293 link
2024-09-27 MASt3R-SfM: a Fully-Integrated Solution for Unconstrained Structure-from-Motion Bardienus Duisterhof et.al. 2409.19152 null
2024-09-26 Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval Mankeerat Sidhu et.al. 2409.18733 null
2024-09-26 Revisit Anything: Visual Place Recognition via Image Segment Retrieval Kartik Garg et.al. 2409.18049 link
2024-09-24 GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization Gennady Sidorov et.al. 2409.16502 link

(back to top)

Homogeous Image Transformation

Publish Date Title Authors PDF Code
2024-10-15 RS-MOCO: A deep learning-based topology-preserving image registration method for cardiac T1 mapping Chiyi Huang et.al. 2410.11651 null
2024-10-14 MoonMetaSync: Lunar Image Registration Analysis Ashutosh Kumar et.al. 2410.11118 link
2024-10-14 Stationary Velocity Fields on Matrix Groups for Deformable Image Registration Johannes Bostelmann et.al. 2410.10997 null
2024-10-14 A Counterexample in Image Registration Serap A. Savari et.al. 2410.10725 null
2024-10-12 FiRework: Field Refinement Framework for Efficient Enhancement of Deformable Registration Haiqiao Wang et.al. 2410.09595 link
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-10-11 Hierarchical uncertainty estimation for learning-based registration in neuroimaging Xiaoling Hu et.al. 2410.09299 link
2024-10-07 DiffuseReg: Denoising Diffusion Model for Obtaining Deformation Fields in Unsupervised Deformable Image Registration Yongtai Zhuo et.al. 2410.05234 link
2024-10-07 Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge Senorita Deb et.al. 2410.05189 null
2024-10-04 DiffKillR: Killing and Recreating Diffeomorphisms for Cell Annotation in Dense Microscopy Images Chen Liu et.al. 2410.03058 link
2024-10-03 Deep Regression 2D-3D Ultrasound Registration for Liver Motion Correction in Focal Tumor Thermal Ablation Shuwei Xing et.al. 2410.02579 link
2024-10-07 NestedMorph: Enhancing Deformable Medical Image Registration with Nested Attention Mechanisms Gurucharan Marthi Krishna Kumar et.al. 2410.02550 null
2024-10-03 CTARR: A fast and robust method for identifying anatomical regions on CT images via atlas registration Thomas Buddenkotte et.al. 2410.02316 link
2024-09-30 Shuffled Linear Regression via Spectral Matching Hang Liu et.al. 2410.00078 null
2024-09-30 Erase, then Redraw: A Novel Data Augmentation Approach for Free Space Detection Using Diffusion Model Fulong Ma et.al. 2409.20164 null
2024-09-29 Dual-Attention Frequency Fusion at Multi-Scale for Joint Segmentation and Deformable Medical Image Registration Hongchao Zhou et.al. 2409.19658 null
2024-09-28 Trigger-Based Fragile Model Watermarking for Image Transformation Networks Preston K. Robinette et.al. 2409.19442 null
2024-09-27 ADEPT: A Noninvasive Method for Determining Elastic Properties of Valve Tissue Wensi Wu et.al. 2409.19081 null
2024-09-26 Ophthalmic Biomarker Detection with Parallel Prediction of Transformer and Convolutional Architecture Md. Touhidul Islam et.al. 2409.17788 null

(back to top)

Homogeous Image

Publish Date Title Authors PDF Code
2025-07-23 Mapping ground-based coronagraphic images to Helioprojective-Cartesian coordinate system by image registration Feiyang Sha et.al. 2507.17670 null
2025-07-22 Combined Image Data Augmentations diminish the benefits of Adaptive Label Smoothing Georg Siedel et.al. 2507.16427 null
2025-07-21 Compress-Align-Detect: onboard change detection from unregistered images Gabriele Inzerillo et.al. 2507.15578 null
2025-07-17 fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting Alicia Durrer et.al. 2507.13146 null
2025-07-17 cIDIR: Conditioned Implicit Neural Representation for Regularized Deformable Image Registration Sidaty El Hadramy et.al. 2507.12953 null
2025-07-16 Pathology-Guided Virtual Staining Metric for Evaluation and Training Qiankai Wang et.al. 2507.12624 null
2025-07-15 Are Vision Foundation Models Ready for Out-of-the-Box Medical Image Registration? Hanxue Gu et.al. 2507.11569 null
2025-07-14 Well-posedness of an optical flow based optimal control formulation for image registration Johannes Haubner et.al. 2507.10188 null
2025-07-09 Segmentation Regularized Training for Multi-Domain Deep Learning Registration applied to MR-Guided Prostate Cancer Radiotherapy Sudharsan Madhavan et.al. 2507.06966 null
2025-07-08 Exploring Partial Multi-Label Learning via Integrating Semantic Co-occurrence Knowledge Xin Wu et.al. 2507.05992 null
2025-07-08 From Motion to Meaning: Biomechanics-Informed Neural Network for Explainable Cardiovascular Disease Identification Comte Valentin et.al. 2507.05783 null
2025-07-06 Grid-Reg: Grid-Based SAR and Optical Image Registration Across Platforms Xiaochen Wei et.al. 2507.04233 null
2025-06-29 Multimodal image registration for effective thermographic fever screening C. Y. N. Dwith et.al. 2507.02955 null
2025-07-09 Modality-agnostic, patient-specific digital twins modeling temporally varying digestive motion Jorge Tapias Gomez et.al. 2507.01909 null
2025-07-01 On the Intensity-based Inversion Method for Quantitative Quasi-Static Elastography Ekaterina Sherina et.al. 2507.01207 null
2025-07-08 Bridging Classical and Learning-based Iterative Registration through Deep Equilibrium Models Yi Zhang et.al. 2507.00582 null
2025-06-30 Puzzles: Unbounded Video-Depth Augmentation for Scalable End-to-End 3D Reconstruction Jiahao Ma et.al. 2506.23863 null
2025-06-27 Cardiovascular disease classification using radiomics and geometric features from cardiac CT Ajay Mittal et.al. 2506.22226 null
2025-06-27 Robust and Accurate Multi-view 2D/3D Image Registration with Differentiable X-ray Rendering and Dual Cross-view Constraints Yuxin Cui et.al. 2506.22191 null
2025-06-25 Real-Time 3D Guidewire Reconstruction from Intraoperative DSA Images for Robot-Assisted Endovascular Interventions Tianliang Yao et.al. 2506.21631 null
2025-06-25 Photon Absorption Remote Sensing (PARS): Comprehensive Absorption Imaging Enabling Label-Free Biomolecule Characterization and Mapping Benjamin R. Ecclestone et.al. 2506.20069 null
2025-06-24 VoxelOpt: Voxel-Adaptive Message Passing for Discrete Optimization in Deformable Abdominal CT Registration Hang Zhang et.al. 2506.19975 null
2025-06-24 Deformable Medical Image Registration with Effective Anatomical Structure Representation and Divide-and-Conquer Network Xinke Ma et.al. 2506.19222 null
2025-06-23 A Deep Learning Based Method for Fast Registration of Cardiac Magnetic Resonance Images Benjamin Graham et.al. 2506.19167 null
2025-06-19 Watermarking Autoregressive Image Generation Nikola Jovanović et.al. 2506.16349 link
2025-06-18 Tree-based adaptive finite element methods for deformable image registration Nicolás A. Barnafi et.al. 2506.15876 null
2025-06-30 Mono-Modalizing Extremely Heterogeneous Multi-Modal Medical Image Registration Kyobin Choo et.al. 2506.15596 null
2025-06-17 A Digital Twin Framework for Adaptive Treatment Planning in Radiotherapy Chih-Wei Chang et.al. 2506.14701 null
2025-06-16 PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images Lingteng Qiu et.al. 2506.13766 null
2025-06-12 Unsupervised Deformable Image Registration with Structural Nonparametric Smoothing Hang Zhang et.al. 2506.10813 null
2025-06-12 RealKeyMorph: Keypoints in Real-world Coordinates for Resolution-agnostic Image Registration Mina C. Moghadam et.al. 2506.10344 null
2025-06-11 CINeMA: Conditional Implicit Neural Multi-Modal Atlas for a Spatio-Temporal Representation of the Perinatal Brain Maik Dannecker et.al. 2506.09668 link
2025-06-11 Geometry Reduced Order Modeling (GROM) with application to modeling of glymphatic function Andreas Solheim et.al. 2506.09442 link
2025-06-07 Exploring Image Transforms derived from Eye Gaze Variables for Progressive Autism Diagnosis Abigail Copiaco et.al. 2506.09065 null
2025-06-04 Personalized MR-Informed Diffusion Models for 3D PET Image Reconstruction George Webber et.al. 2506.03804 null
2025-06-03 FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens Christian Schlarmann et.al. 2506.03096 null
2025-06-03 Guiding Registration with Emergent Similarity from Pre-Trained Diffusion Models Nurislam Tursynbek et.al. 2506.02419 null
2025-06-02 Implicit Deformable Medical Image Registration with Learnable Kernels Stefano Fogarollo et.al. 2506.02150 null
2025-06-01 Modality Translation and Registration of MR and Ultrasound Images Using Diffusion Models Xudong Ma et.al. 2506.01025 null
2025-05-30 MRDust: Wireless Implant Data Uplink & Localization via Magnetic Resonance Image Modulation Biqi Rebekah Zhao et.al. 2506.00143 null
2025-05-30 A Novel Coronary Artery Registration Method Based on Super-pixel Particle Swarm Optimization Peng Qi et.al. 2505.24351 null
2025-05-30 Fourier ptychographic microscopy aided with transport of intensity equation for robust full phase spectrum reconstruction Mikołaj Rogalski et.al. 2505.24322 null
2025-05-30 Pretraining Deformable Image Registration Networks with Random Images Junyu Chen et.al. 2505.24167 link
2025-05-30 Beyond the LUMIR challenge: The pathway to foundational registration models Junyu Chen et.al. 2505.24160 null
2025-05-28 Collaborative Learning for Unsupervised Multimodal Remote Sensing Image Registration: Integrating Self-Supervision and MIM-Guided Diffusion-Based Image Translation Xiaochen Wei et.al. 2505.22000 null
2025-05-27 Moment kernels: a simple and scalable approach for equivariance to rotations and reflections in deep convolutional networks Zachary Schlamowitz et.al. 2505.21736 null
2025-05-23 To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models Simone Gaisbauer et.al. 2505.17973 null
2025-06-01 4D-CTA Image and geometry dataset for kinematic analysis of abdominal aortic aneurysms Mostafa Jamshidian et.al. 2505.17647 null
2025-05-22 Temporal Differential Fields for 4D Motion Modeling via Image-to-Video Synthesis Xin You et.al. 2505.17333 null
2025-05-22 Deep mineralogical segmentation of thin section images based on QEMSCAN maps Jean Pablo Vieira de Mello et.al. 2505.17008 link
2025-05-23 Tracking the Flight: Exploring a Computational Framework for Analyzing Escape Responses in Plains Zebra (Equus quagga) Isla Duporge et.al. 2505.16882 link
2025-05-18 Kornia-rs: A Low-Level 3D Computer Vision Library In Rust Edgar Riba et.al. 2505.12425 null
2025-05-15 IMITATE: Image Registration with Context for unknown time frame recovery Ziad Kheil et.al. 2505.10124 link
2025-05-15 Non-Registration Change Detection: A Novel Change Detection Task and Benchmark Dataset Zhe Shan et.al. 2505.09939 link
2025-05-11 AugMixCloak: A Defense against Membership Inference Attacks via Image Transformation Heqing Ren et.al. 2505.07149 null
2025-05-11 Federated Learning with LoRA Optimized DeiT and Multiscale Patch Embedding for Secure Eye Disease Recognition Md. Naimur Asif Borno et.al. 2505.06982 null
2025-05-11 Bi-directional Self-Registration for Misaligned Infrared-Visible Image Fusion Timing Li et.al. 2505.06920 null
2025-05-10 Improving Generalization of Medical Image Registration Foundation Model Jing Hu et.al. 2505.06527 link
2025-05-09 FF-PNet: A Pyramid Network Based on Feature and Field for Brain Image Registration Ying Zhang et.al. 2505.04938 null
2025-05-07 Tetrahedron-Net for Medical Image Registration Jinhai Xiang et.al. 2505.04380 null
2025-05-05 Unsupervised training of keypoint-agnostic descriptors for flexible retinal image registration David Rivas-Villar et.al. 2505.02787 null
2025-05-05 Unsupervised Deep Learning-based Keypoint Localization Estimating Descriptor Matching Performance David Rivas-Villar et.al. 2505.02779 null
2025-04-30 MagicCraft: Natural Language-Driven Generation of Dynamic and Interactive 3D Objects for Commercial Metaverse Platforms Ryutaro Kurai et.al. 2504.21332 null
2025-04-24 Spectral Bias Correction in PINNs for Myocardial Image Registration of Pathological Data Bastien C. Baluyot et.al. 2504.17945 null
2025-04-22 Towards prediction of morphological heart age from computed tomography angiography Johan Öfverstedt et.al. 2504.15783 null
2025-04-19 Exploring Generalizable Pre-training for Real-world Change Detection via Geometric Estimation Yitao Zhao et.al. 2504.14306 null
2025-04-17 SC3EF: A Joint Self-Correlation and Cross-Correspondence Estimation Framework for Visible and Thermal Image Registration Xi Tong et.al. 2504.12869 null
2025-04-17 Computer-Aided Design of Personalized Occlusal Positioning Splints Using Multimodal 3D Data Agnieszka Anna Tomaka et.al. 2504.12868 null
2025-04-16 Correlation Ratio for Unsupervised Learning of Multi-modal Deformable Registration Xiaojian Chen et.al. 2504.12265 link
2025-04-16 A Category-Fragment Segmentation Framework for Pelvic Fracture Segmentation in X-ray Images Daiqi Liu et.al. 2504.11872 null
2025-04-13 Imaging Transformer for MRI Denoising: a Scalable Model Architecture that enables SNR << 1 Imaging Hui Xue et.al. 2504.10534 null
2025-04-13 Capturing Longitudinal Changes in Brain Morphology Using Temporally Parameterized Neural Displacement Fields Aisha L. Shuaibu et.al. 2504.09514 null
2025-04-09 IGG: Image Generation Informed by Geodesic Dynamics in Deformation Spaces Nian Wu et.al. 2504.07999 link
2025-04-10 Geometric and Dosimetric Validation of Deformable Image Registration for Prostate MR-guided Adaptive Radiotherapy Victor N. Malkov et.al. 2504.07933 null
2025-04-09 OmniCaptioner: One Captioner to Rule Them All Yiting Lu et.al. 2504.07089 link
2025-04-10 nnLandmark: A Self-Configuring Method for 3D Medical Landmark Detection Alexandra Ertl et.al. 2504.06742 null
2025-04-09 Large Scale Supervised Pretraining For Traumatic Brain Injury Segmentation Constantin Ulrich et.al. 2504.06741 null
2025-04-09 EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture Wenfeng Feng et.al. 2504.06738 null
2025-04-10 Image registration of 2D optical thin sections in a 3D porous medium: Application to a Berea sandstone digital rock image Jaehong Chung et.al. 2504.06604 link
2025-04-08 OSDM-MReg: Multimodal Image Registration based One Step Diffusion Model Xiaochen Wei et.al. 2504.06027 null
2025-04-07 Biomechanical Constraints Assimilation in Deep-Learning Image Registration: Application to sliding and locally rigid deformations Ziad Kheil et.al. 2504.05444 null
2025-04-07 Solving the fully nonlinear Monge-Ampère equation using the Legendre-Kolmogorov-Arnold Network method Bingcheng Hu et.al. 2504.05022 null
2025-04-03 IMPACT: A Generic Semantic Loss for Multimodal Medical Image Registration Valentin Boussot et.al. 2503.24121 link
2025-04-01 OncoReg: Medical Image Registration for Oncological Challenges Wiebke Heyer et.al. 2503.23179 link
2025-03-28 Divide to Conquer: A Field Decomposition Approach for Multi-Organ Whole-Body CT Image Registration Xuan Loc Pham et.al. 2503.22281 null
2025-03-26 UWarp: A Whole Slide Image Registration Pipeline to Characterize Scanner-Induced Local Domain Shift Antoine Schieb et.al. 2503.20653 null
2025-03-26 Robust Flower Cluster Matching Using The Unscented Transform Andy Chu et.al. 2503.20631 null
2025-03-26 Vision-Amplified Semantic Entropy for Hallucination Detection in Medical Visual Question Answering Zehui Liao et.al. 2503.20504 null
2025-03-25 SACB-Net: Spatial-awareness Convolutions for Medical Image Registration Xinxing Cheng et.al. 2503.19592 link
2025-03-21 Model reduction of convection-dominated viscous conservation laws using implicit feature tracking and landmark image registration Victor Zucatti et.al. 2503.17463 null
2025-03-21 Halton Scheduler For Masked Generative Image Transformer Victor Besnier et.al. 2503.17076 link
2025-03-21 Downstream Analysis of Foundational Medical Vision Models for Disease Progression Basar Demir et.al. 2503.16842 null
2025-03-18 Weakly Supervised Spatial Implicit Neural Representation Learning for 3D MRI-Ultrasound Deformable Image Registration in HDR Prostate Brachytherapy Jing Wang et.al. 2503.14395 null
2025-03-18 Text-Guided Image Invariant Feature Learning for Robust Image Watermarking Muhammad Ahtesham et.al. 2503.13805 null
2025-03-17 UniReg: Foundation Model for Controllable Medical Image Registration Zi Li et.al. 2503.12868 null
2025-03-15 Meta-operators for all-optical image processing Linzhi Yu et.al. 2503.12252 null
2025-03-14 Multi-Stage Generative Upscaler: Reconstructing Football Broadcast Images via Diffusion Models Luca Martini et.al. 2503.11181 null
2025-03-13 How Should We Evaluate Uncertainty in Accelerated MRI Reconstruction? Luca Trautmann et.al. 2503.10527 null
2025-03-14 On the Limitations of Vision-Language Models in Understanding Image Transforms Ahmad Mustafa Anis et.al. 2503.09837 null
2025-03-10 NimbleReg: A light-weight deep-learning framework for diffeomorphic image registration Antoine Legouhy et.al. 2503.07768 null
2025-03-10 Evaluation of Alignment-Regularity Characteristics in Deformable Image Registration Vasiliki Sideri-Lampretsa et.al. 2503.07185 null
2025-03-07 New multimodal similarity measure for image registration via modeling local functional dependence with linear combination of learned basis functions Joel Honkamaa et.al. 2503.05335 link
2025-03-07 Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration Qianliang Wu et.al. 2503.04127 null
2025-03-02 Cross Modality Medical Image Synthesis for Improving Liver Segmentation Muhammad Rafiq et.al. 2503.00945 null
2025-03-02 Personalizing the meshed SPL/NAC Brain Atlas for patient-specific scientific computing using SynthMorph Andy Huynh et.al. 2503.00931 null
2025-03-02 NCF: Neural Correspondence Field for Medical Image Registration Lei Zhou et.al. 2503.00760 null
2025-02-26 Deep learning and classical computer vision techniques in medical image analysis: Case studies on brain MRI tissue segmentation, lung CT COPD registration, and skin lesion classification Anyimadu Daniel Tweneboah et.al. 2502.19258 null
2025-02-26 From Traditional to Deep Learning Approaches in Whole Slide Image Registration: A Methodological Review Behnaz Elhaminia et.al. 2502.19123 null
2025-02-24 SynthRAD2025 Grand Challenge dataset: generating synthetic CTs for radiotherapy Adrian Thummerer et.al. 2502.17609 null
2025-02-22 Good Representation, Better Explanation: Role of Convolutional Neural Networks in Transformer-Based Remote Sensing Image Captioning Swadhin Das et.al. 2502.16095 null
2025-02-23 Triad: Vision Foundation Model for 3D Magnetic Resonance Imaging Shansong Wang et.al. 2502.14064 link
2025-02-17 On the Logic Elements Associated with Round-Off Errors and Gaussian Blur in Image Registration: A Simple Case of Commingling Serap A. Savari et.al. 2502.11992 null
2025-02-17 Medical Image Registration Meets Vision Foundation Model: Prototype Learning and Contour Awareness Hao Xu et.al. 2502.11440 link
2025-02-15 Super Resolution image reconstructs via total variation-based image deconvolution: a majorization-minimization approach Mouhamad Chehaitly et.al. 2502.10876 null
2025-02-15 Hybrid Deepfake Image Detection: A Comprehensive Dataset-Driven Approach Integrating Convolutional and Attention Mechanisms with Frequency Domain Features Kafi Anan et.al. 2502.10682 null
2025-02-14 PromptArtisan: Multi-instruction Image Editing in Single Pass with Complete Attention Control Kunal Swami et.al. 2502.10258 null
2025-02-13 Vision-based Geo-Localization of Future Mars Rotorcraft in Challenging Illumination Conditions Dario Pisanti et.al. 2502.09795 null
2025-02-12 MRUCT: Mixed Reality Assistance for Acupuncture Guided by Ultrasonic Computed Tomography Yue Yang et.al. 2502.08786 null
2025-02-07 Investigating the impact of kernel harmonization and deformable registration on inspiratory and expiratory chest CT images for people with COPD Aravind R. Krishnan et.al. 2502.05119 null
2025-02-06 Expanding Training Data for Endoscopic Phenotyping of Eosinophilic Esophagitis Juming Xiong et.al. 2502.04199 null
2025-02-05 REALEDIT: Reddit Edits As a Large-scale Empirical Dataset for Image Transformations Peter Sushko et.al. 2502.03629 null
2025-02-05 A Unified Framework for Semi-Supervised Image Segmentation and Registration Ruizhe Li et.al. 2502.03229 null
2025-02-05 Tell2Reg: Establishing spatial correspondence between images by the same language prompts Wen Yan et.al. 2502.03118 link
2025-02-05 PoleStack: Robust Pole Estimation of Irregular Objects from Silhouette Stacking Jacopo Villa et.al. 2502.02907 null
2025-02-04 Test Time Training for 4D Medical Image Interpolation Qikang Zhang et.al. 2502.02341 link
2025-02-04 MORPH-LER: Log-Euclidean Regularization for Population-Aware Image Registration Mokshagna Sai Teja Karanam et.al. 2502.02029 null
2025-02-03 Label Correction for Road Segmentation Using Road-side Cameras Henrik Toikka et.al. 2502.01281 null
2025-02-03 Multi-Resolution SAR and Optical Remote Sensing Image Registration Methods: A Review, Datasets, and Future Perspectives Wenfei Zhang et.al. 2502.01002 null
2025-01-31 Transformation trees -- documentation of multimodal image registration Agnieszka Anna Tomaka et.al. 2501.19140 null
2025-01-31 An Adversarial Approach to Register Extreme Resolution Tissue Cleared 3D Brain Images Abdullah Naziba et.al. 2501.18815 link
2025-01-27 Multi-Objective Deep-Learning-based Biomechanical Deformable Image Registration with MOREA Georgios Andreadis et.al. 2501.16525 null
2025-01-23 Variational U-Net with Local Alignment for Joint Tumor Extraction and Registration (VALOR-Net) of Breast MRI Data Acquired at Two Different Field Strengths Muhammad Shahkar Khan et.al. 2501.13690 null
2025-01-22 Learning accurate rigid registration for longitudinal brain MRI from synthetic data Jingru Fu et.al. 2501.13010 null
2025-01-22 LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation Jiahao Wang et.al. 2501.12976 null
2025-01-21 Regressor-Guided Image Editing Regulates Emotional Response to Reduce Online Engagement Christoph Gebhardt et.al. 2501.12289 null
2025-01-18 Deformable Image Registration of Dark-Field Chest Radiographs for Local Lung Signal Change Assessment Fabian Drexel et.al. 2501.10757 null
2025-01-18 Quasi-linear maps and image transformations S. V. Butler et.al. 2501.10635 null
2025-01-15 A Vessel Bifurcation Landmark Pair Dataset for Abdominal CT Deformable Image Registration (DIR) Validation Edward R Criscuolo et.al. 2501.09162 link
2025-01-15 TimeFlow: Longitudinal Brain Image Registration and Aging Progression Analysis Bailiang Jian et.al. 2501.08667 null
2025-01-13 MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training Xingyi He et.al. 2501.07556 null
2025-01-13 Implicit Neural Representations for Registration of Left Ventricle Myocardium During a Cardiac Cycle Mathias Micheelsen Lowes et.al. 2501.07248 link
2025-01-19 Improved joint modelling of breast cancer radiomics features and hazard by image registration aided longitudinal CT data Subrata Mukherjee et.al. 2501.06814 null
2025-01-06 COph100: A comprehensive fundus image registration dataset from infants constituting the "RIDIRP" database Yan Hu et.al. 2501.02800 null
2025-01-02 Rephotography in the Digital Era: Mass Rephotography and re.photos, the Web Portal for Rephotography Axel Schaffland et.al. 2501.02017 null
2024-12-31 Estimation of 3T MR images from 1.5T images regularized with Physics based Constraint Prabhjot Kaur et.al. 2501.01464 null
2024-12-29 Motion Transfer-Driven intra-class data augmentation for Finger Vein Recognition Xiu-Feng Huang et.al. 2412.20327 link
2024-12-27 Structural Similarity in Deep Features: Image Quality Assessment Robust to Geometrically Disparate Reference Keke Zhang et.al. 2412.19553 null
2024-12-24 Advancing Deformable Medical Image Registration with Multi-axis Cross-covariance Attention Mingyuan Meng et.al. 2412.18545 null
2024-12-23 Unsupervised learning of spatially varying regularization for diffeomorphic image registration Junyu Chen et.al. 2412.17982 null
2024-12-22 Classifier-guided registration of coronary CT angiography and intravascular ultrasound R. L. M. van Herten et.al. 2412.17100 null
2024-12-20 LEDA: Log-Euclidean Diffeomorphic Autoencoder for Efficient Statistical Analysis of Diffeomorphism Krithika Iyer et.al. 2412.16129 null
2024-12-20 From Model Based to Learned Regularization in Medical Image Registration: A Comprehensive Review Anna Reithmeir et.al. 2412.15740 null
2024-12-19 MUSTER: Longitudinal Deformable Registration by Composition of Consecutive Deformations Edvard O. S. Grødem et.al. 2412.14671 link
2024-12-19 E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling Zhihang Yuan et.al. 2412.14170 null
2024-12-17 Image registration is a geometric deep learning task Vasiliki Sideri-Lampretsa et.al. 2412.13294 null
2024-12-17 Prompt Augmentation for Self-supervised Text-guided Image Manipulation Rumeysa Bodur et.al. 2412.13081 null
2024-12-17 Identifying Bias in Deep Neural Networks Using Image Transforms Sai Teja Erukude et.al. 2412.13079 link
2024-12-16 IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation Yiren Song et.al. 2412.11638 null
2024-12-13 RAID-Database: human Responses to Affine Image Distortions Paula Daudén-Oliver et.al. 2412.10211 null
2024-12-12 On Round-Off Errors and Gaussian Blur in Superresolution and in Image Registration Serap A. Savari et.al. 2412.09741 null
2024-12-10 AmCLR: Unified Augmented Learning for Cross-Modal Representations Ajay Jagannath et.al. 2412.07979 link
2024-12-09 Table2Image: Interpretable Tabular data Classification with Realistic Image Transformations Seungeun Lee et.al. 2412.06265 link
2024-12-05 Blind Underwater Image Restoration using Co-Operational Regressor Networks Ozer Can Devecioglu et.al. 2412.03995 null
2024-12-04 MRNet: Multifaceted Resilient Networks for Medical Image-to-Image Translation Hyojeong Lee et.al. 2412.03039 null
2024-12-02 CTRL-D: Controllable Dynamic 3D Scene Editing with Personalized 2D Diffusion Kai He et.al. 2412.01792 null
2024-12-03 Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation Bolin Lai et.al. 2412.01027 null
2024-11-28 FAN-Unet: Enhancing Unet with vision Fourier Analysis Block for Biomedical Image Segmentation Jiashu Xu et.al. 2411.18975 null
2024-11-27 Neural Image Unfolding: Flattening Sparse Anatomical Structures using Neural Fields Leonhard Rist et.al. 2411.18415 null
2024-11-26 CAMLD: Contrast-Agnostic Medical Landmark Detection with Consistency-Based Regularization Soorena Salari et.al. 2411.17845 null
2024-11-25 Improving Deformable Image Registration Accuracy through a Hybrid Similarity Metric and CycleGAN Based Auto-Segmentation Keyur D. Shah et.al. 2411.16992 null
2024-11-25 Oriented histogram-based vector field embedding for characterizing 4D CT data sets in radiotherapy Frederic Madesta et.al. 2411.16314 null
2024-11-28 Can Encrypted Images Still Train Neural Networks? Investigating Image Information and Random Vortex Transformation XiaoKai Cao et.al. 2411.16207 link
2024-11-24 Making Images from Images: Interleaving Denoising and Transformation Shumeet Baluja et.al. 2411.15925 null
2024-11-24 ZeroGS: Training 3D Gaussian Splatting from Unposed Images Yu Chen et.al. 2411.15779 null
2024-11-23 LDM-Morph: Latent diffusion model guided deformable image registration Jiong Wu et.al. 2411.15426 link
2024-11-26 Exploiting Watermark-Based Defense Mechanisms in Text-to-Image Diffusion Models for Unauthorized Data Usage Soumil Datta et.al. 2411.15367 null
2024-11-21 Automatic brain tumor segmentation in 2D intra-operative ultrasound images using MRI tumor annotations Mathilde Faanes et.al. 2411.14017 link
2024-11-20 Virtual Staining of Label-Free Tissue in Imaging Mass Spectrometry Yijie Zhang et.al. 2411.13120 null
2024-11-13 A generalized software framework for consolidation of radiotherapy planning and delivery data from diverse data sources Yasin Abdulkadir et.al. 2411.08876 null
2024-11-12 Atmospheric turbulence restoration by diffeomorphic image registration and blind deconvolution Jerome Gilles et.al. 2411.07578 null
2024-11-12 Uncertainty-Aware Test-Time Adaptation for Inverse Consistent Diffeomorphic Lung Image Registration Muhammad F. A. Chaudhary et.al. 2411.07567 null
2024-11-11 XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration Ismail Can Yagmur et.al. 2411.07430 link
2024-11-10 Graph Neural Networks for modelling breast biomechanical compression Hadeel Awwad et.al. 2411.06596 link
2024-11-09 NeuReg: Domain-invariant 3D Image Registration on Human and Mouse Brains Taha Razzaq et.al. 2411.06315 null
2024-11-11 Relationships between the degrees of freedom in the affine Gaussian derivative model for visual receptive fields and 2-D affine image transformations, with application to covariance properties of simple cells in the primary visual cortex Tony Lindeberg et.al. 2411.05673 null
2024-11-05 A Symmetric Dynamic Learning Framework for Diffeomorphic Medical Image Registration Jinqiu Deng et.al. 2411.02888 null
2024-11-05 Applications of Automatic Differentiation in Image Registration Warin Watson et.al. 2411.02806 link
2024-11-04 Multi-modal deformable image registration using untrained neural networks Quang Luong Nhat Nguyen et.al. 2411.02672 null
2024-11-04 Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery Robert Fonod et.al. 2411.02136 null
2024-11-03 FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing Jitesh Joshi et.al. 2411.01542 link
2024-11-03 MambaReg: Mamba-Based Disentangled Convolutional Sparse Coding for Unsupervised Deformable Multi-Modal Image Registration Kaiang Wen et.al. 2411.01399 null
2024-11-02 RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-identification Lei Tan et.al. 2411.01225 link
2024-10-29 NCA-Morph: Medical Image Registration with Neural Cellular Automata Amin Ranem et.al. 2410.22265 link
2024-10-27 Unsupervised Panoptic Interpretation of Latent Spaces in GANs Using Space-Filling Vector Quantization Mohammad Hassan Vali et.al. 2410.20573 link
2024-10-27 UTSRMorph: A Unified Transformer and Superresolution Network for Unsupervised Medical Image Registration Runshi Zhang et.al. 2410.20348 link
2024-10-26 Cross-Survey Image Transformation: Enhancing SDSS and DECaLS Images to Near-HSC Quality for Advanced Astronomical Analysis Zhijian Luo et.al. 2410.20025 null
2024-10-25 Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series Ilan Naiman et.al. 2410.19538 null
2024-10-24 A Counterexample in Cross-Correlation Template Matching Serap A. Savari et.al. 2410.19085 null
2024-10-24 Python workflow for segmenting multiphase flow in porous rocks Catherine Spurin et.al. 2410.18937 link
2024-10-23 MsMorph: An Unsupervised pyramid learning network for brain image registration Jiaofen Nan et.al. 2410.18228 link
2024-10-23 Improving Instance Optimization in Deformable Image Registration with Gradient Projection Yi Zhang et.al. 2410.15767 null
2024-10-18 GESH-Net: Graph-Enhanced Spherical Harmonic Convolutional Networks for Cortical Surface Registration Ruoyu Zhang et.al. 2410.14805 null
2024-10-18 2D-3D Deformable Image Registration of Histology Slide and Micro-CT with ML-based Initialization Junan Chen et.al. 2410.14343 null
2024-10-17 SAMReg: SAM-enabled Image Registration with ROI-based Correspondence Shiqi Huang et.al. 2410.14083 link
2024-10-13 S $^4$ ST: A Strong, Self-transferable, faSt, and Simple Scale Transformation for Transferable Targeted Attack Yongxiang Liu et.al. 2410.13891 null
2024-10-15 RS-MOCO: A deep learning-based topology-preserving image registration method for cardiac T1 mapping Chiyi Huang et.al. 2410.11651 null
2024-10-14 MoonMetaSync: Lunar Image Registration Analysis Ashutosh Kumar et.al. 2410.11118 link
2024-10-14 Stationary Velocity Fields on Matrix Groups for Deformable Image Registration Johannes Bostelmann et.al. 2410.10997 null
2024-10-14 A Counterexample in Image Registration Serap A. Savari et.al. 2410.10725 null
2024-10-12 FiRework: Field Refinement Framework for Efficient Enhancement of Deformable Registration Haiqiao Wang et.al. 2410.09595 link
2024-10-12 Leveraging Semantic Cues from Foundation Vision Models for Enhanced Local Feature Correspondence Felipe Cadar et.al. 2410.09533 link
2024-10-11 Hierarchical uncertainty estimation for learning-based registration in neuroimaging Xiaoling Hu et.al. 2410.09299 link

(back to top)

About

🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%