Multi-View Intelligent Perception多视角智能感知研究组
Beijing Jiaotong University | School of Computer Sciense & Technology | INSTITUTE OF NETWORK SCIENCE AND INTELLIGENT SYSTEMS
Our group primarily focuses on Multi-View Intelligent Perception in Real-World Degraded Scenarios. The core objective is to bridge the gap between "seeing clearly" (high-fidelity image restoration) and "understanding accurately" (precise 3D perception) in complex, uncontrolled environments. By leveraging Light Field (LF) imaging, Epipolar Geometry Consistency, and Multi-Plane Image (MPI) representations, our work seeks to break through the physical limitations and geometric ambiguities inherent in traditional single-view vision.
Research Interests
This overarching theme is driven by two main pillars of research:
Pixel-level Vision Tasks
This track addresses the challenge of restoring high-fidelity visual information when data is corrupted by environmental degradation (e.g., low light, rain, reflections, and sensor noise). Low-Light Enhancement & Denoising: Exploring view-consistency priors and multi-stream progressive networks to achieve scene-adaptive illumination and noise reduction. Interference Removal (Reflection & Raindrops): Innovating hierarchical multi-plane image (MPI) construction and multi-layer interaction mechanisms to achieve pixel-level decoupling of background and interference layers. Light Field Super-Resolution: Combining flexible hybrid lenses and structure-aware neural rendering to reconstruct high-resolution, high-quality light field images.
Image Super-Resolution
Generating high-resolution images from multiple low-resolution views using deep learning algorithms to surpass hardware limitations.
Occlusion Removal
Analyzing the spatial relationship between occluders and background using multi-view information to intelligently remove foreground occlusions.
Reflection Separation
Separating reflection and transmission layers by leveraging multi-view consistency constraints for images affected by glass reflections.
Image Deblurring
Restoring image clarity from motion blur or defocus blur through the design of specialized deep neural networks.
Denoising / Deraining
Enhancing visual quality in noisy or adverse weather conditions using multi-view information fusion and noise modeling.
Low-light Enhancement
Improving brightness and visual quality in low-light scenes while maintaining view consistency by utilizing complementary scene details.
3D Reconstruction Tasks
This track tackles the challenges of estimating geometry and understanding semantics in the presence of severe occlusions, weak textures, and complex materials. Unsupervised & Occlusion-Robust Depth Estimation: Proposing epipolar consistent attention aggregation networks to overcome physical occlusion bottlenecks and achieve highly accurate, unsupervised light field disparity estimation. Geometry Perception for Complex Materials: Solving depth ambiguity issues for transparent and reflective surfaces through dual-layer depth estimation and decoupling strategies. Semantic Understanding & Occlusion Removal: Integrating structural priors into LF semantic segmentation and advancing flexible 3D occlusion mask learning within Neural Radiance Fields (NeRF).
Depth Estimation
Accurately estimating scene depth distribution using epipolar plane image analysis and deep learning algorithms from multi-view inputs.
View Synthesis / Novel View Synthesis
Reconstructing 3D scene models from sparse view inputs and synthesizing realistic novel views using techniques like NeRF.
Salient Object Detection & Semantic Segmentation
Detecting salient objects and performing pixel-level semantic segmentation by combining depth information with appearance features.
Publications
Representative papers published in top-tier international journals and conferences
Pixel-level Vision Tasks
Hierarchical Interactive Multi-Plane Image Construction for Light Field Background and Reflection Separation
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2026
Depth-aware Intra & Inter Aggregation for Light Field Raindrop Removal
Neurocomputing, 2026
Progressive Multi-Plane Images Construction for Light Field Occlusion Removal
IEEE Transactions on Visualization and Computer Graphics (TVCG), 2025
Structure-Aware Pre-Selected Neural Rendering for Light Field Reconstruction
IEEE Transactions on Multimedia (TMM), 2025
Exploring View Consistency for Scene-Adaptive Low-Light Light Field Image Enhancement
International Conference on Computer Vision (ICCV), 2025
Learning Light Field Denoising with Symmetrical Refocusing Strategy
IEEE Transactions on Computational Imaging (TCI), 2024
Multi-3D Occlusion Mask Learning for Flexible Occlusion Removal in Neural Radiance Fields
The 7th Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2024
Light Field Reflection and Background Separation Network Based on Adaptive Focus Selection
IEEE Transactions on Computational Imaging (TCI), 2023
Multi-Stream Progressive Restoration for Low-Light Light Field Enhancement and Denoising
IEEE Transactions on Computational Imaging (TCI), 2023
Light Field Reconstruction using Efficient Pseudo 4D Epipolar-Aware Structure
IEEE Transactions on Computational Imaging (TCI), 2022
Flexible Hybrid Lenses Light Field Super-Resolution using Layered Refinement
The 30th ACM International Conference on Multimedia (ACM MM), 2022
End-to-End Light Field Spatial Super-Resolution Network using Multiple Epipolar Geometry
IEEE Transactions on Image Processing (TIP), 2021
Micro-lens Image Upsampling for Densely-Sampled Light Field Reconstruction
IEEE Transactions on Computational Imaging (TCI), 2021
Removing Foreground Occlusions in Light Field using Micro-lens Dynamic Filter
The 30th International Joint Conference on Artificial Intelligence (IJCAI), 2021
Residual Network for Light Field Super-Resolution
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
3D Reconstruction Tasks
A Unified Occlusion-free Framework for Unsupervised Light Field Depth Estimation
Pattern Recognition (PR), 2026
LF-BVN: Blind-View Network for Self-Supervised Light Field Denoising
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2026
Decoupling and Aggregating: Dual-layer Light Field Depth Estimation with Reflective and Transparent Surfaces
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025
Epipolar Consistent Attention Aggregation Network for Unsupervised Light Field Disparity Estimation
International Conference on Computer Vision (ICCV), 2025
Epipolar Consistency-based Network for Structure-Aware LF Semantic Segmentation
The 33th ACM International Conference on Multimedia (ACM MM), 2025
Hierarchical Edge Refinement Network for Guided Depth Map Super-Resolution
IEEE Transactions on Computational Imaging (TCI), 2024
Enhanced Spinning Parallelogram Operator Combining Color Constraint and Histogram Integration for Robust Light Field Depth Estimation
IEEE Signal Processing Letters (SPL), 2021
Occlusion-aware Bi-directional Guided Network for Light Field Salient Object Detection
The 29th ACM International Conference on Multimedia (ACM MM), 2021
Attention-based Multi-Level Fusion Network for Light Field Depth Estimation
The 35th AAAI Conference on Artificial Intelligence (AAAI), 2021
Micro-lens-based Matching for Scene Recovery in Lenslet Cameras
IEEE Transactions on Image Processing (TIP), 2018
Geometric Occlusion Analysis in Depth Estimation using Integral Guided Filter for Light-Field Image
IEEE Transactions on Image Processing (TIP), 2017
Occlusion-Aware Depth Estimation for Light Field Using Multi-Orientation EPIs
Pattern Recognition (PR), 2017
Relative Location for Light Field Saliency Detection
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2016
Saliency Analysis based on Depth Contrast Increased
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016
Robust Depth Estimation for Light Field via Spinning Parallelogram Operator
Computer Vision and Image Understanding (CVIU), 2016
Guided Integral Filter Design for Light Field Stereo Matching
IEEE International Conference on Image Processing (ICIP), 2015
Collaborative Research
CrossHypergraph: Consistent High-order Semantic Network for Few-shot Image Classification
IEEE Transactions on Multimedia (TMM), 2025
Spatial-Aware Metric Network via Patchwise Feature Alignment for Few-Shot Learning
IEEE Transactions on Instrumentation and Measurement (TIM), 2025
Unlocking the Potential of Reverse Distillation for Anomaly Detection
The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025
Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection
The 32th ACM International Conference on Multimedia (ACM MM), 2024