Multi-View Intelligent Perception多视角智能感知研究组

Beijing Jiaotong University | School of Computer Sciense & Technology | INSTITUTE OF NETWORK SCIENCE AND INTELLIGENT SYSTEMS

Our group primarily focuses on Multi-View Intelligent Perception in Real-World Degraded Scenarios. The core objective is to bridge the gap between "seeing clearly" (high-fidelity image restoration) and "understanding accurately" (precise 3D perception) in complex, uncontrolled environments. By leveraging Light Field (LF) imaging, Epipolar Geometry Consistency, and Multi-Plane Image (MPI) representations, our work seeks to break through the physical limitations and geometric ambiguities inherent in traditional single-view vision.

Email Google Scholar INSIS

Research Interests

This overarching theme is driven by two main pillars of research:

Pixel-level Vision Tasks

This track addresses the challenge of restoring high-fidelity visual information when data is corrupted by environmental degradation (e.g., low light, rain, reflections, and sensor noise). Low-Light Enhancement & Denoising: Exploring view-consistency priors and multi-stream progressive networks to achieve scene-adaptive illumination and noise reduction. Interference Removal (Reflection & Raindrops): Innovating hierarchical multi-plane image (MPI) construction and multi-layer interaction mechanisms to achieve pixel-level decoupling of background and interference layers. Light Field Super-Resolution: Combining flexible hybrid lenses and structure-aware neural rendering to reconstruct high-resolution, high-quality light field images.

Image Super-Resolution

Generating high-resolution images from multiple low-resolution views using deep learning algorithms to surpass hardware limitations.

Occlusion Removal

Analyzing the spatial relationship between occluders and background using multi-view information to intelligently remove foreground occlusions.

Reflection Separation

Separating reflection and transmission layers by leveraging multi-view consistency constraints for images affected by glass reflections.

Image Deblurring

Restoring image clarity from motion blur or defocus blur through the design of specialized deep neural networks.

Denoising / Deraining

Enhancing visual quality in noisy or adverse weather conditions using multi-view information fusion and noise modeling.

Low-light Enhancement

Improving brightness and visual quality in low-light scenes while maintaining view consistency by utilizing complementary scene details.

3D Reconstruction Tasks

This track tackles the challenges of estimating geometry and understanding semantics in the presence of severe occlusions, weak textures, and complex materials. Unsupervised & Occlusion-Robust Depth Estimation: Proposing epipolar consistent attention aggregation networks to overcome physical occlusion bottlenecks and achieve highly accurate, unsupervised light field disparity estimation. Geometry Perception for Complex Materials: Solving depth ambiguity issues for transparent and reflective surfaces through dual-layer depth estimation and decoupling strategies. Semantic Understanding & Occlusion Removal: Integrating structural priors into LF semantic segmentation and advancing flexible 3D occlusion mask learning within Neural Radiance Fields (NeRF).

Depth Estimation

Accurately estimating scene depth distribution using epipolar plane image analysis and deep learning algorithms from multi-view inputs.

View Synthesis / Novel View Synthesis

Reconstructing 3D scene models from sparse view inputs and synthesizing realistic novel views using techniques like NeRF.

Salient Object Detection & Semantic Segmentation

Detecting salient objects and performing pixel-level semantic segmentation by combining depth information with appearance features.

Salient Object Detection Semantic Segmentation Scene Understanding

Publications

Representative papers published in top-tier international journals and conferences

Pixel-level Vision Tasks

CCF B TCSVT 2026

Hierarchical Interactive Multi-Plane Image Construction for Light Field Background and Reflection Separation

Jiajun Chen, Shuo Zhang*, Yichang Lv, Chen Gao, Youfang Lin

IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2026

CCF C NeuCom 2026

Depth-aware Intra & Inter Aggregation for Light Field Raindrop Removal

Qihua Chen, Shuo Zhang*, Chen Gao, Youfang Lin

Neurocomputing, 2026

CCF A TVCG 2025

Progressive Multi-Plane Images Construction for Light Field Occlusion Removal

Shuo Zhang, Song Chang*, Youfang Lin

IEEE Transactions on Visualization and Computer Graphics (TVCG), 2025

CCF B TMM 2025

Structure-Aware Pre-Selected Neural Rendering for Light Field Reconstruction

Song Chang, Youfang Lin, Shuo Zhang*

IEEE Transactions on Multimedia (TMM), 2025

CCF A ICCV 2025

Exploring View Consistency for Scene-Adaptive Low-Light Light Field Image Enhancement

Shuo Zhang, Chen Gao, Youfang Lin*

International Conference on Computer Vision (ICCV), 2025

TCI 2024

Learning Light Field Denoising with Symmetrical Refocusing Strategy

Song Chang, Youfang Lin, Wenqi Wang, Da An, Shuo Zhang*

IEEE Transactions on Computational Imaging (TCI), 2024

CCF C PRCV 2024

Multi-3D Occlusion Mask Learning for Flexible Occlusion Removal in Neural Radiance Fields

Zhuoyu Shi, Shuo Zhang, Song Chang, Youfang Lin

The 7th Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 2024

TCI 2023

Light Field Reflection and Background Separation Network Based on Adaptive Focus Selection

Zeqi Shen, Shuo Zhang*, Youfang Lin

IEEE Transactions on Computational Imaging (TCI), 2023

TCI 2023

Multi-Stream Progressive Restoration for Low-Light Light Field Enhancement and Denoising

Xianglang Wang, Youfang Lin, Shuo Zhang*

IEEE Transactions on Computational Imaging (TCI), 2023

TCI 2022

Light Field Reconstruction using Efficient Pseudo 4D Epipolar-Aware Structure

Yangling Chen, Shuo Zhang*, Song Chang, Youfang Lin

IEEE Transactions on Computational Imaging (TCI), 2022

CCF A ACM MM 2022

Flexible Hybrid Lenses Light Field Super-Resolution using Layered Refinement

Song Chang, Youfang Lin, Shuo Zhang*

The 30th ACM International Conference on Multimedia (ACM MM), 2022

CCF A TIP 2021

End-to-End Light Field Spatial Super-Resolution Network using Multiple Epipolar Geometry

Shuo Zhang, Song Chang, Youfang Lin*

IEEE Transactions on Image Processing (TIP), 2021

TCI 2021

Micro-lens Image Upsampling for Densely-Sampled Light Field Reconstruction

Shuo Zhang, Song Chang, Zeqi Shen, Youfang Lin*

IEEE Transactions on Computational Imaging (TCI), 2021

CCF A IJCAI 2021

Removing Foreground Occlusions in Light Field using Micro-lens Dynamic Filter

Shuo Zhang, Zeqi Shen, Youfang Lin*

The 30th International Joint Conference on Artificial Intelligence (IJCAI), 2021

CCF A CVPR 2019

Residual Network for Light Field Super-Resolution

Shuo Zhang, Youfang Lin, Hao Sheng

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019

3D Reconstruction Tasks

CCF B PR 2026

A Unified Occlusion-free Framework for Unsupervised Light Field Depth Estimation

Longzhao Guo, Shuo Zhang*, Youfang Lin

Pattern Recognition (PR), 2026

CCF A CVPR 2026

LF-BVN: Blind-View Network for Self-Supervised Light Field Denoising

Guolong Zhao, Shuo Zhang*, Chen Gao, Qian Tian, Youfang Lin

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2026

CCF B TCSVT 2025

Decoupling and Aggregating: Dual-layer Light Field Depth Estimation with Reflective and Transparent Surfaces

Shuo Zhang, Yanlin Xie, Jiaxin Chen, Youfang Lin*

IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025

CCF A ICCV 2025

Epipolar Consistent Attention Aggregation Network for Unsupervised Light Field Disparity Estimation

Chen Gao, Youfang Lin, Shuo Zhang*

International Conference on Computer Vision (ICCV), 2025

CCF A ACM MM 2025

Epipolar Consistency-based Network for Structure-Aware LF Semantic Segmentation

Chen Gao, Youfang Lin, Wenbin Wang, Shuo Zhang*

The 33th ACM International Conference on Multimedia (ACM MM), 2025

TCI 2024

Hierarchical Edge Refinement Network for Guided Depth Map Super-Resolution

Shuo Zhang*, Zexu Pan, Yichang Lv, Youfang Lin

IEEE Transactions on Computational Imaging (TCI), 2024

CCF C SPL 2021

Enhanced Spinning Parallelogram Operator Combining Color Constraint and Histogram Integration for Robust Light Field Depth Estimation

Weikun Wang, Youfang Lin, Shuo Zhang*

IEEE Signal Processing Letters (SPL), 2021

CCF A ACM MM 2021

Occlusion-aware Bi-directional Guided Network for Light Field Salient Object Detection

Dong Jing, Shuo Zhang*, Runmin Cong, Youfang Lin

The 29th ACM International Conference on Multimedia (ACM MM), 2021

CCF A AAAI 2021

Attention-based Multi-Level Fusion Network for Light Field Depth Estimation

Jiaxin Chen, Shuo Zhang*, Youfang Lin

The 35th AAAI Conference on Artificial Intelligence (AAAI), 2021

CCF A TIP 2018

Micro-lens-based Matching for Scene Recovery in Lenslet Cameras

Shuo Zhang, Hao Sheng*, Jun Zhang, Zhang Xiong

IEEE Transactions on Image Processing (TIP), 2018

CCF A TIP 2017

Geometric Occlusion Analysis in Depth Estimation using Integral Guided Filter for Light-Field Image

Hao Sheng, Shuo Zhang*, Xiaochun Cao, Yajun Fang, Zhang Xiong

IEEE Transactions on Image Processing (TIP), 2017

CCF B PR 2017

Occlusion-Aware Depth Estimation for Light Field Using Multi-Orientation EPIs

Hao Sheng, Pan Zhao, Shuo Zhang*, Jun Zhang, Da Yang

Pattern Recognition (PR), 2017

CCF B ICASSP 2016

Relative Location for Light Field Saliency Detection

Hao Sheng, Shuo Zhang, Xiaoyu Liu, Zhang Xiong

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2016

CCF B ICASSP 2016

Saliency Analysis based on Depth Contrast Increased

Hao Sheng, Xiaoyu Liu, Shuo Zhang

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016

CCF B CVIU 2016

Robust Depth Estimation for Light Field via Spinning Parallelogram Operator

Shuo Zhang, Hao Sheng*, Chao Li, Zhang Xiong

Computer Vision and Image Understanding (CVIU), 2016

CCF C ICIP 2015

Guided Integral Filter Design for Light Field Stereo Matching

Hao Sheng, Shuo Zhang, Gengliang Zhu, Zhang Xiong

IEEE International Conference on Image Processing (ICIP), 2015

Collaborative Research

CCF B TMM 2025

CrossHypergraph: Consistent High-order Semantic Network for Few-shot Image Classification

Yucheng Zhang, Hao Wang, Shuo Zhang*, Biao Leng

IEEE Transactions on Multimedia (TMM), 2025

TIM 2025

Spatial-Aware Metric Network via Patchwise Feature Alignment for Few-Shot Learning

Yucheng Zhang, Shuo Zhang*, Biao Leng

IEEE Transactions on Instrumentation and Measurement (TIM), 2025

CCF A AAAI 2025

Unlocking the Potential of Reverse Distillation for Anomaly Detection

Xinyue Liu, Jianyuan Wang*, Biao Leng, Shuo Zhang

The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025

CCF A ACM MM 2024

Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection

Xinyue Liu, Jianyuan Wang*, Biao Leng, Shuo Zhang

The 32th ACM International Conference on Multimedia (ACM MM), 2024