Vision

Latest Face Recognition Research Papers

The newest Face Recognition papers from across the field — arXiv, NeurIPS, CVPR, Nature, and more — refreshed daily and ranked by relevance. Distill AI tracks Face Recognition so you don’t have to: get the standout work delivered to your inbox every morning, with 2-sentence summaries and the option to chat with any paper.

Get the latest Face Recognition papers in your inbox — free →

Recent papers

Defying the Catholic Secondary School Enrollment Decline: A Case Study Exploring Strategic Enrollment Management Practices in an All-Boys Catholic High School
Traci A Koval · Seton Hall University eRepo... · May 15, 2027
Catholic secondary schools in the United States continue to face persistent enrollment decline, closures, and consolidations, creating an urgent need for sustainable and mission-centered strategies. This qualitative case study examined Hill…
Exploring the Relationships Between Emotional Intelligence, Perceived Organizational Support, Job Satisfaction, and Intent to Stay Among Minnesota School Nurses
Kristin K Coudron · RED - a Repository of Digit... · May 6, 2027
School nurses face escalating job demands driven by increasing student health complexity, chronic condition management, and expanding public health responsibilities. Often serving as the sole healthcare professional within a school or distr…
Synthetic data generation framework for quality control automation in gravure printing
Korota Arsène Coulibaly, Mohamed Hamlich, Khalid Hmali, Andrea Trombin · arXiv · Jul 23, 2026
Quality control in printing, particularly in rotogravure printing, still depends on slow, costly, and subjective manual inspection. Automated surface defect detection is critical for maintaining high-quality standards in rotogravure printin…
Future Rendering $\neq$ Future Surface: A Benchmark and Dataset for Dynamic Surface Reconstruction Beyond the Observed Window
Yukun Shi, Minglun Gong · arXiv · Jul 23, 2026
Dynamic-scene reconstruction is almost always evaluated inside the observed time window, yet deployment settings such as AR overlays, robot interaction, and anticipatory planning need the future surface: the geometry at times beyond those c…
SPDCN: Strip-based Deformable Convolutional Network for Steel Surface Defect Segmentation
Zhongming Liu, Bingbing Jiang, Guangxin Wan, Xiang Zou · arXiv · Jul 23, 2026
Steel surface defect segmentation is critical for industrial quality inspection, yet existing methods struggle with elongated, anisotropic defects such as cracks and scratches due to the isotropic receptive fields of standard convolutions a…
Adaptive Identity Anchoring: Closed-Loop Keyframe Placement for Synthetic Paired Supervision in Video Face Swapping
Logan Robbins · arXiv · Jul 23, 2026
Video face swapping has no natural paired supervision: no real footage exists of one person's face performing another person's video. The strongest current answer, DreamID-V's SyncID-Pipe, mints pairs by replacing the identity in exactly tw…
PerceptDrive: Perception Prior World-Action Modeling with Adaptive Expert Routing for End-to-End Autonomous Driving
Yushan Liu, Tianxiong Lv, Bohua Wang, Hangqi Fan et al. · arXiv · Jul 22, 2026
Frozen perception foundation models encode rich geometric, semantic, and dynamic knowledge. Yet narrow conditioning interfaces may attenuate task-relevant cues, while static fusion cannot adjust expert contributions to each scene. We cast t…
Real-Time EEG Cap Electrode Detection for Guided Point-of-Care Placement
William Lehn-Schiøler, Mads Sverker Nilsson, Nicki Skafte Detlefsen · arXiv · Jul 22, 2026
We present a two-stage vision system that detects EEG cap electrodes in a live webcam stream and validates their anatomical placement in real time. A single-class YOLO detector localises electrodes; a geometric stage assigns each detection …
Online Neural Space Time Memory for Dynamic Novel View Synthesis
Baback Elmieh, Lynn Tsai, Zeman Li, Srinivas Kaza et al. · arXiv · Jul 16, 2026
Online novel view synthesis from multi-view streaming videos faces a fundamental trade-off: maintaining a persistent, long-horizon memory to reconstruct temporarily occluded regions while operating under strict real-time constraints. While …
Quantifying Training Membership Information in the Hyperspherical Embedding Geometry of Face Recognition Models
Ünsal Öztürk, Sébastien Marcel · arXiv · Jul 16, 2026
Face recognition models represent each face as an embedding vector on the unit hypersphere by clustering embeddings of the same identity while pushing different identities apart through angular-margin losses. Because these losses act only o…
RoGS: Adaptive Meshgrid Gaussian for Large-Scale Road Surface Mapping
Tianchen Deng, Zhiheng Feng, Wenhua Wu, Ziming Li et al. · arXiv · Jul 16, 2026
Road surface mapping plays a crucial role in autonomous driving, supporting high-definition map generation, lane-level perception, and automatic road annotation. Recent mesh-based road surface reconstruction methods have shown promising res…
JADE-GS: Joint Alternating Deblurring Guided by Events in 3D Gaussian Splatting
Haoyu Fu, Jiafeng Huang, Yuchen Wang, Shengjie Zhao · arXiv · Jul 16, 2026
When a camera moves fast during exposure, blur destroys the intra-exposure motion a 3D model needs to recover the sharp scene, while event cameras capture exactly this signal at microsecond resolution. Turning them into reliable 3D supervis…
DermDepth: Toward Monocular Metric Scale 3D Reconstruction Models for Dermatology
Héctor Carrión, Narges Norouzi · arXiv · Jul 14, 2026
Dermatological practice routinely involves measuring and tracking lesion size, morphology and texture, as critical components of wound or skin cancer screening, monitoring and diagnosis. To accomplish this task, practitioners often image th…
Rank-1 Identity Consensus Predicts Gallery Enrollment in 1:N Face Matching More Accurately than Score Thresholding
Gabriella Pangelinan, Aman Bhatta, Michael C. King, Kevin W. Bowyer · arXiv · Jul 14, 2026
In operational 1:N face identification, a crucial question arises for each probe: is this person enrolled in the gallery or not? The stakes are high and asymmetric. Rejecting a mate-present (MP) probe loses a valid lead; accepting a mate-ab…
Latent-Identity Tuning in Text-to-Image Personalization Models
Daniel Garibi, Ronen Kamenetsky, Hadar Averbuch-Elor, Daniel Cohen-Or et al. · arXiv · Jul 13, 2026
Generating and editing a person's face demands high precision, as even minor modifications can significantly alter a subject's perceived identity. Current personalization and editing methods built on general-purpose text-to-image models, ho…
DiffEEG: A Self-Supervised Denoising Diffusion Model for Learning EEG Generic Representations
Abdulkader Helwan, Lina Abou-Abbas, Hussein El Amouri, Belkacem Chikhaoui et al. · arXiv · Jul 13, 2026
Deep learning for EEG-based seizure detection faces critical challenges: severe annotation scarcity and extreme class imbalance, where ictal events comprise less than 10\% of clinical recordings. We present DiffEEG, a 9.6M-parameter self-su…
What VGGT Knows About Overlap: Probing Geometric Foundation Models for Co-Visibility
Filippo Ziliotto, Luciano Serafini, Lamberto Ballan, Tommaso Campari · arXiv · Jul 10, 2026
A fundamental challenge in 3D reconstruction and robotic localization is co-visibility: determining which image pairs share overlapping visible surfaces, particularly in scenarios with minimal overlap. We demonstrate that VGGT implicitly en…
HumanForge: A Human-Centric Deepfake Video Benchmark with Multi-Agent Forgery Rationales
Wenbo Xu, Zhimin Chen, Xiaojie Liang, Hengrui Liu et al. · arXiv · Jul 9, 2026
Rapid advancements in video diffusion models and temporal editing tools have enabled the generation of highly realistic human-centric videos, posing unprecedented challenges to digital content forensics. Existing benchmarks primarily focus …
Face-trace: Open-Set Attribution and Progressive Discovery of Synthetic Face Generators
Alessia Infantino, Claudio Schiavella, Irene Amerini · arXiv · Jul 8, 2026
Recent advances in generative Artificial Intelligence have made synthetic face images increasingly realistic, creating new challenges for multimedia forensics. Source attribution methods should not only identify the generator of an image wh…
SonoRank: Towards Calibration-Free Real-Time Finger Flexion Detection from Forearm Ultrasound Sequences
Dean Zadok, Alon Wolf, Alex M. Bronstein, Oren Salzman · arXiv · Jul 8, 2026
Powered prosthetic hands are frequently abandoned, largely due to the limited functionality of current devices that rely on surface electromyography (sEMG). Sonomyography (ultrasound) has emerged as a promising alternative, owing to its abi…
Discovering Geometric Biases in 3D Face Reconstruction: A Curvature-Aware Spectral Framework for Fairness Evaluation
Veronika Shilova, Emmanuel Malherbe, Giovanni Palma, Panagiotis-Alexandros Bokaris et al. · arXiv · Jul 8, 2026
3D Morphable Models (3DMMs) remain the standard parametric shape priors for many state-of-the-art 3D face reconstruction algorithms. However, as these models are derived from a finite number of 3D face samples, they inherit the morphologica…
Two-Stage Multi-Modal Fusion with Adaptive Alignment for Action Quality Assessment
Kanglei Zhou, Ruizhi Cai, Xinning Wang, Yijian Zheng et al. · arXiv · Jul 8, 2026
Action Quality Assessment (AQA) aims to evaluate how well a person performs a movement, which is essential in applications such as sports scoring, skill assessment, and healthcare. However, unimodal approaches often struggle to capture subt…
ProxyPose: 6-DoF Pose Tracking via Video-to-Video Translation
Ruihang Zhang, Felix Taubner, Pooja Ravi, Kiriakos N. Kutulakos et al. · arXiv · Jul 7, 2026
Tracking the six-degree-of-freedom (6-DoF) pose of objects and surfaces from monocular video is a long-standing problem in computer vision. To tackle this problem, existing methods require inputs beyond the video itself-such as 3D models, d…
Ink3D: Sculpting 3D Assets with Extremely Complex Textures via Video Generative Models
Yue Han, Chong Li, Zhening Liu, Cong Huang et al. · arXiv · Jul 1, 2026
Recent 3D generative models can synthesize high-quality geometry but often struggle to reproduce intricate textures from reference images, largely due to the scarcity of large-scale 3D training data with rich surface appearance. In contrast…
Linkify: Learning from Interface-Augmented Assembly Graphs
Anushrut Jignasu, Daniele Grandi · arXiv · Jul 1, 2026
We present Linkify, a framework for learning from interface-augmented assembly graphs to enable context-aware part retrieval in mechanical assemblies. While recent generative AI methods for CAD have focused largely on isolated parts or mono…
FaceMoE: Mixture of Experts for Low-Resolution Face Recognition
Kartik Narayan, Vishal M. Patel · arXiv · Jun 30, 2026
Low-resolution face recognition (LR-FR) remains a challenging task due to poor feature extraction and aggregation, as probe images often contain limited identity information resulting from extreme degradations such as blur, occlusion, and l…
Planar-SfM: Camera Pose Estimation via Homography Graph Embeddings
Gabi Pragier, Matan Karklinsky, David Ungarish, Avi Ben-Cohen · arXiv · Jun 30, 2026
Structure from Motion (SfM) systems traditionally struggle with planar scenes, where standard epipolar geometry-based methods become degenerate. Rather than viewing planar surfaces as a limitation, we propose a unified framework that levera…
GROW$^2$: Grounding Which and Where for Robot Tool Use
Yuhong Deng, Yuyao Liu, David Hsu · arXiv · Jun 29, 2026
Can the robot use a plate to cut a cake if no knife is available? Tool use greatly expands robot capabilities, but to use tools creatively beyond their intended functions, the robot faces the challenge of $\textit{open-world affordance grou…
Reweighting Framewise Attention in Video Transformers for Facial Expression Understanding
Seongro Yoon, Donghyeon Cho, Jinsun Park, François Brémond · arXiv · Jun 29, 2026
Understanding facial expressions in videos requires modeling subtle and localized facial dynamics under unconstrained conditions. Although recent Vision Transformer~(ViT)-based video models have shown strong performance through large-scale …
Orca: The World is in Your Mind
Yihao Wang, Yuheng Ji, Mingyu Cao, Yanqing Shen et al. · arXiv · Jun 29, 2026
We introduce Orca, an initial instantiation of a general world foundation model. Orca learns a unified world latent space from multimodal world signals and exposes it through multimodal readout interfaces. Rather than optimizing isolated ne…

Track Face Recognition on Distill AI — start free →

Latest Face Recognition Research Papers

Recent papers

Related topics