9
Federated Learning for Face Recognition via Intra-subject Self-supervised Learning
Hansol Kim (Kookmin University), Hoyeol choi (Kakaobank), Youngjun Kwak (KAKAOBANK)
PDF Poster Video (Right click to download) 

12
CLIP Adaptation by Intra-Modal Overlap Reduction
Alexey Kravets (University of Bath), Vinay P Namboodiri (University of Bath)
PDF Poster Video (Right click to download) 

14
Efficiency-preserving Scene-adaptive Object Detection
Zekun Zhang (State University of New York, Stony Brook), Vu Quang Truong (VinAI Research), Minh Hoai (University of Adelaide)
PDF Poster Video (Right click to download) 

15
Sequential Amodal Segmentation via Cumulative Occlusion Learning
Jiayang Ao (University of Melbourne), Qiuhong Ke (Monash University), Krista A. Ehinger (The University of Melbourne)
PDF Poster Video (Right click to download) 

16
Region-based Entropy Separation for One-shot Test-Time Adaptation
Kodai Kawamura (Korea University), Shunya Yamagami (Tokyo University of Science), Go Irie (Tokyo University of Science)
PDF Poster Video (Right click to download) 

18
MeTTA: Single-View to 3D Textured Mesh Reconstruction with Test-Time Adaptation
Kim Yu-Ji (Pohang University of Science and Technology), Hyunwoo Ha (Pohang University of Science and Technology), Kim Youwang (Pohang University of Science and Technology), Jaeheung Surh (Bucketplace), Hyowon Ha (Bucketplace), Tae-Hyun Oh (POSTECH)
PDF Poster Video (Right click to download) 

19
Few-shot Multispectral Segmentation with Representations Generated by Reinforcement Learning
Dilith Jayakody (University of Moratuwa), Thanuja Ambegoda (University of Moratuwa)
PDF Poster Video (Right click to download) 

22
HDRSplat: Gaussian Splatting for High Dynmaic Range 3D Scene Reconstruction from Raw Images
Shreyas Singh (Fractal Analytics ), Aryan Garg (Department of Computer Science, University of Wisconsin - Madison), Kaushik Mitra (Indian Institute of Technology, Madras)
PDF Poster Video (Right click to download) 

23
Alignment-aware Patch-level Routing for Dynamic Video Frame Interpolation
Ban Chen (Samsung Electronics (China) R&D Center), Xin Jin (Samsung R&D Institute China-Nanjing (SRC-N)), LONG HAI WU (University of Science and Technology of China), Jie Chen (SRCN), Ilhyun Cho (Samsung), Cheul-hee Hahm (Samsung)
PDF Poster Video (Right click to download) 

25
AR-TTA: A Simple Method for Real-World Continual Test-Time Adaptation
Damian Sójka (Technical University of Poznan), Bartłomiej Twardowski (IDEAS NCBR), Tomasz Trzcinski (Warsaw University of Technology), Sebastian Cygert (IDEAS NCBR)
PDF Poster Video (Right click to download) 

26
Improving Depth Gradient Continuity in Transformers: A Comparative Study on Monocular Depth Estimation with CNN
Jiawei Yao (University of Washington), Tong Wu (Amazon), Xiaofeng Zhang (Shanghai Jiaotong University)
PDF Poster 

28
SciPostLayout: A Dataset for Layout Analysis and Layout Generation of Scientific Posters
Shohei Tanaka (OMRON SINIC X), Hao Wang (Waseda University), Yoshitaka Ushiku (OMRON SINIC X)
PDF Poster Video (Right click to download) 

31
COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation
Munish Monga (Indian Institute of Technology, Bombay), Sachin Kumar Giroh (Indian Institute of Technology, Bombay), Ankit Jha (The LNM Institute of Information Technology), Mainak Singha (Indian Institute of Technology, Bombay), Biplab Banerjee (Indian Institute of Technology, Bombay, Dhirubhai Ambani Institute Of Information and Communication Technology), Jocelyn Chanussot (INRIA)
PDF Poster Video (Right click to download) 

32
No Captions, No Problem: Captionless 3D-CLIP Alignment with Hard Negatives via CLIP Knowledge and LLMs
Cristian Sbrolli (Polytechnic Institute of Milan), Matteo Matteucci (Politecnico di Milano)
PDF Poster Video (Right click to download) 

33
Self-Supervised Real-World Denoising by Jointly Learning Visible and Invisible Noise
Shaoyu Wang (Dalian Martime University), Changze Zhou (Dalian Maritime University), Bolin Song (Dalian Martime University), Yiyang Wang (Dalian Martime University)
PDF Poster Video (Right click to download) 

34
TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation
Jack Saunders (University of Bath), Vinay P Namboodiri (University of Bath)
PDF Poster 

37
DRAFT: Direct Radiance Fields Editing with Composable Operations
Zhihan Cai (Tsinghua University, Tsinghua University), Kailu Wu (Tsinghua University, Tsinghua University), Dapeng Cao (Xi'an Jiaotong University), Feng Chen (University of Hong Kong), Kaisheng Ma (Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University)
PDF Poster Video (Right click to download) 

38
Linear Calibration Approach to Knowledge-free Group Robust Classification
Ryota Ishizaki (Tokyo University of Science), Shunya Yamagami (Tokyo University of Science), Yuta Goto (Tokyo University of Science), Go Irie (Tokyo University of Science)
PDF Poster Video (Right click to download) 

39
HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction
Haoyu Zhao (Wuhan University), Xingyue Zhao (Xi'an Jiaotong University), Lingting Zhu (The University of Hong Kong), Weixi Zheng (Wuhan University), Yongchao Xu (Wuhan University)
PDF Poster Video (Right click to download) 

41
Local Implicit Wavelet Transformer for Arbitrary-Scale Super-Resolution
Minghong Duan (Fudan University), Linhao Qu (Fudan University), Shaolei Liu (Shanghai Institute of Microsystem and Information Technology), Manning Wang (Fudan University)
PDF Poster Video (Right click to download) 

42
Spatial-Temporal NAS for Fast Surgical Segmentation
Matthew Lee (Medtronic), Felix John Samuel Bragman (Medtronic), Ricardo Sanchez-Matilla (Medtronic), Imanol Luengo (Medtronic), Danail Stoyanov (University College London)
PDF Poster 

43
Learning to Segment Publicly Accessible Green Spaces with Visual and Semantic Data
Jian Gao (Queen's University Belfast), Niall McLaughlin (The Queen's University Belfast), Joanna Sara Valson (The Queen's University Belfast), Neil Anderson (The Queen's University Belfast), Ruth Hunter (The Queen's University Belfast)
PDF Poster Video (Right click to download) 

45
D³Nav: Data-Driven Driving Agents for Autonomous Vehicles in Unstructured Traffic
Aditya Nalgunda Ganesh (Purdue University), Gowri Srinivasa (PES University, Bengaluru, India)
PDF Poster Video (Right click to download) 

46
FFR-UNet: Feature Filter-Refinement UNet for Medical Image Segmentation
Weixin Xu (Beihang University)
PDF Poster Video (Right click to download) 

47
Group Activity Recognition via Spatio-Temporal Reasoning of Key Instances
Haoting He (Xi'an Jiaotong University), Yaochen Li (Xi'an Jiaotong University), Yutong Wang (Xi'an Jiaotong University), Gaojie Li (Xi'an Jiaotong University), Wei Guo (Xi'an Jiaotong University), Runlin Zou (Xi'an Jiaotong University)
PDF Poster Video (Right click to download) 

53
NCA-Morph: Medical Image Registration with Neural Cellular Automata
Amin Ranem (TU Darmstadt), John Kalkhof (TU Darmstadt), Anirban Mukhopadhyay (TU Darmstadt)
PDF Poster Video (Right click to download) 

54
InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task Learning
Babak Ehteshami Bejnordi (QualComm), Gaurav Kumar (QualComm), Amelie Royer (Kyutai), Christos Louizos (QualComm), Tijmen Blankevoort (Facebook), Mohsen Ghafoorian (Qualcomm)
PDF Poster Video (Right click to download) 

60
Advancing Medical Image Segmentation: Morphology-Driven Learning with Diffusion Transformer
Sungmin Kang (Dongguk University), Jaeha Song (Dongguk University), Jihie Kim (Dongguk University)
PDF Poster Video (Right click to download) 

64
Multi-Modal Information Bottleneck Attribution with Cross-Attention Guidance
Pauline Bourigault (Imperial College London), Emmanuelle Bourigault (University of Oxford), Danilo Mandic (Imperial College London)
PDF Poster Video (Right click to download) 

66
Noise-Tolerant Few-Shot Unsupervised Adapter for Vision-Language Models
Eman Ali (Mohamed bin Zayed University of Artificial Intelligence), Muhammad Haris Khan (Mohamed Bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

70
Advancing Anomaly Detection: The IDW dataset and MC algorithm
Alexander D. J. Taylor (University of Bath), Jonathan James Morrison (Rolls-Royce Defence Aerospace), Phillip Tregidgo (University of Bristol), Neill D. F. Campbell (University of Bath)
PDF Poster 

74
ControlDreamer: Blending Geometry and Style in Text-to-3D
Yeongtak Oh (Seoul National University), Jooyoung Choi (Seoul National University), Yongsung Kim (Seoul National University), Minjun Park (Seoul National University), Chaehun Shin (Seoul National University), Sungroh Yoon (Seoul National University)
PDF Poster Video (Right click to download) 

76
SagaGAN: Style Applied using Gram matrix Attribution based on StarGAN v2
Yongseon Yoo (Hanyang University), Seonggyu Kim (Hanyang University), Jong-Min Lee (Hanyang University)
PDF Poster Video (Right click to download) 

77
PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images
Yiheng Xiong (Technical University of Munichn), Angela Dai (Technical University of Munich)
PDF Poster Video (Right click to download) 

85
Textual Attention RPN for Open-Vocabulary Object Detection
Tae-Min Choi (Korea Institute of Science and Technology), Inug Yoon (Korea Advanced Institute of Science & Technology), Jong-Hwan Kim (Korea Advanced Institute of Science and Technology), Juyoun Park (Korea Institute of Science and Technology (KIST) )
PDF Poster Video (Right click to download) 

100
Painterly Image Harmonization via Bi-Transformation with Dynamic Kernels
Zhangliang Sun (Tsinghua University, Tsinghua University), Hui Zhang (Tsinghua University)
PDF Poster Video (Right click to download) 

101
Interactive Image Segmentation with Temporal Information Augmented
Qiaoqiao Wei (School of Software, Tsinghua University), Hui Zhang (Tsinghua University), Jun-Hai Yong (Tsinghua University, Tsinghua University)
PDF Poster Video (Right click to download) 

102
Distribution-Aware Calibration for Object Detection with Noisy Bounding Boxes
Donghao Zhou (The Chinese University of Hong Kong), Jialin Li (Tencent YouTu Lab), Jinpeng Li (The Chinese University of Hong Kong), Jiancheng Huang (Chinese Academy of Sciences), Qiang Nie (The Hong Kong University of Science and Technology), Yong Liu (Tencent Youtu Lab), Bin-Bin Gao (Tencent), Qiong Wang (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Chinese Academy of Sciences), Pheng-Ann Heng (The Chinese University of Hong Kong), Guangyong Chen (Zhejiang Lab)
PDF Poster 

103
Prompting Diffusion Representations for Cross-Domain Semantic Segmentation
Rui Gong (Amazon), Martin Danelljan (ETH Zurich), Han Sun (EPFL - EPF Lausanne), Julio Delgado Mangas (Meta, Reality labs), Nikolay Marin (Amazon), Luc Van Gool (INSAIT - Sofia Un.)
PDF Poster Video (Right click to download) 

104
MMPrune4U: Regularizing Multimodal Feature Distortion in Weight Pruning for Deep Neural Network Compression
Sudip Das (Valeo), Kaixin Xu (I2R, A*STAR), Nushrat Hussain (Indian Statistical Institute), Ziyuan Zhao (I2R, A*STAR), Arindam Das (Valeo), Weisi Lin (Nanyang Technological University), Ujjwal Bhattacharya (Indian Statistical Institute, Dhirubhai Ambani Institute Of Information and Communication Technology)
PDF Poster Video (Right click to download) 

108
MoManifold: Learning to Measure 3D Human Motion via Decoupled Joint Acceleration Manifolds
Ziqiang Dang (Alibaba Group), Tianxing Fan (Zhejiang University), Boming Zhao (Zhejiang University), Xujie Shen (Zhejiang University), 王 磊 (Guangdong OPPO Mobile Telecommunications Corp.,Ltd.), Guofeng Zhang (Zhejiang University), Zhaopeng Cui (Zhejiang University)
PDF Poster Video (Right click to download) 

111
Projected Stochastic Gradient Descent with Quantum Annealed Binary Gradients
Maximilian Krahn (Aalto University), Michele Sasdelli (The University of Adelaide), Frances Fengyi Yang (University of Adelaide), Vladislav Golyanik (Saarland Informatics Campus, Max-Planck Institute for Informatics), Juho Kannala (Aalto University), Tat-Jun Chin (The University of Adelaide), Tolga Birdal (Imperial College London)
PDF Poster Video (Right click to download) 

113
Text Removal In E-Commerce Images: A Comparison Of Inpainting Methods
Hiya Roy (Rakuten Institute of Technology, The University of Tokyo), Bjorn Stenger (Rakuten Group Inc.)
PDF Poster 

114
Key-point Guided Deformable Image Manipulation Using Diffusion Model
Seok-Hwan Oh (Korea Advanced Institute of Science & Technology), Guil Jung (KAIST), Myeong-Gee Kim (Barreleye, inc.), Sang-yun Kim (KAIST), Young-Min Kim (KAIST), hyeonjik lee (KAIST), Hyuksool Kwon (Seoul National University), Hyeonmin Bae (Korea Advanced Institute of Science and Technology)
PDF Poster Video (Right click to download) 

115
Multi-modal Crowd Counting via Modal Emulation
Chenhao Wang (Harbin Institute of Technology), Xiaopeng Hong (Harbin Institute of Technology), Zhiheng Ma (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Chinese Academy of Sciences), Yupeng Wei (Harbin Institute of Technology), Yabin Wang (Xi'an Jiaotong University), Xiaopeng Fan (Harbin Institute of Technology)
PDF Poster Video (Right click to download) 

133
MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM
Ren-Wu Li (AMD), Wenjing Ke (AMD), Dong Li (AMD), Lu Tian (AMD), Emad Barsoum (AMD)
PDF Poster 

135
Acoustic-based 3D Human Pose Estimation Robust to Human Position
Yusuke Oumi (Keio University), Yuto Shibata (Keio University), Go Irie (Tokyo University of Science), Akisato Kimura (NTT Corporation), Yoshimitsu Aoki (Keio University), Mariko Isogawa (Keio University)
PDF Poster Video (Right click to download) 

136
PhysFlow: Skin tone transfer for remote heart rate estimation through conditional normalizing flows
Joaquim Comas Martinez (Universitat Pompeu Fabra), Antonia Alomar (Universitat Pompeu Fabra), Adria Ruiz (CSIC-UPC), Federico Sukno (Pompeu Fabra University)
PDF Poster Video (Right click to download) 

137
InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth
Cho-Ying Wu (Bosch), Quankai Gao (University of Southern California), Chin-Cheng Hsu (Resemble AI), Te-Lin Wu (Character.AI), Jing-Wen Chen (University of Southern California), Ulrich Neumann (University of Southern California)
PDF Poster Video (Right click to download) 

140
Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space
Junho Lee (Seoul National University), Jeongwoo Shin (Seoul National University), Seung Woo Ko (LG AI Research), Seongsu Ha (Twelve Labs), Joonseok Lee (Seoul National University)
PDF Poster Video (Right click to download) 

142
Recovering Global Data Distribution Locally in Federated Learning
Ziyu Yao (Peking University)
PDF Poster Video (Right click to download) 

145
Privacy-preserving datasets by capturing feature distributions with Conditional VAEs
Francesco Di Salvo (University of Bamberg), David Tafler (University of Bamberg), Sebastian Doerrich (University of Bamberg), Christian Ledig (University of Bamberg)
PDF Poster Video (Right click to download) 

147
MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion
Angel Villar-Corrales (University of Bonn), Moritz Austermann (Rheinische Friedrich-Wilhelms Universität Bonn), Sven Behnke (University of Bonn)
PDF Poster Video (Right click to download) 

150
AISE: Adaptive Input Sampling for Explanation of Black-box Models
Evgeny Tsykunov (Intel Corporation), Wonju Lee (Intel Corporation), Minje Park (Intel)
PDF Poster 

152
Retinex-Inspired Cooperative Game Through Multi-Level Feature Fusion for Robust, Universal Image Enhancement
Ruiqi Mao (Northwest Polytechnical University Xi'an), Rongxin Cui (Northwestern Polytechnical University Xi'an)
PDF Poster Video (Right click to download) 

164
Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor Point Clouds
Yuyang Zhao (National University of Singapore), Na Zhao (Singapore University of Technology and Design), Gim Hee Lee (National University of Singapore)
PDF Poster 

165
Learning Object Placement via Convolution Scoring Attention
Yibin Wang (Fudan University), Yuchao Feng (Westlake University), Jianwei Zheng (Zhejiang University of Technology)
PDF 

166
Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection
Yunsong Wang (National University of Singapore), Na Zhao (Singapore University of Technology and Design), Gim Hee Lee (National University of Singapore)
PDF Poster Video (Right click to download) 

168
Topology-preserving Adversarial Training for Alleviating Natural Accuracy Degradation
Xiaoyue Mi (University of the Chinese Academy of Sciences), Fan Tang (Institute of Computing Technology, CAS), Yepeng Weng (Lenovo Group Limited), Danding Wang (Institute of Computing Technology, Chinese Academy of Sciences), Juan Cao (Institute of Computing Technology, Chinese Academy of Sciences), Sheng Tang (Institute of Computing Technology, Chinese Academy of Sciences), Peng Li (Tsinghua University), Yang Liu (Tsinghua University)
PDF Poster 

180
JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation
Sai Tanmay Reddy Chakkera (State University of New York at Stony Brook), Aggelina Chatziagapi (Stony Brook University), Dimitris Samaras (Stony Brook University)
PDF Poster Video (Right click to download) 

183
Hierarchical Prompt Learning for Scene Graph Generation
Xuhan Zhu (University of Chinese Academy of Sciences), Yifei Xing (Chinese Academy of Sciences), Ruiping Wang (Institute of Computing Technology, Chinese Academy of Sciences), Yaowei Wang (Harbin Institute of Technology, Shenzhen), Xiangyuan Lan (Peng Cheng Laboratory)
PDF Poster Video (Right click to download) 

184
Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization
Roisin Luo (University of Galway), Alexandru Drimbarean (FotoNation), James McDermott (University of Galway), Colm O'Riordan (University of Galway)
PDF Poster Video (Right click to download) 

185
Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion
Zeyu Zhang (The Australian National University), Yiran Wang (University of Sydney, University of Sydney), Biao Wu (University of Technology Sydney), Shuo Chen (Monash University), Zhiyuan Zhang (University of Adelaide), SHIYA HUANG (University of Adelaide), Wenbo Zhang (University of Adelaide), Meng Fang (University of Liverpool), Ling Chen (University of Technology Sydney), Yang Zhao (La Trobe University)
PDF Poster Video (Right click to download) 

188
A self-supervised and adversarial approach to hyperspectral demosaicking and RGB reconstruction in surgical imaging
Peichao Li (King's College London), Oscar MacCormac (King's College London), Jonathan Shapey (King's College London), Tom Vercauteren (King's College London)
PDF Poster Video (Right click to download) 

199
A Revisit to the Decoder for Camouflaged Object Detection
Seung Woo Ko (LG AI Research), Joopyo Hong (Seoul National University), Suyoung Kim (Seoul National University), Seungjai Bang (Seoul National University), Sungzoon Cho (Seoul National University), Nojun Kwak (Seoul National University), Hyung-Sin Kim (Seoul National University), Joonseok Lee (Seoul National University)
PDF Poster Video (Right click to download) 

200
Towards Generative Class Prompt Learning for Fine-grained Visual Recognition
Soumitri Chattopadhyay (University of North Carolina at Chapel Hill), Sanket Biswas (Computer Vision Center, Universitat Autonoma de Barcelona), Emanuele Vivoli (Universidad Autonoma de Barcelona ), Josep Llados (Computer Vision Center, Universitat Autonoma de Barcelona)
PDF Poster Video (Right click to download) 

201
Infrared and Visible Image Fusion Using Multi-level Adaptive Fractional Differential
Kang Zhang (Nanjing University of Science and Technology), Xinnian Guo (Suqian University)
PDF Poster Video (Right click to download) 

203
S³-Match: Common-View Aligned Image Matching via Self-Supervised Keypoint Selection
Shizhen Li (Xi'an Jiaotong University), Jingcheng Liu (Xi'an Jiaotong University), Jianwu Fang (Xi'an Jiaotong University), DeZheng Gao (Xi'an Jiaotong University), Jianru Xue (Xi'an Jiaotong University)
PDF Poster Video (Right click to download) 

205
From Black-box to Label-only: a Plug-and-Play Attack Network for Model Inversion
Huan Bao (Jinan University), Kaimin Wei (Jinan University), Yao Chen (Jinan University), Hanting Hou (Jinan University), Jinpeng Chen (Beijing University of Post and Telecommunication), Yongdong WU (Jinan University)
PDF Poster Video (Right click to download) 

207
Feature Splatting for Better Novel View Synthesis with Low Overlap
Tomas Berriel Martins (Universidad de Zaragoza), Javier Civera (Universidad de Zaragoza)
PDF Poster Video (Right click to download) 

210
BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation
Kieran Ryan Saunders (Aston University), Luis J Manso (Aston University), George Vogiatzis (Aston University)
PDF Poster Video (Right click to download) 

211
Align-DETR: Enhancing End-to-end Object Detection with Aligned Loss
Zhi Cai (Beijing University of Aeronautics and Astronautics), Songtao Liu (Megvii Technology Inc.), Guodong Wang (Beijing University of Aeronautics and Astronautics), Zeming Li (BYTEDANCE), Zheng Ge (Megvii Technology Inc.), Xiangyu Zhang (MEGVII Technology), Di Huang (Beihang University)
PDF Poster Video (Right click to download) 

212
Mixstyle-Entropy: Whole Process Domain Generalization with Causal Intervention and Perturbation
Luyao Tang (Xiamen University), Yuxuan Yuan (Xiamen University), Chaoqi Chen (The University of Hong Kong), Xinghao Ding (Xiamen University), Yue Huang (Xiamen University)
PDF Poster Video (Right click to download) 

213
Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis
Theodoros Kouzelis (National Technical University of Athens), Emmanouil Plitsis (University of Athens), Mihalis Nicolaou (The Cyprus Institute), Yannis Panagakis (National and Kapodistrian University of Athens)
PDF Poster Video (Right click to download) 

215
AttEntropy: On the Generalization Ability of Supervised Semantic Segmentation Transformers to New Objects in New Domains
Krzysztof Baron-Lis (Waabi), Matthias Rottmann (University of Wuppertal), Annika Mütze (Bergische Universität Wuppertal), Sina Honari (Samsung), Pascal Fua (EPFL - EPF Lausanne), Mathieu Salzmann (Swiss Data Science Center)
PDF Poster Video (Right click to download) 

216
Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning
Masane Fuchi (Meiji University), Tomohiro Takagi (Meiji University)
PDF Poster Video (Right click to download) 

217
GeoFormer: A Multi-Polygon Segmentation Transformer
Maxim Khomiakov (Technical University of Denmark), Michael Riis Andersen (Technical University of Denmark), Jes Frellsen (Technical University of Denmark)
PDF Poster 

218
RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance
Avideep Mukherjee (Indian Institute of Technology Kanpur), Soumya Banerjee (IIT Kanpur, IIT Kanpur), Piyush Rai (IIT Kanpur, IIT Kanpur), Vinay P Namboodiri (University of Bath)
PDF Poster Video (Right click to download) 

223
AUPIMO: Redefining Anomaly Localization Benchmarks with High Speed and Low Tolerance
João P. C. Bertoldo (PSL University), Dick Ameln (Intel), Ashwin Vaidya (Intel), Samet Akcay (Intel)
PDF Poster Video (Right click to download) 

227
Cost-Sensitive Learning for Long-Tailed Temporal Action Segmentation
Zhanzhong Pang (National University of Singapore), Fadime Sener (Meta), Shrinivas Ramasubramanian (Fujitsu Research and Development Center), Angela Yao (National University of Singapore)
PDF Poster Video (Right click to download) 

228
Learning Scene-Goal-Aware Motion Representation for Trajectory Prediction
Ziyang Ren (Xi'an Jiaotong University), Ping Wei (Xi'an Jiaotong University), Haowen Tang (Xi'an Jiaotong University), Huan Li (Xi'an Jiaotong University), Jin Yang (Xi'an Jiaotong University)
PDF Poster Video (Right click to download) 

240
SAM Helps SSL: Mask-guided Attention Bias for Self-supervised Learning
Kensuke Taguchi (Kyocera Corporation), Takehiko Kawai (Kyocera Corporation), Wataru Imaeda (Kyocera Corporation), Hironobu Fujiyoshi (DENSO CORPORATION)
PDF Poster Video (Right click to download) 

245
Enhancing 3D Hand Pose Estimation via Dense Ordinal Regression Network
Yamin Mao (Samsung), Zhihua Liu (Samsung Research Center, Beijing), Weiming Li (Samsung), SoonYong Cho (Samsung), Qiang Wang (Samsung), Xiaoshuai Hao (Beijing Academy of Artificial Intelligence(BAAl) )
PDF Poster Video (Right click to download) 

249
Transferable Learned Image Compression-Resistant Adversarial Perturbations
Yang Sui (Rice University), Zhuohang Li (Vanderbilt University), Ding Ding (Tencent Media Lab), Xiang Pan (Tencent), Xiaozhong Xu (Tencent Media Lab), Shan Liu (Tencent Media Lab), Zhenzhong Chen (Wuhan University)
PDF Poster Video (Right click to download) 

250
Deep Unfolding Network with Spatial-spectral Perception Enhanced for Pan-sharpening
Mengjiao Zhao (Zhejiang University), Mengting Ma (Zhejiang University), Xiangdong Li (Zhejiang University), Ao Gao (Zhejiang University), Siyang Song (University of Exeter), Wei Zhang (Zhejiang University)
PDF Poster Video (Right click to download) 

256
IncreLM: Incremental 3D Line Mapping
Xulong Bai (Institute of automation, Chinese academy of science, Chinese Academy of Sciences), Hainan Cui (Chinese Academy of Sciences), Shuhan Shen (Institute of automation, Chinese academy of science)
PDF Poster Video (Right click to download) 

257
Motion Tracking with Rotated Bounding Boxes on Overhead Fisheye Imagery
Jordan Lam (Zhejiang University)
PDF Poster 

262
Toward Highly Efficient Semantic-Guided Machine Vision for Low-Light Object Detection
Xin Feng (Chongqing University of Technology), Junxian Zeng (Chongqing University of Technology), Siping Wang (Chongqing University of Technology), Zhenwei He (Chongqing University of Technology)
PDF Poster Video (Right click to download) 

263
Improving Object Detection via Local-global Contrastive Learning
Danai Triantafyllidou (Huawei Technologies Ltd.), Sarah Parisot (Huawei Technologies Ltd.), Ales Leonardis (University of Birmingham), Steven McDonagh (University of Edinburgh)
PDF Poster Video (Right click to download) 

267
Depth-Guided Privacy-Preserving Visual Localization Using 3D Sphere Clouds
Heejoon Moon (Hanyang University), Jongwoo Lee (Hanyang University), Jeonggon Kim (Hanyang University), Je Hyeong Hong (Hanyang University)
PDF Poster Video (Right click to download) 

287
A Super-pixel-based Approach to the Stable Interpretation of Neural Networks
Shizhan Gong (the Chinese University of Hong Kong), Jingwei Zhang (The Chinese University of Hong Kong), Qi Dou (The Chinese University of Hong Kong), Farzan Farnia (The Chinese University of Hong Kong)
PDF Poster Video (Right click to download) 

288
PawFACS: Leveraging Semi-Supervised Learning for Pet Facial Action Recognition
Anandavardhan Hegde (Samsung), Sudha Velusamy (Samsung), Narayan Kothari (Samsung), Aman Bahuguna (Samsung), Apnesh Rawat (National Institute of Technology Delhi), Hema Sathiamurthy (Indian Institute of Technology, Madras, Dhirubhai Ambani Institute Of Information and Communication Technology), Ankit Raja (Galgotias University )
PDF Poster Video (Right click to download) 

290
Are Sparse Neural Networks Better Hard Sample Learners?
Qiao Xiao (Eindhoven University of Technology), Boqian Wu (University of Twente), Lu Yin (University of Surrey), Christopher Neil Gadzinski (University of Luxemburg), Tianjin Huang (University of Exeter), Mykola Pechenizkiy (Eindhoven University of Technology), Decebal Constantin Mocanu (University of Luxemburg)
PDF Poster Video (Right click to download) 

295
MxT: Mamba x Transformer for Image Inpainting
Shuang Chen (Durham University), Amir Atapour-Abarghouei (Durham University), Haozheng Zhang (Durham University), Hubert P. H. Shum (Durham University)
PDF Poster Video (Right click to download) 

297
Generalizing Teacher Networks for Effective Knowledge Distillation Across Student Architectures
Kuluhan Binici (National University of Singapore), Weiming Wu (National University of Singapore), Tulika Mitra (National University of Singapore)
PDF Poster Video (Right click to download) 

299
RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields
Mihnea-Bogdan Jurca (Vrije Universiteit Brussel), Remco Royen (Vrije Universiteit Brussel), Ion Giosan (Technical University of Cluj-Napoca), Adrian Munteanu (Vrije Universiteit Brussel)
PDF Poster 

303
MixMask: Revisiting Masking Strategy for Siamese ConvNets
Kirill Vishniakov (M42), Eric Xing (Mohamed bin Zayed Univeristy of AI), Zhiqiang Shen (Mohamed bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

304
Interpretable Representation Learning from Videos using Nonlinear Priors
Marian Longa (University of Oxford), Joao F. Henriques (University of Oxford)
PDF Poster Video (Right click to download) 

305
PEEKABOO: Hiding Parts of an Image for Unsupervised Object Localization
Hasib Zunair (Concordia University), Abdessamad Ben Hamza (Concordia University)
PDF Poster Video (Right click to download) 

307
Discovering an Image-Adaptive Coordinate System for Photography Processing
Ziteng Cui (The University of Tokyo), Lin Gu (RIKEN), Tatsuya Harada (RIKEN)
PDF 

308
Effective Message Hiding with Order-Preserving Mechanisms
Gao Yu (University of Queensland), Xuchong QIU (Bosch), Zihan Ye (Xi'an Jiaotong-Liverpool University)
PDF Poster Video (Right click to download) 

317
EIANet: A Novel Domain Adaptation Approach to Maximize Class Distinction with Neural Collapse Principles
Zicheng Pan (Griffith University), Xiaohan Yu (Macquarie University), Yongsheng Gao (Griffith University)
PDF Poster Video (Right click to download) 

318
Mumpy: Multilateral Temporal-view Pyramid Transformer for Video Inpainting Detection
Ying Zhang (Ocean University of China), Yuezun Li (Ocean University of China), Bo Peng (Institute of automation, Chinese academy of science, Chinese Academy of Sciences), Jiaran Zhou (Ocean University of China), Huiyu Zhou (University of Leicester), Junyu Dong (Ocean University of China)
PDF Poster Video (Right click to download) 

319
Annotation by Clicks: A Point-Supervised Contrastive Variance Method for Medical Semantic Segmentation
Qing En (Carleton University), Yuhong Guo (Carleton University)
PDF Poster Video (Right click to download) 

323
Complete the Feature Space: Diffusion-Based Fictional ID Generation for Face Recognition
Myeong-Yeon Yi (Seoul National University), DongJae Lee (KAIST), Naeun Ko (Naver corporation), Yonghyun Jeong (NAVER), Sang-goo Lee (Seoul National University), Seunggyu Chang (NAVER Cloud)
PDF Poster Video (Right click to download) 

328
DisCoM-KD: Cross-Modal Knowledge Distillation via Disentanglement Representation and Adversarial Learning
Dino Ienco (National Institute for Agriculture, Environment and Food), Cassio Fraga Dantas (INRAE)
PDF Poster 

329
Uni-Mlip: Unified Self-Supervision for Medical Vision Language Pre-training
Ameera Ali Bawazir (Technology Innovation Institute ), Kebin Wu (Technology Innovation Institute), Wenbin LI (Technology Innovation Institute)
PDF Poster Video (Right click to download) 

330
Towards Better Zero-Shot Anomaly Detection under Distribution Shift with CLIP
Jiyao Gao (Sichuan University), Chengxin He (Sichuan University), Lei Duan (Sichuan University), Jie Zuo (Sichuan University)
PDF Poster Video (Right click to download) 

335
SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning
Hao Chen (Department of Computer Science and Engineering, The Chinese University of Hong Kong), Jiaze Wang (The Chinese University of Hong Kong), Ziyu Guo (Department of Computer Science and Engineering, The Chinese University of Hong Kong), Jinpeng Li (The Chinese University of Hong Kong), Donghao Zhou (The Chinese University of Hong Kong), Bian Wu (Zhejiang University), Chenyong Guan (Gudsen Technology Co. Ltd), Guangyong Chen (Zhejiang Lab), Pheng-Ann Heng (The Chinese University of Hong Kong)
PDF Poster Video (Right click to download) 

339
FastForensics: Efficient Two-Stream Design for Real-Time Image Manipulation Detection
zhangyangxiang (Ocean University of China), Yuezun Li (Ocean University of China), Ao Luo (Southwest Jiaotong University), Jiaran Zhou (Ocean University of China), Junyu Dong (Ocean University of China)
PDF Poster Video (Right click to download) 

342
Unsupervised Domain Adaptation for Tubular Structure Segmentation Across Different Anatomical Sources
Yuxiang An (University of Sydney), Dongnan Liu (University of Sydney), Weidong Cai (University of Sydney)
PDF Poster Video (Right click to download) 

346
Backdoor Defense through Self-Supervised and Generative Learning
Ivan Sabolic (University of Zagreb), Ivan Grubišić (University of Zagreb), Siniša Šegvić (University of Zagreb)
PDF Poster Video (Right click to download) 

352
DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation
Raquel Vidaurre (Universidad Rey Juan Carlos), Elena Garces (Adobe Systems), Dan Casas (Universidad Rey Juan Carlos)
PDF Poster Video (Right click to download) 

358
Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning
Muhammad Salman Ali (Kyung Hee University), Maryam Qamar (Kyung Hee University), Sung-Ho Bae (Kyung Hee University), Enzo Tartaglione (Institut Polytechnique de Paris)
PDF Poster Video (Right click to download) 

361
Seg-HGNN: Unsupervised and Light-Weight Image Segmentation with Hyperbolic Graph Neural Networks
Debjyoti Mondal (Samsung), Rahul Mishra (Samsung), Chandan Kumar Pandey (Samsung)
PDF Poster Video (Right click to download) 

362
Into the Fog: Evaluating Robustness of Multiple Object Tracking
Nadezda Kirillova (Technische Universität Graz), Muhammad Jehanzeb Mirza (Massachusetts Institute of Technology), Horst Bischof (Graz University of Technology), Horst Possegger (Graz University of Technology)
PDF Poster Video (Right click to download) 

365
Anchor-Based Masked Generative Distillation for Pixel-Level Prediction Tasks
Xie Yu (Beijing University of Aeronautics and Astronautics), Wentao Zhang (Beijing University of Aeronautics and Astronautics)
PDF Poster Video (Right click to download) 

369
Benchmarking and Optimizing Federated Learning with Hardware-related Metrics
Kai Pan (Institute of Computing Technology, Chinese Academy of Sciences), Yapeng Tian (University of Texas at Dallas), Yinhe Han (Institute of Computing Technology, Chinese Academy of Sciences), Yiming Gan (Institute of Computing Technology, Chinese Academy of Sciences)
PDF Poster Video (Right click to download) 

374
Text-Guided Mixup Towards Long-Tailed Image Categorization
Richard Franklin (University of Washington), Jiawei Yao (University of Washington), Deyang Zhong (University of Washington), Qi Qian (Zoom), Juhua Hu (University of Washington)
PDF Poster Video (Right click to download) 

375
A Novel Divide and Merge Approach for Improved Classification of Functional Data
wei zhao (University of Manchester), Xiao-Jun Zeng (University of Manchester), Chengdong shi (University of Manchester), Ching-Hsun Tseng (University of Manchester), Yue Chang (University of Manchester)
PDF Poster Video (Right click to download) 

384
Few-Shot Classification of Interactive Activities of Daily Living (InteractADL)
Zane Durante (Stanford University), Robathan Harries (Stanford University), Edward Vendrow (Massachusetts Institute of Technology), Zelun Luo (Stanford University), Yuta Kyuragi (Panasonic R&D Company of America), Kazuki Kozuka (Panasonic Corporation), Li Fei-Fei (Stanford University), Ehsan Adeli (Stanford University)
PDF Poster 

388
ACIL: Active Class Incremental Learning for Image Classification
Aditya Bhattacharya (Florida State University), Debanjan Goswami (Florida State University), Shayok Chakraborty (Florida State University)
PDF Poster Video (Right click to download) 

391
PatchRot: Self-Supervised Training of Vision Transformers by Rotation Prediction
Sachin Chhabra (Arizona State University), Hemanth Venkateswara (Georgia State University), Baoxin Li (Arizona State University)
PDF Poster Video (Right click to download) 

392
Label Smoothing++: Enhanced Label Regularization for Training Neural Networks
Sachin Chhabra (Arizona State University), Hemanth Venkateswara (Georgia State University), Baoxin Li (Arizona State University)
PDF Poster Video (Right click to download) 

401
Decoupling Forgery Semantics for Generalizable Deepfake Detection
Wei Ye (Nanchang University), Xinan He (Nanchang University), Feng Ding (Nanchang University)
PDF Poster Video (Right click to download) 

406
When Text and Images Don't Mix: Bias-Correcting Language-Image Similarity Scores for Anomaly Detection
Adam Goodge (A*STAR), Bryan Hooi (National University of Singapore), Wee Siong Ng (Institute for Infocomm Research, A*STAR)
PDF Poster Video (Right click to download) 

414
NSSR-DIL: Null-Shot Image Super-Resolution Using Deep Identity Learning
Sree Rama Vamsidhar S (Indian Institute of Technology Tirupati), Gorthi Rama Krishna Sai Subrahmanyam (Indian Institute of Technology, Tirupati, INDIA)
PDF Poster Video (Right click to download) 

416
Taming the Tail: Leveraging Asymmetric Loss and Padé Approximation to Overcome Long-Tailed Class Imbalance
Pankhi Kashyap (Google), Pavni Tandon (Indian Institute of Technology, Bombay), Sunny Gupta (Indian Institute of Technology, Bombay), Abhishek Tiwari (Indian Institute of Technology, Bombay, Dhirubhai Ambani Institute Of Information and Communication Technology), Ritwik Kulkarni (Oraicle Biosciences LTD), Kshitij Sharad Jadhav (Indian Institute of Technology, Bombay)
PDF Poster Video (Right click to download) 

417
Kernel Representation for Dynamic Networks
Yichen Zhou (Sea Group), Teck Khim Ng (National University of Singapore)
PDF Poster 

420
Layout Free Scene Graph to Image Generation
RAMESHWAR MISHRA (Indraprastha Institute of Information Technology, Delhi), A. Subramanyam (Indraprastha Institute of Information Technology, Delhi)
PDF Poster Video (Right click to download) 

421
Rethinking Domain Adaptive Optic Disc and Cup Segmentation in Fundus Image through Dynamic Diffusion Flow
Canran Li (University of Sydney), Dongnan Liu (University of Sydney), Weidong Cai (The University of Sydney)
PDF Poster Video (Right click to download) 

424
RETRO: Reusing teacher projection head for efficient embedding distillation on Lightweight Models via Self-supervised Learning
Khanh-Binh Nguyen (Deakin University), Chae Jung Park (National Cancer Center)
PDF 

425
GLCM-Adapter: Global-Local Content Matching for Few-shot CLIP Adaptation
Shuo Wang (University of Science and Technology of China), Xieenlong (University of Science and Technology of China), Jinda Lu (University of Science and Technology of China), Jinghan Li (University of Science and Technology of China), Yanbin Hao (University of Science and Technology of China)
PDF Poster Video (Right click to download) 

426
Unified Compositional Query Machine with Multimodal Consistency for Video-based Human Activity Recognition
Tuyen Tran (Deakin University), Thao Minh Le (Deakin University), Duy Hung Tran (Deakin University), Truyen Tran (Deakin University)
PDF Poster Video (Right click to download) 

427
Lightweight Human Pose Estimation with Enhanced Knowledge Review
Hao Xu (Nanjing University of Information Science and Technology), Shengye Yan (Nanjing University of Information Science and Technology), Wei Zheng (MINIEYE)
PDF Poster 

432
Channel-Partitioned Windowed Attention And Frequency Learning for Single Image Super-Resolution
Dinh Phu Tran (Korea Advanced Institute of Science & Technology), Dao Duy Hung (Korea Advanced Institute of Science & Technology), Daeyoung Kim (Korea Advanced Institute of Science and Technology)
PDF Poster Video (Right click to download) 

433
Separated and Independent Contrastive Learning on Labeled and Unlabeled Samples: Boosting Performance on Long-tail Semi-supervised Learning
Dongyoung Kim (Hallym University), Jeong-Gun Lee (Hallym University), WonSook Lee (University of Ottawa)
PDF Poster Video (Right click to download) 

437
Difflare: Removing Image Lens Flare with Latent Diffusion Models
Tianwen Zhou (Beijing Normal University), Qihao Duan (University of the Chinese Academy of Sciences), Zitong YU (Great Bay University)
PDF Poster Video (Right click to download) 

440
Explaining Multi-modal Large Language Models by Analyzing their Vision Perception
Loris Giulivi (Polytechnic Institute of Milan), Giacomo Boracchi (Polytechnic Institute of Milan)
PDF Poster 

448
Learning to Project for Cross-Task Knowledge Distillation
Dylan Auty (Imperial College London), Roy Miles (Huawei Technologies Ltd.), Benedikt Kolbeinsson (Imperial College London), Krystian Mikolajczyk (Imperial College London)
PDF Poster 

452
Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty
Saining Zhang (Nanyang Technological University), Baijun Ye (Tsinghua University), Xiaoxue Chen (Tsinghua University, Tsinghua University), Yuantao Chen (The Chinese University of Hong Kong,Shenzhen), Zongzheng Zhang (Tsinghua University), Cheng Peng (Beijing Institute of Technology), Yongliang Shi (Tsinghua University, Tsinghua University), Hao Zhao (Tsinghua University, Tsinghua University)
PDF Poster Video (Right click to download) 

457
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps
Andrey Palaev (Innopolis University), Adil Khan (University of Hull), Syed M Ahsan Kazmi (University of the West of England, Bristol)
PDF Poster Video (Right click to download) 

472
SAM-EG: Segment Anything Model with Egde Guidance framework for efficient Polyp Segmentation
Quoc-Huy Trinh (Aalto University), Hai-Dang Nguyen (Ho Chi Minh city University of Science, Vietnam National University), Nguyen Ngoc Bao Tram (Ho Chi Minh city University of Science, Vietnam National University), Debesh Jha (Northwestern University), Ulas Bagci (Northwestern University), Minh-Triet Tran (Ho Chi Minh city University of Science, Vietnam National University)
PDF Poster 

480
Disparity Estimation Using a Quad-Pixel Sensor
Zhuofeng Wu (Tokyo Institute of Technology, Tokyo Institute of Technology), Doehyung Lee (Tokyo Institute of Technology, Tokyo Institute of Technology), Zihua Liu (Tokyo Institute of Technology, Tokyo Institute of Technology), Kazunori Yoshizaki (Olympus Medical Systems Corporation), Yusuke Monno (Institute of Science Tokyo), Masatoshi Okutomi (Tokyo Institute of Technology)
PDF Poster Video (Right click to download) 

482
Unsupervised Hashing Network with Hyper Quantization Tree
Sungeun Kim (Ajou University), Jongbin Ryu (Ajou University)
PDF Poster Video (Right click to download) 

486
DAVINCI: A Single-Stage Architecture for Constrained CAD Sketch Inference
Ahmet Serdar Karadeniz (University of Luxemburg), Dimitrios Mallis (University of Luxemburg), Nesryne Mejri (University of Luxembourg), Kseniya Cherenkova (University of Luxemburg), Anis Kacem (University of Luxemburg), Djamila Aouada (University of Luxemburg)
PDF Poster Video (Right click to download) 

492
Multimodal base distributions in conditional flow matching generative models
Shane Josias (University of Stellenbosch), Willie Brink (Stellenbosch University)
PDF Poster Video (Right click to download) 

493
Spike-SLR: An Energy-efficient Parallel Spiking Transformer for Event-based Sign Language Recognition
Xinxu Lin (Sichuan University), Mingxuan Liu (Tsinghua University, Tsinghua University), Kezhuo Liu (Tsinghua University, Tsinghua University), Hong Chen (Tsinghua University, Tsinghua University)
PDF Poster Video (Right click to download) 

499
MotionMAE: Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders
Haosen Yang (University of Surrey), Deng Huang (Meituan), Bin Wen (Beijing University of Aeronautics and Astronautics), Jiannan Wu (University of Hong Kong), Hongxun Yao (Harbin Institute of Technology), Yi Jiang (Bytedance), Xiatian Zhu (University of Surrey), Zehuan Yuan (ByteDance Inc.)
PDF Poster Video (Right click to download) 

500
Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences
Rui Yu (East China University of Science and Technology), Runkai Zhao (University of Sydney, University of Sydney), Cong Nie (Tongji University), Heng Wang (Sony R&D), Siyu Li (East China University of Science and Technology), Songhao Zhu (East China University of Science and Technology)
PDF Poster Video (Right click to download) 

505
FLARE up your data: Diffusion-based Augmentation Method in Astronomical Imaging
Mohammed Talha Alam (Mohamed bin Zayed University of Artificial Intelligence), Raza Imam (Mohamed bin Zayed University of Artificial Intelligence), Mohsen Guizani (Mohamed bin Zayed University of Artificial Intelligence), Fakhri Karray (University of Waterloo)
PDF Poster Video (Right click to download) 

508
Semantic Image Synthesis of Anime Characters Based on Conditional Generative Adversarial Networks
Xuhui Zhu (Chongqing University), feng jiang (Chongqing University), Jing Wen (Chongqing University), yi wang (Chongqing University), qiang gao (Chongqing University)
PDF Poster Video (Right click to download) 

510
ML-2SN: A Hybrid Two-Stream System for Sitting Posture Detection
Kehang Jia (Suzhou University), Gaorui Zhang (Suzhou University), Yixuan Yang (Suzhou University), Guangwei Huang (Suzhou University), Penghuan Wang (Suzhou University), Cheng Cheng (Suzhou University)
PDF Poster 

517
Interpretable Long-term Action Quality Assessment
Xu Dong (University of Surrey), Xinran Liu (University of Surrey), Wanqing Li (University of Wollongong), Anthony Adeyemi-Ejeye (University of Surrey), Andrew Gilbert (University of Surrey)
PDF Poster Video (Right click to download) 

524
A self-supervised cyclic neural-analytic approach for novel view synthesis and 3D reconstruction
Dragos Costea (University Politehnica of Bucharest), Alina Marcu (Institute of Mathematics of the Romanian Academy), Marius Leordeanu (Norwegian Research Center (NORCE))
PDF Poster Video (Right click to download) 

528
SOFI: Multi-Scale Deformable Transformer for Camera Calibration with Enhanced Line Queries
Sebastian Janampa (University of New Mexico), Marios Pattichis (University of New Mexico)
PDF Poster Video (Right click to download) 

532
Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers
Jochem Loedeman (University of Amsterdam), Maarten C. Stol (BrainCreators ), Tengda Han (Google DeepMind), Yuki M Asano (University of Technology Nuremberg)
PDF Poster 

533
TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training
Li Li (King's College London, University of London), Tanqiu Qiao (Durham University), Hubert P. H. Shum (Durham University), Toby P. Breckon (Durham University)
PDF Poster 

534
Enhancing Cardiovascular Disease Prediction through Multi-Modal Self-Supervised Learning
Francesco Girlanda (ETH Zurich), Olga V. Demler (ETH Zurich), Bjoern Menze (University of Zurich), Neda Davoudi (ETH Zurich)
PDF Poster Video (Right click to download) 

537
Out-Of-Distribution Detection for Audio-visual Generalized Zero-Shot Learning: A General Framework
Liuyuan Wen (University of Science and Technology of China)
PDF Poster Video (Right click to download) 

545
Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection
Christian Fruhwirth-Reisinger (Graz University of Technology), Wei Lin (Johannes Kepler University Linz), Dušan Malić (Graz University of Technology), Horst Bischof (Graz University of Technology), Horst Possegger (Graz University of Technology)
PDF Poster Video (Right click to download) 

546
Balancing Calibration and Performance: Stochastic Depth in Segmentation BNNs
Linghong Yao (InstaDeep), Denis Hadjivelichkov (University College London), Andromachi Maria Delfaki (University College London), Yuanchang Liu (), Brooks Paige (University College London), Dimitrios Kanoulas (University College London)
PDF Poster Video (Right click to download) 

557
Hybrid-CSR: Coupling Explicit and Implicit Reconstruction of Cortical Surface
shanlin sun (University of California, Irvine), Tung Le (University of California, Irvine), Pooya Khosravi (University of California, Irvine), Chenyu You (State University of New York at Stony Brook), Kun Han (University of California, Irvine), Haoyu Ma (Meta Platforms, Inc), Deying Kong (University of California, Irvine), Xiangyi Yan (University of California, Irvine), Xiaohui Xie (University of California, Irvine)
PDF Poster 

563
As Firm As Their Foundations: Creating Transferable Adversarial Examples Across Downstream Tasks with CLIP
Anjun Hu (University of Oxford), Jindong Gu (University of Oxford), Francesco Pinto (University of Chicago), Konstantinos Kamnitsas (University of Oxford), Philip Torr (University of Oxford)
PDF Poster Video (Right click to download) 

566
SuperLoRA: Parameter-Efficient Unified Adaptation of Large Foundation Models
Xiangyu Chen (University of Kansas), Jing Liu (Mitsubishi Electric Research Labs), Ye Wang (Mitsubishi Electric Research Labs), Pu Perry Wang (Mitsubishi Electric Research Labs), Matthew Brand (Yale University), Guanghui Wang (Toronto Metropolitan University), Toshiaki Koike-Akino (Mitsubishi Electric Research Labs)
PDF Poster Video (Right click to download) 

568
Beyond Static and Dynamic Quantization - Hybrid Quantization of Vision Transformers
Piotr Kluska (International Business Machines), Florian Scheidegger (International Business Machines), A. Cristiano I. Malossi (International Business Machines), Enrique S. Quintana-Orti (Universidad Politecnica de Valencia)
PDF Poster 

572
Multi-Scope Representation Learning for Causal Relation Discovery with new Challenging Datasets
Jiageng Zhu (University of Southern California), Hanchen Xie (Bosch), Jianhua Wu (University of Southern California), Mohamed E. Hussein (USC/ISI), Mahyar Khayatkhoei (USC/ISI), Jiazhi Li (Futurewei Technologies Inc.), Wael AbdAlmageed (Clemson University)
PDF Poster Video (Right click to download) 

577
AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field
Rong Liu (University of Southern California), Rui Xu (USC Institute for Creative Technologies, University of Southern California), Yue Hu (University of Southern California), Meida Chen (University of Southern California), Andrew Feng (Institute for Creative Technologies, University of Southern California)
PDF Poster Video (Right click to download) 

579
Neural Collapse Inspired Contrastive Continual Learning
Antoine Montmaur (ENSEA), Nicolas Larue (ENSEA), Ngoc-Son Vu (ENSEA)
PDF Poster Video (Right click to download) 

584
ATLANTIS: A Framework for Automated Targeted Language-guided Augmentation Training for Robust Image Search
Inderjeet Singh (Fujitsu Research of Europe Limited), Roman Vainshtein (Fujitsu Research and Development Center Co. Ltm.), Alon Zolfi (Ben Gurion University of the Negev), Asaf Shabtai (Ben-Gurion University of the Negev), Tu Bui (Fujitsu Research and Development Center Co. Ltm.), Jonathan Brokman (Technion - Israel Institute of Technology, Technion - Israel Institute of Technology), Omer Hofman (Fujitsu Research and Development Center Co. Ltm.), Fumiyoshi Kasahara (Fujitsu Research and Development Center Co. Ltm.), Kentaro Tsuji (Fujitsu Research and Development Center Co. Ltm.), Hisashi Kojima (Fujitsu Research and Development Center Co. Ltm.)
PDF Poster Video (Right click to download) 

595
A Prototype Unit for Image De-raining using Time-Lapse Data
Jaehoon Cho (Hyundai Motor Company), Minjung Yoo (Korea Aerospace University), Jini Yang (Korea Aerospace University), Sunok Kim (Korea Aerospace University)
PDF Poster Video (Right click to download) 

597
FADE: Few-shot/zero-shot Anomaly Detection Engine using Large Vision-Language Model
Yuanwei Li (Onfido), Elizaveta Ivanova (Onfido), Martins Bruveris (Onfido)
PDF Poster Video (Right click to download) 

599
VLAVAD: Vision-Language Models Assisted Unsupervised Video Anomaly Detection
Changkang Li (Beijing University of Aeronautics and Astronautics), Yalong Jiang (Beihang University)
PDF Poster Video (Right click to download) 

601
Training-Free Zero-Shot Semantic Segmentation with LLM Refinement
Yuantian Huang (CyberAgent, Inc.), Satoshi Iizuka (University of Tsukuba, Tsukuba University), Kazuhiro Fukui (University of Tsukuba)
PDF Poster Video (Right click to download) 

606
VEMIC: View-aware Entropy model for Multi-view Image Compression
Susmija Jabbireddy (University of Maryland, College Park), Davit Soselia (University of Maryland, College Park), Max Ehrlich (University of Maryland, College Park), Christopher Metzler (University of Maryland, College Park), Amitabh Varshney (University of Maryland, College Park)
PDF Poster Video (Right click to download) 

609
Guidance-base Diffusion Models for Improving Photoacoustic Image Quality
Tatsuhiro Eguchi (Kyushu University, Tokyo Institute of Technology), Shumpei Takezaki (Kyushu University), Mihoko Shimano (National Institute of Informatics), Takayuki Yagi (Tokyo Institute of Technology, Tokyo Institute of Technology), Ryoma Bise (Kyushu University, Faculty of Information Science and Electrical Engineering)
PDF Poster Video (Right click to download) 

611
STPose: 6D object pose estimation network based on sparse attention and cross-layer connection
Shihao Chen (Wuhan University), Xiaobing Li (Guangxi University), Keduo Yan (Guangxi University), Yong Li (Guangxi University), Dongxu Gao (University of Portsmouth)
PDF Poster Video (Right click to download) 

615
Measuring Physical Plausibility of 3D Human Poses Using Physics Simulation
Nathan Louis (University of Michigan - Ann Arbor), Mahzad Khoshlessan (University of Michigan - Ann Arbor), Jason J Corso (University of Michigan)
PDF Poster Video (Right click to download) 

619
Prompt-guided Multi-modal contrastive learning for Cross-compression-rate Deepfake Detection
Ching-Yi Lai (National Tsinghua University), Chiou-ting Hsu (National Tsing Hua University), Chih-Chung Hsu (National Yang Ming Chiao Tung University), Chia-Wen Lin (National Tsing Hua University)
PDF 

622
The Attempt on Combining Three Talents by KD with Enhanced Boundary in Co-Salient Object Detection
Ziyi Cao (Nanjing University of Information Science and Technology), Shengye Yan (Nanjing University of Information Science and Technology), Wei Zheng (MINIEYE)
PDF Poster 

627
GLPI: A Global Layered Prompt Integration approach for Explicit Visual Prompt
Yufei Gao (Zhengzhou University), Bin Fu (Zhengzhou University), Lei Shi (Zhengzhou University), Chengming Liu (Zhengzhou University), yucheng shi (Zhengzhou University)
PDF Poster Video (Right click to download) 

630
CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement
Yijie Li (Northwestern University, Northwestern University), Hewei Wang (Apple), Aggelos Katsaggelos (Northwestern University)
PDF Poster Video (Right click to download) 

637
3D Point Cloud Network Pruning: When Some Weights Do not Matter
Amrijit Biswas (North South University), Md. Ismail Hossain (North South University), M M Lutfe Elahi (North South University), Ali Cheraghian (CSIRO), Fuad Rahman (University of Arizona), Nabeel Mohammed (North South University), Shafin Rahman (North South University)
PDF Poster Video (Right click to download) 

642
Revitalizing Legacy Video Content: Deinterlacing with Bidirectional Information Propagation
Zhaowei Gao (Beijing Jingwei Hirain Technologies Co., Inc.), Mingyang Song (Disney Research, Disney Research), Christopher Schroers (Disney Research|Studios, Disney), Yang Zhang (Disney Research, Disney)
PDF Poster 

648
3D Blur Kernel on Gaussian Splatting
Yongchao Lin (Inner Mongolia University), Xiangdong Su (Inner Mongolia University), Yuhan Yang (Inner Mongolia University )
PDF Poster Video (Right click to download) 

650
Drawing Insights: Sequential Representation Learning in Comics
Sam Titarsolej (University of Amsterdam), Neil Cohn (Tilburg University), Nanne Van Noord (University of Amsterdam)
PDF 

657
G3FA: Geometry-guided GAN for Face Animation
Alireza Javanmardi (German Research Center for AI), Alain Pagani (German Research Center for Artificial Intelligence), Didier Stricker (Technical University Kaiserslautern)
PDF Poster Video (Right click to download) 

659
GN-FR: Generalizable Neural Radinace Fields for Flare Removal
Gopi Raju Matta (Indian Institute of Technology Madras), Rahul Siddartha (Indian Institute of Technology Madras, Indian Institute of Technology, Madras), RONGALI SIMHACHALA VENKATA GIRISH (Indian Institute of Technology, Madras.), Sumit Sharma (Indian Institute of Technology, Madras), Kaushik Mitra (Indian Institute of Technology, Madras)
PDF Poster Video (Right click to download) 

663
Unsupervised Point Cloud Registration with Self-Distillation
Christian Löwens (Bosch), Thorben Funke (Bosch), André Wagner (Bosch), Alexandru Paul Condurache (Bosch)
PDF Poster Video (Right click to download) 

667
ICAF-4: An Integrated Framework of Category-level Articulated Object Perception and Manipulation for Embodied Intelligence
WenBo Xu (Hefei University of Technology), Li Zhang (University of Science and Technology of China), Qiankun Li (University of Science and Technology of China), Qi Wu (Shanghai Jiaotong University), Lin Yuanbo Wu (Swansea University), Liu Liu (Hefei University of Technology)
PDF Poster Video (Right click to download) 

670
Leveraging Inductive Bias in ViT for Medical Image Diagnosis
Jungmin Ha (Kookmin University), Euihyun-yoon (Kookmin University), Sungsik Kim (Kookmin University), Jinkyu Kim (Korea University), Jaekoo Lee (Kookmin University)
PDF Poster Video (Right click to download) 

678
Content and Style Aware Audio-Driven Facial Animation
QINGJU LIU (Flawless AI), Hyeongwoo Kim (Imperial College London), Gaurav Bharaj (Reality Defender AI)
PDF Poster Video (Right click to download) 

680
May the Forgetting Be with You: Alternate Replay for Learning with Noisy Labels
Monica Millunzi (University of Modena and Reggio Emilia), Lorenzo Bonicelli (University of Modena and Reggio Emilia), Angelo Porrello (University of Modena and Reggio Emilia, AimageLab), Jacopo Credi (Chalmers University of Technology), Petter N. Kolm (NYU Courant), Simone Calderara (University of Modena and Reggio Emilia)
PDF Poster Video (Right click to download) 

681
On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models
Hashmat Shadab Malik (Mohamed bin Zayed University of Artificial Intelligence), Numan Saeed (Mohamed bin Zayed University of Artificial Intelligence), Asif Hanif (Mohamed bin Zayed University of Artificial Intelligence), Muzammal Naseer (Khalifa University of Science, Technology and Research), Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence), Salman Khan (Mohamed bin Zayed University of Artificial Intelligence), Fahad Shahbaz Khan (Mohamed bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

685
Boundary Contrastive Learning for Label-Efficient Medical Image Segmentation
Satoshi Kamiya (Meijo University), Kota Yamashita (Meijo University), Kazuhiro Hotta (Meijo University)
PDF Poster 

686
TransHuPR: Cross-View Fusion Transformer for Human Pose Estimation Using mmWave Radar
Niraj Prakash Kini (National Yang Ming Chiao Tung University), Ruey-Horng Shiue (National Yang Ming Chiao Tung University), ryan chandra (National Yangmin Chiaotung University), Wen-Hsiao Peng (National Yang Ming Chiao Tung University), Ching-Wen Ma (National Yang Ming Chiao Tung University), Jenq-Neng Hwang (University of Washington)
PDF Poster Video (Right click to download) 

689
AggSS: An Aggregated Self-Supervised Approach for Class Incremental Learning
Jayateja Kalla (Indian Institute of Science), Soma Biswas (Indian Institute of Science, Bangalore, India)
PDF Poster Video (Right click to download) 

692
Spatio-Temporal Transformer with Rotary Position Embedding and Bone Priors for 3D Human Pose Estimation
Cheng Chen (University of Electronic Science and Technology of China), Jiang Liu (Southwest Jiaotong University), Liaoyuan Zeng (University of Electronic Science and Technology of China), Fang Duan (University of Bath), Sean McGrath (University of Limerick), Tian Dan (University of Electronic Science and Technology of China)
PDF Poster Video (Right click to download) 

695
Detecting Audio-Visual Deepfakes with Fine-Grained Inconsistencies
Marcella Astrid (University of Luxembourg), Enjie Ghorbel (CRISTAL, ENSI, University of Manouba), Djamila Aouada (University of Luxemburg)
PDF Poster Video (Right click to download) 

697
Time-conditioned Illumination for Inverse Rendering of Outdoor Scenes
Xiaoxue Chen (Tsinghua University, Tsinghua University), Hao Zhao (Tsinghua University, Tsinghua University), Guyue Zhou (Tsinghua University), Ya-Qin Zhang (AIR, Tsinghua University)
PDF Poster 

707
QUD: Unsupervised Knowledge Distillation for Deep Face Recognition
Jan Niklas Kolf (TU Darmstadt), Naser Damer (Fraunhofer Institute for Computer Graphics Research IGD), Fadi Boutros (Fraunhofer Institute for Computer Graphics Research)
PDF Poster Video (Right click to download) 

721
Sign Stitching: A Novel Approach to Sign Language Production
Harry Walsh (University of Surrey), Ben Saunders (University of Surrey), Richard Bowden (University of Surrey)
PDF Poster Video (Right click to download) 

723
ControlEdit: A MultiModal Local Clothing Image Editing Method
Di Cheng (Beijing Institute of Fashion Technology), Yingjie Shi (Beijing Institute of Fashion Technology), sun shixin (Beijing Institute Of Fashion Technology), JiaFu Zhang (Beijing Institute of Fashion Technology), weijing wang (Beijing Institution of Fashion Technology), YULiu (Beijing Institute Of Fashion Technology)
PDF 

727
Optimising Diffusion Models for Histopathology Image Synthesis
Victoria Porter (The Queen's University Belfast), Richard Gault (The Queen's University Belfast), Stephanie G Craig (The Queen's University Belfast), Jacqueline James (The Queen's University Belfast)
PDF Poster Video (Right click to download) 

729
Reconstructing Spheres by Fitting Planes
Erol Ozgur (Institut Pascal Clermont-Ferrand), Mohammad Alkhatib (Institut Pascal Clermont-Ferrand), Youcef Mezouar (Institut Pascal Clermont-Ferrand), Adrien Bartoli (Institut Pascal Clermont-Ferrand)
PDF Poster Video (Right click to download) 

731
AutoDOM: Automated Dimension Overlay for Enhanced Measurement-Guidance
Pushpendu Ghosh (Amazon), Aniket Joshi (Amazon), Soumyajit Chowdhury (Amazon), Promod Yenigalla (Amazon)
PDF Poster Video (Right click to download) 

736
Rectifying Shortcut Learning through Cellular Differentiation in Deep Learning Neurons
Hongjing Niu (University of Science and Technology of China), Hanting Li (University of Science and Technology of China), Guoping Wu (University of Science and Technology of China), Bin Li (University of Science and Technology of China), Feng Zhao (University of Science and Technology of China)
PDF Poster Video (Right click to download) 

737
Pseudo Labelling for Enhanced Masked Auto Encoders
Srinivasa Rao Nandam (University of Surrey), Sara Atito (University of Surrey), Zhenhua Feng (Jiangnan University), Josef Kittler (University of Surrey), Muhammad Awais (University of Surrey)
PDF Poster Video (Right click to download) 

738
CosFairNet:A Parameter-Space based Approach for Bias Free Learning
Rajeev Ranjan Dwivedi (Indian Institute of Science Education and Research Bhopal), Priyadarshini Kumari (Sony AI), Vinod K. Kurmi (IISER Bhopal )
PDF Poster Video (Right click to download) 

740
Frequency Decomposition to Tap the Potential of Single Domain for Generalization
Hongjing Niu (University of Science and Technology of China), Qingyue Yang (University of Science and Technology of China), Pengfei Xia (University of Science and Technology of China), Wei Zhang (University of Science and Technology of China), Bin Li (University of Science and Technology of China), Feng Zhao (University of Science and Technology of China)
PDF Poster Video (Right click to download) 

745
Task-Related Feature Enhancement Network for Neuronal Morphology Classification
Chunli Sun (University of Science and Technology of China), Feng Zhao (University of Science and Technology of China)
PDF Poster Video (Right click to download) 

746
Adapting MIMO video restoration networks to low latency constraints
Valéry Dewil (Ecole Normale Superieure), Zhe Zheng (Ecole Normale Superieure), Arnaud Barral (Ecole Normale Superieure), Lara Raad (Universidad de la Republica), Nao Nicolas (Thales Group), Ioannis Cassagne (Thales Group), Jean-michel Morel (City University of Hong Kong), Gabriele Facciolo (Ecole Normale Superieure Paris-Saclay), Bruno Galerne (Universite d'Orleans), Pablo Arias (Universitat Pompeu Fabra)
PDF Poster Video (Right click to download) 

753
Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning
Hoàng-Ân Lê (Université de Bretagne Sud), Paul Berg (Université de Bretagne Sud), Minh Tan Pham (Université de Bretagne Sud)
PDF Poster Video (Right click to download) 

754
Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization
Nicholas Moratelli (University of Modena and Reggio Emilia), Davide Caffagni (University of Modena and Reggio Emilia), Marcella Cornia (University of Modena and Reggio Emilia), Lorenzo Baraldi (University of Modena and Reggio Emilia ), Rita Cucchiara (University of Modena and Reggio Emilia)
PDF Poster 

755
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Chenhongyi Yang (University of Edinburgh), Zehui Chen (University of Science and Technology of China), Miguel Espinosa (University of Edinburgh), Linus Ericsson (University of Edinburgh), Zhenyu Wang (Peking University), Jiaming Liu (Peking University), Elliot J. Crowley (University of Edinburgh)
PDF Poster Video (Right click to download) 

762
Open-World Semi-Supervised Learning under Compound Distribution Shifts
Shijia Xu (Nanjing University of Science and Technology), Lin Zhao (Nanjing University of Science and Technology), Jialiang Tang (NJUST), Guangyu Li (Nanjing University of Science and Technology), Chen Gong (Nanjing University of Science and Technology)
PDF Poster 

763
Horospherical Learning with Smart Prototypes
Paul Berg (Université de Bretagne Sud), Björn Michele (Université de Bretagne Sud), Minh Tan Pham (Université de Bretagne Sud), Laetitia Chapel (Institut Agro Rennes-Angers), Nicolas Courty (Université de Bretagne Sud)
PDF Poster Video (Right click to download) 

769
Flexible Graph Convolutional Network for 3D Human Pose Estimation
Abu Taib Mohammed Shahjahan (Concordia University), Abdessamad Ben Hamza (Concordia University)
PDF Poster Video (Right click to download) 

775
SAE: Single Architecture Ensemble Neural Networks
Martin Ferianc (University College London, University of London), Hongxiang Fan (Samsung), Miguel R. D. Rodrigues (University College London)
PDF Poster Video (Right click to download) 

779
Outlier detection by ensembling uncertainty with negative objectness
Anja Delić (University of Zagreb), Matej Grcic (Faculty of Electrical Engineering and Computing, University of Zagreb), Siniša Šegvić (UniZg-FER)
PDF Poster Video (Right click to download) 

787
MSA2Net: Multi-scale Adaptive Attention-guided Network for Medical Image Segmentation
Sina Ghorbani Kolahi (Tarbiat Modares University), Seyed Kamal Chaharsooghi (Tarbiat Modares University), Toktam Khatibi (Tarbiat Modares University), Afshin Bozorgpour (University of Regensburg), Reza Azad (RWTH Aachen), Moein Heidari (University of British Columbia), Ilker Hacihaliloglu (University of British Columbia), Dorit Merhof (University of Regensburg)
PDF Poster Video (Right click to download) 

790
FILS: Self-Supervised Video Feature Prediction In Semantic Language Space
Mona Ahmadian (University of Surrey), Frank Guerin (University of Surrey), Andrew Gilbert (University of Surrey)
PDF Poster Video (Right click to download) 

797
Calibration of 2D LiDAR sensors using cylindrical target
Tamás Tófalvi (Eotvos Lorand University), Bandó Kovács (Eotvos Lorand University), Levente Hajder (Eotvos Lorand University)
PDF Poster Video (Right click to download) 

828
Multi-Scale Semantic Enrichment and Dual Angular Margin Contrast for Few-Shot Class Incremental Learning
Riya Verma (Indian Institute of Technology, Madras), Sukhendu Das (Indian Institute of Technology Madras)
PDF Poster Video (Right click to download) 

833
Anomaly Detection Based on Semi-Formula Driven Pre-training Dataset to Represent Subtle Difference and Anomaly Score
Hiroki Kobayashi (Chukyo University), Naoki Murakami (Chukyo University), Naoto Hiramatsu (Chukyo University), Takahiro Suzuki (Chukyo University), Manabu Hashimoto (Chukyo University)
PDF Poster Video (Right click to download) 

853
Budget-aware Dynamic Spatially Adaptive Inference
Georgios Zampokas (Imperial College London), Christos-Savvas Bouganis (Imperial College London), Dimitris Tzovaras (Centre for Research and Technology Hellas)
PDF Poster Video (Right click to download) 

854
CSAD: Unsupervised Component Segmentation for Logical Anomaly Detection
Yu-Hsuan Hsieh (Department of Computer Science, National Tsing Hua University, National Tsinghua University), Shang-Hong Lai (National Tsing Hua University)
PDF Poster Video (Right click to download) 

857
Enhancing Radiology Report Generation: The Impact of Locally Grounded Vision and Language Training
Sergio Sanchez Santiesteban (University of Surrey), Muhammad Awais (University of Surrey), Yi-Zhe Song (University of Surrey), Josef Kittler (University of Surrey)
PDF Poster Video (Right click to download) 

859
Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data Regimes
Dmitry Demidov (Mohamed bin Zayed University of Artificial Intelligence), Abduragim Shtanchaev (Mohamed bin Zayed University of Artificial Intelligence), Mihail Minkov Mihaylov (Mohamed bin Zayed University of Artificial Intelligence), Mohammad Almansoori (Mohamed bin Zayed University of Artificial Intelligence)
PDF Poster Video (Right click to download) 

863
CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning
Emanuele Frascaroli (University of Modena and Reggio Emilia), Aniello Panariello (University of Modena and Reggio Emilia), Pietro Buzzega (University of Modena and Reggio Emilia), Lorenzo Bonicelli (University of Modena and Reggio Emilia), Angelo Porrello (University of Modena and Reggio Emilia, AimageLab), Simone Calderara (University of Modena and Reggio Emilia)
PDF Poster Video (Right click to download) 

865
APTPose: Anatomy-aware Pre-Training for 3D Human Pose Estimation
Qing-Wen Yang (MediaTek Inc.), Kai-Wen Duan (National Tsinghua University), Ting-Yi Lu (National Tsinghua University), Kevin Lin (Microsoft), Cheng-Yen Yang (University of Washington), Lijuan Wang (Microsoft), Jenq-Neng Hwang (University of Washington, Seattle), Shang-Hong Lai (National Tsing Hua University)
PDF Poster Video (Right click to download) 

866
A Deep Belief Network Approach to Scalable Compression of Light Field Data for Auto-Stereoscopic Displays
Sally Khaidem (Indian Institute of Technology, Madras), Mansi Sharma (Thapar Institute of Engineering & Technology)
PDF Poster Video (Right click to download) 

878
Learning conditionally untangled latent spaces using Fixed Point Iteration
Victor Enescu (LIP6), Hichem Sahbi (Sorbonne University)
PDF Poster 

882
A Multimodal Network on Handwritten Chinese Character Error Correction
Haizhao Sun (Beijing University of Posts and Telecommunications), Yu Ning (Beijing University of Posts and Telecommunications), jixv (Beijing University of Posts and Telecommunications), Chuang Zhang (Beijing University of Posts and Telecommunications), Ming Wu (Beijing University of Post and Telecommunication)
PDF Poster Video (Right click to download) 

885
Efficient Data Source Relevance Quantification for Multi-Source Neural Networks
Jakob Gawlikowski (Technical University of Munich (TUM)), Nina Maria Gottschling (German Aerospace Center (DLR))
PDF Poster 

887
Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Bin Fu (Institute of Computing Technology, Chinese Academy of Sciences), Qiyang Wan (Institute of Computing Technology, Chinese Academy of Sciences), Jialin Li (Institute of Computing Technology, Chinese Academy of Sciences), Ruiping Wang (Institute of Computing Technology, Chinese Academy of Sciences), Xilin Chen (Institute of Computing Technology)
PDF Poster Video (Right click to download) 

895
Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs
Sadra Safadoust (Koc University), Fabio Tosi (University of Bologna), Fatma Guney (Koc University), Matteo Poggi (University di Bologna)
PDF Poster Video (Right click to download) 

897
topK dice loss for medical image segmentation
Seyed mohsen hosseini (University of Tehran, University of Tehran)
PDF Poster Video (Right click to download) 

900
Direct-Sum Approach to Integrate Losses Via Classifier Subspace
Takumi Kobayashi (National Institute of Advanced Industrial Science and Technology (AIST))
PDF Poster Video (Right click to download) 

902
Knowledge Distillation with Global Filters for Efficient Human Pose Estimation
Kaushik Bhargav Sivangi (University of Glasgow), Fani Deligianni (University of Glasgow)
PDF Poster Video (Right click to download) 

911
A Learnable Color Correction Matrix for RAW Reconstruction
Anqi Liu (Shanghai University), Shiyi Mu (Shanghai University), Shugong Xu (Shanghai University)
PDF Poster Video (Right click to download) 

913
Examining the Threat Landscape: Foundation Models and Model Stealing
Ankita Raj (Indian Institute of Technology, Delhi), Deepankar Varma (Indian Institute of Technology, Delhi), Chetan Arora (Indian Institute of Technology Delhi)
PDF Poster Video (Right click to download) 

922
UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural Networks with Convolutional ARMA Filters
Kovvuri Sai Gopal Reddy (Shiv Nadar University), Saran Bodduluri (Shiv Nadar University), A. Mudit Adityaja (Shiv Nadar University), Saurabh Shigwan (Shiv Nadar University), Nitin Kumar (Shiv Nadar University), Snehasis Mukherjee (Shiv Nadar University)
PDF Poster Video (Right click to download) 

927
GazeHELL: Gaze Estimation with Hybrid Encoders and Localised Losses with weighing
Shubham Dokania (Mercedes-Benz R&D India), Vasudev Singh (Mercedes Benz Research & Development India), Shuaib Ahmed (Mercedes Benz R&D India )
PDF Poster Video (Right click to download) 

929
TrakAthlete4D: Multi-View On-Field Player Position Tracking in Sports
Nitish Agarwal (KinaTrax), Steven Cadavid (University of Miami)
PDF Poster Video (Right click to download) 

932
Spatiotemporal Vision Transformer for Weakly Supervised Dense Prediction of Dynamic Brain Maps
Behnam Kazemivash (Georgia State University), Armin Iraji (Georgia State University), Sergey Plis (Georgia State University), Vince Calhoun (Georgia State University)
PDF Poster Video (Right click to download) 

933
SceneSAM: Integrating 2D Labels for Weakly Supervised 3D Scene Understanding
Julius Koerner (Technical University of Munich), Dogu Tamgac (Technical University of Munich), David Rozenberszki (Technical University of Munich)
PDF Poster Video (Right click to download) 

936
PV-SLAM: Panoptic Visual SLAM with Loop Closure and Online Bundle Adjustment
Ashok Bandyopadhyay (Indian Institute of Technology, Guwahati), Pranjal Baranwal (Indian Institute of Technology, Guwahati, Indian institute of science, Bangalore), Arijit Sur (Indian Institute of Technology, Guwahati), Rajeev UP (Vikram Sarabhai Space Centre, Indian Space Research Organization, Thiruvananthapuram, India)
PDF Poster Video (Right click to download) 

939
Deep Learning for GPS-Denied SAR Image Focusing and Vehicle Trajectory Estimation
Christopher Beam (University of North Carolina at Charlotte), Andrew R. Willis (University of North Carolina, Charlotte), Kevin M Brink (Air Force Research Laboratory)
PDF Poster Video (Right click to download) 

945
Gaussian Splatting in Mirrors: Reflection-aware Rendering via Virtual Camera Optimization
Zihan Wang (Aalto University), Shuzhe Wang (Aalto University), Matias Turkulainen (Aalto University), Junyuan Fang (University of Helsinki), Juho Kannala (Aalto University)
PDF Poster Video (Right click to download) 

947
Layer-wise Learning of CNNs by Self-tuning Learning Rate and Early Stopping at Each Layer
Melika Sadeghi Tabrizi (University of Tehran, University of Tehran), Ali Karimi (Kharazmi University), Ahmad Kalhor (University of Tehran), Babak N Araabi (University of Tehran, University of Tehran), Mona Ahmadian (University of Surrey)
PDF Poster Video (Right click to download) 

949
On Partial Prototype Collapse in the DINO Family of Self-Supervised Methods
Hariprasath Govindarajan (Qualcomm Inc, QualComm), Per Sidén (Linkoping University), Jacob Roll (Qualcomm Inc, QualComm), Fredrik Lindsten (Linkoping University)
PDF Poster Video (Right click to download) 

954
Beyond Face Matching: A Facial Traits based Privacy Score for Synthetic Face Datasets
Robero Leyva (The university of Warwick), Praveen Selvaraj (University College London, University of London), Andrew Elliott (Alan Turing Institute), Dr Gregory Epiphaniou (University of Warwick), carsten maple (The university of Warwick)
PDF Poster Video (Right click to download) 

957
Putting the Segment Anything Model to the Test with 3D Knee MRI - A Comparison with State-of-the-Art Performance
Oliver Mills (University of Leeds), Nishant Ravikumar (University of Leeds), Philip G Conaghan (University of Leeds), Samuel D Relton (University of Leeds)
PDF Poster Video (Right click to download) 

959
SR+Codec: a Benchmark of Super-Resolution for Video Compression Bitrate Reduction
Evgeney Bogatyrev (Moscow State University, Lomonosov Moscow State University), Ivan Molodetskikh (Moscow State University, Lomonosov Moscow State University), Dmitriy S. Vatolin (Moscow State University, Lomonosov Moscow State University)
PDF Poster Video (Right click to download) 

967
CVAM-Pose: Conditional Variational Autoencoder for Multi-Object Monocular Pose Estimation
Jianyu Zhao (University of Central Lancashire), Wei Quan (University of Central Lancashire), Bogdan Matuszewski (University of Central Lancashire)
PDF Poster Video (Right click to download) 

977
Improving Multimodal Learning with Multi-Loss Gradient Modulation
Konstantinos Kontras (Department of Electrical Engineering, KU Leuven, Belgium, KU Leuven), Christos Chatzichristos (KU Leuven), Matthew B. Blaschko (KU Leuven), Maarten De Vos (KU Leuven)
PDF Poster 

986
Adaptive Weighted Co-Learning for Cross-Domain Few-Shot Learning
Abdullah Alchihabi (Carleton University), Marzi Heidari (Carleton University), Yuhong Guo (Carleton University)
PDF Poster Video (Right click to download) 

987
Guided Attention for Interpretable Motion Captioning
KARIM RADOUANE (University of Montpellier), Julien Lagarde (University of Montpellier), Sylvie RANWEZ (IMT Mines Ales), Andon Tchechmedjiev (IMT Mines Ales)
PDF Poster Video (Right click to download) 

991
iHAST: Integrating Hybrid Attention for Super-Resolution in Spatial Transcriptomics
Xi Li (University of California, Irvine), Jing Zhang (Donald Bren School of Information and Computer Sciences, University of California, Irvine), Ziheng Duan (University of California, Irvine), Yi Dai (University of California, Irvine), Siwei Xu (Donald Bren School of Information and Computer Sciences, University of California, Irvine)
PDF Poster Video (Right click to download) 

998
MV-Match: Multi-View Matching for Domain-Adaptive Identification of Plant Nutrient Deficiencies
Jinhui Yi (University of Bonn), Yanan Luo (University of Bonn), Marion Deichmann (University of Bonn), Gabriel Schaaf (University of Bonn), Juergen Gall (University of Bonn)
PDF Poster Video (Right click to download) 

1013
Open-Vocabulary Temporal Action Localization using Multimodal Guidance
Akshita Gupta (University of Guelph), Aditya Arora (York University), Sanath Narayan (Technology Innovation Institute), Salman Khan (Mohamed bin Zayed University of Artificial Intelligence), Fahad Shahbaz Khan (Mohamed bin Zayed University of Artificial Intelligence), Graham W. Taylor (University of Guelph)
PDF Poster Video (Right click to download) 

1020
Recovering SLAM Tracking Lost by Trifocal Pose Estimation using GPU-HC++
Chiang-Heng Chien (Brown University), Ahmad Abdelfattah (University of Tennessee, Knoxville), Benjamin Kimia (Brown University)
PDF Poster Video (Right click to download) 

If there are any mistakes on this page, please do not hesitate to contact bmvc@bmvc2024.org