The 35th British Machine Vision Conference 2024: Proceedings

9	Federated Learning for Face Recognition via Intra-subject Self-supervised Learning Hansol Kim (Kookmin University), Hoyeol choi (Kakaobank), Youngjun Kwak (KAKAOBANK) PDF Poster Video (Right click to download)
12	CLIP Adaptation by Intra-Modal Overlap Reduction Alexey Kravets (University of Bath), Vinay P Namboodiri (University of Bath) PDF Poster Video (Right click to download)
14	Efficiency-preserving Scene-adaptive Object Detection Zekun Zhang (State University of New York, Stony Brook), Vu Quang Truong (VinAI Research), Minh Hoai (University of Adelaide) PDF Poster Video (Right click to download)
15	Sequential Amodal Segmentation via Cumulative Occlusion Learning Jiayang Ao (University of Melbourne), Qiuhong Ke (Monash University), Krista A. Ehinger (The University of Melbourne) PDF Poster Video (Right click to download)
16	Region-based Entropy Separation for One-shot Test-Time Adaptation Kodai Kawamura (Korea University), Shunya Yamagami (Tokyo University of Science), Go Irie (Tokyo University of Science) PDF Poster Video (Right click to download)
18	MeTTA: Single-View to 3D Textured Mesh Reconstruction with Test-Time Adaptation Kim Yu-Ji (Pohang University of Science and Technology), Hyunwoo Ha (Pohang University of Science and Technology), Kim Youwang (Pohang University of Science and Technology), Jaeheung Surh (Bucketplace), Hyowon Ha (Bucketplace), Tae-Hyun Oh (POSTECH) PDF Poster Video (Right click to download)
19	Few-shot Multispectral Segmentation with Representations Generated by Reinforcement Learning Dilith Jayakody (University of Moratuwa), Thanuja Ambegoda (University of Moratuwa) PDF Poster Video (Right click to download)
22	HDRSplat: Gaussian Splatting for High Dynmaic Range 3D Scene Reconstruction from Raw Images Shreyas Singh (Fractal Analytics ), Aryan Garg (Department of Computer Science, University of Wisconsin - Madison), Kaushik Mitra (Indian Institute of Technology, Madras) PDF Poster Video (Right click to download)
23	Alignment-aware Patch-level Routing for Dynamic Video Frame Interpolation Ban Chen (Samsung Electronics (China) R&D Center), Xin Jin (Samsung R&D Institute China-Nanjing (SRC-N)), LONG HAI WU (University of Science and Technology of China), Jie Chen (SRCN), Ilhyun Cho (Samsung), Cheul-hee Hahm (Samsung) PDF Poster Video (Right click to download)
25	AR-TTA: A Simple Method for Real-World Continual Test-Time Adaptation Damian Sójka (Technical University of Poznan), Bartłomiej Twardowski (IDEAS NCBR), Tomasz Trzcinski (Warsaw University of Technology), Sebastian Cygert (IDEAS NCBR) PDF Poster Video (Right click to download)
26	Improving Depth Gradient Continuity in Transformers: A Comparative Study on Monocular Depth Estimation with CNN Jiawei Yao (University of Washington), Tong Wu (Amazon), Xiaofeng Zhang (Shanghai Jiaotong University) PDF Poster
28	SciPostLayout: A Dataset for Layout Analysis and Layout Generation of Scientific Posters Shohei Tanaka (OMRON SINIC X), Hao Wang (Waseda University), Yoshitaka Ushiku (OMRON SINIC X) PDF Poster Video (Right click to download)
31	COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation Munish Monga (Indian Institute of Technology, Bombay), Sachin Kumar Giroh (Indian Institute of Technology, Bombay), Ankit Jha (The LNM Institute of Information Technology), Mainak Singha (Indian Institute of Technology, Bombay), Biplab Banerjee (Indian Institute of Technology, Bombay, Dhirubhai Ambani Institute Of Information and Communication Technology), Jocelyn Chanussot (INRIA) PDF Poster Video (Right click to download)
32	No Captions, No Problem: Captionless 3D-CLIP Alignment with Hard Negatives via CLIP Knowledge and LLMs Cristian Sbrolli (Polytechnic Institute of Milan), Matteo Matteucci (Politecnico di Milano) PDF Poster Video (Right click to download)
33	Self-Supervised Real-World Denoising by Jointly Learning Visible and Invisible Noise Shaoyu Wang (Dalian Martime University), Changze Zhou (Dalian Maritime University), Bolin Song (Dalian Martime University), Yiyang Wang (Dalian Martime University) PDF Poster Video (Right click to download)
34	TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation Jack Saunders (University of Bath), Vinay P Namboodiri (University of Bath) PDF Poster
37	DRAFT: Direct Radiance Fields Editing with Composable Operations Zhihan Cai (Tsinghua University, Tsinghua University), Kailu Wu (Tsinghua University, Tsinghua University), Dapeng Cao (Xi'an Jiaotong University), Feng Chen (University of Hong Kong), Kaisheng Ma (Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University) PDF Poster Video (Right click to download)
38	Linear Calibration Approach to Knowledge-free Group Robust Classification Ryota Ishizaki (Tokyo University of Science), Shunya Yamagami (Tokyo University of Science), Yuta Goto (Tokyo University of Science), Go Irie (Tokyo University of Science) PDF Poster Video (Right click to download)
39	HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction Haoyu Zhao (Wuhan University), Xingyue Zhao (Xi'an Jiaotong University), Lingting Zhu (The University of Hong Kong), Weixi Zheng (Wuhan University), Yongchao Xu (Wuhan University) PDF Poster Video (Right click to download)
41	Local Implicit Wavelet Transformer for Arbitrary-Scale Super-Resolution Minghong Duan (Fudan University), Linhao Qu (Fudan University), Shaolei Liu (Shanghai Institute of Microsystem and Information Technology), Manning Wang (Fudan University) PDF Poster Video (Right click to download)
42	Spatial-Temporal NAS for Fast Surgical Segmentation Matthew Lee (Medtronic), Felix John Samuel Bragman (Medtronic), Ricardo Sanchez-Matilla (Medtronic), Imanol Luengo (Medtronic), Danail Stoyanov (University College London) PDF Poster
43	Learning to Segment Publicly Accessible Green Spaces with Visual and Semantic Data Jian Gao (Queen's University Belfast), Niall McLaughlin (The Queen's University Belfast), Joanna Sara Valson (The Queen's University Belfast), Neil Anderson (The Queen's University Belfast), Ruth Hunter (The Queen's University Belfast) PDF Poster Video (Right click to download)
45	D³Nav: Data-Driven Driving Agents for Autonomous Vehicles in Unstructured Traffic Aditya Nalgunda Ganesh (Purdue University), Gowri Srinivasa (PES University, Bengaluru, India) PDF Poster Video (Right click to download)
46	FFR-UNet: Feature Filter-Refinement UNet for Medical Image Segmentation Weixin Xu (Beihang University) PDF Poster Video (Right click to download)
47	Group Activity Recognition via Spatio-Temporal Reasoning of Key Instances Haoting He (Xi'an Jiaotong University), Yaochen Li (Xi'an Jiaotong University), Yutong Wang (Xi'an Jiaotong University), Gaojie Li (Xi'an Jiaotong University), Wei Guo (Xi'an Jiaotong University), Runlin Zou (Xi'an Jiaotong University) PDF Poster Video (Right click to download)
53	NCA-Morph: Medical Image Registration with Neural Cellular Automata Amin Ranem (TU Darmstadt), John Kalkhof (TU Darmstadt), Anirban Mukhopadhyay (TU Darmstadt) PDF Poster Video (Right click to download)
54	InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task Learning Babak Ehteshami Bejnordi (QualComm), Gaurav Kumar (QualComm), Amelie Royer (Kyutai), Christos Louizos (QualComm), Tijmen Blankevoort (Facebook), Mohsen Ghafoorian (Qualcomm) PDF Poster Video (Right click to download)
60	Advancing Medical Image Segmentation: Morphology-Driven Learning with Diffusion Transformer Sungmin Kang (Dongguk University), Jaeha Song (Dongguk University), Jihie Kim (Dongguk University) PDF Poster Video (Right click to download)
64	Multi-Modal Information Bottleneck Attribution with Cross-Attention Guidance Pauline Bourigault (Imperial College London), Emmanuelle Bourigault (University of Oxford), Danilo Mandic (Imperial College London) PDF Poster Video (Right click to download)
66	Noise-Tolerant Few-Shot Unsupervised Adapter for Vision-Language Models Eman Ali (Mohamed bin Zayed University of Artificial Intelligence), Muhammad Haris Khan (Mohamed Bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
70	Advancing Anomaly Detection: The IDW dataset and MC algorithm Alexander D. J. Taylor (University of Bath), Jonathan James Morrison (Rolls-Royce Defence Aerospace), Phillip Tregidgo (University of Bristol), Neill D. F. Campbell (University of Bath) PDF Poster
74	ControlDreamer: Blending Geometry and Style in Text-to-3D Yeongtak Oh (Seoul National University), Jooyoung Choi (Seoul National University), Yongsung Kim (Seoul National University), Minjun Park (Seoul National University), Chaehun Shin (Seoul National University), Sungroh Yoon (Seoul National University) PDF Poster Video (Right click to download)
76	SagaGAN: Style Applied using Gram matrix Attribution based on StarGAN v2 Yongseon Yoo (Hanyang University), Seonggyu Kim (Hanyang University), Jong-Min Lee (Hanyang University) PDF Poster Video (Right click to download)
77	PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images Yiheng Xiong (Technical University of Munichn), Angela Dai (Technical University of Munich) PDF Poster Video (Right click to download)
85	Textual Attention RPN for Open-Vocabulary Object Detection Tae-Min Choi (Korea Institute of Science and Technology), Inug Yoon (Korea Advanced Institute of Science & Technology), Jong-Hwan Kim (Korea Advanced Institute of Science and Technology), Juyoun Park (Korea Institute of Science and Technology (KIST) ) PDF Poster Video (Right click to download)
100	Painterly Image Harmonization via Bi-Transformation with Dynamic Kernels Zhangliang Sun (Tsinghua University, Tsinghua University), Hui Zhang (Tsinghua University) PDF Poster Video (Right click to download)
101	Interactive Image Segmentation with Temporal Information Augmented Qiaoqiao Wei (School of Software, Tsinghua University), Hui Zhang (Tsinghua University), Jun-Hai Yong (Tsinghua University, Tsinghua University) PDF Poster Video (Right click to download)
102	Distribution-Aware Calibration for Object Detection with Noisy Bounding Boxes Donghao Zhou (The Chinese University of Hong Kong), Jialin Li (Tencent YouTu Lab), Jinpeng Li (The Chinese University of Hong Kong), Jiancheng Huang (Chinese Academy of Sciences), Qiang Nie (The Hong Kong University of Science and Technology), Yong Liu (Tencent Youtu Lab), Bin-Bin Gao (Tencent), Qiong Wang (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Chinese Academy of Sciences), Pheng-Ann Heng (The Chinese University of Hong Kong), Guangyong Chen (Zhejiang Lab) PDF Poster
103	Prompting Diffusion Representations for Cross-Domain Semantic Segmentation Rui Gong (Amazon), Martin Danelljan (ETH Zurich), Han Sun (EPFL - EPF Lausanne), Julio Delgado Mangas (Meta, Reality labs), Nikolay Marin (Amazon), Luc Van Gool (INSAIT - Sofia Un.) PDF Poster Video (Right click to download)
104	MMPrune4U: Regularizing Multimodal Feature Distortion in Weight Pruning for Deep Neural Network Compression Sudip Das (Valeo), Kaixin Xu (I2R, ASTAR), Nushrat Hussain (Indian Statistical Institute), Ziyuan Zhao (I2R, ASTAR), Arindam Das (Valeo), Weisi Lin (Nanyang Technological University), Ujjwal Bhattacharya (Indian Statistical Institute, Dhirubhai Ambani Institute Of Information and Communication Technology) PDF Poster Video (Right click to download)
108	MoManifold: Learning to Measure 3D Human Motion via Decoupled Joint Acceleration Manifolds Ziqiang Dang (Alibaba Group), Tianxing Fan (Zhejiang University), Boming Zhao (Zhejiang University), Xujie Shen (Zhejiang University), çŽ‹ ç£Š (Guangdong OPPO Mobile Telecommunications Corp.,Ltd.), Guofeng Zhang (Zhejiang University), Zhaopeng Cui (Zhejiang University) PDF Poster Video (Right click to download)
111	Projected Stochastic Gradient Descent with Quantum Annealed Binary Gradients Maximilian Krahn (Aalto University), Michele Sasdelli (The University of Adelaide), Frances Fengyi Yang (University of Adelaide), Vladislav Golyanik (Saarland Informatics Campus, Max-Planck Institute for Informatics), Juho Kannala (Aalto University), Tat-Jun Chin (The University of Adelaide), Tolga Birdal (Imperial College London) PDF Poster Video (Right click to download)
113	Text Removal In E-Commerce Images: A Comparison Of Inpainting Methods Hiya Roy (Rakuten Institute of Technology, The University of Tokyo), Bjorn Stenger (Rakuten Group Inc.) PDF Poster
114	Key-point Guided Deformable Image Manipulation Using Diffusion Model Seok-Hwan Oh (Korea Advanced Institute of Science & Technology), Guil Jung (KAIST), Myeong-Gee Kim (Barreleye, inc.), Sang-yun Kim (KAIST), Young-Min Kim (KAIST), hyeonjik lee (KAIST), Hyuksool Kwon (Seoul National University), Hyeonmin Bae (Korea Advanced Institute of Science and Technology) PDF Poster Video (Right click to download)
115	Multi-modal Crowd Counting via Modal Emulation Chenhao Wang (Harbin Institute of Technology), Xiaopeng Hong (Harbin Institute of Technology), Zhiheng Ma (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Chinese Academy of Sciences), Yupeng Wei (Harbin Institute of Technology), Yabin Wang (Xi'an Jiaotong University), Xiaopeng Fan (Harbin Institute of Technology) PDF Poster Video (Right click to download)
133	MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM Ren-Wu Li (AMD), Wenjing Ke (AMD), Dong Li (AMD), Lu Tian (AMD), Emad Barsoum (AMD) PDF Poster
135	Acoustic-based 3D Human Pose Estimation Robust to Human Position Yusuke Oumi (Keio University), Yuto Shibata (Keio University), Go Irie (Tokyo University of Science), Akisato Kimura (NTT Corporation), Yoshimitsu Aoki (Keio University), Mariko Isogawa (Keio University) PDF Poster Video (Right click to download)
136	PhysFlow: Skin tone transfer for remote heart rate estimation through conditional normalizing flows Joaquim Comas Martinez (Universitat Pompeu Fabra), Antonia Alomar (Universitat Pompeu Fabra), Adria Ruiz (CSIC-UPC), Federico Sukno (Pompeu Fabra University) PDF Poster Video (Right click to download)
137	InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth Cho-Ying Wu (Bosch), Quankai Gao (University of Southern California), Chin-Cheng Hsu (Resemble AI), Te-Lin Wu (Character.AI), Jing-Wen Chen (University of Southern California), Ulrich Neumann (University of Southern California) PDF Poster Video (Right click to download)
140	Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space Junho Lee (Seoul National University), Jeongwoo Shin (Seoul National University), Seung Woo Ko (LG AI Research), Seongsu Ha (Twelve Labs), Joonseok Lee (Seoul National University) PDF Poster Video (Right click to download)
142	Recovering Global Data Distribution Locally in Federated Learning Ziyu Yao (Peking University) PDF Poster Video (Right click to download)
145	Privacy-preserving datasets by capturing feature distributions with Conditional VAEs Francesco Di Salvo (University of Bamberg), David Tafler (University of Bamberg), Sebastian Doerrich (University of Bamberg), Christian Ledig (University of Bamberg) PDF Poster Video (Right click to download)
147	MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion Angel Villar-Corrales (University of Bonn), Moritz Austermann (Rheinische Friedrich-Wilhelms UniversitÃ¤t Bonn), Sven Behnke (University of Bonn) PDF Poster Video (Right click to download)
150	AISE: Adaptive Input Sampling for Explanation of Black-box Models Evgeny Tsykunov (Intel Corporation), Wonju Lee (Intel Corporation), Minje Park (Intel) PDF Poster
152	Retinex-Inspired Cooperative Game Through Multi-Level Feature Fusion for Robust, Universal Image Enhancement Ruiqi Mao (Northwest Polytechnical University Xi'an), Rongxin Cui (Northwestern Polytechnical University Xi'an) PDF Poster Video (Right click to download)
164	Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor Point Clouds Yuyang Zhao (National University of Singapore), Na Zhao (Singapore University of Technology and Design), Gim Hee Lee (National University of Singapore) PDF Poster
165	Learning Object Placement via Convolution Scoring Attention Yibin Wang (Fudan University), Yuchao Feng (Westlake University), Jianwei Zheng (Zhejiang University of Technology) PDF
166	Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection Yunsong Wang (National University of Singapore), Na Zhao (Singapore University of Technology and Design), Gim Hee Lee (National University of Singapore) PDF Poster Video (Right click to download)
168	Topology-preserving Adversarial Training for Alleviating Natural Accuracy Degradation Xiaoyue Mi (University of the Chinese Academy of Sciences), Fan Tang (Institute of Computing Technology, CAS), Yepeng Weng (Lenovo Group Limited), Danding Wang (Institute of Computing Technology, Chinese Academy of Sciences), Juan Cao (Institute of Computing Technology, Chinese Academy of Sciences), Sheng Tang (Institute of Computing Technology, Chinese Academy of Sciences), Peng Li (Tsinghua University), Yang Liu (Tsinghua University) PDF Poster
180	JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation Sai Tanmay Reddy Chakkera (State University of New York at Stony Brook), Aggelina Chatziagapi (Stony Brook University), Dimitris Samaras (Stony Brook University) PDF Poster Video (Right click to download)
183	Hierarchical Prompt Learning for Scene Graph Generation Xuhan Zhu (University of Chinese Academy of Sciences), Yifei Xing (Chinese Academy of Sciences), Ruiping Wang (Institute of Computing Technology, Chinese Academy of Sciences), Yaowei Wang (Harbin Institute of Technology, Shenzhen), Xiangyuan Lan (Peng Cheng Laboratory) PDF Poster Video (Right click to download)
184	Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization Roisin Luo (University of Galway), Alexandru Drimbarean (FotoNation), James McDermott (University of Galway), Colm O'Riordan (University of Galway) PDF Poster Video (Right click to download)
185	Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion Zeyu Zhang (The Australian National University), Yiran Wang (University of Sydney, University of Sydney), Biao Wu (University of Technology Sydney), Shuo Chen (Monash University), Zhiyuan Zhang (University of Adelaide), SHIYA HUANG (University of Adelaide), Wenbo Zhang (University of Adelaide), Meng Fang (University of Liverpool), Ling Chen (University of Technology Sydney), Yang Zhao (La Trobe University) PDF Poster Video (Right click to download)
188	A self-supervised and adversarial approach to hyperspectral demosaicking and RGB reconstruction in surgical imaging Peichao Li (King's College London), Oscar MacCormac (King's College London), Jonathan Shapey (King's College London), Tom Vercauteren (King's College London) PDF Poster Video (Right click to download)
199	A Revisit to the Decoder for Camouflaged Object Detection Seung Woo Ko (LG AI Research), Joopyo Hong (Seoul National University), Suyoung Kim (Seoul National University), Seungjai Bang (Seoul National University), Sungzoon Cho (Seoul National University), Nojun Kwak (Seoul National University), Hyung-Sin Kim (Seoul National University), Joonseok Lee (Seoul National University) PDF Poster Video (Right click to download)
200	Towards Generative Class Prompt Learning for Fine-grained Visual Recognition Soumitri Chattopadhyay (University of North Carolina at Chapel Hill), Sanket Biswas (Computer Vision Center, Universitat Autonoma de Barcelona), Emanuele Vivoli (Universidad Autonoma de Barcelona ), Josep Llados (Computer Vision Center, Universitat Autonoma de Barcelona) PDF Poster Video (Right click to download)
201	Infrared and Visible Image Fusion Using Multi-level Adaptive Fractional Differential Kang Zhang (Nanjing University of Science and Technology), Xinnian Guo (Suqian University) PDF Poster Video (Right click to download)
203	S³-Match: Common-View Aligned Image Matching via Self-Supervised Keypoint Selection Shizhen Li (Xi'an Jiaotong University), Jingcheng Liu (Xi'an Jiaotong University), Jianwu Fang (Xi'an Jiaotong University), DeZheng Gao (Xi'an Jiaotong University), Jianru Xue (Xi'an Jiaotong University) PDF Poster Video (Right click to download)
205	From Black-box to Label-only: a Plug-and-Play Attack Network for Model Inversion Huan Bao (Jinan University), Kaimin Wei (Jinan University), Yao Chen (Jinan University), Hanting Hou (Jinan University), Jinpeng Chen (Beijing University of Post and Telecommunication), Yongdong WU (Jinan University) PDF Poster Video (Right click to download)
207	Feature Splatting for Better Novel View Synthesis with Low Overlap Tomas Berriel Martins (Universidad de Zaragoza), Javier Civera (Universidad de Zaragoza) PDF Poster Video (Right click to download)
210	BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation Kieran Ryan Saunders (Aston University), Luis J Manso (Aston University), George Vogiatzis (Aston University) PDF Poster Video (Right click to download)
211	Align-DETR: Enhancing End-to-end Object Detection with Aligned Loss Zhi Cai (Beijing University of Aeronautics and Astronautics), Songtao Liu (Megvii Technology Inc.), Guodong Wang (Beijing University of Aeronautics and Astronautics), Zeming Li (BYTEDANCE), Zheng Ge (Megvii Technology Inc.), Xiangyu Zhang (MEGVII Technology), Di Huang (Beihang University) PDF Poster Video (Right click to download)
212	Mixstyle-Entropy: Whole Process Domain Generalization with Causal Intervention and Perturbation Luyao Tang (Xiamen University), Yuxuan Yuan (Xiamen University), Chaoqi Chen (The University of Hong Kong), Xinghao Ding (Xiamen University), Yue Huang (Xiamen University) PDF Poster Video (Right click to download)
213	Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis Theodoros Kouzelis (National Technical University of Athens), Emmanouil Plitsis (University of Athens), Mihalis Nicolaou (The Cyprus Institute), Yannis Panagakis (National and Kapodistrian University of Athens) PDF Poster Video (Right click to download)
215	AttEntropy: On the Generalization Ability of Supervised Semantic Segmentation Transformers to New Objects in New Domains Krzysztof Baron-Lis (Waabi), Matthias Rottmann (University of Wuppertal), Annika MÃ¼tze (Bergische UniversitÃ¤t Wuppertal), Sina Honari (Samsung), Pascal Fua (EPFL - EPF Lausanne), Mathieu Salzmann (Swiss Data Science Center) PDF Poster Video (Right click to download)
216	Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning Masane Fuchi (Meiji University), Tomohiro Takagi (Meiji University) PDF Poster Video (Right click to download)
217	GeoFormer: A Multi-Polygon Segmentation Transformer Maxim Khomiakov (Technical University of Denmark), Michael Riis Andersen (Technical University of Denmark), Jes Frellsen (Technical University of Denmark) PDF Poster
218	RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance Avideep Mukherjee (Indian Institute of Technology Kanpur), Soumya Banerjee (IIT Kanpur, IIT Kanpur), Piyush Rai (IIT Kanpur, IIT Kanpur), Vinay P Namboodiri (University of Bath) PDF Poster Video (Right click to download)
223	AUPIMO: Redefining Anomaly Localization Benchmarks with High Speed and Low Tolerance JoÃ£o P. C. Bertoldo (PSL University), Dick Ameln (Intel), Ashwin Vaidya (Intel), Samet Akcay (Intel) PDF Poster Video (Right click to download)
227	Cost-Sensitive Learning for Long-Tailed Temporal Action Segmentation Zhanzhong Pang (National University of Singapore), Fadime Sener (Meta), Shrinivas Ramasubramanian (Fujitsu Research and Development Center), Angela Yao (National University of Singapore) PDF Poster Video (Right click to download)
228	Learning Scene-Goal-Aware Motion Representation for Trajectory Prediction Ziyang Ren (Xi'an Jiaotong University), Ping Wei (Xi'an Jiaotong University), Haowen Tang (Xi'an Jiaotong University), Huan Li (Xi'an Jiaotong University), Jin Yang (Xi'an Jiaotong University) PDF Poster Video (Right click to download)
240	SAM Helps SSL: Mask-guided Attention Bias for Self-supervised Learning Kensuke Taguchi (Kyocera Corporation), Takehiko Kawai (Kyocera Corporation), Wataru Imaeda (Kyocera Corporation), Hironobu Fujiyoshi (DENSO CORPORATION) PDF Poster Video (Right click to download)
245	Enhancing 3D Hand Pose Estimation via Dense Ordinal Regression Network Yamin Mao (Samsung), Zhihua Liu (Samsung Research Center, Beijing), Weiming Li (Samsung), SoonYong Cho (Samsung), Qiang Wang (Samsung), Xiaoshuai Hao (Beijing Academy of Artificial Intelligence(BAAl) ) PDF Poster Video (Right click to download)
249	Transferable Learned Image Compression-Resistant Adversarial Perturbations Yang Sui (Rice University), Zhuohang Li (Vanderbilt University), Ding Ding (Tencent Media Lab), Xiang Pan (Tencent), Xiaozhong Xu (Tencent Media Lab), Shan Liu (Tencent Media Lab), Zhenzhong Chen (Wuhan University) PDF Poster Video (Right click to download)
250	Deep Unfolding Network with Spatial-spectral Perception Enhanced for Pan-sharpening Mengjiao Zhao (Zhejiang University), Mengting Ma (Zhejiang University), Xiangdong Li (Zhejiang University), Ao Gao (Zhejiang University), Siyang Song (University of Exeter), Wei Zhang (Zhejiang University) PDF Poster Video (Right click to download)
256	IncreLM: Incremental 3D Line Mapping Xulong Bai (Institute of automation, Chinese academy of science, Chinese Academy of Sciences), Hainan Cui (Chinese Academy of Sciences), Shuhan Shen (Institute of automation, Chinese academy of science) PDF Poster Video (Right click to download)
257	Motion Tracking with Rotated Bounding Boxes on Overhead Fisheye Imagery Jordan Lam (Zhejiang University) PDF Poster
262	Toward Highly Efficient Semantic-Guided Machine Vision for Low-Light Object Detection Xin Feng (Chongqing University of Technology), Junxian Zeng (Chongqing University of Technology), Siping Wang (Chongqing University of Technology), Zhenwei He (Chongqing University of Technology) PDF Poster Video (Right click to download)
263	Improving Object Detection via Local-global Contrastive Learning Danai Triantafyllidou (Huawei Technologies Ltd.), Sarah Parisot (Huawei Technologies Ltd.), Ales Leonardis (University of Birmingham), Steven McDonagh (University of Edinburgh) PDF Poster Video (Right click to download)
267	Depth-Guided Privacy-Preserving Visual Localization Using 3D Sphere Clouds Heejoon Moon (Hanyang University), Jongwoo Lee (Hanyang University), Jeonggon Kim (Hanyang University), Je Hyeong Hong (Hanyang University) PDF Poster Video (Right click to download)
287	A Super-pixel-based Approach to the Stable Interpretation of Neural Networks Shizhan Gong (the Chinese University of Hong Kong), Jingwei Zhang (The Chinese University of Hong Kong), Qi Dou (The Chinese University of Hong Kong), Farzan Farnia (The Chinese University of Hong Kong) PDF Poster Video (Right click to download)
288	PawFACS: Leveraging Semi-Supervised Learning for Pet Facial Action Recognition Anandavardhan Hegde (Samsung), Sudha Velusamy (Samsung), Narayan Kothari (Samsung), Aman Bahuguna (Samsung), Apnesh Rawat (National Institute of Technology Delhi), Hema Sathiamurthy (Indian Institute of Technology, Madras, Dhirubhai Ambani Institute Of Information and Communication Technology), Ankit Raja (Galgotias University ) PDF Poster Video (Right click to download)
290	Are Sparse Neural Networks Better Hard Sample Learners? Qiao Xiao (Eindhoven University of Technology), Boqian Wu (University of Twente), Lu Yin (University of Surrey), Christopher Neil Gadzinski (University of Luxemburg), Tianjin Huang (University of Exeter), Mykola Pechenizkiy (Eindhoven University of Technology), Decebal Constantin Mocanu (University of Luxemburg) PDF Poster Video (Right click to download)
295	MxT: Mamba x Transformer for Image Inpainting Shuang Chen (Durham University), Amir Atapour-Abarghouei (Durham University), Haozheng Zhang (Durham University), Hubert P. H. Shum (Durham University) PDF Poster Video (Right click to download)
297	Generalizing Teacher Networks for Effective Knowledge Distillation Across Student Architectures Kuluhan Binici (National University of Singapore), Weiming Wu (National University of Singapore), Tulika Mitra (National University of Singapore) PDF Poster Video (Right click to download)
299	RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields Mihnea-Bogdan Jurca (Vrije Universiteit Brussel), Remco Royen (Vrije Universiteit Brussel), Ion Giosan (Technical University of Cluj-Napoca), Adrian Munteanu (Vrije Universiteit Brussel) PDF Poster
303	MixMask: Revisiting Masking Strategy for Siamese ConvNets Kirill Vishniakov (M42), Eric Xing (Mohamed bin Zayed Univeristy of AI), Zhiqiang Shen (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
304	Interpretable Representation Learning from Videos using Nonlinear Priors Marian Longa (University of Oxford), Joao F. Henriques (University of Oxford) PDF Poster Video (Right click to download)
305	PEEKABOO: Hiding Parts of an Image for Unsupervised Object Localization Hasib Zunair (Concordia University), Abdessamad Ben Hamza (Concordia University) PDF Poster Video (Right click to download)
307	Discovering an Image-Adaptive Coordinate System for Photography Processing Ziteng Cui (The University of Tokyo), Lin Gu (RIKEN), Tatsuya Harada (RIKEN) PDF
308	Effective Message Hiding with Order-Preserving Mechanisms Gao Yu (University of Queensland), Xuchong QIU (Bosch), Zihan Ye (Xi'an Jiaotong-Liverpool University) PDF Poster Video (Right click to download)
317	EIANet: A Novel Domain Adaptation Approach to Maximize Class Distinction with Neural Collapse Principles Zicheng Pan (Griffith University), Xiaohan Yu (Macquarie University), Yongsheng Gao (Griffith University) PDF Poster Video (Right click to download)
318	Mumpy: Multilateral Temporal-view Pyramid Transformer for Video Inpainting Detection Ying Zhang (Ocean University of China), Yuezun Li (Ocean University of China), Bo Peng (Institute of automation, Chinese academy of science, Chinese Academy of Sciences), Jiaran Zhou (Ocean University of China), Huiyu Zhou (University of Leicester), Junyu Dong (Ocean University of China) PDF Poster Video (Right click to download)
319	Annotation by Clicks: A Point-Supervised Contrastive Variance Method for Medical Semantic Segmentation Qing En (Carleton University), Yuhong Guo (Carleton University) PDF Poster Video (Right click to download)
323	Complete the Feature Space: Diffusion-Based Fictional ID Generation for Face Recognition Myeong-Yeon Yi (Seoul National University), DongJae Lee (KAIST), Naeun Ko (Naver corporation), Yonghyun Jeong (NAVER), Sang-goo Lee (Seoul National University), Seunggyu Chang (NAVER Cloud) PDF Poster Video (Right click to download)
328	DisCoM-KD: Cross-Modal Knowledge Distillation via Disentanglement Representation and Adversarial Learning Dino Ienco (National Institute for Agriculture, Environment and Food), Cassio Fraga Dantas (INRAE) PDF Poster
329	Uni-Mlip: Unified Self-Supervision for Medical Vision Language Pre-training Ameera Ali Bawazir (Technology Innovation Institute ), Kebin Wu (Technology Innovation Institute), Wenbin LI (Technology Innovation Institute) PDF Poster Video (Right click to download)
330	Towards Better Zero-Shot Anomaly Detection under Distribution Shift with CLIP Jiyao Gao (Sichuan University), Chengxin He (Sichuan University), Lei Duan (Sichuan University), Jie Zuo (Sichuan University) PDF Poster Video (Right click to download)
335	SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning Hao Chen (Department of Computer Science and Engineering, The Chinese University of Hong Kong), Jiaze Wang (The Chinese University of Hong Kong), Ziyu Guo (Department of Computer Science and Engineering, The Chinese University of Hong Kong), Jinpeng Li (The Chinese University of Hong Kong), Donghao Zhou (The Chinese University of Hong Kong), Bian Wu (Zhejiang University), Chenyong Guan (Gudsen Technology Co. Ltd), Guangyong Chen (Zhejiang Lab), Pheng-Ann Heng (The Chinese University of Hong Kong) PDF Poster Video (Right click to download)
339	FastForensics: Efficient Two-Stream Design for Real-Time Image Manipulation Detection zhangyangxiang (Ocean University of China), Yuezun Li (Ocean University of China), Ao Luo (Southwest Jiaotong University), Jiaran Zhou (Ocean University of China), Junyu Dong (Ocean University of China) PDF Poster Video (Right click to download)
342	Unsupervised Domain Adaptation for Tubular Structure Segmentation Across Different Anatomical Sources Yuxiang An (University of Sydney), Dongnan Liu (University of Sydney), Weidong Cai (University of Sydney) PDF Poster Video (Right click to download)
346	Backdoor Defense through Self-Supervised and Generative Learning Ivan Sabolic (University of Zagreb), Ivan Grubišić (University of Zagreb), Siniša Šegvić (University of Zagreb) PDF Poster Video (Right click to download)
352	DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation Raquel Vidaurre (Universidad Rey Juan Carlos), Elena Garces (Adobe Systems), Dan Casas (Universidad Rey Juan Carlos) PDF Poster Video (Right click to download)
358	Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning Muhammad Salman Ali (Kyung Hee University), Maryam Qamar (Kyung Hee University), Sung-Ho Bae (Kyung Hee University), Enzo Tartaglione (Institut Polytechnique de Paris) PDF Poster Video (Right click to download)
361	Seg-HGNN: Unsupervised and Light-Weight Image Segmentation with Hyperbolic Graph Neural Networks Debjyoti Mondal (Samsung), Rahul Mishra (Samsung), Chandan Kumar Pandey (Samsung) PDF Poster Video (Right click to download)
362	Into the Fog: Evaluating Robustness of Multiple Object Tracking Nadezda Kirillova (Technische UniversitÃ¤t Graz), Muhammad Jehanzeb Mirza (Massachusetts Institute of Technology), Horst Bischof (Graz University of Technology), Horst Possegger (Graz University of Technology) PDF Poster Video (Right click to download)
365	Anchor-Based Masked Generative Distillation for Pixel-Level Prediction Tasks Xie Yu (Beijing University of Aeronautics and Astronautics), Wentao Zhang (Beijing University of Aeronautics and Astronautics) PDF Poster Video (Right click to download)
369	Benchmarking and Optimizing Federated Learning with Hardware-related Metrics Kai Pan (Institute of Computing Technology, Chinese Academy of Sciences), Yapeng Tian (University of Texas at Dallas), Yinhe Han (Institute of Computing Technology, Chinese Academy of Sciences), Yiming Gan (Institute of Computing Technology, Chinese Academy of Sciences) PDF Poster Video (Right click to download)
374	Text-Guided Mixup Towards Long-Tailed Image Categorization Richard Franklin (University of Washington), Jiawei Yao (University of Washington), Deyang Zhong (University of Washington), Qi Qian (Zoom), Juhua Hu (University of Washington) PDF Poster Video (Right click to download)
375	A Novel Divide and Merge Approach for Improved Classification of Functional Data wei zhao (University of Manchester), Xiao-Jun Zeng (University of Manchester), Chengdong shi (University of Manchester), Ching-Hsun Tseng (University of Manchester), Yue Chang (University of Manchester) PDF Poster Video (Right click to download)
384	Few-Shot Classification of Interactive Activities of Daily Living (InteractADL) Zane Durante (Stanford University), Robathan Harries (Stanford University), Edward Vendrow (Massachusetts Institute of Technology), Zelun Luo (Stanford University), Yuta Kyuragi (Panasonic R&D Company of America), Kazuki Kozuka (Panasonic Corporation), Li Fei-Fei (Stanford University), Ehsan Adeli (Stanford University) PDF Poster
388	ACIL: Active Class Incremental Learning for Image Classification Aditya Bhattacharya (Florida State University), Debanjan Goswami (Florida State University), Shayok Chakraborty (Florida State University) PDF Poster Video (Right click to download)
391	PatchRot: Self-Supervised Training of Vision Transformers by Rotation Prediction Sachin Chhabra (Arizona State University), Hemanth Venkateswara (Georgia State University), Baoxin Li (Arizona State University) PDF Poster Video (Right click to download)
392	Label Smoothing++: Enhanced Label Regularization for Training Neural Networks Sachin Chhabra (Arizona State University), Hemanth Venkateswara (Georgia State University), Baoxin Li (Arizona State University) PDF Poster Video (Right click to download)
401	Decoupling Forgery Semantics for Generalizable Deepfake Detection Wei Ye (Nanchang University), Xinan He (Nanchang University), Feng Ding (Nanchang University) PDF Poster Video (Right click to download)
406	When Text and Images Don't Mix: Bias-Correcting Language-Image Similarity Scores for Anomaly Detection Adam Goodge (ASTAR), Bryan Hooi (National University of Singapore), Wee Siong Ng (Institute for Infocomm Research, ASTAR) PDF Poster Video (Right click to download)
414	NSSR-DIL: Null-Shot Image Super-Resolution Using Deep Identity Learning Sree Rama Vamsidhar S (Indian Institute of Technology Tirupati), Gorthi Rama Krishna Sai Subrahmanyam (Indian Institute of Technology, Tirupati, INDIA) PDF Poster Video (Right click to download)
416	Taming the Tail: Leveraging Asymmetric Loss and PadÃ© Approximation to Overcome Long-Tailed Class Imbalance Pankhi Kashyap (Google), Pavni Tandon (Indian Institute of Technology, Bombay), Sunny Gupta (Indian Institute of Technology, Bombay), Abhishek Tiwari (Indian Institute of Technology, Bombay, Dhirubhai Ambani Institute Of Information and Communication Technology), Ritwik Kulkarni (Oraicle Biosciences LTD), Kshitij Sharad Jadhav (Indian Institute of Technology, Bombay) PDF Poster Video (Right click to download)
417	Kernel Representation for Dynamic Networks Yichen Zhou (Sea Group), Teck Khim Ng (National University of Singapore) PDF Poster
420	Layout Free Scene Graph to Image Generation RAMESHWAR MISHRA (Indraprastha Institute of Information Technology, Delhi), A. Subramanyam (Indraprastha Institute of Information Technology, Delhi) PDF Poster Video (Right click to download)
421	Rethinking Domain Adaptive Optic Disc and Cup Segmentation in Fundus Image through Dynamic Diffusion Flow Canran Li (University of Sydney), Dongnan Liu (University of Sydney), Weidong Cai (The University of Sydney) PDF Poster Video (Right click to download)
424	RETRO: Reusing teacher projection head for efficient embedding distillation on Lightweight Models via Self-supervised Learning Khanh-Binh Nguyen (Deakin University), Chae Jung Park (National Cancer Center) PDF
425	GLCM-Adapter: Global-Local Content Matching for Few-shot CLIP Adaptation Shuo Wang (University of Science and Technology of China), Xieenlong (University of Science and Technology of China), Jinda Lu (University of Science and Technology of China), Jinghan Li (University of Science and Technology of China), Yanbin Hao (University of Science and Technology of China) PDF Poster Video (Right click to download)
426	Unified Compositional Query Machine with Multimodal Consistency for Video-based Human Activity Recognition Tuyen Tran (Deakin University), Thao Minh Le (Deakin University), Duy Hung Tran (Deakin University), Truyen Tran (Deakin University) PDF Poster Video (Right click to download)
427	Lightweight Human Pose Estimation with Enhanced Knowledge Review Hao Xu (Nanjing University of Information Science and Technology), Shengye Yan (Nanjing University of Information Science and Technology), Wei Zheng (MINIEYE) PDF Poster
432	Channel-Partitioned Windowed Attention And Frequency Learning for Single Image Super-Resolution Dinh Phu Tran (Korea Advanced Institute of Science & Technology), Dao Duy Hung (Korea Advanced Institute of Science & Technology), Daeyoung Kim (Korea Advanced Institute of Science and Technology) PDF Poster Video (Right click to download)
433	Separated and Independent Contrastive Learning on Labeled and Unlabeled Samples: Boosting Performance on Long-tail Semi-supervised Learning Dongyoung Kim (Hallym University), Jeong-Gun Lee (Hallym University), WonSook Lee (University of Ottawa) PDF Poster Video (Right click to download)
437	Difflare: Removing Image Lens Flare with Latent Diffusion Models Tianwen Zhou (Beijing Normal University), Qihao Duan (University of the Chinese Academy of Sciences), Zitong YU (Great Bay University) PDF Poster Video (Right click to download)
440	Explaining Multi-modal Large Language Models by Analyzing their Vision Perception Loris Giulivi (Polytechnic Institute of Milan), Giacomo Boracchi (Polytechnic Institute of Milan) PDF Poster
448	Learning to Project for Cross-Task Knowledge Distillation Dylan Auty (Imperial College London), Roy Miles (Huawei Technologies Ltd.), Benedikt Kolbeinsson (Imperial College London), Krystian Mikolajczyk (Imperial College London) PDF Poster
452	Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty Saining Zhang (Nanyang Technological University), Baijun Ye (Tsinghua University), Xiaoxue Chen (Tsinghua University, Tsinghua University), Yuantao Chen (The Chinese University of Hong Kong,Shenzhen), Zongzheng Zhang (Tsinghua University), Cheng Peng (Beijing Institute of Technology), Yongliang Shi (Tsinghua University, Tsinghua University), Hao Zhao (Tsinghua University, Tsinghua University) PDF Poster Video (Right click to download)
457	LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps Andrey Palaev (Innopolis University), Adil Khan (University of Hull), Syed M Ahsan Kazmi (University of the West of England, Bristol) PDF Poster Video (Right click to download)
472	SAM-EG: Segment Anything Model with Egde Guidance framework for efficient Polyp Segmentation Quoc-Huy Trinh (Aalto University), Hai-Dang Nguyen (Ho Chi Minh city University of Science, Vietnam National University), Nguyen Ngoc Bao Tram (Ho Chi Minh city University of Science, Vietnam National University), Debesh Jha (Northwestern University), Ulas Bagci (Northwestern University), Minh-Triet Tran (Ho Chi Minh city University of Science, Vietnam National University) PDF Poster
480	Disparity Estimation Using a Quad-Pixel Sensor Zhuofeng Wu (Tokyo Institute of Technology, Tokyo Institute of Technology), Doehyung Lee (Tokyo Institute of Technology, Tokyo Institute of Technology), Zihua Liu (Tokyo Institute of Technology, Tokyo Institute of Technology), Kazunori Yoshizaki (Olympus Medical Systems Corporation), Yusuke Monno (Institute of Science Tokyo), Masatoshi Okutomi (Tokyo Institute of Technology) PDF Poster Video (Right click to download)
482	Unsupervised Hashing Network with Hyper Quantization Tree Sungeun Kim (Ajou University), Jongbin Ryu (Ajou University) PDF Poster Video (Right click to download)
486	DAVINCI: A Single-Stage Architecture for Constrained CAD Sketch Inference Ahmet Serdar Karadeniz (University of Luxemburg), Dimitrios Mallis (University of Luxemburg), Nesryne Mejri (University of Luxembourg), Kseniya Cherenkova (University of Luxemburg), Anis Kacem (University of Luxemburg), Djamila Aouada (University of Luxemburg) PDF Poster Video (Right click to download)
492	Multimodal base distributions in conditional flow matching generative models Shane Josias (University of Stellenbosch), Willie Brink (Stellenbosch University) PDF Poster Video (Right click to download)
493	Spike-SLR: An Energy-efficient Parallel Spiking Transformer for Event-based Sign Language Recognition Xinxu Lin (Sichuan University), Mingxuan Liu (Tsinghua University, Tsinghua University), Kezhuo Liu (Tsinghua University, Tsinghua University), Hong Chen (Tsinghua University, Tsinghua University) PDF Poster Video (Right click to download)
499	MotionMAE: Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders Haosen Yang (University of Surrey), Deng Huang (Meituan), Bin Wen (Beijing University of Aeronautics and Astronautics), Jiannan Wu (University of Hong Kong), Hongxun Yao (Harbin Institute of Technology), Yi Jiang (Bytedance), Xiatian Zhu (University of Surrey), Zehuan Yuan (ByteDance Inc.) PDF Poster Video (Right click to download)
500	Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences Rui Yu (East China University of Science and Technology), Runkai Zhao (University of Sydney, University of Sydney), Cong Nie (Tongji University), Heng Wang (Sony R&D), Siyu Li (East China University of Science and Technology), Songhao Zhu (East China University of Science and Technology) PDF Poster Video (Right click to download)
505	FLARE up your data: Diffusion-based Augmentation Method in Astronomical Imaging Mohammed Talha Alam (Mohamed bin Zayed University of Artificial Intelligence), Raza Imam (Mohamed bin Zayed University of Artificial Intelligence), Mohsen Guizani (Mohamed bin Zayed University of Artificial Intelligence), Fakhri Karray (University of Waterloo) PDF Poster Video (Right click to download)
508	Semantic Image Synthesis of Anime Characters Based on Conditional Generative Adversarial Networks Xuhui Zhu (Chongqing University), feng jiang (Chongqing University), Jing Wen (Chongqing University), yi wang (Chongqing University), qiang gao (Chongqing University) PDF Poster Video (Right click to download)
510	ML-2SN: A Hybrid Two-Stream System for Sitting Posture Detection Kehang Jia (Suzhou University), Gaorui Zhang (Suzhou University), Yixuan Yang (Suzhou University), Guangwei Huang (Suzhou University), Penghuan Wang (Suzhou University), Cheng Cheng (Suzhou University) PDF Poster
517	Interpretable Long-term Action Quality Assessment Xu Dong (University of Surrey), Xinran Liu (University of Surrey), Wanqing Li (University of Wollongong), Anthony Adeyemi-Ejeye (University of Surrey), Andrew Gilbert (University of Surrey) PDF Poster Video (Right click to download)
524	A self-supervised cyclic neural-analytic approach for novel view synthesis and 3D reconstruction Dragos Costea (University Politehnica of Bucharest), Alina Marcu (Institute of Mathematics of the Romanian Academy), Marius Leordeanu (Norwegian Research Center (NORCE)) PDF Poster Video (Right click to download)
528	SOFI: Multi-Scale Deformable Transformer for Camera Calibration with Enhanced Line Queries Sebastian Janampa (University of New Mexico), Marios Pattichis (University of New Mexico) PDF Poster Video (Right click to download)
532	Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers Jochem Loedeman (University of Amsterdam), Maarten C. Stol (BrainCreators ), Tengda Han (Google DeepMind), Yuki M Asano (University of Technology Nuremberg) PDF Poster
533	TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training Li Li (King's College London, University of London), Tanqiu Qiao (Durham University), Hubert P. H. Shum (Durham University), Toby P. Breckon (Durham University) PDF Poster
534	Enhancing Cardiovascular Disease Prediction through Multi-Modal Self-Supervised Learning Francesco Girlanda (ETH Zurich), Olga V. Demler (ETH Zurich), Bjoern Menze (University of Zurich), Neda Davoudi (ETH Zurich) PDF Poster Video (Right click to download)
537	Out-Of-Distribution Detection for Audio-visual Generalized Zero-Shot Learning: A General Framework Liuyuan Wen (University of Science and Technology of China) PDF Poster Video (Right click to download)
545	Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection Christian Fruhwirth-Reisinger (Graz University of Technology), Wei Lin (Johannes Kepler University Linz), Dušan Malić (Graz University of Technology), Horst Bischof (Graz University of Technology), Horst Possegger (Graz University of Technology) PDF Poster Video (Right click to download)
546	Balancing Calibration and Performance: Stochastic Depth in Segmentation BNNs Linghong Yao (InstaDeep), Denis Hadjivelichkov (University College London), Andromachi Maria Delfaki (University College London), Yuanchang Liu (), Brooks Paige (University College London), Dimitrios Kanoulas (University College London) PDF Poster Video (Right click to download)
557	Hybrid-CSR: Coupling Explicit and Implicit Reconstruction of Cortical Surface shanlin sun (University of California, Irvine), Tung Le (University of California, Irvine), Pooya Khosravi (University of California, Irvine), Chenyu You (State University of New York at Stony Brook), Kun Han (University of California, Irvine), Haoyu Ma (Meta Platforms, Inc), Deying Kong (University of California, Irvine), Xiangyi Yan (University of California, Irvine), Xiaohui Xie (University of California, Irvine) PDF Poster
563	As Firm As Their Foundations: Creating Transferable Adversarial Examples Across Downstream Tasks with CLIP Anjun Hu (University of Oxford), Jindong Gu (University of Oxford), Francesco Pinto (University of Chicago), Konstantinos Kamnitsas (University of Oxford), Philip Torr (University of Oxford) PDF Poster Video (Right click to download)
566	SuperLoRA: Parameter-Efficient Unified Adaptation of Large Foundation Models Xiangyu Chen (University of Kansas), Jing Liu (Mitsubishi Electric Research Labs), Ye Wang (Mitsubishi Electric Research Labs), Pu Perry Wang (Mitsubishi Electric Research Labs), Matthew Brand (Yale University), Guanghui Wang (Toronto Metropolitan University), Toshiaki Koike-Akino (Mitsubishi Electric Research Labs) PDF Poster Video (Right click to download)
568	Beyond Static and Dynamic Quantization - Hybrid Quantization of Vision Transformers Piotr Kluska (International Business Machines), Florian Scheidegger (International Business Machines), A. Cristiano I. Malossi (International Business Machines), Enrique S. Quintana-Orti (Universidad Politecnica de Valencia) PDF Poster
572	Multi-Scope Representation Learning for Causal Relation Discovery with new Challenging Datasets Jiageng Zhu (University of Southern California), Hanchen Xie (Bosch), Jianhua Wu (University of Southern California), Mohamed E. Hussein (USC/ISI), Mahyar Khayatkhoei (USC/ISI), Jiazhi Li (Futurewei Technologies Inc.), Wael AbdAlmageed (Clemson University) PDF Poster Video (Right click to download)
577	AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field Rong Liu (University of Southern California), Rui Xu (USC Institute for Creative Technologies, University of Southern California), Yue Hu (University of Southern California), Meida Chen (University of Southern California), Andrew Feng (Institute for Creative Technologies, University of Southern California) PDF Poster Video (Right click to download)
579	Neural Collapse Inspired Contrastive Continual Learning Antoine Montmaur (ENSEA), Nicolas Larue (ENSEA), Ngoc-Son Vu (ENSEA) PDF Poster Video (Right click to download)
584	ATLANTIS: A Framework for Automated Targeted Language-guided Augmentation Training for Robust Image Search Inderjeet Singh (Fujitsu Research of Europe Limited), Roman Vainshtein (Fujitsu Research and Development Center Co. Ltm.), Alon Zolfi (Ben Gurion University of the Negev), Asaf Shabtai (Ben-Gurion University of the Negev), Tu Bui (Fujitsu Research and Development Center Co. Ltm.), Jonathan Brokman (Technion - Israel Institute of Technology, Technion - Israel Institute of Technology), Omer Hofman (Fujitsu Research and Development Center Co. Ltm.), Fumiyoshi Kasahara (Fujitsu Research and Development Center Co. Ltm.), Kentaro Tsuji (Fujitsu Research and Development Center Co. Ltm.), Hisashi Kojima (Fujitsu Research and Development Center Co. Ltm.) PDF Poster Video (Right click to download)
595	A Prototype Unit for Image De-raining using Time-Lapse Data Jaehoon Cho (Hyundai Motor Company), Minjung Yoo (Korea Aerospace University), Jini Yang (Korea Aerospace University), Sunok Kim (Korea Aerospace University) PDF Poster Video (Right click to download)
597	FADE: Few-shot/zero-shot Anomaly Detection Engine using Large Vision-Language Model Yuanwei Li (Onfido), Elizaveta Ivanova (Onfido), Martins Bruveris (Onfido) PDF Poster Video (Right click to download)
599	VLAVAD: Vision-Language Models Assisted Unsupervised Video Anomaly Detection Changkang Li (Beijing University of Aeronautics and Astronautics), Yalong Jiang (Beihang University) PDF Poster Video (Right click to download)
601	Training-Free Zero-Shot Semantic Segmentation with LLM Refinement Yuantian Huang (CyberAgent, Inc.), Satoshi Iizuka (University of Tsukuba, Tsukuba University), Kazuhiro Fukui (University of Tsukuba) PDF Poster Video (Right click to download)
606	VEMIC: View-aware Entropy model for Multi-view Image Compression Susmija Jabbireddy (University of Maryland, College Park), Davit Soselia (University of Maryland, College Park), Max Ehrlich (University of Maryland, College Park), Christopher Metzler (University of Maryland, College Park), Amitabh Varshney (University of Maryland, College Park) PDF Poster Video (Right click to download)
609	Guidance-base Diffusion Models for Improving Photoacoustic Image Quality Tatsuhiro Eguchi (Kyushu University, Tokyo Institute of Technology), Shumpei Takezaki (Kyushu University), Mihoko Shimano (National Institute of Informatics), Takayuki Yagi (Tokyo Institute of Technology, Tokyo Institute of Technology), Ryoma Bise (Kyushu University, Faculty of Information Science and Electrical Engineering) PDF Poster Video (Right click to download)
611	STPose: 6D object pose estimation network based on sparse attention and cross-layer connection Shihao Chen (Wuhan University), Xiaobing Li (Guangxi University), Keduo Yan (Guangxi University), Yong Li (Guangxi University), Dongxu Gao (University of Portsmouth) PDF Poster Video (Right click to download)
615	Measuring Physical Plausibility of 3D Human Poses Using Physics Simulation Nathan Louis (University of Michigan - Ann Arbor), Mahzad Khoshlessan (University of Michigan - Ann Arbor), Jason J Corso (University of Michigan) PDF Poster Video (Right click to download)
619	Prompt-guided Multi-modal contrastive learning for Cross-compression-rate Deepfake Detection Ching-Yi Lai (National Tsinghua University), Chiou-ting Hsu (National Tsing Hua University), Chih-Chung Hsu (National Yang Ming Chiao Tung University), Chia-Wen Lin (National Tsing Hua University) PDF
622	The Attempt on Combining Three Talents by KD with Enhanced Boundary in Co-Salient Object Detection Ziyi Cao (Nanjing University of Information Science and Technology), Shengye Yan (Nanjing University of Information Science and Technology), Wei Zheng (MINIEYE) PDF Poster
627	GLPI: A Global Layered Prompt Integration approach for Explicit Visual Prompt Yufei Gao (Zhengzhou University), Bin Fu (Zhengzhou University), Lei Shi (Zhengzhou University), Chengming Liu (Zhengzhou University), yucheng shi (Zhengzhou University) PDF Poster Video (Right click to download)
630	CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement Yijie Li (Northwestern University, Northwestern University), Hewei Wang (Apple), Aggelos Katsaggelos (Northwestern University) PDF Poster Video (Right click to download)
637	3D Point Cloud Network Pruning: When Some Weights Do not Matter Amrijit Biswas (North South University), Md. Ismail Hossain (North South University), M M Lutfe Elahi (North South University), Ali Cheraghian (CSIRO), Fuad Rahman (University of Arizona), Nabeel Mohammed (North South University), Shafin Rahman (North South University) PDF Poster Video (Right click to download)
642	Revitalizing Legacy Video Content: Deinterlacing with Bidirectional Information Propagation Zhaowei Gao (Beijing Jingwei Hirain Technologies Co., Inc.), Mingyang Song (Disney Research, Disney Research), Christopher Schroers (Disney Research\|Studios, Disney), Yang Zhang (Disney Research, Disney) PDF Poster
648	3D Blur Kernel on Gaussian Splatting Yongchao Lin (Inner Mongolia University), Xiangdong Su (Inner Mongolia University), Yuhan Yang (Inner Mongolia University ) PDF Poster Video (Right click to download)
650	Drawing Insights: Sequential Representation Learning in Comics Sam Titarsolej (University of Amsterdam), Neil Cohn (Tilburg University), Nanne Van Noord (University of Amsterdam) PDF
657	G3FA: Geometry-guided GAN for Face Animation Alireza Javanmardi (German Research Center for AI), Alain Pagani (German Research Center for Artificial Intelligence), Didier Stricker (Technical University Kaiserslautern) PDF Poster Video (Right click to download)
659	GN-FR: Generalizable Neural Radinace Fields for Flare Removal Gopi Raju Matta (Indian Institute of Technology Madras), Rahul Siddartha (Indian Institute of Technology Madras, Indian Institute of Technology, Madras), RONGALI SIMHACHALA VENKATA GIRISH (Indian Institute of Technology, Madras.), Sumit Sharma (Indian Institute of Technology, Madras), Kaushik Mitra (Indian Institute of Technology, Madras) PDF Poster Video (Right click to download)
663	Unsupervised Point Cloud Registration with Self-Distillation Christian Löwens (Bosch), Thorben Funke (Bosch), André Wagner (Bosch), Alexandru Paul Condurache (Bosch) PDF Poster Video (Right click to download)
667	ICAF-4: An Integrated Framework of Category-level Articulated Object Perception and Manipulation for Embodied Intelligence WenBo Xu (Hefei University of Technology), Li Zhang (University of Science and Technology of China), Qiankun Li (University of Science and Technology of China), Qi Wu (Shanghai Jiaotong University), Lin Yuanbo Wu (Swansea University), Liu Liu (Hefei University of Technology) PDF Poster Video (Right click to download)
670	Leveraging Inductive Bias in ViT for Medical Image Diagnosis Jungmin Ha (Kookmin University), Euihyun-yoon (Kookmin University), Sungsik Kim (Kookmin University), Jinkyu Kim (Korea University), Jaekoo Lee (Kookmin University) PDF Poster Video (Right click to download)
678	Content and Style Aware Audio-Driven Facial Animation QINGJU LIU (Flawless AI), Hyeongwoo Kim (Imperial College London), Gaurav Bharaj (Reality Defender AI) PDF Poster Video (Right click to download)
680	May the Forgetting Be with You: Alternate Replay for Learning with Noisy Labels Monica Millunzi (University of Modena and Reggio Emilia), Lorenzo Bonicelli (University of Modena and Reggio Emilia), Angelo Porrello (University of Modena and Reggio Emilia, AimageLab), Jacopo Credi (Chalmers University of Technology), Petter N. Kolm (NYU Courant), Simone Calderara (University of Modena and Reggio Emilia) PDF Poster Video (Right click to download)
681	On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models Hashmat Shadab Malik (Mohamed bin Zayed University of Artificial Intelligence), Numan Saeed (Mohamed bin Zayed University of Artificial Intelligence), Asif Hanif (Mohamed bin Zayed University of Artificial Intelligence), Muzammal Naseer (Khalifa University of Science, Technology and Research), Mohammad Yaqub (Mohamed bin Zayed University of Artificial Intelligence), Salman Khan (Mohamed bin Zayed University of Artificial Intelligence), Fahad Shahbaz Khan (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
685	Boundary Contrastive Learning for Label-Efficient Medical Image Segmentation Satoshi Kamiya (Meijo University), Kota Yamashita (Meijo University), Kazuhiro Hotta (Meijo University) PDF Poster
686	TransHuPR: Cross-View Fusion Transformer for Human Pose Estimation Using mmWave Radar Niraj Prakash Kini (National Yang Ming Chiao Tung University), Ruey-Horng Shiue (National Yang Ming Chiao Tung University), ryan chandra (National Yangmin Chiaotung University), Wen-Hsiao Peng (National Yang Ming Chiao Tung University), Ching-Wen Ma (National Yang Ming Chiao Tung University), Jenq-Neng Hwang (University of Washington) PDF Poster Video (Right click to download)
689	AggSS: An Aggregated Self-Supervised Approach for Class Incremental Learning Jayateja Kalla (Indian Institute of Science), Soma Biswas (Indian Institute of Science, Bangalore, India) PDF Poster Video (Right click to download)
692	Spatio-Temporal Transformer with Rotary Position Embedding and Bone Priors for 3D Human Pose Estimation Cheng Chen (University of Electronic Science and Technology of China), Jiang Liu (Southwest Jiaotong University), Liaoyuan Zeng (University of Electronic Science and Technology of China), Fang Duan (University of Bath), Sean McGrath (University of Limerick), Tian Dan (University of Electronic Science and Technology of China) PDF Poster Video (Right click to download)
695	Detecting Audio-Visual Deepfakes with Fine-Grained Inconsistencies Marcella Astrid (University of Luxembourg), Enjie Ghorbel (CRISTAL, ENSI, University of Manouba), Djamila Aouada (University of Luxemburg) PDF Poster Video (Right click to download)
697	Time-conditioned Illumination for Inverse Rendering of Outdoor Scenes Xiaoxue Chen (Tsinghua University, Tsinghua University), Hao Zhao (Tsinghua University, Tsinghua University), Guyue Zhou (Tsinghua University), Ya-Qin Zhang (AIR, Tsinghua University) PDF Poster
707	QUD: Unsupervised Knowledge Distillation for Deep Face Recognition Jan Niklas Kolf (TU Darmstadt), Naser Damer (Fraunhofer Institute for Computer Graphics Research IGD), Fadi Boutros (Fraunhofer Institute for Computer Graphics Research) PDF Poster Video (Right click to download)
721	Sign Stitching: A Novel Approach to Sign Language Production Harry Walsh (University of Surrey), Ben Saunders (University of Surrey), Richard Bowden (University of Surrey) PDF Poster Video (Right click to download)
723	ControlEdit: A MultiModal Local Clothing Image Editing Method Di Cheng (Beijing Institute of Fashion Technology), Yingjie Shi (Beijing Institute of Fashion Technology), sun shixin (Beijing Institute Of Fashion Technology), JiaFu Zhang (Beijing Institute of Fashion Technology), weijing wang (Beijing Institution of Fashion Technology), YULiu (Beijing Institute Of Fashion Technology) PDF
727	Optimising Diffusion Models for Histopathology Image Synthesis Victoria Porter (The Queen's University Belfast), Richard Gault (The Queen's University Belfast), Stephanie G Craig (The Queen's University Belfast), Jacqueline James (The Queen's University Belfast) PDF Poster Video (Right click to download)
729	Reconstructing Spheres by Fitting Planes Erol Ozgur (Institut Pascal Clermont-Ferrand), Mohammad Alkhatib (Institut Pascal Clermont-Ferrand), Youcef Mezouar (Institut Pascal Clermont-Ferrand), Adrien Bartoli (Institut Pascal Clermont-Ferrand) PDF Poster Video (Right click to download)
731	AutoDOM: Automated Dimension Overlay for Enhanced Measurement-Guidance Pushpendu Ghosh (Amazon), Aniket Joshi (Amazon), Soumyajit Chowdhury (Amazon), Promod Yenigalla (Amazon) PDF Poster Video (Right click to download)
736	Rectifying Shortcut Learning through Cellular Differentiation in Deep Learning Neurons Hongjing Niu (University of Science and Technology of China), Hanting Li (University of Science and Technology of China), Guoping Wu (University of Science and Technology of China), Bin Li (University of Science and Technology of China), Feng Zhao (University of Science and Technology of China) PDF Poster Video (Right click to download)
737	Pseudo Labelling for Enhanced Masked Auto Encoders Srinivasa Rao Nandam (University of Surrey), Sara Atito (University of Surrey), Zhenhua Feng (Jiangnan University), Josef Kittler (University of Surrey), Muhammad Awais (University of Surrey) PDF Poster Video (Right click to download)
738	CosFairNet:A Parameter-Space based Approach for Bias Free Learning Rajeev Ranjan Dwivedi (Indian Institute of Science Education and Research Bhopal), Priyadarshini Kumari (Sony AI), Vinod K. Kurmi (IISER Bhopal ) PDF Poster Video (Right click to download)
740	Frequency Decomposition to Tap the Potential of Single Domain for Generalization Hongjing Niu (University of Science and Technology of China), Qingyue Yang (University of Science and Technology of China), Pengfei Xia (University of Science and Technology of China), Wei Zhang (University of Science and Technology of China), Bin Li (University of Science and Technology of China), Feng Zhao (University of Science and Technology of China) PDF Poster Video (Right click to download)
745	Task-Related Feature Enhancement Network for Neuronal Morphology Classification Chunli Sun (University of Science and Technology of China), Feng Zhao (University of Science and Technology of China) PDF Poster Video (Right click to download)
746	Adapting MIMO video restoration networks to low latency constraints Valéry Dewil (Ecole Normale Superieure), Zhe Zheng (Ecole Normale Superieure), Arnaud Barral (Ecole Normale Superieure), Lara Raad (Universidad de la Republica), Nao Nicolas (Thales Group), Ioannis Cassagne (Thales Group), Jean-michel Morel (City University of Hong Kong), Gabriele Facciolo (Ecole Normale Superieure Paris-Saclay), Bruno Galerne (Universite d'Orleans), Pablo Arias (Universitat Pompeu Fabra) PDF Poster Video (Right click to download)
753	Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning Hoàng-Ân Lê (Université de Bretagne Sud), Paul Berg (Université de Bretagne Sud), Minh Tan Pham (Université de Bretagne Sud) PDF Poster Video (Right click to download)
754	Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization Nicholas Moratelli (University of Modena and Reggio Emilia), Davide Caffagni (University of Modena and Reggio Emilia), Marcella Cornia (University of Modena and Reggio Emilia), Lorenzo Baraldi (University of Modena and Reggio Emilia ), Rita Cucchiara (University of Modena and Reggio Emilia) PDF Poster
755	PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition Chenhongyi Yang (University of Edinburgh), Zehui Chen (University of Science and Technology of China), Miguel Espinosa (University of Edinburgh), Linus Ericsson (University of Edinburgh), Zhenyu Wang (Peking University), Jiaming Liu (Peking University), Elliot J. Crowley (University of Edinburgh) PDF Poster Video (Right click to download)
762	Open-World Semi-Supervised Learning under Compound Distribution Shifts Shijia Xu (Nanjing University of Science and Technology), Lin Zhao (Nanjing University of Science and Technology), Jialiang Tang (NJUST), Guangyu Li (Nanjing University of Science and Technology), Chen Gong (Nanjing University of Science and Technology) PDF Poster
763	Horospherical Learning with Smart Prototypes Paul Berg (Université de Bretagne Sud), Björn Michele (Université de Bretagne Sud), Minh Tan Pham (Université de Bretagne Sud), Laetitia Chapel (Institut Agro Rennes-Angers), Nicolas Courty (Université de Bretagne Sud) PDF Poster Video (Right click to download)
769	Flexible Graph Convolutional Network for 3D Human Pose Estimation Abu Taib Mohammed Shahjahan (Concordia University), Abdessamad Ben Hamza (Concordia University) PDF Poster Video (Right click to download)
775	SAE: Single Architecture Ensemble Neural Networks Martin Ferianc (University College London, University of London), Hongxiang Fan (Samsung), Miguel R. D. Rodrigues (University College London) PDF Poster Video (Right click to download)
779	Outlier detection by ensembling uncertainty with negative objectness Anja Delić (University of Zagreb), Matej Grcic (Faculty of Electrical Engineering and Computing, University of Zagreb), Siniša Šegvić (UniZg-FER) PDF Poster Video (Right click to download)
787	MSA2Net: Multi-scale Adaptive Attention-guided Network for Medical Image Segmentation Sina Ghorbani Kolahi (Tarbiat Modares University), Seyed Kamal Chaharsooghi (Tarbiat Modares University), Toktam Khatibi (Tarbiat Modares University), Afshin Bozorgpour (University of Regensburg), Reza Azad (RWTH Aachen), Moein Heidari (University of British Columbia), Ilker Hacihaliloglu (University of British Columbia), Dorit Merhof (University of Regensburg) PDF Poster Video (Right click to download)
790	FILS: Self-Supervised Video Feature Prediction In Semantic Language Space Mona Ahmadian (University of Surrey), Frank Guerin (University of Surrey), Andrew Gilbert (University of Surrey) PDF Poster Video (Right click to download)
797	Calibration of 2D LiDAR sensors using cylindrical target Tamás Tófalvi (Eotvos Lorand University), Bandó Kovács (Eotvos Lorand University), Levente Hajder (Eotvos Lorand University) PDF Poster Video (Right click to download)
828	Multi-Scale Semantic Enrichment and Dual Angular Margin Contrast for Few-Shot Class Incremental Learning Riya Verma (Indian Institute of Technology, Madras), Sukhendu Das (Indian Institute of Technology Madras) PDF Poster Video (Right click to download)
833	Anomaly Detection Based on Semi-Formula Driven Pre-training Dataset to Represent Subtle Difference and Anomaly Score Hiroki Kobayashi (Chukyo University), Naoki Murakami (Chukyo University), Naoto Hiramatsu (Chukyo University), Takahiro Suzuki (Chukyo University), Manabu Hashimoto (Chukyo University) PDF Poster Video (Right click to download)
853	Budget-aware Dynamic Spatially Adaptive Inference Georgios Zampokas (Imperial College London), Christos-Savvas Bouganis (Imperial College London), Dimitris Tzovaras (Centre for Research and Technology Hellas) PDF Poster Video (Right click to download)
854	CSAD: Unsupervised Component Segmentation for Logical Anomaly Detection Yu-Hsuan Hsieh (Department of Computer Science, National Tsing Hua University, National Tsinghua University), Shang-Hong Lai (National Tsing Hua University) PDF Poster Video (Right click to download)
857	Enhancing Radiology Report Generation: The Impact of Locally Grounded Vision and Language Training Sergio Sanchez Santiesteban (University of Surrey), Muhammad Awais (University of Surrey), Yi-Zhe Song (University of Surrey), Josef Kittler (University of Surrey) PDF Poster Video (Right click to download)
859	Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data Regimes Dmitry Demidov (Mohamed bin Zayed University of Artificial Intelligence), Abduragim Shtanchaev (Mohamed bin Zayed University of Artificial Intelligence), Mihail Minkov Mihaylov (Mohamed bin Zayed University of Artificial Intelligence), Mohammad Almansoori (Mohamed bin Zayed University of Artificial Intelligence) PDF Poster Video (Right click to download)
863	CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning Emanuele Frascaroli (University of Modena and Reggio Emilia), Aniello Panariello (University of Modena and Reggio Emilia), Pietro Buzzega (University of Modena and Reggio Emilia), Lorenzo Bonicelli (University of Modena and Reggio Emilia), Angelo Porrello (University of Modena and Reggio Emilia, AimageLab), Simone Calderara (University of Modena and Reggio Emilia) PDF Poster Video (Right click to download)
865	APTPose: Anatomy-aware Pre-Training for 3D Human Pose Estimation Qing-Wen Yang (MediaTek Inc.), Kai-Wen Duan (National Tsinghua University), Ting-Yi Lu (National Tsinghua University), Kevin Lin (Microsoft), Cheng-Yen Yang (University of Washington), Lijuan Wang (Microsoft), Jenq-Neng Hwang (University of Washington, Seattle), Shang-Hong Lai (National Tsing Hua University) PDF Poster Video (Right click to download)
866	A Deep Belief Network Approach to Scalable Compression of Light Field Data for Auto-Stereoscopic Displays Sally Khaidem (Indian Institute of Technology, Madras), Mansi Sharma (Thapar Institute of Engineering & Technology) PDF Poster Video (Right click to download)
878	Learning conditionally untangled latent spaces using Fixed Point Iteration Victor Enescu (LIP6), Hichem Sahbi (Sorbonne University) PDF Poster
882	A Multimodal Network on Handwritten Chinese Character Error Correction Haizhao Sun (Beijing University of Posts and Telecommunications), Yu Ning (Beijing University of Posts and Telecommunications), jixv (Beijing University of Posts and Telecommunications), Chuang Zhang (Beijing University of Posts and Telecommunications), Ming Wu (Beijing University of Post and Telecommunication) PDF Poster Video (Right click to download)
885	Efficient Data Source Relevance Quantification for Multi-Source Neural Networks Jakob Gawlikowski (Technical University of Munich (TUM)), Nina Maria Gottschling (German Aerospace Center (DLR)) PDF Poster
887	Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models Bin Fu (Institute of Computing Technology, Chinese Academy of Sciences), Qiyang Wan (Institute of Computing Technology, Chinese Academy of Sciences), Jialin Li (Institute of Computing Technology, Chinese Academy of Sciences), Ruiping Wang (Institute of Computing Technology, Chinese Academy of Sciences), Xilin Chen (Institute of Computing Technology) PDF Poster Video (Right click to download)
895	Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs Sadra Safadoust (Koc University), Fabio Tosi (University of Bologna), Fatma Guney (Koc University), Matteo Poggi (University di Bologna) PDF Poster Video (Right click to download)
897	topK dice loss for medical image segmentation Seyed mohsen hosseini (University of Tehran, University of Tehran) PDF Poster Video (Right click to download)
900	Direct-Sum Approach to Integrate Losses Via Classifier Subspace Takumi Kobayashi (National Institute of Advanced Industrial Science and Technology (AIST)) PDF Poster Video (Right click to download)
902	Knowledge Distillation with Global Filters for Efficient Human Pose Estimation Kaushik Bhargav Sivangi (University of Glasgow), Fani Deligianni (University of Glasgow) PDF Poster Video (Right click to download)
911	A Learnable Color Correction Matrix for RAW Reconstruction Anqi Liu (Shanghai University), Shiyi Mu (Shanghai University), Shugong Xu (Shanghai University) PDF Poster Video (Right click to download)
913	Examining the Threat Landscape: Foundation ModelsÂ andÂ ModelÂ Stealing Ankita Raj (Indian Institute of Technology, Delhi), Deepankar Varma (Indian Institute of Technology, Delhi), Chetan Arora (Indian Institute of Technology Delhi) PDF Poster Video (Right click to download)
922	UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural Networks with Convolutional ARMA Filters Kovvuri Sai Gopal Reddy (Shiv Nadar University), Saran Bodduluri (Shiv Nadar University), A. Mudit Adityaja (Shiv Nadar University), Saurabh Shigwan (Shiv Nadar University), Nitin Kumar (Shiv Nadar University), Snehasis Mukherjee (Shiv Nadar University) PDF Poster Video (Right click to download)
927	GazeHELL: Gaze Estimation with Hybrid Encoders and Localised Losses with weighing Shubham Dokania (Mercedes-Benz R&D India), Vasudev Singh (Mercedes Benz Research & Development India), Shuaib Ahmed (Mercedes Benz R&D India ) PDF Poster Video (Right click to download)
929	TrakAthlete4D: Multi-View On-Field Player Position Tracking in Sports Nitish Agarwal (KinaTrax), Steven Cadavid (University of Miami) PDF Poster Video (Right click to download)
932	Spatiotemporal Vision Transformer for Weakly Supervised Dense Prediction of Dynamic Brain Maps Behnam Kazemivash (Georgia State University), Armin Iraji (Georgia State University), Sergey Plis (Georgia State University), Vince Calhoun (Georgia State University) PDF Poster Video (Right click to download)
933	SceneSAM: Integrating 2D Labels for Weakly Supervised 3D Scene Understanding Julius Koerner (Technical University of Munich), Dogu Tamgac (Technical University of Munich), David Rozenberszki (Technical University of Munich) PDF Poster Video (Right click to download)
936	PV-SLAM: Panoptic Visual SLAM with Loop Closure and Online Bundle Adjustment Ashok Bandyopadhyay (Indian Institute of Technology, Guwahati), Pranjal Baranwal (Indian Institute of Technology, Guwahati, Indian institute of science, Bangalore), Arijit Sur (Indian Institute of Technology, Guwahati), Rajeev UP (Vikram Sarabhai Space Centre, Indian Space Research Organization, Thiruvananthapuram, India) PDF Poster Video (Right click to download)
939	Deep Learning for GPS-Denied SAR Image Focusing and Vehicle Trajectory Estimation Christopher Beam (University of North Carolina at Charlotte), Andrew R. Willis (University of North Carolina, Charlotte), Kevin M Brink (Air Force Research Laboratory) PDF Poster Video (Right click to download)
945	Gaussian Splatting in Mirrors: Reflection-aware Rendering via Virtual Camera Optimization Zihan Wang (Aalto University), Shuzhe Wang (Aalto University), Matias Turkulainen (Aalto University), Junyuan Fang (University of Helsinki), Juho Kannala (Aalto University) PDF Poster Video (Right click to download)
947	Layer-wise Learning of CNNs by Self-tuning Learning Rate and Early Stopping at Each Layer Melika Sadeghi Tabrizi (University of Tehran, University of Tehran), Ali Karimi (Kharazmi University), Ahmad Kalhor (University of Tehran), Babak N Araabi (University of Tehran, University of Tehran), Mona Ahmadian (University of Surrey) PDF Poster Video (Right click to download)
949	On Partial Prototype Collapse in the DINO Family of Self-Supervised Methods Hariprasath Govindarajan (Qualcomm Inc, QualComm), Per SidÃ©n (Linkoping University), Jacob Roll (Qualcomm Inc, QualComm), Fredrik Lindsten (Linkoping University) PDF Poster Video (Right click to download)
954	Beyond Face Matching: A Facial Traits based Privacy Score for Synthetic Face Datasets Robero Leyva (The university of Warwick), Praveen Selvaraj (University College London, University of London), Andrew Elliott (Alan Turing Institute), Dr Gregory Epiphaniou (University of Warwick), carsten maple (The university of Warwick) PDF Poster Video (Right click to download)
957	Putting the Segment Anything Model to the Test with 3D Knee MRI - A Comparison with State-of-the-Art Performance Oliver Mills (University of Leeds), Nishant Ravikumar (University of Leeds), Philip G Conaghan (University of Leeds), Samuel D Relton (University of Leeds) PDF Poster Video (Right click to download)
959	SR+Codec: a Benchmark of Super-Resolution for Video Compression Bitrate Reduction Evgeney Bogatyrev (Moscow State University, Lomonosov Moscow State University), Ivan Molodetskikh (Moscow State University, Lomonosov Moscow State University), Dmitriy S. Vatolin (Moscow State University, Lomonosov Moscow State University) PDF Poster Video (Right click to download)
967	CVAM-Pose: Conditional Variational Autoencoder for Multi-Object Monocular Pose Estimation Jianyu Zhao (University of Central Lancashire), Wei Quan (University of Central Lancashire), Bogdan Matuszewski (University of Central Lancashire) PDF Poster Video (Right click to download)
977	Improving Multimodal Learning with Multi-Loss Gradient Modulation Konstantinos Kontras (Department of Electrical Engineering, KU Leuven, Belgium, KU Leuven), Christos Chatzichristos (KU Leuven), Matthew B. Blaschko (KU Leuven), Maarten De Vos (KU Leuven) PDF Poster
986	Adaptive Weighted Co-Learning for Cross-Domain Few-Shot Learning Abdullah Alchihabi (Carleton University), Marzi Heidari (Carleton University), Yuhong Guo (Carleton University) PDF Poster Video (Right click to download)
987	Guided Attention for Interpretable Motion Captioning KARIM RADOUANE (University of Montpellier), Julien Lagarde (University of Montpellier), Sylvie RANWEZ (IMT Mines Ales), Andon Tchechmedjiev (IMT Mines Ales) PDF Poster Video (Right click to download)
991	iHAST: Integrating Hybrid Attention for Super-Resolution in Spatial Transcriptomics Xi Li (University of California, Irvine), Jing Zhang (Donald Bren School of Information and Computer Sciences, University of California, Irvine), Ziheng Duan (University of California, Irvine), Yi Dai (University of California, Irvine), Siwei Xu (Donald Bren School of Information and Computer Sciences, University of California, Irvine) PDF Poster Video (Right click to download)
998	MV-Match: Multi-View Matching for Domain-Adaptive Identification of Plant Nutrient Deficiencies Jinhui Yi (University of Bonn), Yanan Luo (University of Bonn), Marion Deichmann (University of Bonn), Gabriel Schaaf (University of Bonn), Juergen Gall (University of Bonn) PDF Poster Video (Right click to download)
1013	Open-Vocabulary Temporal Action Localization using Multimodal Guidance Akshita Gupta (University of Guelph), Aditya Arora (York University), Sanath Narayan (Technology Innovation Institute), Salman Khan (Mohamed bin Zayed University of Artificial Intelligence), Fahad Shahbaz Khan (Mohamed bin Zayed University of Artificial Intelligence), Graham W. Taylor (University of Guelph) PDF Poster Video (Right click to download)
1020	Recovering SLAM Tracking Lost by Trifocal Pose Estimation using GPU-HC++ Chiang-Heng Chien (Brown University), Ahmad Abdelfattah (University of Tennessee, Knoxville), Benjamin Kimia (Brown University) PDF Poster Video (Right click to download)

If there are any mistakes on this page, please do not hesitate to contact bmvc@bmvc2024.org