About This Whitepaper presentation
This session features 7 whitepaper presentations, each lasting 5 minutes with 1 minute for questions.
Presented Papers
-
ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way Rajarshi Roy, Devleena Das, Ankesh Banerjee, Arjya Bhattacharjee, Kousik Dasgupta, Subarna Tripathi
-
Reasoning-Enhanced Prompt Strategies for Multi-Label Classification Jinze Yu, Guanghui Wang
-
A Multi-Stage Pipeline for Accurate Handwritten Information Extraction from Financial Forms Guanghui Wang, Xing Zhang, Jinze Yu, Tomal Deb, Xuefeng Liu, Peiyang He
-
MGT: Extending Virtual Try-Off to Multi-Garment Scenarios Riza Velioglu, Petra Bevandić, Robin Chan, Barbara Hammer
-
From Pixels to Context: Adapting Generative Models for Advertising at Scale HyunHee Chung, Taeyoung Na
-
Cross-lingual Visual Text Stylization and Synthesis Incorporating Text Rendering and Diffusion Model Minmin Shen, Caren Chen
-
Toward Scalable Video Narration: A Training-free Approach using Multimodal Large Language Models Tz-Ying Wu, Tahani Trigui, Sharath Nittur Sridhar, Anand Bodas, Subarna Tripathi