Whitepaper Presentations

Oct 20 3:00 PM HST :calendar:
Audience level: Whitepaper Presentations

About This Whitepaper presentation

This session features 7 whitepaper presentations, each lasting 5 minutes with 1 minute for questions.

Presented Papers

  • ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way Rajarshi Roy, Devleena Das, Ankesh Banerjee, Arjya Bhattacharjee, Kousik Dasgupta, Subarna Tripathi

  • Reasoning-Enhanced Prompt Strategies for Multi-Label Classification Jinze Yu, Guanghui Wang

  • A Multi-Stage Pipeline for Accurate Handwritten Information Extraction from Financial Forms Guanghui Wang, Xing Zhang, Jinze Yu, Tomal Deb, Xuefeng Liu, Peiyang He

  • MGT: Extending Virtual Try-Off to Multi-Garment Scenarios Riza Velioglu, Petra Bevandić, Robin Chan, Barbara Hammer

  • From Pixels to Context: Adapting Generative Models for Advertising at Scale HyunHee Chung, Taeyoung Na

  • Cross-lingual Visual Text Stylization and Synthesis Incorporating Text Rendering and Diffusion Model Minmin Shen, Caren Chen

  • Toward Scalable Video Narration: A Training-free Approach using Multimodal Large Language Models Tz-Ying Wu, Tahani Trigui, Sharath Nittur Sridhar, Anand Bodas, Subarna Tripathi