Publication IGOR: Image-GOal Representations are the Atomic Control Units for Foundation Models in Embodied AI Xiaoyu Chen, Junliang Guo, Tianyu He, Chuheng Zhang, Pushi Zhang, Derek Yang, Li Zhao, Jiang Bian October 2024
Publication A Large-Scale Human-Centric Benchmark for Referring Expression Comprehension in the LMM Era Fangyun Wei October 2024
Publication Boosting Text-to-Video Generative Model with MLLMs Feedback Xun Wu, Shaohan Huang, Furu Wei October 2024
Publication ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic Time-Series Transformer Shun Zheng, Xumeng Wen, Jiang Bian October 2024
Publication Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning Rui Wang, Junliang Guo, Xu Tan October 2024
Publication Scaling the Codebook Size of VQ-GAN to 100,000 with a Utilization Rate of 99% Fangyun Wei, Dong Chen October 2024
Publication BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End Learning Xiao Yang, Xu Yang, Weiqing Liu, Lewen Wang, Jiang Bian October 2024
Publication EEG2Video: Towards Decoding Dynamic Visual Perception from EEG Signals Yansen Wang, Zilong Wang October 2024