Digital Library


Search: "[ keyword: multimodal learning ]" (1)
    Performance Analysis of Video–Audio Action Recognition Using a Cross-Attention-Based Multimodal Fusion Architecture
    Jun Hwa Kim The Transactions of the Korea Information Processing Society, Vol. 15, No. 2, pp. 113-120, Feb. 2026
    https://doi.org/10.3745/TKIPS.2026.15.2.113
    Keywords: multimodal learning, Action Recognition, audio-visual fusion, Cross attention, Transformer