Multimodal-action-recognition
PublicCode on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
cross-attentionmultimodal-action-recognitionmultimodal-datamultimodal-deep-learningmultimodal-fusionmultimodal-learningmultimodality
Creat:2021-05-20T00:37:37
Update:2025-03-19T16:57:19
73
Stars
0
Stars Increase