akashe/Multimodal-action-recognition
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
Python
Stargazers
- anirudhssundarGeorgia Institute of Technology
- bdockbockd
- billhhhUAE
- BPrasad123Bangalore
- Breeze-ZeroNUS
- chenbiaolong
- cvding
- elmahyaiAvailable to work
- gxh-rayBei Jing, CHN
- HariWu1995
- iAuAi
- increase24Shanghai
- iuhiyuh
- jiany-ctrl
- JolsonTsci学生
- Kryptonite1992
- Lansuan
- learning511
- m-ali-awanIslamabad,Pakistan
- muzihuolePeople's Public Security University of China
- nahidalam
- nanxingzhang
- NEUdeephangzhou
- Psjs
- seongminpActionPower
- Smiler36
- solitude6060Taiwan/Taipei
- SoyeonHHAjou University
- Strand2013China
- TheodoreGalanosAustrian Institute of Technology
- wangxiao5791509Anhui University (安徽大学)
- xgjzhls
- xiao2motrio.ai
- Yybcbjy
- Zinference
- zoombapup