FoundationVision/Groma
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
PythonApache-2.0
Watchers
- BillionerdCul-de-sac
- cerviny上海
- CloseGoingAwayTetra LTD
- coder-drinker翼悟科技
- d3p10yUltraBear Education
- DamaZonia0.0.0.0
- e-kiss-meo’flye
- err-nilTaiwan
- FarmingTongMindShare
- fryukiLINK REIT
- fskeoTU Berlin
- HS991023Student
- IAm20cm很惭愧,就做了一点微小的工作
- jbluvTraffic Tech
- lilong-wen
- Lycokie@Google
- machuofan
- maigoneUCloud
- masemxiaoIBIS
- mistyr0se@Shopify
- MOGUIJOE快手
- MonsterDoveBlessed Be the Fruit
- N0wwaOrganization Strategy
- nicbairex-UnionStack
- ntt720Amoy
- Obsidian6sPudong, Shanghai
- paramedickParameters Lab
- Peiqiqi520Soul App
- reikoloBusan, Korea
- S8XYOPPO
- SpicygumL上海
- vamokoHPE
- wensiyuansixRoamResearch Inc.
- WinDB3llState Ltd.
- xupercoinThe Coin Company