wenh18's Stars
OpenInterpreter/open-interpreter
A natural language interface for computers
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
xlang-ai/OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
promptslab/Promptify
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
facundoolano/google-play-scraper
Node.js scraper to get data from Google Play
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
mbzuai-oryx/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
ClaudiuGeorgiu/PlaystoreDownloader
A command line tool to download Android applications directly from the Google Play Store by specifying their package name (an initial one-time configuration is required)
google-deepmind/android_env
RL research on Android devices.
MobileLLM/Personal_LLM_Agents_Survey
Paper list for Personal LLM Agents
Farama-Foundation/miniwob-plusplus
MiniWoB++: a web interaction benchmark for reinforcement learning
YunheWang/HomePage
Yunhe Wang's HomePage
the-themis-benchmarks/home
The Themis Benchmark for evaluating automated GUI testing
Chaos96/fourierft
Bosszhe/EMIFF
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection
XYIheng/AndroidTesting
Android Testing Literature
X-LANCE/Mobile-Env
A Universal Platform for Training and Evaluation of Mobile Interaction
AndroidArenaAgent/AndroidArena
coinse/droidagent
DroidAgent: Intent-Driven Mobile GUI Testing with Autonomous LLM Agents
testinging6/GPTDroid
wenh18/AdaptiveNet_artifact
boostvolt/icon-dataset
142,416 structured images for icon classification and recognition
m8than/RWKV-LM-LoRA
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
yikang-li/CalAgent
Calendar Agent using Wechat and GPT
Midysen/googleplay
lori930/lori930.github.io
sunshinewhy/OpenAgents
OpenAgents: An Open Platform for Language Agents in the Wild