Pinned Repositories
CogAGENT
CogKTR
CogKTR: A Knowledge-Enhanced Text Representation Toolkit for Natural Language Understanding. EMNLP 2022
CogKGE
CogKGE: A Knowledge Graph Embedding Toolkit and Benchmark for Representing Multi-source and Heterogeneous Knowledge. ACL 2022
abstract-state-seqmodel
Code for EMNLP 2023 paper "Emergence of Abstract State Representations in Embodied Sequence Modeling"
agent-attack
[Arxiv 2024] Adversarial Attacks on Multimodal Agents
Agent-Smith
[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
alignment-handbook
Robust recipes for to align language models with human and AI preferences
artifacts
AutoDroid
Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"
babyai
BabyAI platform. A testbed for training agents to understand and execute language commands.
Quester-one's Repositories
Quester-one/abstract-state-seqmodel
Code for EMNLP 2023 paper "Emergence of Abstract State Representations in Embodied Sequence Modeling"
Quester-one/agent-attack
[Arxiv 2024] Adversarial Attacks on Multimodal Agents
Quester-one/Agent-Smith
[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Quester-one/alignment-handbook
Robust recipes for to align language models with human and AI preferences
Quester-one/artifacts
Quester-one/AutoDroid
Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"
Quester-one/babyai
BabyAI platform. A testbed for training agents to understand and execute language commands.
Quester-one/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Quester-one/gpt_academic
为ChatGPT/GLM提供图形交互界面,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持清华chatglm2等本地模型。兼容复旦MOSS, llama, rwkv, newbing, claude, claude2等
Quester-one/LanguageAgentTreeSearch
Quester-one/LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Quester-one/R-Judge
R-Judge: Benchmarking Safety Risk Awareness for LLM Agents
Quester-one/SmartPlay
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs.
Quester-one/Synapse
Trajectory-as-Exemplar Prompting with Memory for Computer Control
Quester-one/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Quester-one/gym-minigrid
Minimalistic gridworld package for OpenAI Gym
Quester-one/HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Quester-one/label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Quester-one/llm-reasoners
A library for advanced large language model reasoning
Quester-one/llm-transparency-tool
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo
Quester-one/lm-arithmetic
Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"
Quester-one/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Quester-one/pyvene
Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions
Quester-one/ToRA
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools.
Quester-one/transformer-debugger
Quester-one/TravelPlanner
[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
Quester-one/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Quester-one/VisualAgentBench
Towards Large Multimodal Models as Visual Foundation Agents
Quester-one/webarena
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
Quester-one/WebShop
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents