OpenRouter Runner is a monolith inference engine, built with Modal, used for lots of the open source models hosted in a fallback capacity on openrouter.ai.
- vLLM
- HF Transformers
cd modal
- Select a modal app, like the runner
- Follow the steps in the project README.
Interested in contributing? Please read our contributing guide and follow our code of conduct.