hasura-pod42

A Discord bot to answer questions based on docs using the latest ChatGPT API, built on Hasura GraphQL Engine and LangChain.

You can try the bot on our Discord, read more about the annoucement here.

Its features include:

Asynchronous architecture based on the Hasura events system with rate limiting and retries.
Performant Discord bot built on Hasura's streaming subscriptions.
Ability to ingest your content to the bot.
Prompt to make GPT-3 answer with sources while minimizing bogus answers.

Made with ❤️ by Hasura

Motivation

We at Hasura always believe that we are better at caring for plumbing so they can focus on their core problems. Hence, when text-davinci-003 came out, we saw an opportunity to resolve our user's query on Discord.

We had the following objectives when creating the bot,

Use Hasura's docs/blogs/learning courses.
Always list sources when answering.
Better to say "I don't know" over an incorrect answer.
Capture user feedback and iterate quickly.

Installation
- Setup Hasura Pod42
Architecture
Comparison: text-davinci-003 vs gpt-3.5-turbo

Installation

Steps to Setup Hasura Pod42

Setup pod42-server
You can use the one-click to deploy on Hasura Cloud to get started quickly:

Architecture

Pod42 is based on 3-factor architecture

Discord Bot:

Uses real-time GraphQL API from Hasura.
Minimal state and code.
Instant feedback.
Easily Scalable.

Tasks:

Collect questions from users and persist them via Hasura's GraphQL API.
Listen for answers in real-time using subscriptions.

Hasura:

Completely asynchronous orchestrator using event triggers and subscriptions.
Event triggers handle retry and rate limit to webhook.
Subscriptions allow us to deliver instant answers to the Discord bot.

Tasks:

Trigger workflow when a new question comes.
When the Answer arrives, notify the clients through subscription.

Serverless Functions/Containers:

Stateless easily be deployed as a function on the cloud.

Tasks:

Fetch top K-related docs excerpts from the vector store.
Combine them in one document along with the question to OpenAI.
Persist the answer using Hasura's GraphQL API.

Comparison: text-davinci-003 vs gpt-3.5-turbo

For Hasura's use case, we want to emphasize the correctness of the answers; it's better for us if Pod42 says "I Don't Know" instead of bluffing an answer. We see that gpt-3.5-turbo does much better in that regard. It's also more verbose, but many new users like the details, and in terms of latency, gpt-3.5-turbo was ~60% faster.

Also, We found that passing information part user role in the prompt is more effective at the moment vs. the system role.

All the examples use the same prompt and vector store data.