OpenAI Streaming Hooks

Talk directly to OpenAI Completion APIs and stream the response back in real-time in the browser--no server required.

Provides custom React Hooks capable of calling OpenAI Completions APIs with streaming support enabled by Server-Sent Events (SSE).

The library then decorates every Completion response with metadata about the transaction such as:

The number of tokens used in the response
The time it took to complete the request
Each chunk of the stream
The timestamp each chunk was received
The timestamp from when the Completion was finished

Example

Example code here

See section on running the example for more information.

Use

Install the OpenAI Streaming Hooks library via a package manager like npm or yarn:

npm install --save openai-streaming-hooks

Import the hook and use it:

import { useChatCompletion, GPT35 } from 'openai-streaming-hooks';

const Component = () => {
  const [messages, submitQuery] = useChatCompletion({
    model: GPT35.TURBO,
    apiKey: 'your-api-key',
  });
  ...
};

Supported Types of Completions

There are two main types of completions available from OpenAI that are supported here:

Text Completions, which includes models like text-davinci-003.
Chat Completions, which includes models like gpt-4 and gpt-3.5-turbo.

There are some pretty big fundamental differences in the way these models are supported on the API side. Chat Completions consider the context of previous messages when making the next completion. Text Completions only consider the context passed into the explicit message it is currently answering.

For more information on chat vs. text completion models, see LangChain's excellent blog post on the topic.

Chat Completions

An individual message in a Chat Completion's messages list looks like:

interface ChatMessage {
  content: string;                // The content of the completion
  role: string;                   // The role of the person/AI in the message
  timestamp: number;              // The timestamp of when the completion finished
  meta: {
    loading: boolean;             // If the completion is still being executed
    responseTime: string;         // The total elapsed time the completion took
    chunks: ChatMessageToken[];   // The chunks returned as a part of streaming the execution of the completion
  };
}

Each chunk corresponds to a token streamed back to the client in the completion. A ChatMessageToken is the base incremental shape that content in the stream returned from the OpenAI API looks like:

interface ChatMessageToken {
  content: string;    // The partial content, if any, received in the chunk
  role: string;       // The role, if any, received in the chunk
  timestamp: number;  // The time the chunk was received
}

Submitting a Query

Call the submitQuery function to spawn a request whose response will be streamed back to the client from the OpenAI Chat Completions API. A query takes a list of new messages to append to the existing messages list and submit to OpenAI.

A sample message list might look like:

const newMessages = [
  { role: 'system', content: 'You are a short story bot, you write short stories for kids' },
  { role: 'user', content: 'Write a story about a lonely bunny' },
];

When the query is submitted, a blank message is appended to the end of the messages list with its meta.loading state set to true. This message will be where the content that is streamed back to the client is collected in real-time.

New chunks of the message will appear in the meta.chunks list and your React component will be updated every time a new chunk appears automatically.

💡 Chunks correspond directly to tokens.

By counting the number of chunks, you can count the number of tokens that a response used.

If the submitQuery function is called without any parameters or with an empty list, no request will be sent and instead the messages list will be set back to empty.

Running the Example

Clone this package locally and navigate to it:

git clone https://github.com/jonrhall/openai-streaming-hooks.git
cd openai-streaming-hooks

Export your OpenAI API Key as environment variable VITE_OPENAI_API_KEY:

export VITE_OPENAI_API_KEY=your-key-here

Run the example dev server:

npm run example

Navigate to https://localhost:5179 to see the live example.