Structured extraction in Typescript, powered by llms, designed for simplicity, transparency, and control.
Dive into the world of Python-based structured extraction, by OpenAI's function calling API and Pydantic, the most widely used data validation library for Python. Instructor stands out for its simplicity, transparency, and user-centric design. Whether you're a seasoned developer or just starting out, you'll find Instructor's approach intuitive and steerable.
!!! tip "Support in other languages"
Check out ports to other languages below:
- [Python](https://www.github.com/jxnl/instructor)
- [Elixir](https://github.com/thmsmlr/instructor_ex/)
If you want to port Instructor to another language, please reach out to us on [Twitter](https://twitter.com/jxnlco) we'd love to help you get started!
import Instructor from "@/instructor"
import OpenAI from "openai"
import { z } from "zod"
const UserSchema = z.object({
age: z.number(),
name: z.string()
})
type User = z.infer<typeof UserSchema>
const oai = new OpenAI({
apiKey: process.env.OPENAI_API_KEY ?? undefined,
organization: process.env.OPENAI_ORG_ID ?? undefined
})
const client = Instructor({
client: oai,
mode: "FUNCTIONS"
})
const user: User = await client.chat.completions.create({
messages: [{ role: "user", content: "Jason Liu is 30 years old" }],
model: "gpt-3.5-turbo",
response_model: { schema: UserSchema }
})
console.log(user)
// { age: 30, name: "Jason Liu" }
The question of using Instructor is fundamentally a question of why to use Pydantic.
-
Powered by OpenAI — Instructor is powered by OpenAI's function calling API. This means you can use the same API for both prompting and extraction.
-
Customizable — Zod is highly customizable. You can define your own validators, custom error messages, and more.
-
Ecosystem Zod is the most widely used data validation library for Typescript.
-
Battle Tested — Zod is downloaded over 24M times per month, and supported by a large community of contributors.
If you'd like to see more check out our cookbook.
Installing Instructor is a breeze.
If you want to help out, checkout some of the issues marked as good-first-issue
or help-wanted
. Found here. They could be anything from code improvements, a guest blog post, or a new cook book.
This project is licensed under the terms of the MIT License.
- Add
llm_validator
- Logging for Distillation / Finetuning
- Support Streaming
- Optional/Maybe types
- Add Tutorials, include in docs
- Text Classification
- Search Queries
- Query Decomposition
- Citations
- Knowledge Graph
- Self Critique
- Image Extracting Tables
- Moderation
- Entity Resolution
- Action Item and Dependency Mapping
These translations provide a structured approach to creating TypeScript schemas with Zod, mirroring the functionality and intent of the original Python examples.