/langrila

useful tool to use ChatGPT with OpenAI API or Azure in an easy way

Primary LanguagePythonMIT LicenseMIT

Langri-La

Langrila is a useful tool to use ChatGPT with OpenAI API or Azure in an easy way. This library put emphasis on simple architecture for readability.

Dependencies

must

  • Python >=3.10,<3.13
  • openai
  • tiktoken

as needed

  • chroma or qdrant-client (for retrieval)

Contribution

Coding policy

  1. Sticking to simplicity : This library is motivated by simplifying architecture for readability. Thus multiple inheriting and nested inheriting should be avoided as much as possible for basic modules at least.
  2. Making responsibility Independent : The responsibility of each module must be closed into each module itself. It means a module is not allowed to affect other modules.
  3. Implementing minimum modules : The more functions each module has, the more complex the source code becomes. Langrila focuses on implementing minimum necessary functions in each module.

Branch management rule

  • Topic branch are checkout from main branch.
  • Topic branch should be small.

Installation

clone

git clone git@github.com:taikinman/langrila.git

pip

cd langrila
pip install -e .

poetry

poetry add --editable /path/to/langrila

Usage example

Pre-requirement

  1. Pre-configure environment variables to use OpenAI API or Azure OpenAI Service.

Basic usage

For OpenAI

from langrila import OpenAIChatModule

# For conventional model
chat = OpenAIChatModule(
    api_key_env_name = "API_KEY", # env variable name
    model_name="gpt-3.5-turbo-16k-0613",
    # organization_id_env_name="ORGANIZATION_ID", # env variable name
    timeout=60, 
    max_retries=2, 
)

message = "Please give me only one advice to improve the quality of my sleep."
response = chat(message)

# # for asynchronous processing
# response = await chat(message, arun=True)

response.model_dump()

>>> {'message': {'role': 'assistant',
  'content': 'Establish a consistent sleep schedule by going to bed and waking up at the same time every day, even on weekends.'},
 'usage': {'prompt_tokens': 21, 'completion_tokens': 23},
 'prompt': [{'role': 'user',
   'content': 'Please give me only one advice to improve the quality of my sleep.'}]}

For newest model released on 2024/01/25.

chat = OpenAIChatModule(
    api_key_env_name = "API_KEY", # env variable name
    model_name="gpt-4-0125-preview", # you can specify newest model
    timeout=60, 
    max_retries=2, 
    # seed=42, # as needed
    # response_format={"type":"json_object"} # as needed
)

For Azure

chat = OpenAIChatModule(
        api_key_env_name="AZURE_API_KEY", # env variable name
        model_name="gpt-3.5-turbo-16k-0613",
        api_type="azure",
        api_version="2023-07-01-preview", 
        deployment_id_env_name="DEPLOY_ID", # env variable name
        endpoint_env_name="ENDPOINT", # env variable name
    )

Vision model

from PIL import Image

# In this example, I use a picture of "osechi-ryori"
image = Image.open("path/to/your/local/image/file")

chat = OpenAIChatModule(
    api_key_env_name="API_KEY",
    model_name="gpt-4-vision-preview",
)

# stream, astream are also runnable
prompt = "What kind of food is in the picture?"
response = await chat(prompt, images=image, arun=True) # multiple image input is also allowed
response.model_dump()

>>> {'message': {'role': 'assistant',
  'content': 'The image shows a traditional Japanese New Year\'s food called "osechi-ryori." It is typically presented in special boxes called "jubako," which are often lacquered and stacked for a beautiful presentation. Osechi-ryori consists of various dishes, each with a special meaning for the New Year. The foods are often colorful and include items such as sweet black soybeans (kuromame), fish cakes (kamaboko), simmered burdock root (kinpira gobo), marinated herring roe (kazunoko), and other delicacies like prawns, chestnuts, and sweet omelet (tamagoyaki). Each dish is chosen for its auspicious significance and is intended to bring good luck in the coming year.'},
 'usage': {'prompt_tokens': 101, 'completion_tokens': 159},
 'prompt': [{'role': 'user',
   'content': [{'type': 'text',
     'text': 'What kind of food is in the picture?'},
    {'type': 'image_url',
     'image_url': {'url': '...',
      'detail': 'low'}}]}]}

Batch processing

prompts = [
    "Please give me only one advice to improve the quality of my sleep.", 
    "Please give me only one advice to improve my memory.",
    "Please give me only one advice on how to make exercise a habit.",
    "Please give me only one advice to help me not get bored with things so quickly." 
            ]

await chat.abatch_run(prompts, batch_size=4)

>>> [CompletionResults(message={'role': 'assistant', 'content': 'Establish a consistent sleep schedule by going to bed and waking up at the same time every day, even on weekends.'}, usage=Usage(prompt_tokens=21, completion_tokens=23, total_tokens=44), prompt=[{'role': 'user', 'content': 'Please give me only one advice to improve the quality of my sleep.'}]),
 CompletionResults(message={'role': 'assistant', 'content': 'One advice to improve memory is to practice regular physical exercise. Exercise has been shown to enhance memory and cognitive function by increasing blood flow to the brain and promoting the growth of new brain cells. Aim for at least 30 minutes of moderate-intensity exercise, such as brisk walking or jogging, most days of the week.'}, usage=Usage(prompt_tokens=18, completion_tokens=64, total_tokens=82), prompt=[{'role': 'user', 'content': 'Please give me only one advice to improve my memory.'}]),
 CompletionResults(message={'role': 'assistant', 'content': "Start small and be consistent. Start with just a few minutes of exercise each day and gradually increase the duration and intensity over time. Consistency is key, so make it a priority to exercise at the same time every day or on specific days of the week. By starting small and being consistent, you'll be more likely to stick with it and make exercise a long-term habit."}, usage=Usage(prompt_tokens=21, completion_tokens=76, total_tokens=97), prompt=[{'role': 'user', 'content': 'Please give me only one advice on how to make exercise a habit.'}]),
 CompletionResults(message={'role': 'assistant', 'content': 'One advice to help you not get bored with things so quickly is to cultivate a sense of curiosity and explore new perspectives. Instead of approaching tasks or activities with a fixed mindset, try to approach them with an open mind and a desire to learn something new. Embrace the mindset of a beginner and seek out different ways to engage with the task at hand. By continuously seeking novelty and finding new angles to approach things, you can keep your interest alive and prevent boredom from setting in.'}, usage=Usage(prompt_tokens=24, completion_tokens=96, total_tokens=120), prompt=[{'role': 'user', 'content': 'Please give me only one advice to help me not get bored with things so quickly.'}])]

You can also run vision model with batch processing.

prompts = [
    "What is this image?",
    "What is this image?",
]

images = [
    image1, # PIL image of Osechi-ryori
    image2, # PIL image of Mt.Fuji
]

response = await chat.abatch_run(prompts, images=images)

>>> [CompletionResults(message={'role': 'assistant', 'content': 'The image shows a traditional Japanese New Year\'s food called "osechi-ryori." It is presented in special boxes called "jubako," which are often lacquered and stacked for the occasion. Osechi-ryori consists of various dishes, each with a special meaning for the New Year. The dishes are typically colorful and are made to be eaten over the first few days of the New Year, as it\'s considered good luck to not cook during that time. The food items are often sweet, sour, or dried, so they can last for several days without refrigeration.'}, usage=Usage(prompt_tokens=97, completion_tokens=120, total_tokens=217), prompt=[{'role': 'user', 'content': [{'type': 'text', 'text': 'What is this image?'}, {'type': 'image_url', 'image_url': {'url': '...', 'detail': 'low'}}]}]),
 CompletionResults(message={'role': 'assistant', 'content': "The image shows a scenic view of Mount Fuji, Japan's highest mountain, with its iconic snow-capped peak. In the foreground, there are lush green slopes, possibly covered with forests, and there are clouds surrounding the middle section of the mountain, adding to the dramatic effect of the scene. The clear blue sky above and the natural beauty suggest this might be a popular spot for sightseeing and photography."}, usage=Usage(prompt_tokens=97, completion_tokens=81, total_tokens=178), prompt=[{'role': 'user', 'content': [{'type': 'text', 'text': 'What is this image?'}, {'type': 'image_url', 'image_url': {'url': '...', 'detail': 'low'}}]}])]

Stream

prompt = "Please give me only one advice to improve the quality of my sleep."
response = chat(prompt, stream=True)

for c in response:
    # print(c, end="\r") # for flush
    print(c)

# # For async process
# response = chat(prompt, stream=True, arun=True)

# async for c in response:
#     print(c)

>>> Establish
Establish a
Establish a consistent
Establish a consistent sleep
Establish a consistent sleep schedule
Establish a consistent sleep schedule by
Establish a consistent sleep schedule by going
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Establish a consistent sleep schedule by going to bed and waking up at the same time every day, even on weekends.
message={'role': 'assistant', 'content': 'Establish a consistent sleep schedule by going to bed and waking up at the same time every day, even on weekends.'} usage=Usage(prompt_tokens=21, completion_tokens=30, total_tokens=51) prompt=[{'role': 'user', 'content': 'Please give me only one advice to improve the quality of my sleep.'}] # return the CompletionResults instance at the end

Function Calling

For OpenAI

from langrila import ToolConfig, ToolParameter, ToolProperty, OpenAIFunctionCallingModule

def get_weather(city: str, date: str) -> str:
    return f"The weather in {city} on {date} is sunny."

tool_config = ToolConfig(
            name="get_weather",
            description="Get weather at current location.",
            parameters=ToolParameter(
                            properties=[
                                ToolProperty(
                                    name="city",
                                    type="string",
                                    description="City name"
                                ),
                                ToolProperty(
                                    name="date",
                                    type="string",
                                    description="Date"
                                )
                            ],
                            required=["city", "date"]
                        )     
                    )  

                    
client = OpenAIFunctionCallingModule(
        api_key_env_name="API_KEY", 
        model_name = "gpt-3.5-turbo-1106",
        tools=[get_weather], 
        tool_configs=[tool_config],
        seed=42,
    )


response = await client("Please tell me the weather in Tokyo on 2023/12/13", arun=True)
response.model_dump()

>>> {'usage': {'prompt_tokens': 71, 'completion_tokens': 24},
 'results': [{'call_id': 'call_NRCqCcPhbRWygXq26h7Pll9n',
   'funcname': 'get_weather',
   'args': '{"city":"Tokyo","date":"2023/12/13"}',
   'output': 'The weather in Tokyo on 2023/12/13 is sunny.'}],
 'prompt': [{'role': 'user',
   'content': 'Please tell me the weather in Tokyo on 2023/12/13'}]}

For Azure

formatter = OpenAIFunctionCallingModule(
            api_key_env_name = "AZURE_API_KEY", # env variable name
            model_name="gpt-3.5-turbo-16k-0613",
            api_type="azure", 
            api_version="2023-07-01-preview",
            endpoint_env_name="ENDPOINT", # env variable name
            deployment_id_env_name="DEPLOY_ID", # env variable name
            timeout=60,
            max_retries=2,
            tools=[get_weather], 
            tool_configs=[tool_config],
        )

Conversation memory

For standard chat model

from langrila import JSONConversationMemory

chat = OpenAIChatModule(
    api_key_env_name = "API_KEY", # env variable name
    model_name="gpt-3.5-turbo-16k-0613",
    conversation_memory=JSONConversationMemory("./conversation_memory.json"),
    timeout=60, 
    max_retries=2, 
)


message = "Do you know Rude who is the character in Final Fantasy 7."
response = chat(message)
response.model_dump()
>>> {'message': {'role': 'assistant',
  'content': 'Yes, I am familiar with Rude, who is a character in the popular video game Final Fantasy 7. Rude is a member of the Turks, an elite group of operatives working for the Shinra Electric Power Company. He is known for his bald head, sunglasses, and calm demeanor. Rude often partners with another Turk named Reno, and together they carry out various missions throughout the game.'},
 'usage': {'prompt_tokens': 22, 'completion_tokens': 81},
 'prompt': [{'role': 'user',
   'content': 'Do you know Rude who is the character in Final Fantasy 7.'}]}


message = "What does he think about Tifa?"
response = chat(message)
response.model_dump()
>>> {'message': {'role': 'assistant',
  'content': "Rude's thoughts and feelings about Tifa, another character in Final Fantasy 7, are not explicitly stated in the game. However, it is known that Rude respects Tifa as a formidable fighter and acknowledges her skills. In the game, Rude and Tifa have a few encounters where they engage in combat, but there is no indication of any personal feelings or opinions Rude may have towards Tifa beyond their professional interactions."},
 'usage': {'prompt_tokens': 119, 'completion_tokens': 88},
 'prompt': [{'role': 'user',
   'content': 'Do you know Rude who is the character in Final Fantasy 7.'},
  {'role': 'assistant',
   'content': 'Yes, I am familiar with Rude, who is a character in the popular video game Final Fantasy 7. Rude is a member of the Turks, an elite group of operatives working for the Shinra Electric Power Company. He is known for his bald head, sunglasses, and calm demeanor. Rude often partners with another Turk named Reno, and together they carry out various missions throughout the game.'},
  {'role': 'user', 'content': 'What does he think about Tifa?'}]}

For vision chat model

chat = OpenAIChatModule(
    api_key_env_name="API_KEY",
    model_name="gpt-4-vision-preview",
    conversation_memory=JSONConversationMemory("./conversation_memory.json"),
)

prompt = "What kind of food is in the picture?"
response = await chat(prompt, images=image, arun=True)

prompt = "Why did Japanese start eating this at New Year's?"
response = await chat(prompt, arun=True)
response.model_dump()

>>> {'message': {'role': 'assistant',
  'content': "The tradition of eating osechi-ryori during the New Year's celebration in Japan has its origins in the Heian period (794-1185). The practice is deeply rooted in the Shinto belief of toshigami (year gods), who are said to visit during the New Year to bring blessings for the coming year. Preparing osechi-ryori is a way to honor these deities.\n\nThe specific dishes that make up osechi-ryori are chosen for their auspicious meanings and symbolism, which are meant to ensure good fortune, health, and prosperity in the new year. Each dish has a particular significance, such as happiness, fertility, longevity, and success.\n\nAnother practical reason for the development of osechi-ryori was the need to prepare food ahead of time. During the first few days of the New Year, it was traditionally believed that using a hearth or cooking would offend the visiting gods. Therefore, osechi-ryori dishes are made to last for several days without spoiling, allowing people to avoid cooking during this period and instead focus on the New Year's festivities and rituals.\n\nOver time, this practice has evolved, and while the traditional meanings remain, osechi-ryori has also become a special meal for families to enjoy together while celebrating the arrival of the New Year."},
 'usage': {'prompt_tokens': 248, 'completion_tokens': 273},
 'prompt': [{'role': 'user',
   'content': [{'type': 'text',
     'text': 'What kind of food is in the picture?'},
    {'type': 'image_url',
     'image_url': {'url': '...',
      'detail': 'low'}}]},
  {'role': 'assistant',
   'content': 'The image shows a traditional Japanese New Year\'s food called "osechi-ryori." It is typically presented in special boxes called "jubako," which are often lacquered and stacked for a beautiful presentation. Osechi-ryori consists of various dishes, each with a special meaning for the New Year. Common items include sweet black soybeans (kuromame), which symbolize health; herring roe (kazunoko), which represents fertility; and shrimp, which are associated with long life. The food is colorful and arranged meticulously to celebrate the beginning of the new year with wishes for prosperity and happiness.'},
  {'role': 'user',
   'content': "Why did Japanese start eating this at New Year's?"}]}

Conversation memory with Cosmos DB

from langrila import OpenAIChatModule
from langrila.memory.cosmos import CosmosConversationMemory

chat = OpenAIChatModule(
    api_key_env_name = "API_KEY", # env variable name
    model_name="gpt-3.5-turbo-16k-0613",
    conversation_memory=CosmosConversationMemory(
        endpoint_env_name = "COSMOS_ENDPOINT", # env variable names
        key_env_name = "COSMOS_KEY", 
        db_env_name = "COSMOS_DB_NAME", 
        container_env_name = "COSMOS_CONTAINER_NAME"
        ),
    api_type="azure",
    api_version="2023-07-01-preview", 
    deployment_id_env_name="DEPLOY_ID", # env variable name
    endpoint_env_name="ENDPOINT", # env variable name
)

Embedding

from langrila import OpenAIEmbeddingModule

embedder = OpenAIEmbeddingModule(
    api_key_env_name="API_KEY",  # env variable name
    model_name="text-embedding-ada-002",
)

message = "Please give me only one advice to improve the quality of my sleep."

results = embedder(message)
results.model_dump()

>>> {'text': ['Please give me only one advice to improve the quality of my sleep.'],
 'embeddings': [[-0.010138720273971558,
   0.03176884725689888,
   0.025842785835266113,
   -0.02733718417584896,
   0.009971244260668755,
   ...]],
 'usage': {'prompt_tokens': 14, 'completion_tokens': 0}}

For 3rd generation embedding model.

embedder = OpenAIEmbeddingModule(
    api_key_env_name="API_KEY",  # env variable name
    model_name="text-embedding-3-small", # or large
    dimensions = 512
)

message = "Please give me only one advice to improve the quality of my sleep."

results = embedder(message)

Assembling module for specific use case

Using prompt template

from langrila import BaseModule, OpenAIChatModule, PromptTemplate
from typing import Optional
        
SAMPLE_TEMPLATE = """Please answer Yes or No to the following questions.

[Question]
{question}

[Answer]
"""

class TemplateChat(BaseModule):
    def __init__(
        self,
        api_key_env_name: str,
        model_name: str,
        max_tokens: int = 2048,
        timeout: int = 60,
        max_retries: int = 2,
        context_length: Optional[int] = None,
    ):
        self.chat = OpenAIChatModule(
            api_key_env_name=api_key_env_name,
            model_name=model_name,
            max_tokens=max_tokens,
            timeout=timeout,
            max_retries=max_retries,
            context_length=context_length,
        )


    def run(self, prompt: str, prompt_template: Optional[PromptTemplate] = None):
        if isinstance(prompt_template, PromptTemplate):
            prompt = prompt_template.set_args(question=prompt).format()
        response = self.chat(prompt)
        return response


prompt_template = PromptTemplate(
    template=SAMPLE_TEMPLATE,
)

chat = TemplateChat(
    api_key_env_name="API_KEY", 
    model_name = "gpt-3.5-turbo-1106",
)

prompt = "Are you GPT-4?"
response = chat(prompt, prompt_template=prompt_template)
response.model_dump()

>>> {'message': {'role': 'assistant', 'content': 'No'},
 'usage': {'prompt_tokens': 30, 'completion_tokens': 1},
 'prompt': [{'role': 'user',
   'content': 'Please answer Yes or No to the following questions.\n\n[Question]\nAre you GPT-4?\n\n[Answer]\n'}]}

Retrieval

Now only Chroma and Qdrant are supported for basic retrieval.

For Chroma

from langrila import OpenAIEmbeddingModule
from langrila.database.chroma import ChromaCollectionModule, ChromaRetrievalModule

#######################
# create collection
#######################

embedder = OpenAIEmbeddingModule(
        api_key_env_name="API_KEY", 
    )

collection = ChromaCollectionModule(
    persistence_directory="chroma", 
    collection_name="sample", 
    embedder=embedder
)

documents = [
    "Langrila is a useful tool to use ChatGPT with OpenAI API or Azure in an easy way.",
    "LangChain is a framework for developing applications powered by language models.", 
    "LlamaIndex (GPT Index) is a data framework for your LLM application."
]

collection(documents=documents)

#######################
# retrieval
#######################

# In the case collection was already instantiated
# retriever = collection.as_retriever(n_results=2, threshold_similarity=0.8)

retriever = ChromaRetrievalModule(
            embedder=embedder,
            persistence_directory="chroma", 
            collection_name="sample", 
            n_results=2,
            threshold_similarity=0.8,
        )

query = "What is Langrila?"
retriever(query, where=None).model_dump()

>>> {'ids': ['0'],
 'documents': ['Langrila is a useful tool to use ChatGPT with OpenAI API or Azure in an easy way.'],
 'metadatas': [None],
 'similarities': [0.8371831512206707],
 'usage': {'prompt_tokens': 6, 'completion_tokens': 0}}

For Qdrant

from langrila import OpenAIEmbeddingModule
from langrila.database.qdrant import QdrantLocalCollectionModule, QdrantLocalRetrievalModule

#######################
# create collection
#######################

embedder = OpenAIEmbeddingModule(
        api_key_env_name="API_KEY", 
    )

collection = QdrantLocalCollectionModule(
    persistence_directory="qdrant", 
    collection_name="sample", 
    embedder=embedder
)

documents = [
    "Langrila is a useful tool to use ChatGPT with OpenAI API or Azure in an easy way.",
    "LangChain is a framework for developing applications powered by language models.", 
    "LlamaIndex (GPT Index) is a data framework for your LLM application."
]

collection(documents=documents)

#######################
# retrieval
#######################

# In the case collection was already instantiated
# retriever = collection.as_retriever(n_results=2, threshold_similarity=0.8)

retriever = QdrantLocalRetrievalModule(
            embedder=embedder,
            persistence_directory="qdrant", 
            collection_name="sample", 
            n_results=2,
            threshold_similarity=0.8,
        )

query = "What is Langrila?"
retriever(query, filter=None).model_dump()

>>> {'ids': [0],
 'documents': ['Langrila is a useful tool to use ChatGPT with OpenAI API or Azure in an easy way.'],
 'metadatas': [None],
 'similarities': [0.8371831512206701],
 'usage': {'prompt_tokens': 6, 'completion_tokens': 0}}

Specific use case

The library supports a variety of use cases by combining modules such as these and defining new modules. For example, the following is an example of a module that combines basic Retrieval and prompt templates.

from langrila import BaseModule, PromptTemplate
from langrila.database.chroma import ChromaRetrievalModule


class RetrievalChatWithTemplate(BaseModule):
    def __init__(
        self,
        api_type: str = "azure",
        api_version: str = "2023-07-01-preview",
        api_version_embedding: str = "2023-05-15",
        endpoint_env_name: str = "ENDPOINT",
        api_key_env_name: str = "API_KEY",
        deployment_id_env_name: str = "DEPLOY_ID",
        deployment_id_name_embedding: str = "DEPLOY_ID_EMBEDDING",
        max_tokens: int = 2048,
        timeout: int = 60,
        max_retries: int = 2,
        context_length: int = None,
        model_name="gpt-3.5-turbo-16k-0613",
        path_to_index: str = "path-to-your-chroma-index",
        index_collection_name: str = "your-collection-name"
    ):
        chatmodel_kwargs = {
            "api_type": api_type,
            "api_version": api_version,
            "api_key_env_name": api_key_env_name,
            "endpoint_env_name": endpoint_env_name,
            "deployment_id_env_name": deployment_id_env_name,
        }

        embedding_kwargs = {
            "api_type": api_type,
            "api_version": api_version_embedding,
            "api_key_env_name": api_key_env_name,
            "endpoint_env_name": endpoint_env_name,
            "deployment_id_env_name": deployment_id_name_embedding,
            "model_name": "text-embedding-ada-002"
        }

        self.chat = OpenAIChatModule(
            **chatmodel_kwargs,
            max_tokens=max_tokens,
            timeout=timeout,
            max_retries=max_retries,
            model_name=model_name,
            context_length=context_length,
            conversation_memory=None,
        )

        self.retriever = ChromaRetrievalModule(
            embedder=OpenAIEmbeddingModule(**embedding_kwargs),
            persistence_directory=path_to_index,
            collection_name=index_collection_name,
            n_results=5,
            threshold_similarity=0.8,
        )

    def run(self, prompt: str, prompt_template: Optional[PromptTemplate] = None):
        retrieval_results = self.retriever(prompt)
        relevant_docs = "\n\n".join(retrieval_results.documents)

        if isinstance(prompt_template, PromptTemplate):
            prompt = prompt_template.set_args(relevant_docs=relevant_docs).format()

        response = self.chat(prompt)
        return response