puppetm4st3r
Passionate about AI and natural language processing (NLP), focusing on innovating and adding value at the intersection of technology and business.
Chile
Pinned Repositories
convert_checkpoint_to_lsg
Efficient Attention for Long Sequence Processing
chainlit
Build Conversational AI in minutes ⚡️
text-generation-inference
Large Language Model Text Generation Inference
baai_m3_simple_server
This code sets up a simple yet robust server using FastAPI for handling asynchronous requests for embedding generation and reranking tasks using the BAAI M3 multilingual model.
convert_checkpoint_to_lsg
Efficient Attention for Long Sequence Processing
local_function_calling
This repository contains a Python implementation that allows you to use gorilla-llm/gorilla-openfunctions-v2 LLM to perform function calling using the OpenAI protocol. It provides a way to extend the capabilities of the local model by enabling it to generate function arguments and execute functions based on the provided specifications.
aphrodite-engine
PygmalionAI's large-scale inference engine
puppetm4st3r's Repositories
puppetm4st3r/baai_m3_simple_server
This code sets up a simple yet robust server using FastAPI for handling asynchronous requests for embedding generation and reranking tasks using the BAAI M3 multilingual model.
puppetm4st3r/local_function_calling
This repository contains a Python implementation that allows you to use gorilla-llm/gorilla-openfunctions-v2 LLM to perform function calling using the OpenAI protocol. It provides a way to extend the capabilities of the local model by enabling it to generate function arguments and execute functions based on the provided specifications.
puppetm4st3r/convert_checkpoint_to_lsg
Efficient Attention for Long Sequence Processing