ksopyla
Ph.D., NLP researcher, data scientist, and entrepreneur. Polish ML Community discord https://discord.gg/a96XHkUHm5
@PromptSail @Polish-ML-Community Olsztyn, Poland
ksopyla's Stars
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
open-webui/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
unslothai/unsloth
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
encode/starlette
The little ASGI framework that shines. 🌟
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
axolotl-ai-cloud/axolotl
Go ahead and axolotl questions
MadcowD/ell
A language model programming library.
python-websockets/websockets
Library for building WebSocket servers and clients in Python
ReactiveX/RxPY
ReactiveX for Python
asyncapi/spec
The AsyncAPI specification allows you to create machine-readable definitions of your asynchronous APIs.
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
pytorch/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
microsoft/Phi-3CookBook
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
character-ai/prompt-poet
Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.
PromtEngineer/Verbi
A modular voice assistant application for experimenting with state-of-the-art transcription, response generation, and text-to-speech models. Supports OpenAI, Groq, Elevanlabs, CartesiaAI, and Deepgram APIs, plus local models via Ollama. Ideal for research and development in voice technology.
Neph0s/awesome-llm-role-playing-with-persona
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
chujiezheng/chat_templates
Chat Templates for 🤗 HuggingFace Large Language Models
salesforce/DialogStudio
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI
choosewhatulike/trainable-agents
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"
knights-analytics/hugot
Onnx transformer pipelines in Golang
Nealcly/MuTual
A Dataset for Multi-Turn Dialogue Reasoning
alibaba/ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
chrisociepa/allamo
Simple, hackable and fast implementation for training/finetuning medium-sized LLaMA-based models
Sahandfer/PersonaPaper
This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linked if available.
open-compass/BotChat
Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.
PacktPublishing/Up-and-Running-with-Concurrency-in-Go-Golang-
Up and Running with Concurrency in Go (Golang), by Packt Publishing
open-compass/CriticBench
[NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs