11zhouxuan's Stars
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
optuna/optuna
A hyperparameter optimization framework
huggingface/tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
NirDiamant/RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
pyppeteer/pyppeteer
Headless chrome/chromium automation library (unofficial port of puppeteer)
huggingface/speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
linkedin/Liger-Kernel
Efficient Triton Kernels for LLM Training
stanford-futuredata/ColBERT
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
AnswerDotAI/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
anthropics/anthropic-quickstarts
A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
GanjinZero/awesome_Chinese_medical_NLP
中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc
mkhorasani/Streamlit-Authenticator
A secure authentication module to manage user access in a Streamlit application.
tensorflow/text
Making text a first-class citizen in TensorFlow.
THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
homanp/langchain-ui
🧬 The open source chat-ai toolkit
amazon-science/RAGChecker
RAGChecker: A Fine-grained Framework For Diagnosing RAG
TEN-framework/ASTRA.ai
TEN(theten.ai) agent is an open-source multimodal AI agent that can speak, see, and access a knowledge base(RAG).
agent-husky/Husky-v1
Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and knowledge-based reasoning tasks.
zyushun/Adam-mini
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
abusix/ahocorapy
Pure python Aho-Corasick library.
wharris/esmre
Python extension module for accelerating regular expressions using libesm
TableBench/TableBench
Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"
daac-tools/python-daachorse
🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure. (Python wrapper for daachorse)