lqvito's Stars
xyflow/xyflow
React Flow | Svelte Flow - Powerful open source libraries for building node-based UIs with React (https://reactflow.dev) or Svelte (https://svelteflow.dev). Ready out-of-the-box and infinitely customizable.
twintproject/twint
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
nltk/nltk
NLTK Source
pytube/pytube
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
OpenLMLab/MOSS
An open-source tool-augmented conversational language model from Fudan University
josdejong/jsoneditor
A web-based tool to view, edit, format, and validate JSON
mistralai/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
zilliztech/GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
InternLM/InternLM
Official release of InternLM2.5 7B base and chat models. 1M context support
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
princeton-vl/infinigen
Infinite Photorealistic Worlds using Procedural Generation
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
aiwaves-cn/agents
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
SuperDuperDB/superduperdb
🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
dvlab-research/LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
refuel-ai/autolabel
Label, clean and enrich text datasets with LLMs.
LC1332/Chat-Haruhi-Suzumiya
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
laekov/fastmoe
A fast MoE impl for PyTorch
google-research/FLAN
alibaba/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
Farama-Foundation/chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Xwin-LM/Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
Duxiaoman-DI/XuanYuan
轩辕:度小满中文金融对话大模型
yule-BUAA/MergeLM
Codebase for Merging Language Models (ICML 2024)
miso-belica/jusText
Heuristic based boilerplate removal tool
choosewhatulike/trainable-agents
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"