little1d's Stars
little1d/NLIS
natural-language-image-search
little1d/lapsrn-baseline
modelscope/agentscope
Start building LLM-empowered multi-agent applications in an easier way.
0xPlaygrounds/rig
⚙️🦀 Build portable, modular & lightweight Fullstack Agents
camel-ai/agent-trust
The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"
open-sciencelab/Social_Science
Multi-Agent System for Science of Science
flarum/flarum
Simple forum software for building great communities.
turquoise1231/Mathematical-Fundamentals-of-Network-Engineering
XDU-SCE 网工数基2024
facebookresearch/spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
hyp1231/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
OpenBMB/AgentVerse
🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
OpenBMB/ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
thunlp/OpenPrompt
An Open-Source Framework for Prompt-Learning.
KashiwaByte/multimodal-arxiv-daily
🎓Automatically Update Multimodal and Computational Argumentation Papers Daily using Github Actions (Update Every 12th hours)
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
WePOINTS/WePOINTS
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
PFCCLab/Starter
【HACKATHON 预备营】飞桨启航计划集训营
hcffffff/ProSide
Github repository for NLPCC'24 paper ProSide: Knowledge Projector and Sideway for Pre-trained Language Models
gpt-omni/mini-omni2
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
THUDM/GLM-4-Voice
GLM-4-Voice | 端到端中英语音对话模型
Standard-Intelligence/hertz-dev
first base model for full-duplex conversational audio
datawhalechina/hack-rnns
本仓库将带大家从零开始,用pytorch的线性层搭建传统的NLP神经网络
Henry-23/VideoChat
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
LeMei/Multimodal-Affective-Computing-Survey
declare-lab/multimodal-deep-learning
This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.
ZonglinY/MOOSE-Chem
Official Implementation for <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>
Nijikadesu/Transformer-From-Scratch
Implement Transformer using PyTorch.
limafang/tiny-graphrag