eugene-yh's Stars
chromalock/TI-32
A mod for TI-84 calculators to turn them into cheating devices.
Qucs/qucs
Qucs Project official mirror
rtqichen/torchdiffeq
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
ahkab/ahkab
a SPICE-like electronic circuit simulator written in Python
xjasonlyu/tun2socks
tun2socks - powered by gVisor TCP/IP stack
fcbond/hkcancor
Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).
modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
Agrover112/awesome-semantic-search
A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.
INL/BlackLab
Linguistic search for large annotated text corpora, based on Apache Lucene
DmitryKey/luke
This is mavenised Luke: Lucene Toolbox Project
AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
masanorihirano/llm-japanese-dataset
LLM構築用の日本語チャットデータセット
textexploration/mtas
Multi Tier Annotation Search
ginuerzh/gost
GO Simple Tunnel - a simple tunnel written in golang
megagonlabs/ginza
A Japanese NLP Library using spaCy as framework based on Universal Dependencies
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
jiangxinyang227/LLM-tuning
llama,chatglm 等模型的微调
sambanova/bloomchat
This repo contains the data preparation, tokenization, training and inference code for BLOOMChat. BLOOMChat is a 176 billion parameter multilingual chat model based on BLOOM.
google-research/FLAN
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
meertensinstituut/mtas
Multi Tier Annotation Search
as-ideas/TransformerTTS
🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
transitive-bullshit/chatgpt-well-known-plugin-finder
Checks Alexa's top 1M websites for the presence of OpenAI's new .well-known/ai-plugin.json files
NoUnique/pymecab-ko
🐍 pymecab-ko. you can find original version here: https://bitbucket.org/eunjeon/mecab-ko, https://github.com/SamuraiT/mecab-python3
WorksApplications/sudachi.rs
Sudachi in Rust 🦀 and new generation of SudachiPy
CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
facebookresearch/cc_net
Tools to download and cleanup Common Crawl data
sysid/sse-starlette
mpetazzoni/sseclient
Pure-Python Server Side Events (SSE) client
jbaudisch/ssec
Client for Server-Sent Events (SSE)