oshizo's Stars
webbigdata-jp/JTransBench
A tool to easily benchmark Japanese translation skills
10sedecim/J-RAD
Japanese Rhetoric Annotation Dasaset
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
shadcn-ui/ui
Beautifully designed components that you can copy and paste into your apps. Accessible. Customizable. Open Source.
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
kaityo256/sevendayshpc
一週間でなれる!スパコンプログラマ
segment-any-text/wtpsplit
Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
confident-ai/deepeval
The LLM Evaluation Framework
pfnet-research/pfmt-bench-fin-ja
pfmt-bench-fin-ja: Preferred Multi-turn Benchmark for Finance in Japanese
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
nath1295/LLMFlex
A python package for developing AI applications with local LLMs.
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
clovaai/CRAFT-pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
shisa-ai/shisa-v2
Japanese / English Bilingual LLM
pfnet-research/japanese-lm-fin-harness
Japanese Language Model Financial Evaluation Harness
llm-jp/llm-jp-eval
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
FailSpy/abliterator
Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens
microsoft/promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
fkiliver/SakuraTranslator
HKUNLP/ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
deepdoctection/deepdoctection
A Repo For Document AI
XiongjieDai/GPU-Benchmarks-on-LLM-Inference
Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
mistralai/mistral-finetune
ContextualAI/gritlm
Generative Representational Instruction Tuning
ritaranx/BMRetriever
This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".
jina-ai/reader
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
mizuumi/JDocQA
luyug/GradCache
Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint