oshizo

oshizo's Stars

webbigdata-jp/JTransBench
A tool to easily benchmark Japanese translation skills
Language:Python6
10sedecim/J-RAD
Japanese Rhetoric Annotation Dasaset
Language:Python2
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Language:Python265k45k
shadcn-ui/ui
Beautifully designed components that you can copy and paste into your apps. Accessible. Customizable. Open Source.
Language:TypeScript65.3k3.8k
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Language:Python13.1k1.1k
kaityo256/sevendayshpc
一週間でなれる！スパコンプログラマ
Language:HTML68228
segment-any-text/wtpsplit
Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.
Language:Python63437
confident-ai/deepeval
The LLM Evaluation Framework
Language:Python2.5k177
pfnet-research/pfmt-bench-fin-ja
pfmt-bench-fin-ja: Preferred Multi-turn Benchmark for Finance in Japanese
Language:Python8
lukas-blecher/LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Language:Python11.5k955
nath1295/LLMFlex
A python package for developing AI applications with local LLMs.
Language:Python13616
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Language:TypeScript8.3k606
clovaai/CRAFT-pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
Language:Python3k862
shisa-ai/shisa-v2
Japanese / English Bilingual LLM
Language:Python8
pfnet-research/japanese-lm-fin-harness
Japanese Language Model Financial Evaluation Harness
Language:Shell514
llm-jp/llm-jp-eval
Language:Python8430
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Language:Python1.3k135
FailSpy/abliterator
Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens
Language:Python24124
microsoft/promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
Language:Python8.9k801
fkiliver/SakuraTranslator
Language:C#963
HKUNLP/ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
Language:Python30415
deepdoctection/deepdoctection
A Repo For Document AI
Language:Python2.4k122
XiongjieDai/GPU-Benchmarks-on-LLM-Inference
Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
Language:Jupyter Notebook66925
qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Language:Python1.2k86
mistralai/mistral-finetune
Language:Python2.6k186
ContextualAI/gritlm
Generative Representational Instruction Tuning
Language:Jupyter Notebook48934
ritaranx/BMRetriever
This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".
Language:Python122
jina-ai/reader
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
Language:TypeScript5.9k457
mizuumi/JDocQA
191
luyug/GradCache
Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint
Language:Python33519