ohsuz

ʕ•̫͡•ʔ-̫͡-ʕ•͓͡•ʔ-̫͡-ʕ•̫͡•ʔ-̫͡-ʕ•͓͡•ʔ-̫͡-ʔ

ohsuz's Stars

mlflow/mlflow
Open source platform for the machine learning lifecycle
Language:Python19.1k 305 4k4.3k
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬
Language:Jupyter Notebook8.5k 105 1161.2k
OpenBMB/ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Language:Python4.9k 49 304430
meta-llama/llama-stack-apps
Agentic components of the Llama Stack APIs
4k 120 63640
awslabs/multi-agent-orchestrator
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
Language:Python3.6k 32 76267
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.7k 50 3175
ekzhu/datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
Language:Python2.6k 48 166296
mlabonne/llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
2.2k 35 1187
zjunlp/EasyEdit
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Language:Jupyter Notebook2k 24 365242
Nutlope/llamatutor
An AI personal tutor built with Llama 3.1
Language:TypeScript1.5k 18 5239
microsoft/responsible-ai-toolbox
Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.
Language:TypeScript1.4k 33 286376
Tebmer/Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.
706 11 540
ChenghaoMou/text-dedup
All-in-one text de-duplication
Language:Python641 4 7171
tianyi-lab/Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
Language:Python343 4 530
tianyi-lab/Cherry_LLM
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models
Language:Python323 3 3021
braintrustdata/autoevals
AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
Language:Python299 3 1423
davanstrien/awesome-synthetic-datasets
awesome synthetic (text) datasets
Language:Jupyter Notebook251 7 211
huridocs/pdf-document-layout-analysis
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.
Language:Python215 8 2224
sambanova/toolbench
ToolBench, an evaluation suite for LLM tool manipulation capabilities.
Language:Python144 1 411
tianyi-lab/Superfiltering
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
Language:Python135 2 612
microsoft/llmops-workshop
Learn how to build solutions with Large Language Models.
Language:Jupyter Notebook130 7 347
ZHZisZZ/modpo
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
Language:Python61 2 34
hist0613/arxivbot
Language:Python57 3 282
dsdanielpark/hf-transllm
LLMtranslator translates and generates text in multiple languages.
Language:Jupyter Notebook41 3 13
Azure/slm-innovator-lab
This lab is a 1-day/2-day end-to-end SLM workshop led and developed by AI GBB. Attendees will learn how to quickly and easily perform the data preparation-fine tuning-serving-LLMOps series of processes using Azure ML Studio and AI Studio, and will be able to expand the workload based on this.
Language:Jupyter Notebook31 5 013
J-Seo/KoCommonGEN-V2
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
Language:Python25 4 01
dsdanielpark/arxiv2text
Converting PDF files to text, mainly with a focus on arXiv papers.
Language:Jupyter Notebook14 2 21
aeolian83/paper_translator
Language:Python9 1 00
DojunPark/multidimensional_MTE
Language:Jupyter Notebook2 1 01
Marker-Inc-Korea/Logickor-Gemma2-Eval
Logickor self-evaluation code (with gemma2)
Language:Python1

ohsuz

ohsuz's Stars

mlflow/mlflow

SakanaAI/AI-Scientist

OpenBMB/ToolBench

meta-llama/llama-stack-apps

awslabs/multi-agent-orchestrator

Zjh-819/LLMDataHub

ekzhu/datasketch

mlabonne/llm-datasets

zjunlp/EasyEdit

Nutlope/llamatutor

microsoft/responsible-ai-toolbox

Tebmer/Awesome-Knowledge-Distillation-of-LLMs

ChenghaoMou/text-dedup

tianyi-lab/Reflection_Tuning

tianyi-lab/Cherry_LLM

braintrustdata/autoevals

davanstrien/awesome-synthetic-datasets

huridocs/pdf-document-layout-analysis

sambanova/toolbench

tianyi-lab/Superfiltering

microsoft/llmops-workshop

ZHZisZZ/modpo

hist0613/arxivbot

dsdanielpark/hf-transllm

Azure/slm-innovator-lab

J-Seo/KoCommonGEN-V2

dsdanielpark/arxiv2text

aeolian83/paper_translator

DojunPark/multidimensional_MTE

Marker-Inc-Korea/Logickor-Gemma2-Eval