ohsuz's Stars
mlflow/mlflow
Open source platform for the machine learning lifecycle
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
OpenBMB/ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
meta-llama/llama-stack-apps
Agentic components of the Llama Stack APIs
awslabs/multi-agent-orchestrator
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
ekzhu/datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
mlabonne/llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
zjunlp/EasyEdit
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Nutlope/llamatutor
An AI personal tutor built with Llama 3.1
microsoft/responsible-ai-toolbox
Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.
Tebmer/Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.
ChenghaoMou/text-dedup
All-in-one text de-duplication
tianyi-lab/Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
tianyi-lab/Cherry_LLM
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models
braintrustdata/autoevals
AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
davanstrien/awesome-synthetic-datasets
awesome synthetic (text) datasets
huridocs/pdf-document-layout-analysis
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of different parts of PDF pages, identifying the elements such as texts, titles, pictures, tables and so on.
sambanova/toolbench
ToolBench, an evaluation suite for LLM tool manipulation capabilities.
tianyi-lab/Superfiltering
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
microsoft/llmops-workshop
Learn how to build solutions with Large Language Models.
ZHZisZZ/modpo
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
hist0613/arxivbot
dsdanielpark/hf-transllm
LLMtranslator translates and generates text in multiple languages.
Azure/slm-innovator-lab
This lab is a 1-day/2-day end-to-end SLM workshop led and developed by AI GBB. Attendees will learn how to quickly and easily perform the data preparation-fine tuning-serving-LLMOps series of processes using Azure ML Studio and AI Studio, and will be able to expand the workload based on this.
J-Seo/KoCommonGEN-V2
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
dsdanielpark/arxiv2text
Converting PDF files to text, mainly with a focus on arXiv papers.
aeolian83/paper_translator
DojunPark/multidimensional_MTE
Marker-Inc-Korea/Logickor-Gemma2-Eval
Logickor self-evaluation code (with gemma2)