sharkwyf

First to the key

Pinned Repositories

agenta
The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.
Language:TypeScript0 0 00
ASE
Language:Python0 0 00
basalt_2022
Language:Python0 1 00
block-recurrent-transformer
Pytorch implementation of "Block Recurrent Transformers" (Hutchins & Schlag et al., 2022)
Language:Python0 0 00
cgdt
[AAAI'2024] Critic-Guided Decision Transformer for Offline Reinforcement Learning
Language:Python9 2 00
Continuous-AdvTrain
Language:Python0 0 00
decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Language:Python0 0 00
DeepSpeedExamples
Example models using DeepSpeed
Language:Python0 0 00
DI-engine
OpenDILab Decision AI Engine
Language:Python0 0 00
dify
An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine.
Language:TypeScript0 0 00

sharkwyf's Repositories

sharkwyf/cgdt
[AAAI'2024] Critic-Guided Decision Transformer for Offline Reinforcement Learning
Language:Python9 2 00
sharkwyf/RepoAgent
An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.
Language:Python1 0 00
sharkwyf/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1 0 00
sharkwyf/agenta
The LLMOps platform to build robust LLM apps. Easily experiment and evaluate different prompts, models, and workflows.
Language:TypeScript0 0 00
sharkwyf/ASE
Language:Python0 0 00
sharkwyf/Continuous-AdvTrain
Language:Python0 0 00
sharkwyf/DeepSpeedExamples
Example models using DeepSpeed
Language:Python0 0 00
sharkwyf/DI-engine
OpenDILab Decision AI Engine
Language:Python0 0 00
sharkwyf/dify
An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine.
Language:TypeScript0 0 00
sharkwyf/dreamerv3
Mastering Diverse Domains through World Models
Language:Python0 0 00
sharkwyf/FrozenBiLM
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
Language:Python0 0 00
sharkwyf/gpt-researcher
LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.
sharkwyf/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
Language:Python0 0
sharkwyf/HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Language:Jupyter Notebook0 0
sharkwyf/IVR
Author's implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
Language:Python0 0
sharkwyf/langflow
⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.
Language:Python0 0
sharkwyf/latent-adversarial-training
sharkwyf/LLaMA-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
Language:Python0 0
sharkwyf/lmm-r1
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
sharkwyf/neuralmmo
Baselines for Neural MMO -- new users should treat this repo as a starter project
Language:Python0 0
sharkwyf/notion-feeder
🕸 A Node app for creating a Feed Reader in Notion.
Language:JavaScript0 0
sharkwyf/PDT
Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer
Language:Python0 0
sharkwyf/R1-V
Witness the aha moment of VLM with less than $3.
sharkwyf/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python0 0
sharkwyf/SimPO
SimPO: Simple Preference Optimization with a Reference-Free Reward
Language:Python0 0
sharkwyf/Stable-Alignment
Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".
Language:Python0 0
sharkwyf/trl
Train transformer language models with reinforcement learning.
Language:Python0 0
sharkwyf/verl
verl: Volcano Engine Reinforcement Learning for LLMs
Language:Python
sharkwyf/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
Language:Python0 0
sharkwyf/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0