reward

There are 120 repositories under reward topic.

tigerneil/awesome-deep-rl
For deep RL and the future of AI.
Language:HTML1.4k 108 3217
aleju/mario-ai
Playing Mario with Deep Reinforcement Learning
Language:Lua687 49 8142
yanm1ng/hexo-theme-vexo
🍟 Vexo is a Hexo theme inspired by Vue's official website.
Language:JavaScript618 15 103127
henry-fun/hanshan-lottery
An amazing lottery app created for the world
Language:HTML353 11 690
greedying/tctip
Language:JavaScript340 28 21161
drallgood/jpasskit
jPasskit is an Java™ implementation of the Apple™ PassKit Web Service.
Language:Java280 29 167110
ecency/ecency-mobile
Ecency Mobile - reimagined social blogging, contribute and get rewarded (for Android and iOS)
Language:TypeScript242 12 1k71
Prem-ium/BingRewards
🤖 Automate Bing Searches 🔍, Quizzes 🧪, Polls 📝, & more for Bing Rewards. 💸
Language:Python236 17 6458
Miraclemarvel55/ChatGLM-RLHF
对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF
Language:Python191 1 726
alison-carrera/mabalgs
:bust_in_silhouette: Multi-Armed Bandit Algorithms Library (MAB) :cop:
Language:Python130 4 526
WFCD/warframe-drop-data
:moneybag: Warframe Drop Data in an easier to parse format.
Language:JavaScript127 18 3722
NiuTrans/Vision-LLM-Alignment
This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
Language:Python96 3 96
bulwark-crypto/Bulwark
The primary development repository for the Bulwark project
Language:C++58 23 6257
iGoodie/TwitchSpawn
👾 TwitchSpawn is a Minecraft mod, which is designed for Twitch streamers using 3rd party streaming tools! (comes with its own language!)
Language:Java52 3 5225
powerpool-finance/powerindex
📈📉Power Index is an ecosystem product of PowerPool. The main feature of Power Index is a possibility to create special pools with unique governance and design.
Language:JavaScript51 6 311
ihoey/Playing-reward
超好看的打赏功能~ 演示地址
Language:CSS49 2 332
khinthandarkyaw98/Optimizing-UAV-trajectory-for-maximum-data-rate-via-Q-Learning
During our participation in the Internship Exchange Program, my friend and I collaborated with the guidance of our esteemed supervisor from NTHU.
Language:Python33 2 32
ssbuild/chatglm_rlhf
chatglm_rlhf_finetuning
Language:Python28 2 61
anarkrypto/P2PoW
A P2P Delegated Proof of Work solution for Nano cryptocurrency
Language:JavaScript27 6 13
ssbuild/llm_rlhf
realize the reinforcement learning training for gpt2 llama bloom and so on llm model
Language:Python26 2 72
dp770/aws_deepracer_worksheet
Worksheet and Utilities for AWS DeepRacer – one of the most exciting ways of building strong skills in reinforcement learning and through a hands-on approach. This repository offers: 1) Functionally-rich and flexible reward function 2) Utilities with Jupiter notes for Racing Line calculation and visualisation of track 3) Scripts to parse RoboMaker training and evaluation logs to CSV file 4) Sample Excel file for car behaviour analysis as well as designing and planning new reward curves 5) Coordinates and AWS DeepRacer tracks and images.
Language:Python23 4 18
Miraclemarvel55/LLaMA-MOSS-RLHF-LoRA
用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]
Language:Python21 2 31
netcoinfoundation/netcoin
Netcoin - Digital currency with personal interest rate and fair weight stake mining
Language:C++20 17 832
piconnectdev/wepi
🐀 Building a federated alternative to reddit in rust
Language:Rust18 1 00
2008Choco/DragonEggDrop
Spigot plugin. Overhaul the dragons summoned in The End. Configurable templates, loot and particles. (Modern fork of PixelStix's DragonEggDrop)
Language:Java15 3 3913
aaksham/frozenlake
Value & Policy Iteration for the frozenlake environment of OpenAI
Language:Python15 2 011
winston1017/Platformer
2D Platformer game for Android with automatic randomizing level generator! Currently set at 10 levels but 2 lines of code and you will have another.
Language:C#15 5 04
HANDZCZ/genshin-stats
Repository that host code to show my genshin stats. Claims daily reward and active primo codes.
Language:Mako14 2 115
Arcadier/Discount-Coupon-Generator
:gift: Create and reward your consumers with discount coupons to boost sales and build a loyal user base.
Language:JavaScript11 1 02
corbosiny/AIVO-StreetFigherReinforcementLearning
Creating an environment to quickly train a variety of Deep Reinforcement Learning algorithms on Street Fighter 2 using tournaments between learning agents
Language:Python11 3 01
denvash/jesta-android-app
Jesta 💎 is a social app where people can do favors for each other, in exchange for rewards. 🤝
Language:Kotlin11 2 60
mjwpl/LoopLords
Loop Lords is an application designed to help users manage their recurring tasks efficiently. It aims to remind users of their cyclical tasks before the deadline, reward them for completing tasks within the cycle, prioritize tasks based on their last completion date (e.g., diet), and assist users in breaking habits (e.g., computer gaming addiction)
Language:C#11 1 00
my-cloud/ruthenium
Golang implementation of the Ruthenium protocol
Language:Go11 2 1361
NPW-Project/NewPowerCoin
New Power Coin - A new masternode-enabled cryptocurrency that drives online traffic into a new era of decentralization.
Language:C++11 4 85
CarsonScott/Dual-Process-Reinforcement
An intelligent agent that adaptively changes its thought processes to maximize cumulative reward
10 3 01
citizenweb3/staking
Non custodial staking service for web3
Language:Shell10 3 561

reward

tigerneil/awesome-deep-rl

aleju/mario-ai

yanm1ng/hexo-theme-vexo

henry-fun/hanshan-lottery

greedying/tctip

drallgood/jpasskit

ecency/ecency-mobile

Prem-ium/BingRewards

Miraclemarvel55/ChatGLM-RLHF

alison-carrera/mabalgs

WFCD/warframe-drop-data

NiuTrans/Vision-LLM-Alignment

bulwark-crypto/Bulwark

iGoodie/TwitchSpawn

powerpool-finance/powerindex

ihoey/Playing-reward

khinthandarkyaw98/Optimizing-UAV-trajectory-for-maximum-data-rate-via-Q-Learning

ssbuild/chatglm_rlhf

anarkrypto/P2PoW

ssbuild/llm_rlhf

dp770/aws_deepracer_worksheet

Miraclemarvel55/LLaMA-MOSS-RLHF-LoRA

netcoinfoundation/netcoin

piconnectdev/wepi

2008Choco/DragonEggDrop

aaksham/frozenlake

winston1017/Platformer

HANDZCZ/genshin-stats

Arcadier/Discount-Coupon-Generator

corbosiny/AIVO-StreetFigherReinforcementLearning

denvash/jesta-android-app

mjwpl/LoopLords

my-cloud/ruthenium

NPW-Project/NewPowerCoin

CarsonScott/Dual-Process-Reinforcement

citizenweb3/staking