Pinned Repositories
adaptive-mbr
Code of "Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding" 2024
annotation-efficient-po
Code of "Annotation-Efficient Preference Optimization for Language Model Alignment"
diverse-mbr
Code of "Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding" 2024
model-based-mbr
Code of "Model-Based Minimum Bayes Risk Decoding for Text Generation" 2024
regularized-bon
Code of "Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment" (2024).
Atari-iterative-width
Dominated Action Sequence Detection for Online Blind Planning applied in Arcade Learning Environment (Atari)
Best-Papers
Best Papers nominees from top conferences related to Artificial Intelligence
Optimal-Options-ICML-2019
Code for generating options for planning and reinforcement learning
Parallel-Best-First-Searches
The source code for the HDA*, PBNF algorithm, and friends.
search-ja
ヒューリスティック探索入門
jinnaiyuu's Repositories
jinnaiyuu/Best-Papers
Best Papers nominees from top conferences related to Artificial Intelligence
jinnaiyuu/search-ja
ヒューリスティック探索入門
jinnaiyuu/Optimal-Options-ICML-2019
Code for generating options for planning and reinforcement learning
jinnaiyuu/Parallel-Best-First-Searches
The source code for the HDA*, PBNF algorithm, and friends.
jinnaiyuu/Atari-iterative-width
Dominated Action Sequence Detection for Online Blind Planning applied in Arcade Learning Environment (Atari)
jinnaiyuu/distributed-fast-downward
Distributed Fast Downward: classical planner for parallel/distributed environments
jinnaiyuu/Hash-Distributed-Astar
Hash Distributed A*
jinnaiyuu/combinatorial_instances
Instance generators for combinatorial search domains: 15-puzzle, 24-puzzle, grid-pathfinding, multiple sequenece alignment
jinnaiyuu/covering-options
covering-options
jinnaiyuu/ods
Mission: To provide a high-quality open content data structures textbook that is both mathematically rigorous and provides complete implementations.
jinnaiyuu/tensorforce
TensorForce: A TensorFlow library for applied reinforcement learning
jinnaiyuu/Asymmetric-k-center
Implementation of an O(log* k) approximation algorithm (Archer 2001) for asymmetric k-center problem.
jinnaiyuu/b-pro
jinnaiyuu/BPIDA-appendix
supplemental material for SoSC 2017 paper https://aaai.org/ocs/index.php/SOCS/SOCS17/paper/view/15801
jinnaiyuu/ContinuousSPM
Significant pattern mining for continuous variables (reimplementation of Sugiyama&Borgwardt https://arxiv.org/abs/1702.08694)
jinnaiyuu/DASP-RL
DASP applied to RL
jinnaiyuu/free-programming-books
jinnaiyuu/icml2016-minecraft
Implementation of "Control of Memory, Active Perception, and Action in Minecraft"
jinnaiyuu/LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.
jinnaiyuu/mbr-decoding
jinnaiyuu/mp-lamp
Distributed Significant Pattern Mining for Binary/Continuous Variable Features
jinnaiyuu/open-llm-leaderboard-local
Open LLM Leaderboard のローカル実行用スクリプト
jinnaiyuu/optuna
A hyperparameter optimization framework
jinnaiyuu/pybrisque
A python implementation of BRISQUE Image Quality Assessment
jinnaiyuu/Set-Cover
An implementation of a greedy algorithm for the set cover optimization problem (Chvatal, V 1979)
jinnaiyuu/temperature-monitor
Temperature monitor for cluster. It pretty much depends on each hardware so that just pulling this code won't work.
jinnaiyuu/TensorFlow-Examples
TensorFlow Tutorial and Examples for Beginners with Latest APIs
jinnaiyuu/trl
Train transformer language models with reinforcement learning.