jinnaiyuu

Artificial Intelligence, Planning, Reinforcement Learning, and Symbol Grounding

Tokyo

Pinned Repositories

adaptive-mbr
Code of "Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding" 2024
Language:Python1 1 00
annotation-efficient-po
Code of "Annotation-Efficient Preference Optimization for Language Model Alignment"
Language:Python4 0 00
diverse-mbr
Code of "Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding" 2024
Language:Python2 1 00
model-based-mbr
Code of "Model-Based Minimum Bayes Risk Decoding for Text Generation" 2024
Language:Jupyter Notebook3 1 00
regularized-bon
Code of "Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment" (2024).
Language:Python7 0 00
Atari-iterative-width
Dominated Action Sequence Detection for Online Blind Planning applied in Arcade Learning Environment (Atari)
Language:C++6 2 00
Best-Papers
Best Papers nominees from top conferences related to Artificial Intelligence
20 5 01
Optimal-Options-ICML-2019
Code for generating options for planning and reinforcement learning
Language:Python11 2 03
Parallel-Best-First-Searches
The source code for the HDA*, PBNF algorithm, and friends.
Language:C++8 2 00
search-ja
ヒューリスティック探索入門
Language:TeX17 2 04

jinnaiyuu's Repositories

jinnaiyuu/Best-Papers
Best Papers nominees from top conferences related to Artificial Intelligence
20 5 01
jinnaiyuu/search-ja
ヒューリスティック探索入門
Language:TeX17 2 04
jinnaiyuu/Optimal-Options-ICML-2019
Code for generating options for planning and reinforcement learning
Language:Python11 2 03
jinnaiyuu/Parallel-Best-First-Searches
The source code for the HDA*, PBNF algorithm, and friends.
Language:C++8 2 00
jinnaiyuu/Atari-iterative-width
Dominated Action Sequence Detection for Online Blind Planning applied in Arcade Learning Environment (Atari)
Language:C++6 2 00
jinnaiyuu/distributed-fast-downward
Distributed Fast Downward: classical planner for parallel/distributed environments
Language:C++6 3 01
jinnaiyuu/Hash-Distributed-Astar
Hash Distributed A*
Language:C++3 2 1081
jinnaiyuu/combinatorial_instances
Instance generators for combinatorial search domains: 15-puzzle, 24-puzzle, grid-pathfinding, multiple sequenece alignment
Language:Roff1 2 00
jinnaiyuu/covering-options
covering-options
Language:Python1 2 12
jinnaiyuu/ods
Mission: To provide a high-quality open content data structures textbook that is both mathematically rigorous and provides complete implementations.
Language:TeX1 3 00
jinnaiyuu/tensorforce
TensorForce: A TensorFlow library for applied reinforcement learning
Language:Python1 3 0
jinnaiyuu/Asymmetric-k-center
Implementation of an O(log* k) approximation algorithm (Archer 2001) for asymmetric k-center problem.
Language:Python2 0
jinnaiyuu/b-pro
Language:C++2 0
jinnaiyuu/BPIDA-appendix
supplemental material for SoSC 2017 paper https://aaai.org/ocs/index.php/SOCS/SOCS17/paper/view/15801
Language:Cuda0 0
jinnaiyuu/ContinuousSPM
Significant pattern mining for continuous variables (reimplementation of Sugiyama&Borgwardt https://arxiv.org/abs/1702.08694)
Language:C2 0
jinnaiyuu/DASP-RL
DASP applied to RL
Language:C++2 0
jinnaiyuu/free-programming-books
2 0
jinnaiyuu/icml2016-minecraft
Implementation of "Control of Memory, Active Perception, and Action in Minecraft"
Language:Java2 0
jinnaiyuu/LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.
Language:Python
jinnaiyuu/mbr-decoding
1 0
jinnaiyuu/mp-lamp
Distributed Significant Pattern Mining for Binary/Continuous Variable Features
Language:C++4 01
jinnaiyuu/open-llm-leaderboard-local
Open LLM Leaderboard のローカル実行用スクリプト
jinnaiyuu/optuna
A hyperparameter optimization framework
Language:Python2 0
jinnaiyuu/pybrisque
A python implementation of BRISQUE Image Quality Assessment
Language:Python1 0
jinnaiyuu/Set-Cover
An implementation of a greedy algorithm for the set cover optimization problem (Chvatal, V 1979)
Language:Python2 0
jinnaiyuu/temperature-monitor
Temperature monitor for cluster. It pretty much depends on each hardware so that just pulling this code won't work.
Language:Shell2 0
jinnaiyuu/TensorFlow-Examples
TensorFlow Tutorial and Examples for Beginners with Latest APIs
Language:Jupyter Notebook2 0
jinnaiyuu/trl
Train transformer language models with reinforcement learning.
Language:Python0 0